|
5eecda7e28
|
cleanup log activations
|
2025-07-20 09:10:05 +05:30 |
|
|
a713c92b82
|
cleanup hook model outputs
|
2025-07-20 09:02:34 +05:30 |
|
|
5bdedcffec
|
remove labml_helpers dep
|
2025-07-20 08:56:03 +05:30 |
|
|
1b702523b9
|
remove labml_helpers dependency: replace Module with nn.Module
|
2025-07-18 10:32:36 +05:30 |
|
|
e09ee89f36
|
Transformer experiment logs (#130)
|
2022-06-27 14:11:44 +05:30 |
|
|
0ce65adf9e
|
RoPER (#126)
|
2022-06-03 21:29:41 +05:30 |
|
|
aa311eb30d
|
experiemnt logs
|
2022-05-03 09:12:18 +01:00 |
|
|
a7a7a3bdb7
|
RETRO (#110)
|
2022-03-12 15:44:35 +05:30 |
|
|
6cd0ed168e
|
update torch
|
2021-10-29 09:05:59 +05:30 |
|
|
76ae56ff15
|
highlight code in notes
|
2021-10-22 18:34:59 +05:30 |
|
|
dc4da2106b
|
headings
|
2021-10-21 15:15:05 +05:30 |
|
|
c5b13162cf
|
anchors
|
2021-10-21 15:01:58 +05:30 |
|
|
d4b4c28840
|
typo fixes
|
2021-10-19 19:17:51 +05:30 |
|
|
6615c7158a
|
Primer EZ (#96)
|
2021-09-21 16:01:26 +05:30 |
|
|
02992a43ab
|
cleanup
|
2021-08-27 20:49:33 +05:30 |
|
|
a4c720debf
|
short long
|
2021-08-27 17:00:00 +05:30 |
|
|
eaa248c9e6
|
__call__ -> forward
|
2021-08-19 15:45:59 +05:30 |
|
|
876845d0c2
|
ResNet (#68)
|
2021-07-16 08:35:46 +05:30 |
|
|
1a9f15eebb
|
Distillation (#65)
|
2021-07-03 14:01:17 +05:30 |
|
|
7f05ed043b
|
FNet (#53)
|
2021-05-26 10:56:42 +05:30 |
|
|
2edf17fa8c
|
✨ group norm
|
2021-04-20 13:40:26 +05:30 |
|
|
983286e216
|
📚 batch norm
|
2021-02-01 14:43:11 +05:30 |
|
|
20d2e27a3c
|
📚 ffn notes
|
2021-01-25 22:09:11 +05:30 |
|
|
2927aa217b
|
gpt notes
|
2021-01-14 09:37:12 +05:30 |
|
|
4c769128cb
|
autoregression trainer notes
|
2021-01-14 08:46:41 +05:30 |
|
|
774a72e0bf
|
gpt
|
2021-01-13 12:54:43 +05:30 |
|
|
ce190701e8
|
global step
|
2021-01-07 21:11:50 +05:30 |
|
|
c124348b14
|
gradient clipping
|
2021-01-07 13:33:32 +05:30 |
|
|
dfefde657b
|
no loss smoothing
|
2021-01-07 12:02:19 +05:30 |
|
|
b9da12ee3f
|
d_model
|
2021-01-06 17:24:09 +05:30 |
|
|
4fe8392a8d
|
auto regression common exp
|
2020-12-27 07:30:20 +05:30 |
|
|
c64b98e390
|
basic auto regression experiment for reuse
|
2020-12-26 21:12:44 +05:30 |
|