32 Commits

Author SHA1 Message Date
5eecda7e28 cleanup log activations 2025-07-20 09:10:05 +05:30
a713c92b82 cleanup hook model outputs 2025-07-20 09:02:34 +05:30
5bdedcffec remove labml_helpers dep 2025-07-20 08:56:03 +05:30
1b702523b9 remove labml_helpers dependency: replace Module with nn.Module 2025-07-18 10:32:36 +05:30
e09ee89f36 Transformer experiment logs (#130) 2022-06-27 14:11:44 +05:30
0ce65adf9e RoPER (#126) 2022-06-03 21:29:41 +05:30
aa311eb30d experiemnt logs 2022-05-03 09:12:18 +01:00
a7a7a3bdb7 RETRO (#110) 2022-03-12 15:44:35 +05:30
6cd0ed168e update torch 2021-10-29 09:05:59 +05:30
76ae56ff15 highlight code in notes 2021-10-22 18:34:59 +05:30
dc4da2106b headings 2021-10-21 15:15:05 +05:30
c5b13162cf anchors 2021-10-21 15:01:58 +05:30
d4b4c28840 typo fixes 2021-10-19 19:17:51 +05:30
6615c7158a Primer EZ (#96) 2021-09-21 16:01:26 +05:30
02992a43ab cleanup 2021-08-27 20:49:33 +05:30
a4c720debf short long 2021-08-27 17:00:00 +05:30
eaa248c9e6 __call__ -> forward 2021-08-19 15:45:59 +05:30
876845d0c2 ResNet (#68) 2021-07-16 08:35:46 +05:30
1a9f15eebb Distillation (#65) 2021-07-03 14:01:17 +05:30
7f05ed043b FNet (#53) 2021-05-26 10:56:42 +05:30
2edf17fa8c group norm 2021-04-20 13:40:26 +05:30
983286e216 📚 batch norm 2021-02-01 14:43:11 +05:30
20d2e27a3c 📚 ffn notes 2021-01-25 22:09:11 +05:30
2927aa217b gpt notes 2021-01-14 09:37:12 +05:30
4c769128cb autoregression trainer notes 2021-01-14 08:46:41 +05:30
774a72e0bf gpt 2021-01-13 12:54:43 +05:30
ce190701e8 global step 2021-01-07 21:11:50 +05:30
c124348b14 gradient clipping 2021-01-07 13:33:32 +05:30
dfefde657b no loss smoothing 2021-01-07 12:02:19 +05:30
b9da12ee3f d_model 2021-01-06 17:24:09 +05:30
4fe8392a8d auto regression common exp 2020-12-27 07:30:20 +05:30
c64b98e390 basic auto regression experiment for reuse 2020-12-26 21:12:44 +05:30