|
5eecda7e28
|
cleanup log activations
|
2025-07-20 09:10:05 +05:30 |
|
|
a713c92b82
|
cleanup hook model outputs
|
2025-07-20 09:02:34 +05:30 |
|
|
5bdedcffec
|
remove labml_helpers dep
|
2025-07-20 08:56:03 +05:30 |
|
|
1b702523b9
|
remove labml_helpers dependency: replace Module with nn.Module
|
2025-07-18 10:32:36 +05:30 |
|
|
9a42ac2697
|
arxiv.org links
|
2023-10-24 14:42:32 +01:00 |
|
|
b43fb807a8
|
sophia speed up
|
2023-07-15 08:30:41 +05:30 |
|
|
0101618de6
|
links
|
2023-07-14 21:27:44 +05:30 |
|
|
8db330dd22
|
sophia-g docs
|
2023-07-14 21:25:08 +05:30 |
|
|
7c02294e7c
|
sophia exp
|
2023-07-14 16:44:45 +05:30 |
|
|
f45ca5ee69
|
sophia wip
|
2023-07-14 15:40:26 +05:30 |
|
|
c5685c9ffe
|
remove app.labml.ai links
|
2023-04-02 12:10:18 +05:30 |
|
|
980a84ed4f
|
Zero3 memory optimizations (#140)
|
2022-08-11 15:44:13 +05:30 |
|
|
c5d9235280
|
Typo fixes (#125)
|
2022-06-03 08:38:12 +05:30 |
|
|
31e4b36070
|
typo fix
|
2021-12-14 16:13:25 +05:30 |
|
|
8fab7f5389
|
fix katex error
|
2021-11-02 14:18:41 +05:30 |
|
|
a8954c1cbb
|
fix math align
|
2021-10-24 17:26:28 +05:30 |
|
|
76ae56ff15
|
highlight code in notes
|
2021-10-22 18:34:59 +05:30 |
|
|
7f74eeba77
|
\textcolor
|
2021-10-21 17:32:54 +05:30 |
|
|
dc4da2106b
|
headings
|
2021-10-21 15:15:05 +05:30 |
|
|
c5b13162cf
|
anchors
|
2021-10-21 15:01:58 +05:30 |
|
|
d4b4c28840
|
typo fixes
|
2021-10-19 19:17:51 +05:30 |
|
|
3b3f4d9471
|
fix adam paper link
|
2021-08-21 11:15:50 +05:30 |
|
|
996b58be04
|
paper links
|
2021-08-17 14:12:33 +05:30 |
|
|
876845d0c2
|
ResNet (#68)
|
2021-07-16 08:35:46 +05:30 |
|
|
c2107755bb
|
labml app links
|
2021-02-27 17:54:11 +05:30 |
|
|
4f31570f92
|
📚 optimizers readme
|
2021-02-23 17:49:00 +05:30 |
|
|
5388e807e1
|
layer norm
|
2021-02-02 11:18:09 +05:30 |
|
|
8800c43509
|
Minor fixes
|
2021-01-30 17:24:05 +05:30 |
|
|
9b09a5f3d2
|
pytorch link
|
2021-01-30 13:38:15 +05:30 |
|
|
2927aa217b
|
gpt notes
|
2021-01-14 09:37:12 +05:30 |
|
|
774a72e0bf
|
gpt
|
2021-01-13 12:54:43 +05:30 |
|
|
b05faa9b98
|
default warm up
|
2020-12-23 13:57:34 +05:30 |
|
|
634809af3c
|
noam chart fix
|
2020-12-23 13:52:08 +05:30 |
|
|
8078061c51
|
🐛 ada belief fix
|
2020-12-16 14:12:38 +05:30 |
|
|
91d917ae00
|
optimizers path fix
|
2020-12-14 10:23:58 +05:30 |
|
|
ef922321be
|
optimizers
|
2020-12-14 09:33:07 +05:30 |
|
|
5b1897b792
|
optimizers
|
2020-12-14 09:31:11 +05:30 |
|
|
19ed54c6e1
|
radam plot
|
2020-12-10 16:02:01 +05:30 |
|
|
14bed4c432
|
radam
|
2020-12-10 14:18:07 +05:30 |
|
|
10ee239a14
|
unoptimized adam
|
2020-12-10 10:51:19 +05:30 |
|
|
4d58757671
|
unoptimized adam
|
2020-12-10 10:50:18 +05:30 |
|
|
443458e812
|
summaries
|
2020-12-10 08:42:06 +05:30 |
|
|
b7d5c5db75
|
optimizer links
|
2020-12-08 07:12:17 +05:30 |
|
|
22fb0b79a2
|
colab
|
2020-12-07 10:15:14 +05:30 |
|
|
116f1645f8
|
amsgrad
|
2020-12-07 10:07:10 +05:30 |
|
|
ca1cea7009
|
amsgrad synthetic experiment
|
2020-12-06 15:23:57 +05:30 |
|
|
264bdc8eeb
|
math fix
|
2020-12-06 08:54:21 +05:30 |
|
|
dc48f0a4e1
|
notes
|
2020-12-06 08:14:56 +05:30 |
|
|
c71a5c5ae2
|
adam comments
|
2020-12-06 07:22:36 +05:30 |
|
|
874c238651
|
📚 adam notes
|
2020-12-05 11:21:11 +05:30 |
|