|
5bdedcffec
|
remove labml_helpers dep
|
2025-07-20 08:56:03 +05:30 |
|
|
6df1d798c0
|
version
|
2024-08-24 14:34:44 +05:30 |
|
|
ba58ad9720
|
version
|
2023-11-07 09:06:49 +00:00 |
|
|
24c1f8684a
|
Update setup.py encoding='utf-8'
>> py .\setup.py
Traceback (most recent call last):
File "annotated_deep_learning_paper_implementations\setup.py", line 4, in <module>
^^^^^^^^
UnicodeDecodeError: 'gbk' codec can't decode byte 0xa8 in position 951: illegal multibyte sequence
|
2023-08-26 06:05:36 +08:00 |
|
|
b43fb807a8
|
sophia speed up
|
2023-07-15 08:30:41 +05:30 |
|
|
594f89c8cc
|
version
|
2023-07-14 21:28:14 +05:30 |
|
|
d69b809cac
|
version
|
2022-09-24 15:35:31 +05:30 |
|
|
4c47086b0f
|
links
|
2022-09-15 14:55:48 +05:30 |
|
|
7d1550dd67
|
fix ddpm attn
|
2022-09-12 08:30:27 +05:30 |
|
|
3f2a42bc3a
|
version
|
2022-08-20 11:15:03 +05:30 |
|
|
7c2a9105f8
|
version
|
2022-08-15 08:45:21 +05:30 |
|
|
980a84ed4f
|
Zero3 memory optimizations (#140)
|
2022-08-11 15:44:13 +05:30 |
|
|
72669d0526
|
ALiBi (#134)
|
2022-07-17 09:28:32 +05:30 |
|
|
ee5a34aa59
|
experiment links transformer
|
2022-06-28 19:02:20 +05:30 |
|
|
e09ee89f36
|
Transformer experiment logs (#130)
|
2022-06-27 14:11:44 +05:30 |
|
|
58b24f6c83
|
Diffusion Notebook (#127)
|
2022-06-09 14:43:17 +05:30 |
|
|
0ce65adf9e
|
RoPER (#126)
|
2022-06-03 21:29:41 +05:30 |
|
|
6a41c82b30
|
FTA (#115)
|
2022-05-23 22:26:39 +05:30 |
|
|
45171c71c3
|
version
|
2022-05-03 09:17:33 +01:00 |
|
|
0aff72f970
|
DeepNorm (#114)
|
2022-04-10 08:08:55 +05:30 |
|
|
a7a7a3bdb7
|
RETRO (#110)
|
2022-03-12 15:44:35 +05:30 |
|
|
201ad98ef4
|
📇 dependencies
|
2021-12-20 11:33:29 +05:30 |
|
|
6cd0ed168e
|
update torch
|
2021-10-29 09:05:59 +05:30 |
|
|
6464269933
|
dqn images
|
2021-10-16 14:30:41 +05:30 |
|
|
9e430d2dba
|
ppo experiment configs
|
2021-10-02 13:57:47 +05:30 |
|
|
c7fb3f7f4c
|
version
|
2021-09-21 16:02:29 +05:30 |
|
|
583f8cfc81
|
version
|
2021-09-17 12:07:42 +05:30 |
|
|
103cf81a13
|
📇 versions
|
2021-08-29 14:59:11 +05:30 |
|
|
7d41961f2e
|
version
|
2021-08-28 14:26:26 +05:30 |
|
|
b6607524b8
|
Evidential Deep Learning to Quantify Classification Uncertainty (#85)
|
2021-08-21 10:25:32 +05:30 |
|
|
ff0d5c065d
|
ponder net highlighted paper
|
2021-08-15 16:58:18 +05:30 |
|
|
068225aa16
|
📇 version
|
2021-08-13 16:30:58 +05:30 |
|
|
0a8a26b220
|
📇 version
|
2021-08-13 16:27:30 +05:30 |
|
|
0e3d47c051
|
version
|
2021-08-08 08:37:02 +05:30 |
|
|
0171892afc
|
slack
|
2021-08-08 08:19:41 +05:30 |
|
|
48329bd64d
|
📇 version
|
2021-08-08 08:18:59 +05:30 |
|
|
f038ab673d
|
vit
|
2021-07-17 15:24:17 +05:30 |
|
|
e8a89ec994
|
remove loop for google colab
|
2021-06-21 17:33:13 +05:30 |
|
|
9b97aa25cf
|
version
|
2021-06-21 17:10:38 +05:30 |
|
|
f6e430a9b3
|
MLP Mixer (#59)
|
2021-06-16 09:36:13 +05:30 |
|
|
ebce404402
|
Masked Language Model (#56)
|
2021-06-06 15:12:11 +05:30 |
|
|
d449497222
|
readme
|
2021-05-21 15:05:02 +05:30 |
|
|
7526ec4f4c
|
Improved Training of Wasserstein GANs (#50)
* ✨ gradient penalty
* gp experiment
* 📚 gradient penalty
|
2021-05-09 14:11:43 +05:30 |
|
|
8a4222c36b
|
Weight standardization (#47)
* 🚧 weight standardization
* 🐛 small fixes
* 📚🚧 weight standardization
* 📚 weight standardization
* 📚 weight standardization experiment
* 📚 batch channel norm
* ✍️ corrections
* 📚 experiment links
|
2021-04-28 10:44:50 +05:30 |
|
|
e7e817ce20
|
📚 group norm
|
2021-04-24 14:44:38 +05:30 |
|
|
ba5c7200e8
|
📇 version
|
2021-04-04 13:22:16 +05:30 |
|
|
3d87b0b485
|
♻️ dynamic hp
|
2021-03-27 11:59:19 +05:30 |
|
|
e0e7f15da1
|
📇 version
|
2021-03-27 11:54:32 +05:30 |
|
|
2b3220bd10
|
fast weights experiment
|
2021-03-14 08:02:17 +05:30 |
|
|
a1b1550245
|
📚 compressive transformer
|
2021-02-19 08:34:17 +05:30 |
|