91 Commits

Author SHA1 Message Date
5bdedcffec remove labml_helpers dep 2025-07-20 08:56:03 +05:30
6df1d798c0 version 2024-08-24 14:34:44 +05:30
ba58ad9720 version 2023-11-07 09:06:49 +00:00
24c1f8684a Update setup.py encoding='utf-8'
>> py .\setup.py

Traceback (most recent call last):
  File "annotated_deep_learning_paper_implementations\setup.py", line 4, in <module>
                       ^^^^^^^^
UnicodeDecodeError: 'gbk' codec can't decode byte 0xa8 in position 951: illegal multibyte sequence
2023-08-26 06:05:36 +08:00
b43fb807a8 sophia speed up 2023-07-15 08:30:41 +05:30
594f89c8cc version 2023-07-14 21:28:14 +05:30
d69b809cac version 2022-09-24 15:35:31 +05:30
4c47086b0f links 2022-09-15 14:55:48 +05:30
7d1550dd67 fix ddpm attn 2022-09-12 08:30:27 +05:30
3f2a42bc3a version 2022-08-20 11:15:03 +05:30
7c2a9105f8 version 2022-08-15 08:45:21 +05:30
980a84ed4f Zero3 memory optimizations (#140) 2022-08-11 15:44:13 +05:30
72669d0526 ALiBi (#134) 2022-07-17 09:28:32 +05:30
ee5a34aa59 experiment links transformer 2022-06-28 19:02:20 +05:30
e09ee89f36 Transformer experiment logs (#130) 2022-06-27 14:11:44 +05:30
58b24f6c83 Diffusion Notebook (#127) 2022-06-09 14:43:17 +05:30
0ce65adf9e RoPER (#126) 2022-06-03 21:29:41 +05:30
6a41c82b30 FTA (#115) 2022-05-23 22:26:39 +05:30
45171c71c3 version 2022-05-03 09:17:33 +01:00
0aff72f970 DeepNorm (#114) 2022-04-10 08:08:55 +05:30
a7a7a3bdb7 RETRO (#110) 2022-03-12 15:44:35 +05:30
201ad98ef4 📇 dependencies 2021-12-20 11:33:29 +05:30
6cd0ed168e update torch 2021-10-29 09:05:59 +05:30
6464269933 dqn images 2021-10-16 14:30:41 +05:30
9e430d2dba ppo experiment configs 2021-10-02 13:57:47 +05:30
c7fb3f7f4c version 2021-09-21 16:02:29 +05:30
583f8cfc81 version 2021-09-17 12:07:42 +05:30
103cf81a13 📇 versions 2021-08-29 14:59:11 +05:30
7d41961f2e version 2021-08-28 14:26:26 +05:30
b6607524b8 Evidential Deep Learning to Quantify Classification Uncertainty (#85) 2021-08-21 10:25:32 +05:30
ff0d5c065d ponder net highlighted paper 2021-08-15 16:58:18 +05:30
068225aa16 📇 version 2021-08-13 16:30:58 +05:30
0a8a26b220 📇 version 2021-08-13 16:27:30 +05:30
0e3d47c051 version 2021-08-08 08:37:02 +05:30
0171892afc slack 2021-08-08 08:19:41 +05:30
48329bd64d 📇 version 2021-08-08 08:18:59 +05:30
f038ab673d vit 2021-07-17 15:24:17 +05:30
e8a89ec994 remove loop for google colab 2021-06-21 17:33:13 +05:30
9b97aa25cf version 2021-06-21 17:10:38 +05:30
f6e430a9b3 MLP Mixer (#59) 2021-06-16 09:36:13 +05:30
ebce404402 Masked Language Model (#56) 2021-06-06 15:12:11 +05:30
d449497222 readme 2021-05-21 15:05:02 +05:30
7526ec4f4c Improved Training of Wasserstein GANs (#50)
*  gradient penalty

* gp experiment

* 📚 gradient penalty
2021-05-09 14:11:43 +05:30
8a4222c36b Weight standardization (#47)
* 🚧 weight standardization

* 🐛 small fixes

* 📚🚧 weight standardization

* 📚 weight standardization

* 📚 weight standardization experiment

* 📚 batch channel norm

* ✍️  corrections

* 📚 experiment links
2021-04-28 10:44:50 +05:30
e7e817ce20 📚 group norm 2021-04-24 14:44:38 +05:30
ba5c7200e8 📇 version 2021-04-04 13:22:16 +05:30
3d87b0b485 ♻️ dynamic hp 2021-03-27 11:59:19 +05:30
e0e7f15da1 📇 version 2021-03-27 11:54:32 +05:30
2b3220bd10 fast weights experiment 2021-03-14 08:02:17 +05:30
a1b1550245 📚 compressive transformer 2021-02-19 08:34:17 +05:30