|  | ee0ad9bac6 | version | 2025-08-08 20:00:50 +05:30 |  | 
			
				
					|  | 9262c57f18 | flash attention | 2025-08-08 19:57:57 +05:30 |  | 
			
				
					|  | 5bdedcffec | remove labml_helpers dep | 2025-07-20 08:56:03 +05:30 |  | 
			
				
					|  | 6df1d798c0 | version | 2024-08-24 14:34:44 +05:30 |  | 
			
				
					|  | ba58ad9720 | version | 2023-11-07 09:06:49 +00:00 |  | 
			
				
					|  | 24c1f8684a | Update setup.py encoding='utf-8' >> py .\setup.py
Traceback (most recent call last):
  File "annotated_deep_learning_paper_implementations\setup.py", line 4, in <module>
                       ^^^^^^^^
UnicodeDecodeError: 'gbk' codec can't decode byte 0xa8 in position 951: illegal multibyte sequence | 2023-08-26 06:05:36 +08:00 |  | 
			
				
					|  | b43fb807a8 | sophia speed up | 2023-07-15 08:30:41 +05:30 |  | 
			
				
					|  | 594f89c8cc | version | 2023-07-14 21:28:14 +05:30 |  | 
			
				
					|  | d69b809cac | version | 2022-09-24 15:35:31 +05:30 |  | 
			
				
					|  | 4c47086b0f | links | 2022-09-15 14:55:48 +05:30 |  | 
			
				
					|  | 7d1550dd67 | fix ddpm attn | 2022-09-12 08:30:27 +05:30 |  | 
			
				
					|  | 3f2a42bc3a | version | 2022-08-20 11:15:03 +05:30 |  | 
			
				
					|  | 7c2a9105f8 | version | 2022-08-15 08:45:21 +05:30 |  | 
			
				
					|  | 980a84ed4f | Zero3 memory optimizations (#140) | 2022-08-11 15:44:13 +05:30 |  | 
			
				
					|  | 72669d0526 | ALiBi (#134) | 2022-07-17 09:28:32 +05:30 |  | 
			
				
					|  | ee5a34aa59 | experiment links transformer | 2022-06-28 19:02:20 +05:30 |  | 
			
				
					|  | e09ee89f36 | Transformer experiment logs (#130) | 2022-06-27 14:11:44 +05:30 |  | 
			
				
					|  | 58b24f6c83 | Diffusion Notebook (#127) | 2022-06-09 14:43:17 +05:30 |  | 
			
				
					|  | 0ce65adf9e | RoPER (#126) | 2022-06-03 21:29:41 +05:30 |  | 
			
				
					|  | 6a41c82b30 | FTA (#115) | 2022-05-23 22:26:39 +05:30 |  | 
			
				
					|  | 45171c71c3 | version | 2022-05-03 09:17:33 +01:00 |  | 
			
				
					|  | 0aff72f970 | DeepNorm (#114) | 2022-04-10 08:08:55 +05:30 |  | 
			
				
					|  | a7a7a3bdb7 | RETRO (#110) | 2022-03-12 15:44:35 +05:30 |  | 
			
				
					|  | 201ad98ef4 | 📇 dependencies | 2021-12-20 11:33:29 +05:30 |  | 
			
				
					|  | 6cd0ed168e | update torch | 2021-10-29 09:05:59 +05:30 |  | 
			
				
					|  | 6464269933 | dqn images | 2021-10-16 14:30:41 +05:30 |  | 
			
				
					|  | 9e430d2dba | ppo experiment configs | 2021-10-02 13:57:47 +05:30 |  | 
			
				
					|  | c7fb3f7f4c | version | 2021-09-21 16:02:29 +05:30 |  | 
			
				
					|  | 583f8cfc81 | version | 2021-09-17 12:07:42 +05:30 |  | 
			
				
					|  | 103cf81a13 | 📇 versions | 2021-08-29 14:59:11 +05:30 |  | 
			
				
					|  | 7d41961f2e | version | 2021-08-28 14:26:26 +05:30 |  | 
			
				
					|  | b6607524b8 | Evidential Deep Learning to Quantify Classification Uncertainty (#85) | 2021-08-21 10:25:32 +05:30 |  | 
			
				
					|  | ff0d5c065d | ponder net highlighted paper | 2021-08-15 16:58:18 +05:30 |  | 
			
				
					|  | 068225aa16 | 📇 version | 2021-08-13 16:30:58 +05:30 |  | 
			
				
					|  | 0a8a26b220 | 📇 version | 2021-08-13 16:27:30 +05:30 |  | 
			
				
					|  | 0e3d47c051 | version | 2021-08-08 08:37:02 +05:30 |  | 
			
				
					|  | 0171892afc | slack | 2021-08-08 08:19:41 +05:30 |  | 
			
				
					|  | 48329bd64d | 📇 version | 2021-08-08 08:18:59 +05:30 |  | 
			
				
					|  | f038ab673d | vit | 2021-07-17 15:24:17 +05:30 |  | 
			
				
					|  | e8a89ec994 | remove loop for google colab | 2021-06-21 17:33:13 +05:30 |  | 
			
				
					|  | 9b97aa25cf | version | 2021-06-21 17:10:38 +05:30 |  | 
			
				
					|  | f6e430a9b3 | MLP Mixer (#59) | 2021-06-16 09:36:13 +05:30 |  | 
			
				
					|  | ebce404402 | Masked Language Model (#56) | 2021-06-06 15:12:11 +05:30 |  | 
			
				
					|  | d449497222 | readme | 2021-05-21 15:05:02 +05:30 |  | 
			
				
					|  | 7526ec4f4c | Improved Training of Wasserstein GANs (#50) * ✨ gradient penalty
* gp experiment
* 📚 gradient penalty | 2021-05-09 14:11:43 +05:30 |  | 
			
				
					|  | 8a4222c36b | Weight standardization (#47) * 🚧 weight standardization
* 🐛 small fixes
* 📚🚧 weight standardization
* 📚 weight standardization
* 📚 weight standardization experiment
* 📚 batch channel norm
* ✍️  corrections
* 📚 experiment links | 2021-04-28 10:44:50 +05:30 |  | 
			
				
					|  | e7e817ce20 | 📚 group norm | 2021-04-24 14:44:38 +05:30 |  | 
			
				
					|  | ba5c7200e8 | 📇 version | 2021-04-04 13:22:16 +05:30 |  | 
			
				
					|  | 3d87b0b485 | ♻️  dynamic hp | 2021-03-27 11:59:19 +05:30 |  | 
			
				
					|  | e0e7f15da1 | 📇 version | 2021-03-27 11:54:32 +05:30 |  |