767 Commits

Author SHA1 Message Date
f00ba4a70f paper url fix 2024-06-21 19:01:16 +05:30
09d09379c2 fix value pe double rotation 2024-06-20 12:53:09 +05:30
2236f6383c fix rope test code 2024-06-20 12:49:27 +05:30
cf565bcc1d cleanup 2024-06-18 11:09:02 +05:30
999f2036a5 RWKV docs 2024-03-17 17:47:51 +05:30
418d1ec44a RWKV docs 2024-03-17 17:47:39 +05:30
df9e1af615 RWKV docs 2024-03-17 17:45:08 +05:30
7db6e92376 RWKV (#222)
* rwkv-init

* annotations

* Re-added docs

* make dir if not exist

* Add RWKV paper and update doc index

* add train loop

* experiment

---------

Co-authored-by: Jacob Hatef <hatef.4@buckeyemail.buckeyemail.osu.edu>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
2024-03-17 17:36:15 +05:30
285cb3735b uodate docs 2024-03-02 14:33:53 +05:30
5ec0f70855 Fix formula typo in Relative MHA (#242)
${(\textcolor{lightgreen}{\mathbf{A + C}})}_{i,j} = Q_i^\top K_j + \textcolor{orange}{v^\top} K_j$
2024-03-02 14:19:06 +05:30
fea91b9699 Cleanup group norm Cifar experiment (#240)
This experiment is not written in the previous format in group normalization and appears longer. With modifications, this code will normalize and be shorter
2024-03-02 14:17:39 +05:30
84ad3f9783 Update unet.py (#239) fix typo 2024-03-02 14:13:34 +05:30
a0679ecd90 title 2024-01-12 13:21:54 +05:30
84fab839c2 fix typo chineese translation 2024-01-12 13:19:14 +05:30
45dc127061 Merge pull request #235 from qiangxinglin/master
Fix typo chineese translation
2024-01-12 13:18:50 +05:30
81cf808d05 rope typo 2024-01-12 13:17:39 +05:30
083988f411 Merge pull request #232 from Etienne248/patch-1
Fix a typo in the formula of RoPE
2024-01-12 13:17:19 +05:30
4135eda943 title 2024-01-12 13:15:04 +05:30
fdd4da6401 Fix typo 2023-12-24 12:08:10 +08:00
bc5565b84c Fix a typo in the formula of RoPE 2023-12-08 15:50:21 +01:00
b18a82213d docs 2023-11-17 19:40:59 +00:00
36a374ed76 Merge pull request #226 from MrYxJ/patch-1
Fix a typo in the formula of ALiBi.
2023-11-17 17:42:17 +00:00
07b5782a48 Merge pull request #224 from callanwu/master
Add backticks to mask'shape
2023-11-17 17:41:05 +00:00
830161b299 Update __init__.py
This formula is wrong, there is one symbol '-' missing in front of the 1, which will affect people's understanding when reading. What is expressed here is that the position of the ith token is increasing from -(i-1) to 0, so it should be -1.
2023-11-14 00:30:26 +08:00
4d922e838f Add backticks to mask'shape 2023-11-10 19:51:37 +08:00
f42c0e9cf4 right shift example comment fix 2023-11-07 09:28:22 +00:00
a2d6e805a6 fix beta expression in ddpm docs 2023-11-07 09:13:19 +00:00
f26c603648 update docs 2023-11-07 09:10:13 +00:00
ba58ad9720 version 2023-11-07 09:06:49 +00:00
334fb05fb2 Merge pull request #221 from lizhuoq/fix
fix: fix cls_token bug in vit.
2023-11-07 09:06:12 +00:00
a8ddc7d1d9 Merge pull request #223 from tatsuookubo/master
Very minor typos in the type hint and text description
2023-11-07 09:05:33 +00:00
3a759f45b0 minor typo: instance norm -> group norm 2023-11-06 21:50:35 +08:00
6576e98498 minor typo in the type hint of ddpm/unet 2023-11-06 21:48:39 +08:00
ffafaf1df7 fix: fix cls_token bug in vit. 2023-11-06 11:28:45 +08:00
9a42ac2697 arxiv.org links 2023-10-24 14:42:32 +01:00
1159ecfc63 Merge pull request #206 from xBoyMinemc/master
Update setup.py encoding='utf-8'
2023-10-24 18:56:24 +05:30
293cee404d Merge pull request #214 from Mirmix/patch-1
Minor typo DDPM->DDIM
2023-10-15 16:49:02 +05:30
732aedcfc6 Minor typo DDPM->DDIM 2023-09-21 18:15:15 +02:00
d957d361d1 Merge pull request #204 from gstamatiou/master-1
Fix typo in latex
2023-09-10 17:31:29 +05:30
24c1f8684a Update setup.py encoding='utf-8'
>> py .\setup.py

Traceback (most recent call last):
  File "annotated_deep_learning_paper_implementations\setup.py", line 4, in <module>
                       ^^^^^^^^
UnicodeDecodeError: 'gbk' codec can't decode byte 0xa8 in position 951: illegal multibyte sequence
2023-08-26 06:05:36 +08:00
51955fdf89 Update __init__.py
Corrected typo in latex code
2023-08-15 20:59:13 +03:00
ec3c7366b7 labml update 2023-08-10 19:37:55 +05:30
753fbd3446 typo fix docs 2023-07-15 08:35:33 +05:30
b01848a2a0 Merge pull request #196 from YangWu1227/fix-comment-typo
fix: potential typo in resnet comments
2023-07-15 08:35:09 +05:30
b43fb807a8 sophia speed up 2023-07-15 08:30:41 +05:30
594f89c8cc version 2023-07-14 21:28:14 +05:30
0101618de6 links 2023-07-14 21:27:44 +05:30
8db330dd22 sophia-g docs 2023-07-14 21:25:08 +05:30
7c02294e7c sophia exp 2023-07-14 16:44:45 +05:30
f45ca5ee69 sophia wip 2023-07-14 15:40:26 +05:30