cbc38bb26b
GPT 2 implementation
2024-07-26 09:41:13 +05:30
89a3ae8882
Merge pull request #264 from Seas0/patch-1
2024-07-16 11:11:46 +05:30
66e92edb04
Fix typo in Wasserstein GAN
2024-07-15 13:06:40 +08:00
f6e913eb09
transformer mha chinese translation
2024-06-27 19:35:37 +05:30
d3f0bd305a
Merge pull request #259 from pengchzn/master (Transformer MHA Chinese Translation)
...
Refine Chinese translation
2024-06-27 19:28:45 +05:30
e03dbc17b6
Refine Chinese translation
2024-06-26 19:03:38 +08:00
1446bb124a
Refine Chinese translation
2024-06-25 21:49:51 +08:00
730046c9c1
Merge branch 'labmlai:master' into master
2024-06-25 21:49:12 +08:00
391fa39167
cleanup notebooks
2024-06-24 16:17:09 +05:30
26e64a8827
zh
2024-06-24 15:59:56 +05:30
20494ae94c
fix gae formula
2024-06-24 15:58:03 +05:30
a78ca14532
refine translation of /__init__.zh.json
2024-06-23 11:42:50 +08:00
4699c514f5
refine translation of /transformers/__init__.zh.json
2024-06-23 11:42:41 +08:00
d858f2eec0
remove tranding papers link
2024-06-21 19:35:22 +05:30
0bb4be3ff9
zh translation
2024-06-21 19:28:14 +05:30
a631e73b42
Merge pull request #258 from pengchzn/master
...
Refine Chinese translation
2024-06-21 19:20:22 +05:30
7ad78f40a0
Merge branch 'master' into master
2024-06-21 19:19:11 +05:30
bf8a491250
chineese translation
2024-06-21 19:09:13 +05:30
f00ba4a70f
paper url fix
2024-06-21 19:01:16 +05:30
df09205605
Refine Chinese translation
2024-06-21 13:52:15 +08:00
09d09379c2
fix value pe double rotation
2024-06-20 12:53:09 +05:30
2236f6383c
fix rope test code
2024-06-20 12:49:27 +05:30
cf565bcc1d
cleanup
2024-06-18 11:09:02 +05:30
999f2036a5
RWKV docs
2024-03-17 17:47:51 +05:30
418d1ec44a
RWKV docs
2024-03-17 17:47:39 +05:30
df9e1af615
RWKV docs
2024-03-17 17:45:08 +05:30
7db6e92376
RWKV ( #222 )
...
* rwkv-init
* annotations
* Re-added docs
* make dir if not exist
* Add RWKV paper and update doc index
* add train loop
* experiment
---------
Co-authored-by: Jacob Hatef <hatef.4@buckeyemail.buckeyemail.osu.edu>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
2024-03-17 17:36:15 +05:30
285cb3735b
uodate docs
2024-03-02 14:33:53 +05:30
5ec0f70855
Fix formula typo in Relative MHA ( #242 )
...
${(\textcolor{lightgreen}{\mathbf{A + C}})}_{i,j} = Q_i^\top K_j + \textcolor{orange}{v^\top} K_j$
2024-03-02 14:19:06 +05:30
fea91b9699
Cleanup group norm Cifar experiment ( #240 )
...
This experiment is not written in the previous format in group normalization and appears longer. With modifications, this code will normalize and be shorter
2024-03-02 14:17:39 +05:30
84ad3f9783
Update unet.py ( #239 ) fix typo
2024-03-02 14:13:34 +05:30
a0679ecd90
title
2024-01-12 13:21:54 +05:30
84fab839c2
fix typo chineese translation
2024-01-12 13:19:14 +05:30
45dc127061
Merge pull request #235 from qiangxinglin/master
...
Fix typo chineese translation
2024-01-12 13:18:50 +05:30
81cf808d05
rope typo
2024-01-12 13:17:39 +05:30
083988f411
Merge pull request #232 from Etienne248/patch-1
...
Fix a typo in the formula of RoPE
2024-01-12 13:17:19 +05:30
4135eda943
title
2024-01-12 13:15:04 +05:30
fdd4da6401
Fix typo
2023-12-24 12:08:10 +08:00
bc5565b84c
Fix a typo in the formula of RoPE
2023-12-08 15:50:21 +01:00
b18a82213d
docs
2023-11-17 19:40:59 +00:00
36a374ed76
Merge pull request #226 from MrYxJ/patch-1
...
Fix a typo in the formula of ALiBi.
2023-11-17 17:42:17 +00:00
07b5782a48
Merge pull request #224 from callanwu/master
...
Add backticks to mask'shape
2023-11-17 17:41:05 +00:00
830161b299
Update __init__.py
...
This formula is wrong, there is one symbol '-' missing in front of the 1, which will affect people's understanding when reading. What is expressed here is that the position of the ith token is increasing from -(i-1) to 0, so it should be -1.
2023-11-14 00:30:26 +08:00
4d922e838f
Add backticks to mask'shape
2023-11-10 19:51:37 +08:00
f42c0e9cf4
right shift example comment fix
2023-11-07 09:28:22 +00:00
a2d6e805a6
fix beta expression in ddpm docs
2023-11-07 09:13:19 +00:00
f26c603648
update docs
2023-11-07 09:10:13 +00:00
ba58ad9720
version
2023-11-07 09:06:49 +00:00
334fb05fb2
Merge pull request #221 from lizhuoq/fix
...
fix: fix cls_token bug in vit.
2023-11-07 09:06:12 +00:00
a8ddc7d1d9
Merge pull request #223 from tatsuookubo/master
...
Very minor typos in the type hint and text description
2023-11-07 09:05:33 +00:00