810 Commits

Author SHA1 Message Date
863772e04a rename layers 2024-08-18 01:04:04 +05:30
f3465ac926 Chineese translation 2024-08-16 16:35:25 +05:30
edf875aa70 LoRA experiment notes 2024-08-16 16:25:19 +05:30
d69f1c1058 LoRA experiment notes 2024-08-16 16:14:52 +05:30
64959fdff9 Merge pull request #267 from pengchzn/master
Refine and fix Chinese typo
2024-08-16 15:45:01 +05:30
5d384d6be7 Merge pull request #268 from lakshith-403/LoRA
LoRA experiment
2024-08-07 09:51:17 +05:30
61d32f4696 create LoRA experiment
- remove global configs
- Do weight loading inside experiment
- remove train and transform notebooks
2024-08-07 09:49:35 +05:30
4aa1bdb810 Merge remote-tracking branch 'origin/master'
# Conflicts:
#	translate_cache/transformers/feed_forward.zh.json
2024-08-06 16:21:51 +08:00
dc26e6c06d Fix Chinese typo 2024-08-06 16:12:04 +08:00
d4af40b595 LoRA notes 2024-08-03 16:59:15 +05:30
eb9337e949 Clean up LoRA 2024-08-02 15:33:45 +05:30
dc4762161d Clean up LoRA 2024-08-02 15:32:02 +05:30
957ade6d67 Merge pull request #266 from lakshith-403/LoRA 2024-07-31 21:06:28 +05:30
bc32b507ea clear notebook outputs 2024-07-31 20:39:46 +05:30
77d00f089b Add LoRA to GPT2 2024-07-31 18:29:24 +05:30
0f2a9be6d2 training loop 2024-07-29 23:01:06 +05:30
23b7e2ee8e create experiment notebook and refactoring 2024-07-29 19:41:24 +05:30
c82529ce67 move LoRA to labml.nn 2024-07-29 11:17:38 +05:30
8e756f292b lora layers 2024-07-28 11:22:27 +05:30
d1e8daa121 replace convo1D layers with linear 2024-07-28 08:51:03 +05:30
50c3cc4eab keep only required configs 2024-07-27 22:01:21 +05:30
106e72605d remove droput layers 2024-07-27 21:30:15 +05:30
b3aedf3093 remove gelu custom impl and use pytorch impl 2024-07-27 21:28:07 +05:30
cbc38bb26b GPT 2 implementation 2024-07-26 09:41:13 +05:30
89a3ae8882 Merge pull request #264 from Seas0/patch-1 2024-07-16 11:11:46 +05:30
66e92edb04 Fix typo in Wasserstein GAN 2024-07-15 13:06:40 +08:00
7d7863c080 Fix Chinese typo 2024-07-09 15:31:57 +08:00
f6e913eb09 transformer mha chinese translation 2024-06-27 19:35:37 +05:30
d3f0bd305a Merge pull request #259 from pengchzn/master (Transformer MHA Chinese Translation)
Refine Chinese translation
2024-06-27 19:28:45 +05:30
e03dbc17b6 Refine Chinese translation 2024-06-26 19:03:38 +08:00
1446bb124a Refine Chinese translation 2024-06-25 21:49:51 +08:00
730046c9c1 Merge branch 'labmlai:master' into master 2024-06-25 21:49:12 +08:00
391fa39167 cleanup notebooks 2024-06-24 16:17:09 +05:30
26e64a8827 zh 2024-06-24 15:59:56 +05:30
20494ae94c fix gae formula 2024-06-24 15:58:03 +05:30
a78ca14532 refine translation of /__init__.zh.json 2024-06-23 11:42:50 +08:00
4699c514f5 refine translation of /transformers/__init__.zh.json 2024-06-23 11:42:41 +08:00
d858f2eec0 remove tranding papers link 2024-06-21 19:35:22 +05:30
0bb4be3ff9 zh translation 2024-06-21 19:28:14 +05:30
a631e73b42 Merge pull request #258 from pengchzn/master
Refine Chinese translation
2024-06-21 19:20:22 +05:30
7ad78f40a0 Merge branch 'master' into master 2024-06-21 19:19:11 +05:30
bf8a491250 chineese translation 2024-06-21 19:09:13 +05:30
f00ba4a70f paper url fix 2024-06-21 19:01:16 +05:30
df09205605 Refine Chinese translation 2024-06-21 13:52:15 +08:00
09d09379c2 fix value pe double rotation 2024-06-20 12:53:09 +05:30
2236f6383c fix rope test code 2024-06-20 12:49:27 +05:30
cf565bcc1d cleanup 2024-06-18 11:09:02 +05:30
999f2036a5 RWKV docs 2024-03-17 17:47:51 +05:30
418d1ec44a RWKV docs 2024-03-17 17:47:39 +05:30
df9e1af615 RWKV docs 2024-03-17 17:45:08 +05:30