Commit Graph

90 Commits

Author SHA1 Message Date
Varuna Jayasiri
25e169843e link to jax transformer 2025-11-11 09:22:38 +00:00
Varuna Jayasiri
9262c57f18 flash attention 2025-08-08 19:57:57 +05:30
Varuna Jayasiri
5731bff586 LoRA docs 2024-08-24 10:50:02 +05:30
Varuna Jayasiri
d858f2eec0 remove tranding papers link 2024-06-21 19:35:22 +05:30
Varuna Jayasiri
cf565bcc1d cleanup 2024-06-18 11:09:02 +05:30
Varuna Jayasiri
df9e1af615 RWKV docs 2024-03-17 17:45:08 +05:30
Jacob Hatef
7db6e92376 RWKV (#222)
* rwkv-init

* annotations

* Re-added docs

* make dir if not exist

* Add RWKV paper and update doc index

* add train loop

* experiment

---------

Co-authored-by: Jacob Hatef <hatef.4@buckeyemail.buckeyemail.osu.edu>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
2024-03-17 17:36:15 +05:30
Varuna Jayasiri
a0679ecd90 title 2024-01-12 13:21:54 +05:30
Varuna Jayasiri
4135eda943 title 2024-01-12 13:15:04 +05:30
Varuna Jayasiri
9a42ac2697 arxiv.org links 2023-10-24 14:42:32 +01:00
Varuna Jayasiri
0101618de6 links 2023-07-14 21:27:44 +05:30
Varuna Jayasiri
ce0cdb676f ja translation 2023-06-30 16:03:07 +05:30
Varuna Jayasiri
d4c3335525 link to translations 2023-04-02 14:03:23 +05:30
Varuna Jayasiri
c0b4a37870 remove sponsors 2023-02-24 18:40:20 +05:30
Varuna Jayasiri
4c47086b0f links 2022-09-15 14:55:48 +05:30
Varuna Jayasiri
05632f9f8e docs 2022-09-07 10:33:13 +05:30
Varuna Jayasiri
c0004c9e8e html 2022-08-26 18:06:08 +05:30
Varuna Jayasiri
db4dd4905b links 2022-08-20 13:41:51 +05:30
Varuna Jayasiri
980a84ed4f Zero3 memory optimizations (#140) 2022-08-11 15:44:13 +05:30
Varuna Jayasiri
4cf1d74e6d sampling links 2022-08-08 12:27:11 +05:30
Varuna Jayasiri
948e436854 U-Net (#137) 2022-07-25 16:55:06 +05:30
Varuna Jayasiri
465d01ef77 html template 2022-07-18 09:01:01 +05:30
Varuna Jayasiri
27a7fec943 sponsor button 2022-07-17 13:10:49 +05:30
Varuna Jayasiri
72669d0526 ALiBi (#134) 2022-07-17 09:28:32 +05:30
Varuna Jayasiri
2891504f52 og:site_name 2022-06-29 13:42:32 +05:30
Varuna Jayasiri
e09ee89f36 Transformer experiment logs (#130) 2022-06-27 14:11:44 +05:30
Varuna Jayasiri
f7262109c6 deploy html 2022-06-22 18:25:56 +05:30
Varuna Jayasiri
6a41c82b30 FTA (#115) 2022-05-23 22:26:39 +05:30
Varuna Jayasiri
bf0c4e36d7 highlighted paper links 2022-05-03 09:30:50 +01:00
Varuna Jayasiri
0aff72f970 DeepNorm (#114) 2022-04-10 08:08:55 +05:30
Varuna Jayasiri
d2229b6edd katex fixes 2022-03-21 09:16:52 +05:30
Varuna Jayasiri
1536c6ec5e links 2022-03-12 15:51:10 +05:30
Varuna Jayasiri
62c5786d31 html 2022-02-23 15:15:59 +05:30
Varuna Jayasiri
c1a1556c6e url fix 2021-11-06 14:36:50 +05:30
Varuna Jayasiri
3a54a2099d links 2021-11-06 14:29:38 +05:30
Varuna Jayasiri
5e56ba1964 update docs 2021-10-29 09:32:09 +05:30
Varuna Jayasiri
90c755bc39 click to highlight 2021-10-23 16:56:48 +05:30
Varuna Jayasiri
76ae56ff15 highlight code in notes 2021-10-22 18:34:59 +05:30
Varuna Jayasiri
582469a6ad code formatting 2021-10-22 17:10:13 +05:30
Varuna Jayasiri
f8481e3fb4 katex generated 2021-10-21 18:00:58 +05:30
Varuna Jayasiri
623662c355 🤦‍♂️ revert 2021-10-21 15:03:00 +05:30
Varuna Jayasiri
c5b13162cf anchors 2021-10-21 15:01:58 +05:30
Varuna Jayasiri
77bf55e03a 🤦‍♂️ fix 2021-10-21 11:47:05 +05:30
Varuna Jayasiri
8aa83ddf7b unescape * 2021-10-21 11:46:06 +05:30
Varuna Jayasiri
7ce8eeb6d2 light 2021-10-16 14:35:06 +05:30
Varuna Jayasiri
72a197542c dark photo 2021-10-16 14:33:37 +05:30
Varuna Jayasiri
6464269933 dqn images 2021-10-16 14:30:41 +05:30
Varuna Jayasiri
f8abd6a276 Conv mixer (#100) 2021-10-14 18:41:37 +05:30
Varuna Jayasiri
e2be5ddf35 Denoising Diffusion Probabilistic Models (#98) 2021-10-08 21:33:04 +05:30
Varuna Jayasiri
58cda113e0 primer links 2021-09-21 16:06:34 +05:30