Varuna Jayasiri
25e169843e
link to jax transformer
2025-11-11 09:22:38 +00:00
Varuna Jayasiri
9262c57f18
flash attention
2025-08-08 19:57:57 +05:30
Varuna Jayasiri
5731bff586
LoRA docs
2024-08-24 10:50:02 +05:30
Varuna Jayasiri
d858f2eec0
remove tranding papers link
2024-06-21 19:35:22 +05:30
Varuna Jayasiri
cf565bcc1d
cleanup
2024-06-18 11:09:02 +05:30
Varuna Jayasiri
df9e1af615
RWKV docs
2024-03-17 17:45:08 +05:30
Jacob Hatef
7db6e92376
RWKV ( #222 )
...
* rwkv-init
* annotations
* Re-added docs
* make dir if not exist
* Add RWKV paper and update doc index
* add train loop
* experiment
---------
Co-authored-by: Jacob Hatef <hatef.4@buckeyemail.buckeyemail.osu.edu >
Co-authored-by: Quentin Anthony <qganthony@yahoo.com >
2024-03-17 17:36:15 +05:30
Varuna Jayasiri
a0679ecd90
title
2024-01-12 13:21:54 +05:30
Varuna Jayasiri
4135eda943
title
2024-01-12 13:15:04 +05:30
Varuna Jayasiri
9a42ac2697
arxiv.org links
2023-10-24 14:42:32 +01:00
Varuna Jayasiri
0101618de6
links
2023-07-14 21:27:44 +05:30
Varuna Jayasiri
ce0cdb676f
ja translation
2023-06-30 16:03:07 +05:30
Varuna Jayasiri
d4c3335525
link to translations
2023-04-02 14:03:23 +05:30
Varuna Jayasiri
c0b4a37870
remove sponsors
2023-02-24 18:40:20 +05:30
Varuna Jayasiri
4c47086b0f
links
2022-09-15 14:55:48 +05:30
Varuna Jayasiri
05632f9f8e
docs
2022-09-07 10:33:13 +05:30
Varuna Jayasiri
c0004c9e8e
html
2022-08-26 18:06:08 +05:30
Varuna Jayasiri
db4dd4905b
links
2022-08-20 13:41:51 +05:30
Varuna Jayasiri
980a84ed4f
Zero3 memory optimizations ( #140 )
2022-08-11 15:44:13 +05:30
Varuna Jayasiri
4cf1d74e6d
sampling links
2022-08-08 12:27:11 +05:30
Varuna Jayasiri
948e436854
U-Net ( #137 )
2022-07-25 16:55:06 +05:30
Varuna Jayasiri
465d01ef77
html template
2022-07-18 09:01:01 +05:30
Varuna Jayasiri
27a7fec943
sponsor button
2022-07-17 13:10:49 +05:30
Varuna Jayasiri
72669d0526
ALiBi ( #134 )
2022-07-17 09:28:32 +05:30
Varuna Jayasiri
2891504f52
og:site_name
2022-06-29 13:42:32 +05:30
Varuna Jayasiri
e09ee89f36
Transformer experiment logs ( #130 )
2022-06-27 14:11:44 +05:30
Varuna Jayasiri
f7262109c6
deploy html
2022-06-22 18:25:56 +05:30
Varuna Jayasiri
6a41c82b30
FTA ( #115 )
2022-05-23 22:26:39 +05:30
Varuna Jayasiri
bf0c4e36d7
highlighted paper links
2022-05-03 09:30:50 +01:00
Varuna Jayasiri
0aff72f970
DeepNorm ( #114 )
2022-04-10 08:08:55 +05:30
Varuna Jayasiri
d2229b6edd
katex fixes
2022-03-21 09:16:52 +05:30
Varuna Jayasiri
1536c6ec5e
links
2022-03-12 15:51:10 +05:30
Varuna Jayasiri
62c5786d31
html
2022-02-23 15:15:59 +05:30
Varuna Jayasiri
c1a1556c6e
url fix
2021-11-06 14:36:50 +05:30
Varuna Jayasiri
3a54a2099d
links
2021-11-06 14:29:38 +05:30
Varuna Jayasiri
5e56ba1964
update docs
2021-10-29 09:32:09 +05:30
Varuna Jayasiri
90c755bc39
click to highlight
2021-10-23 16:56:48 +05:30
Varuna Jayasiri
76ae56ff15
highlight code in notes
2021-10-22 18:34:59 +05:30
Varuna Jayasiri
582469a6ad
code formatting
2021-10-22 17:10:13 +05:30
Varuna Jayasiri
f8481e3fb4
katex generated
2021-10-21 18:00:58 +05:30
Varuna Jayasiri
623662c355
🤦♂️ revert
2021-10-21 15:03:00 +05:30
Varuna Jayasiri
c5b13162cf
anchors
2021-10-21 15:01:58 +05:30
Varuna Jayasiri
77bf55e03a
🤦♂️ fix
2021-10-21 11:47:05 +05:30
Varuna Jayasiri
8aa83ddf7b
unescape *
2021-10-21 11:46:06 +05:30
Varuna Jayasiri
7ce8eeb6d2
light
2021-10-16 14:35:06 +05:30
Varuna Jayasiri
72a197542c
dark photo
2021-10-16 14:33:37 +05:30
Varuna Jayasiri
6464269933
dqn images
2021-10-16 14:30:41 +05:30
Varuna Jayasiri
f8abd6a276
Conv mixer ( #100 )
2021-10-14 18:41:37 +05:30
Varuna Jayasiri
e2be5ddf35
Denoising Diffusion Probabilistic Models ( #98 )
2021-10-08 21:33:04 +05:30
Varuna Jayasiri
58cda113e0
primer links
2021-09-21 16:06:34 +05:30