7db6e92376
RWKV ( #222 )
...
* rwkv-init
* annotations
* Re-added docs
* make dir if not exist
* Add RWKV paper and update doc index
* add train loop
* experiment
---------
Co-authored-by: Jacob Hatef <hatef.4@buckeyemail.buckeyemail.osu.edu>
Co-authored-by: Quentin Anthony <qganthony@yahoo.com>
2024-03-17 17:36:15 +05:30
e4747dee6d
flash attention pdf
2022-07-05 15:15:30 +05:30
1c3992f949
OPT: Open Pre-trained Transformer Language Models
2022-05-04 11:11:25 +01:00
4d4cbefe3e
Autoregressive Search Engines Highlighted paper ( #121 )
2022-04-27 20:31:57 +05:30
ce35cda7e0
Trillion Parameter models highlighted paper ( #119 )
2022-04-26 01:01:26 +05:30
4c908da357
Adding PDF ( #116 )
2022-04-19 13:27:00 +05:30
eddf936b3a
palm paper
2022-04-15 09:49:45 +01:00
243a581b42
dall-e 2 highlighted pdf
2022-04-12 17:10:21 +01:00
3aaae6ae93
star paper
2022-03-31 15:50:46 +05:30
e88904731f
retro paper
2021-12-28 15:59:08 +05:30
4fd9a102a3
nerf pdf
2021-12-07 17:24:22 +05:30
706e7397ea
reptile paper
2021-11-16 15:16:08 +05:30
7d80c2716e
transformer paper
2021-10-30 12:19:27 +05:30
e309638fea
ddpm highlighted paper
2021-10-07 16:08:23 +05:30
ea5439697d
primer paper
2021-09-29 19:40:20 +05:30
f81a88fc38
clip paper
2021-09-23 19:02:28 +05:30
a8b8e48c8a
The Sensory Neuron as a Transformer highlighted paper
2021-09-18 07:25:39 +05:30
c3ad4bb514
meta-gradients paper
2021-09-08 11:57:10 +05:30
e2516cc306
google maps eta highlighted paper
2021-09-06 13:31:24 +05:30
ff0d5c065d
ponder net highlighted paper
2021-08-15 16:58:18 +05:30
79505c4a89
muzero highlighted paper
2021-07-26 19:13:37 +05:30
bd1523e85c
paper
2021-07-23 14:10:07 +05:30
e81f44b883
paper pdf
2021-07-17 23:17:42 +05:30
f0bf8d39e4
resnet annotated paper
2021-07-16 08:56:53 +05:30
189770da92
highlighted paper
2021-07-03 14:04:11 +05:30