📚 annotations

This commit is contained in:
Varuna Jayasiri
2020-10-25 09:49:23 +05:30
parent 20c2aa3174
commit 5ab7f32015
3 changed files with 3 additions and 3 deletions

View File

@@ -34,7 +34,7 @@ and
#### ✨ [Reinforcement Learning](http://lab-ml.com/labml_nn/rl/)
* [Proximal Policy Optimization](http://lab-ml.com/labml_nn/rl/ppo/) with
[Generalized Advantage Estimation](http://lab-ml.com/labml_nn/rl/ppo/gae.html)
* [Deep Q Netowrks](http://lab-ml.com/labml_nn/rl/dqn/) with
* [Deep Q Networks](http://lab-ml.com/labml_nn/rl/dqn/) with
with [Dueling Network](http://lab-ml.com/labml_nn/rl/dqn/model.html),
[Prioritized Replay](http://lab-ml.com/labml_nn/rl/dqn/replay_buffer.html)
and Double Q Network.

View File

@@ -1,5 +1,5 @@
"""
# Neural Network Model for Deep Q Network (DQN)
# Deep Q Network (DQN) Model
"""
import torch

View File

@@ -33,7 +33,7 @@ and
#### ✨ [Reinforcement Learning](http://lab-ml.com/labml_nn/rl/)
* [Proximal Policy Optimization](http://lab-ml.com/labml_nn/rl/ppo/) with
[Generalized Advantage Estimation](http://lab-ml.com/labml_nn/rl/ppo/gae.html)
* [Deep Q Netowrks](http://lab-ml.com/labml_nn/rl/dqn/) with
* [Deep Q Networks](http://lab-ml.com/labml_nn/rl/dqn/) with
with [Dueling Network](http://lab-ml.com/labml_nn/rl/dqn/model.html),
[Prioritized Replay](http://lab-ml.com/labml_nn/rl/dqn/replay_buffer.html)
and Double Q Network.