mirror of
https://github.com/labmlai/annotated_deep_learning_paper_implementations.git
synced 2025-10-31 02:39:16 +08:00
paper links
This commit is contained in:
@ -17,7 +17,7 @@ class Model(Module):
|
||||
"""
|
||||
## Dueling Network ⚔️ Model for $Q$ Values
|
||||
|
||||
We are using a [dueling network](https://arxiv.org/abs/1511.06581)
|
||||
We are using a [dueling network](https://papers.labml.ai/paper/1511.06581)
|
||||
to calculate Q-values.
|
||||
Intuition behind dueling network architecture is that in most states
|
||||
the action doesn't matter,
|
||||
|
||||
Reference in New Issue
Block a user