mirror of
				https://github.com/labmlai/annotated_deep_learning_paper_implementations.git
				synced 2025-11-01 03:43:09 +08:00 
			
		
		
		
	📇 version
This commit is contained in:
		| @ -1,11 +1,11 @@ | ||||
| """ | ||||
| --- | ||||
| title: Proximal Policy Optimization (PPO) | ||||
| title: Proximal Policy Optimization - PPO | ||||
| summary: > | ||||
|  An annotated implementation of Proximal Policy Optimization (PPO) algorithm in PyTorch. | ||||
|  An annotated implementation of Proximal Policy Optimization - PPO algorithm in PyTorch. | ||||
| --- | ||||
|  | ||||
| # Proximal Policy Optimization (PPO) | ||||
| # Proximal Policy Optimization - PPO | ||||
|  | ||||
| This is a [PyTorch](https://pytorch.org) implementation of | ||||
| [Proximal Policy Optimization - PPO](https://arxiv.org/abs/1707.06347). | ||||
|  | ||||
| @ -1,4 +1,4 @@ | ||||
| # [Proximal Policy Optimization (PPO)](https://nn.labml.ai/rl/ppo/index.html) | ||||
| # [Proximal Policy Optimization - PPO](https://nn.labml.ai/rl/ppo/index.html) | ||||
|  | ||||
| This is a [PyTorch](https://pytorch.org) implementation of | ||||
| [Proximal Policy Optimization - PPO](https://arxiv.org/abs/1707.06347). | ||||
|  | ||||
		Reference in New Issue
	
	Block a user
	 Varuna Jayasiri
					Varuna Jayasiri