GitXplorerGitXplorer
v

PPO-Implementation-Deep-Dive

public
45 stars
3 forks
1 issues

Commits

List of commits on branch master.
Verified
2009144662c28231c7592d64fc3ddb5f399e9cc9

Update README.md

vvwxyzjn committed 3 years ago
Verified
ef1b011b481aeb990e09c467a87e432803cb6152

Merge pull request #2 from vwxyzjn/mujoco

vvwxyzjn committed 3 years ago
Unverified
3953dce22edc00ecdda4c1c9f1e72c7980066878

Add mujoco env support for Linux

vvwxyzjn committed 3 years ago
Verified
758a0d43806a2f35bc2e2ad76d9f7f8aeeba7995

Merge pull request #1 from vwxyzjn/improve-installation

vvwxyzjn committed 3 years ago
Unverified
3ddf329cba18ff2a8a820a022b302d7b67170dfb

Improve atari installation experience

vvwxyzjn committed 3 years ago
Verified
12b54d4734f000eb94dfbd5fd5093447ddce7688

Update README.md

vvwxyzjn committed 3 years ago

README

The README file for this repository.

Deprecation Notice

This repo is deprecated - please visit our new repo https://github.com/vwxyzjn/ppo-implementation-details and the improved ICLR 2022 blog post on PPO https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/

PPO-Implementation-Deep-Dive

This repo contains the source code for the PPO Implementation Deep Dive tutorial series.

  1. Proximal Policy Optimization Implementation Deep Dive | 11 Core Implementation Details (youtu.be/MEt6rrxH8W4)
  2. Proximal Policy Optimization Implementation Deep Dive | 9 Atari-specific Details (youtu.be/05RMTj-2K_Y)
  3. Proximal Policy Optimization Implementation Deep Dive | 8 Details for Continuous Actions (youtu.be/BvZvx7ENZBw)

image

You can find out where theses implementation details come from by visiting my blog post, which contains github permanent links of the details to the original implementation.

If you like this repo, consider also checking out CleanRL, my RL library based on single-file implementations.

Get started

Prerequisites:

Install dependencies:

poetry install

Train agents:

poetry run python ppo.py

Train agents with experiment tracking:

poetry run python ppo.py --track --capture-video

Atari

Install dependencies:

poetry install -E atari

Train agents:

poetry run python ppo_atari.py

Train agents with experiment tracking:

poetry run python ppo_atari.py --track --capture-video

Pybullet

Install dependencies:

poetry install -E pybullet

Train agents:

poetry run python ppo_continuous_action.py

Train agents with experiment tracking:

poetry run python ppo_continuous_action.py --track --capture-video

MuJoCo

!! Note this installation method only works in Linux

Install dependencies:

poetry install -E mujoco
poetry run python -c "import mujoco_py"

Train agents:

poetry run python ppo_continuous_action.py --gym-id Hopper-v2

Train agents with experiment tracking:

poetry run python ppo_continuous_action.py --gym-id Hopper-v2 --track --capture-video