Please use hyper parameters from this readme. With other hyper parameters things might not work (it's RL after all)!
This repo contains a PyTorch implementation for the paper
@article{touati2020stable,
title={Stable Policy Optimization via Off-Policy Divergence Regularization},
author={Touati, Ahmed and Zhang, Amy and Pineau, Joelle and Vincent, Pascal},
journal={arXiv preprint arXiv:2003.04108},
year={2020}
}
- Python 3 (it might work with Python 2, but I didn't test it)
- PyTorch
- OpenAI baselines
In order to install requirements, follow:
# PyTorch
conda install pytorch torchvision -c soumith
# Baselines for Atari preprocessing
git clone https://github.com/openai/baselines.git
cd baselines
pip install -e .
# Other requirements
pip install -r requirements.txt
./run_local_atari.sh
./run_local.sh