GitXplorerGitXplorer
s

snapshotrl

public
3 stars
0 forks
0 issues

Commits

List of commits on branch main.
Verified
0f0d1924183d2a1b1ca776cf70ef8502dcaa439c

update wandb link

ssdpkjc committed 10 months ago
Verified
6fd3b11c53af630d77552ee1dba0b5ee87668f92

update website

ssdpkjc committed 10 months ago
Verified
02918ec8a272784eee946c9cbaf5cfdb76e8997f

init

ssdpkjc committed 10 months ago

README

The README file for this repository.

🚀 Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency

arxiv github models weights and biases

This repository contains the code for the paper "Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency" by Yanxiao Zhao, Yangge Qian, Tianyi Wang, Jingyang Shan, Xiaolin Qin.

snapshotrl

Running the code

The code is written in Python 3.9 and PyTorch 2.0.1. To install the required dependencies, run

pip install -r src/requirements.txt

To train the agents, run the following commands:

# for S3RL+TD3
python src/snapshot_td3_mujoco.py --env-id Hopper-v4 --n-clusters 6 --truncate-step 100 --seed 1

# for SnapshotRL+SC+TD3
python src/snapshot_td3_mujoco.py --env-id Hopper-v4 --n-clusters 6 --truncate-step 1000 --seed 1

# for SnapshotRL+STT+TD3
python src/snapshot_td3_mujoco_random.py --env-id Hopper-v4 --truncate-step 100 --seed 1

# for SnapshotRL+TD3
python src/snapshot_td3_mujoco_random.py --env-id Hopper-v4 --truncate-step 1000 --seed 1


# for S3RL+SAC
python src/snapshot_sac_mujoco.py --env-id Hopper-v4 --n-clusters 6 --truncate-step 100 --seed 1

# for S3RL+PPO
python src/snapshot_ppo_fix_mujoco.py --env-id Hopper-v4 --n-clusters 6 --truncate-step 100 --seed 1

Citing this work

@article{zhao2024snapshot,
  title={Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency},
  author={Zhao, Yanxiao and Qian, Yangge and Wang, Tianyi and Shan, Jingyang and Qin, Xiaolin},
  journal={arXiv preprint arXiv:2403.00673},
  year={2024}
}