GitXplorerGitXplorer
m

domrand

public
26 stars
4 forks
7 issues

Commits

List of commits on branch master.
Verified
c7cc6252656d88836037b86cafd8a88021f64ce4

Merge pull request #2 from matwilso/dependabot/pip/pyyaml-5.1

mmatwilso committed 6 years ago
Verified
ebf760c7dc08dd0b33add2cb511cfb0e827026c5

Bump pyyaml from 3.13 to 5.1

ddependabot[bot] committed 6 years ago
Verified
cebfa91bc3c29c135bc9fb754eab8f65c5efdfcb

fix xmls

mmatwilso committed 6 years ago
Unverified
3a8c6bdbff449b044e6e92ff9f76b1bcf281d5ec

few more touches

mmatwilso committed 6 years ago
Unverified
364f0e6d4000a43d38811f8536edec36834ad164

update readme and fix import

mmatwilso committed 6 years ago
Unverified
5c6aac322d797cdd3feed082abf4a95d9b9391e8

fix readme

mmatwilso committed 6 years ago

README

The README file for this repository.

Reproducing Domain Randomization for Sim-to-Real

Reproducing [Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World]

Intro

This repository contains my implementation of domain randomization setup for training an object localization model in simulation to adapt to the real world. I was implementing this as part of a larger research project, and decided to publish this in case others may find it useful.

I use the KUKA LBR4 arm (video) instead of the Fetch robot that was used in OpenAI's initial domain randomization work.

I show that I am able to get average 3.7cm accuracy in the real world with a model trained purely in simulation. I suspect I could improve this accuracy to be on par with the 1.5cm reported in the original paper, by doing better calibration to the real world and randomizing some more relevant aspects of the visuals. But I figured the results are good enough for purposes of showing it works, and 3.7cm is still quite good for the bigger table (with large range of positions) I was using.

Table of Contents

Results

Training

I randomize the textures, lighting, camera position and orientation (slightly around the real world), and I train the model to predict the cube XYZ coordinates (with fixed height, so Z is always the same, as in the paper).

Example training images

Evaluation

To evaluate my system, I collected 54 images from the real world with a cube placed in uniformly spaced positions on a table in front of the robot arm. I run the model trained in simulation on each of these images, and achieve an average error of 3.7cm. I show examples from various percentiles with 100 being the best prediction (under 1 cm error) and 0 being the worst prediction (45 cm error). Note that the worst results come from the object being on the edge of the table, and that despite a few outliers, the model is pretty accuracte.

NOTE: that I only use the XY coordinates in calculating error, since the height is always the same. This provides a more fair estimation of model accuracy.

Plots for percentiles of model accuracy (best = 100, worst = 0)

100 90 80 70
50 20 10 0

Setup

Follow the instructions at the mujoco_py repo. Installation can be a pain, and see the tips below for some GitHub issues to check out if you have any problems with installation.

You may have to install some extra packages (see my apt.txt for a (partial) list of these).

To install the python dependencies:

pip install -r requirements.txt

Running demos

Once you have mujoco_py installed, you should try running the randomization demos. These only depend on mujoco_py and can help you ensure that you have installed it correctly.

python3 demos/disco_kuka.py

# or

python3 demos/disco_fetch.py

Instructions

Generating data

python3 run_domrand.py

# or

python3 run_domrand.py --gui 1  # to visualize results, but not write to file

# And you can just kill it when you are done. 
# My code handles file closing for imcomplete writes, but can sometimes cause data corruptoins. 
# To be safe, after running the script, you can run:

python3 scripts/find_corruption.py

I generally generate and test with about 100 files (100k images) for good measure, but I have not done a thorough test of how many are really required for good performance. I suspect 50k images would give similar results and that you could likely get away with 20k.

Running model

python3 run_training.py --num_files=100

This will train a model in about 3 hours (about 8 epochs) on a GTX 1080 Ti GPU.

You can monitor training progress and results using TensorBoard. The default is to log into the checkpoint folder. See sim2real/define_flags.py or python3 run_training.py --help for script settings.

Mujoco Tips

  <visual>
    <map znear=0.01 /> 
  </visual>