Skip to content

Latest commit

 

History

History
35 lines (25 loc) · 1.28 KB

File metadata and controls

35 lines (25 loc) · 1.28 KB

Zero-Shot Transfer using Buckingham's Pi Theorem

This repo contains the code used to generate the figures of Zero-Shot Transfer in Reinforcement Learning using Buckingham's Pi Theorem.

Running the notebooks

  1. Setup the environment.
    python3 -m venv .venv
    . .venv/bin/activate
    python3 -m pip install -r requirements-exp.txt
    The requirements-exp.txt is the exact environment that was used to generate the figures. The requirements.txt contains the simplified "direct" dependencies of the environment.
  2. Open and run each notebook.

The notebooks

transfer_pendulum.ipynb

Generates the figures for the simulated pendulum. Note that the "Optimal" data generation takes a looong time (>30h) to compute (because of sub-optimal single threaded Python).

transfer_real_pendulum.ipynb

Generates the figure for the real pendulum. This notebook is not particularly long to run. The code used to run the pendulum to gather the experimental data is in experimental_code.zip. The raw data is contained in the bags/ directory as ROS2 bags.

transfer_halfcheetah.ipynb

Generates the figure for the HalfCheetah. This notebook takes about 1h to run. The make_cheetah.py module is used to generate custom HalfCheetah MuJoCo specs.