Zero-Shot Transfer using Buckingham's Pi Theorem

This repo contains the code used to generate the figures of Zero-Shot Transfer in Reinforcement Learning using Buckingham's Pi Theorem.

Running the notebooks

Setup the environment.
```
python3 -m venv .venv
. .venv/bin/activate
python3 -m pip install -r requirements-exp.txt
```
The requirements-exp.txt is the exact environment that was used to generate the figures. The requirements.txt contains the simplified "direct" dependencies of the environment.
Open and run each notebook.

The notebooks

transfer_pendulum.ipynb

Generates the figures for the simulated pendulum. Note that the "Optimal" data generation takes a looong time (>30h) to compute (because of sub-optimal single threaded Python).

transfer_real_pendulum.ipynb

Generates the figure for the real pendulum. This notebook is not particularly long to run. The code used to run the pendulum to gather the experimental data is in experimental_code.zip. The raw data is contained in the bags/ directory as ROS2 bags.

transfer_halfcheetah.ipynb

Generates the figure for the HalfCheetah. This notebook takes about 1h to run. The make_cheetah.py module is used to generate custom HalfCheetah MuJoCo specs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zero-Shot Transfer using Buckingham's Pi Theorem

Running the notebooks

The notebooks

transfer_pendulum.ipynb

transfer_real_pendulum.ipynb

transfer_halfcheetah.ipynb

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Zero-Shot Transfer using Buckingham's Pi Theorem

Running the notebooks

The notebooks

transfer_pendulum.ipynb

transfer_real_pendulum.ipynb

transfer_halfcheetah.ipynb