PPO : Proximal Policy Optimization

Introduction

This is our PPO implementation used for reinforcement learning of a controller in:

Emanuel Joos, Fabien Péan, Orcun Goksel:
"Reinforcement Learning of Musculoskeletal Control from Functional Simulations",
arXiv:2007.06669, 2020.

If you use this code, please cite the work above.
Installation instructions and a simple demo example are given below.

Set up python environment

Clone OpenAI gym and OpenAI baselines from:
https://github.com/openai/baselines
https://github.com/openai/gym/
Copy the test_envs.py to OpenAI/gym_shoulder/gym_OneMuscle/envs/ (Create the directory first)
Copy the init.py to the same directory
Copy the setup.py to OpenAI/gym_shoulder/
Download and install conda
Set up the virtual environment

conda create -n PPO python=3.5
conda activate PPO
pip3 install https://download.pytorch.org/whl/cpu/torch-1.1.0-cp35-cp35m-linux_x86_64.whl --user
pip3 install https://download.pytorch.org/whl/cpu/torchvision-0.3.0-cp35-cp35m-linux_x86_64.whl --user
pip install seaborn psutil matplotlib pandas tensorboard tensorboardX
pip install --user ipykernel
python -m ipykernel install --user --name=PPO
cd OpenAI/baselines
pip install -e .
cd OpenAI/gym
pip install -e .

Train a simple control example

source activate PPO
cd PPODummyExample
python main.py --vis True

Training is very slow if vis is set true. For all availabe arguments please have a look at PPODummyExample/main.py

Run the pretrained example for control

source activate PPO
cd PPODummyExample
python main.py --play True

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
PPODummyExample		PPODummyExample
videos		videos
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
setup.py		setup.py
test_envs.py		test_envs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPO : Proximal Policy Optimization

Introduction

Set up python environment

Train a simple control example

Run the pretrained example for control

About

Releases

Packages

Languages

License

CAiM-lab/PPO

Folders and files

Latest commit

History

Repository files navigation

PPO : Proximal Policy Optimization

Introduction

Set up python environment

Train a simple control example

Run the pretrained example for control

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages