Quickstart¶
0. Requirements¶
Python 3.8 or higher
(Optional) CUDA-compatible GPU for faster training
1. Installation¶
You can install Duo using pip:
pip install duo-ai
Alternatively, install from source:
git clone --recurse-submodules https://github.com/khanhptnk/duo-ai.git
pip install -e .
Check that Duo was installed correctly by running:
python -c "import duo_ai; print(duo_ai.__version__)"
2. Training a Random Coordination Policy for Procgen-Coinrun¶
If you have not already cloned the repository in the previous step, do so now. The repository already integrates the Procgen environments:
git clone --recurse-submodules https://github.com/khanhptnk/duo-ai.git
Install requirements for Procgen environments:
pip install -r requirements/requirements_procgen.txt
Next, train the coordination policy using the following command:
python examples/procgen_yrc.py \
--config configs/procgen_random.yaml \
--mode train \
--type coord \
overwrite=1
Training takes about 16 minutes to complete on an RTX 6000 GPU. You should expect a reward of around 5.73:
[0:16:12 INFO]: BEST test so far
[0:16:12 INFO]: Parameters: {'temperature': 1.0, 'threshold': -1.9764122247695923}
[0:16:12 INFO]: Steps: 18,792
Episode length: mean 73.41 min 17.00 max 256.00
Reward: mean 5.73 ± 0.58
Base Reward: mean 6.88 ± 0.57
Action 1 fraction: 0.36
Wandb logs of all methods on Procgen-Coinrun are available here.