Evaluating different engineering tricks that make RL work

Last update: Dec 26, 2022

Related tags

Overview

Reinforcement Learning Tricks, Index

This repository contains the code for the paper "Distilling Reinforcement Learning Tricks for Video Games".

Short story shorter: RL algorithms are neat and all, but to get it to work in video games (RL competitions and whatnot), there are some nifty little tricks involved that need bit of expertise in the domain. This includes reward shaping, curriculum learning, splitting task into subtasks by hand and guiding agent's actions. We took some of these tricks and tried them on three environments with DQN. With right setup you get more out of DQN.

Code authors: Anssi Kanervisto, Christian Scheller and Yanick Schraner.

The experiments in the three environments are split into three git branches:

vizdoom for ViZDoom Deathmatch experiments
minerl for MineRL ObtainDiamond experiments
gfootball for Football environment experiments

To run the experiments, checkout the repository you want to run experiments for with git checkout [branch name], and follow the instructions in the README file there.

After running all the experiments, collect the results as described the respective branches. You should have three directories

vizdoom-runs
minerl-runs
football-runs

After this, running python plot_paper.py should create a figures/learning_curves.pdf file which summarizes the results.

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

Improving Machine Translation Systems via Isotopic Replacement

ICSS - Interactive Continual Semantic Segmentation

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

Code for the paper Task Agnostic Morphology Evolution.

Keyhole Imaging: Non-Line-of-Sight Imaging and Tracking of Moving Objects Along a Single Optical Path

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Machine learning for NeuroImaging in Python

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Self-Supervised Learning

PyGCL: A PyTorch Library for Graph Contrastive Learning

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

The official repository for "Score Transformer: Generating Musical Scores from Note-level Representation" (MMAsia '21)

This repository contains python code necessary to replicated the experiments performed in our paper "Invariant Ancestry Search"

NR-GAN: Noise Robust Generative Adversarial Networks

✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

The Generic Manipulation Driver Package - Implements a ROS Interface over the robotics toolbox for Python

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Evaluating different engineering tricks that make RL work

Related tags

Overview

Reinforcement Learning Tricks, Index

Owner

Anssi

Improving Machine Translation Systems via Isotopic Replacement

ICSS - Interactive Continual Semantic Segmentation

[NeurIPS'21] "AugMax: Adversarial Composition of Random Augmentations for Robust Training" by Haotao Wang, Chaowei Xiao, Jean Kossaifi, Zhiding Yu, Animashree Anandkumar, and Zhangyang Wang.

Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)

Code for the paper Task Agnostic Morphology Evolution.

Keyhole Imaging: Non-Line-of-Sight Imaging and Tracking of Moving Objects Along a Single Optical Path

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Machine learning for NeuroImaging in Python

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Self-Supervised Learning

PyGCL: A PyTorch Library for Graph Contrastive Learning

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

The official repository for "Score Transformer: Generating Musical Scores from Note-level Representation" (MMAsia '21)

This repository contains python code necessary to replicated the experiments performed in our paper "Invariant Ancestry Search"

NR-GAN: Noise Robust Generative Adversarial Networks

✅ How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.

a reimplementation of Optical Flow Estimation using a Spatial Pyramid Network in PyTorch

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

The Generic Manipulation Driver Package - Implements a ROS Interface over the robotics toolbox for Python

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.