Do you want a RL agent nicely moving on Atari?

Rainbow is all you need!

This is a step-by-step tutorial from DQN to Rainbow. Every chapter contains both of theoretical backgrounds and object-oriented implementation. Just pick any topic in which you are interested, and learn! You can execute them right away with Colab even on your smartphone.

Please feel free to open an issue or a pull-request if you have any idea to make it better. :)

If you want a tutorial for policy gradient methods, please see PG is All You Need.

DQN [NBViewer] [Colab]
DoubleDQN [NBViewer] [Colab]
PrioritizedExperienceReplay [NBViewer] [Colab]
DuelingNet [NBViewer] [Colab]
NoisyNet [NBViewer] [Colab]
CategoricalDQN [NBViewer] [Colab]
N-stepLearning [NBViewer] [Colab]
Rainbow [NBViewer] [Colab]

Prerequisites

This repository is tested on Anaconda virtual environment with python 3.7+

$ conda create -n rainbow-is-all-you-need python=3.7
$ conda activate rainbow-is-all-you-need

Installation

First, clone the repository.

git clone https://github.com/Curt-Park/rainbow-is-all-you-need.git
cd rainbow-is-all-you-need

Secondly, install packages required to execute the code. Just type:

make setup

Contributors

Thanks goes to these wonderful people (emoji key):

_{Jinwoo Park (Curt)}

_{Kyunghwan Kim}

_{Wei Chen}

_{WANG Lei}

_leeyaf

_ahmadF

This project follows the all-contributors specification. Contributions of any kind welcome!

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Related tags

Overview

Rainbow is all you need!

Contents

Prerequisites

Installation

Related Papers

Contributors

Owner

Jinwoo Park (Curt)

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.

Embeddinghub is a database built for machine learning embeddings.

S-attack library. Official implementation of two papers "Are socially-aware trajectory prediction models really socially-aware?" and "Vehicle trajectory prediction works, but not everywhere".

Annealed Flow Transport Monte Carlo

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

NCNN implementation of Real-ESRGAN. Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Pairwise model for commonlit competition

Customer-Transaction-Analysis - This analysis is based on a synthesised transaction dataset containing 3 months worth of transactions for 100 hypothetical customers.

Simultaneous Demand Prediction and Planning

mlpack: a scalable C++ machine learning library --

TransGAN: Two Transformers Can Make One Strong GAN

Notebook and code to synthesize complex and highly dimensional datasets using Gretel APIs.

Code To Tune or Not To Tune? Zero-shot Models for Legal Case Entailment.

DAT4 - General Assembly's Data Science course in Washington, DC

To propose and implement a multi-class classification approach to disaster assessment from the given data set of post-earthquake satellite imagery.

HeatNet is a python package that provides tools to build, train and evaluate neural networks designed to predict extreme heat wave events globally on daily to subseasonal timescales.

This project aims to segment 4 common retinal lesions from Fundus Images.

✨风纪委员会自动投票脚本，利用Github Action帮你进行裁决操作（为了让其他风纪委员有案件可判，本程序从中午12点才开始运行，有需要请自己修改运行时间）

A Python library for unevenly-spaced time series analysis