Deeprl - Standard DQN and dueling network for simple games

Last update: Apr 12, 2020

Overview

DeepRL

This code implements the standard deep Q-learning and dueling network with experience replay (memory buffer) for playing simple games.

DQN algorithm implemented in this code is from the Google DeepMind's paper Playing Atari with Deep Reinforcement Learning[link].

Dueling network is from the paper Dueling Network Architectures for Deep Reinforcement Learning [link]

Requirement

DeepRL is implemented with Torch and the packages of its ecosystem. This code is well worked on my Mac Pro with CPU (I haven't tested it on Linux and GPU). Install Torch7 firstly, then you should install the following packages by luarocks

luarocks install nn
luarocks install image
luarocks install qt
luarocks install optim

Running

You can run this code by tapping the command in the project dir.

qlua main.lua

The result looks like

DQN: I got the accuracy of 93.2% (932 success of 1000 epochs).

Dueling: I got the accuracy of 99.2% (992 success of 1000 epochs).

Code

The envir.lua indicates the environment in reinforcement learning stage, which receives the action and produces the states and a reward for agent.

The agent.lua is the implementation of agent which receives the states and reward to produce the action directed by the policy network.

The learner.lua is the learning algorithm of DQN with experience replay as the following.

MISC

I completed this code when I was an intern at Horizon Robotics. I will greatly thank the article of Andrej Karpathy and other implementations:SeanNaren's code and EderSantana's gist.

LICENSE

MIT

Deeprl - Standard DQN and dueling network for simple games

Related tags

Overview

DeepRL

Requirement

Running

Code

MISC

LICENSE

Owner

Yao Zhou

Synthesizing and manipulating 2048x1024 images with conditional GANs

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

Code for our TKDE paper "Understanding WeChat User Preferences and “Wow” Diffusion"

This is an official implementation for the WTW Dataset in "Parsing Table Structures in the Wild " on table detection and table structure recognition.

TextBPN Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

Illuminated3D This project participates in the Nasa Space Apps Challenge 2021.

Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"

StarGAN - Official PyTorch Implementation (CVPR 2018)

MMRazor: a model compression toolkit for model slimming and AutoML

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

MPI Interest Group on Algorithms on 1st semester 2021

Rational Activation Functions - Replacing Padé Activation Units

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

The code repository for "PyCIL: A Python Toolbox for Class-Incremental Learning" in PyTorch.

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

LIVECell - A large-scale dataset for label-free live cell segmentation

PyTorch Implementation of Spatially Consistent Representation Learning(SCRL)

A pure PyTorch batched computation implementation of "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition"

BboxToolkit is a tiny library of special bounding boxes.