Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Last update: Dec 26, 2022

Related tags

Deep Learning DQN-tensorflow

Overview

Human-Level Control through Deep Reinforcement Learning

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning.

This implementation contains:

Deep Q-network and Q-learning
Experience replay memory
- to reduce the correlations between consecutive updates
Network for Q-learning targets are fixed for intervals
- to reduce the correlations between target and predicted Q-values

Requirements

Python 2.7 or Python 3.3+
gym
tqdm
SciPy or OpenCV2
TensorFlow 0.12.0

Usage

First, install prerequisites with:

$ pip install tqdm gym[all]

To train a model for Breakout:

$ python main.py --env_name=Breakout-v0 --is_train=True
$ python main.py --env_name=Breakout-v0 --is_train=True --display=True

To test and record the screen with gym:

$ python main.py --is_train=False
$ python main.py --is_train=False --display=True

Results

Result of training for 24 hours using GTX 980 ti.

Simple Results

Details of Breakout with model m2(red) for 30 hours using GTX 980 Ti.

Details of Breakout with model m3(red) for 30 hours using GTX 980 Ti.

Detailed Results

[1] Action-repeat (frame-skip) of 1, 2, and 4 without learning rate decay

[2] Action-repeat (frame-skip) of 1, 2, and 4 with learning rate decay

[1] & [2]

[3] Action-repeat of 4 for DQN (dark blue) Dueling DQN (dark green) DDQN (brown) Dueling DDQN (turquoise)

The current hyper parameters and gradient clipping are not implemented as it is in the paper.

[4] Distributed action-repeat (frame-skip) of 1 without learning rate decay

[5] Distributed action-repeat (frame-skip) of 4 without learning rate decay

References

License

MIT License.

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Related tags

Overview

Human-Level Control through Deep Reinforcement Learning

Requirements

Usage

Results

Simple Results

Detailed Results

References

License

Owner

Devsisters Corp.

Contrastively Disentangled Sequential Variational Audoencoder

Co-GAIL: Learning Diverse Strategies for Human-Robot Collaboration

Picasso: a methods for embedding points in 2D in a way that respects distances while fitting a user-specified shape.

Spatial Action Maps for Mobile Manipulation (RSS 2020)

Python framework for Stochastic Differential Equations modeling

Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.

A PyTorch implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"

The repository for freeCodeCamp's YouTube course, Algorithmic Trading in Python

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

Contrastive Learning for Compact Single Image Dehazing, CVPR2021

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

tf2-keras implement yolov5

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

DRIFT is a tool for Diachronic Analysis of Scientific Literature.

MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images (ISBI 2021, MELBA 2021)

Neural Radiance Fields Using PyTorch

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

Unsupervised Representation Learning by Invariance Propagation

IOT: Instance-wise Layer Reordering for Transformer Structures

Image Captioning using CNN and Transformers