DrQ-v2: Improved Data-Augmented Reinforcement Learning

Last update: Jan 01, 2023

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

DrQ-v2 is a model-free off-policy algorithm for image-based continuous control. DrQ-v2 builds on DrQ, an actor-critic approach that uses data augmentation to learn directly from pixels. We introduce several improvements including:

Switch the base RL learner from SAC to DDPG.
Incorporate n-step returns to estimate TD error.
Introduce a decaying schedule for exploration noise.
Make implementation 3.5 times faster.
Find better hyper-parameters.

These changes allow us to significantly improve sample efficiency and wall-clock training time on a set of challening tasks from the DeepMind Control Suite compared to prior methods. Furthermore, DrQ-v2 is able to solve complex humanoid locomotion tasks directly from pixel observations, previously unattained by model-free RL.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@article{yarats2021drqv2,
  title={Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning},
  author={Denis Yarats and Rob Fergus and Alessandro Lazaric and Lerrel Pinto},
  journal={arXiv preprint arXiv:},
  year={2021}
}

Instructions

Install dependencies:

conda env create -f conda_env.yml
conda activate drqv2

Train the agent:

python train.py task=quadruped_walk

Monitor results:

tensorboard --logdir exp_local

License

The majority of DrQ-v2 is licensed under the MIT license, however portions of the project are available under separate license terms: DeepMind is licensed under the Apache 2.0 license.

DrQ-v2: Improved Data-Augmented Reinforcement Learning

Related tags

Overview

DrQ-v2: Improved Data-Augmented RL Agent

Method

Citation

Instructions

License

Owner

Facebook Research

Code for 2021 NeurIPS --- Towards Multi-Grained Explainability for Graph Neural Networks

LightningFSL: Pytorch-Lightning implementations of Few-Shot Learning models.

Multi-Output Gaussian Process Toolkit

Smart edu-autobooking - Johnson @ DMI-UNICT study room self-booking system

IMBENS: class-imbalanced ensemble learning in Python.

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

Self-Supervised Pillar Motion Learning for Autonomous Driving (CVPR 2021)

Self-Learned Video Rain Streak Removal: When Cyclic Consistency Meets Temporal Correspondence

Lolviz - A simple Python data-structure visualization tool for lists of lists, lists, dictionaries; primarily for use in Jupyter notebooks / presentations

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Unofficial implementation of Alias-Free Generative Adversarial Networks. (https://arxiv.org/abs/2106.12423) in PyTorch

CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection

Utility code for use with PyXLL

3D Pose Estimation for Vehicles

The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".

Keras community contributions

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

2021 Artificial Intelligence Diabetes Datathon

Sionna: An Open-Source Library for Next-Generation Physical Layer Research

This is a simple backtesting framework to help you test your crypto currency trading. It includes a way to download and store historical crypto data and to execute a trading strategy.