Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Last update: Sep 16, 2022

Related tags

Overview

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Official implementation of ACC, described in the paper "Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning". The source code is based on the pytorch implementation of TQC, which again is based on TD3. We thank the authors for making their source code publicly available.

Requirements

Install MuJoCo

Download and install MuJoCo 1.50 from the MuJoCo website. We assume that the MuJoCo files are extracted to the default location (~/.mujoco/mjpro150).
Copy your MuJoCo license key (mjkey.txt) to ~/.mujoco/mjkey.txt:

Install

We recommend to use an anaconda environment. In our experiments we used python 3.7 and the following dependencies

pip install gym==0.17.2 mujoco-py==1.50.1.68 numpy==1.19.1 torch==1.6.0 torchvision==0.7.0

Running ACC

You can run ACC for TQC on one of the gym continuous control environments by calling

python main.py --env "HalfCheetah-v3" --max_timesteps 5000000 --seed 0

To run the data efficient variant with 4 critic update steps per environment step you can call

python main.py --env "HalfCheetah-v3" --max_timesteps 1000000 --num_critic_updates 4 --seed 0

An example script that runs the experiments for 10 seeds and all environments is in run_experiment.sh and run_experiment_data_efficient.sh.

You can speed up the experiments by using fewer networks in the ensemble of TQC. This trades off a little bit of performance for a faster runtime (see the Appendix of the paper). The number of networks can be controlled with the flag --n_nets. For example

python main.py --env "HalfCheetah-v3" --max_timesteps 5000000 --n_nets 2--seed 0

Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Related tags

Overview

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Requirements

Install MuJoCo

Install

Running ACC

Owner

Segment axon and myelin from microscopy data using deep learning

Build upon neural radiance fields to create a scene-specific implicit 3D semantic representation, Semantic-NeRF

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

Cross-Document Coreference Resolution

KwaiRec: A Fully-observed Dataset for Recommender Systems (Density: Almost 100%)

Multi-robot collaborative exploration and mapping through Voronoi partition and DRL in unknown environment

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

Code repository for paper `Skeleton Merger: an Unsupervised Aligned Keypoint Detector`.

Final Project for the CS238: Decision Making Under Uncertainty course at Stanford University in Autumn '21.

Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)

Multilingual Image Captioning

Python PID Tuner - Makes a model of the System from a Process Reaction Curve and calculates PID Gains

Code accompanying the paper on "An Empirical Investigation of Domain Generalization with Empirical Risk Minimizers" published at NeurIPS, 2021

Pre-training of Graph Augmented Transformers for Medication Recommendation

A curated list of Generative Deep Art projects, tools, artworks, and models

QKeras: a quantization deep learning library for Tensorflow Keras

Dcf-game-infrastructure-public - Contains all the components necessary to run a DC finals (attack-defense CTF) game from OOO

The official implementation of the Hybrid Self-Attention NEAT algorithm

Latent Execution for Neural Program Synthesis