A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Last update: Jan 05, 2023

Related tags

Overview

This is a re-implementation of the model-based RL algorithm MBPO in pytorch as described in the following paper: When to Trust Your Model: Model-Based Policy Optimization.

This code is based on a previous paper in the NeurIPS reproducibility challenge that reproduces the result with a tensorflow ensemble model but shows a significant drop in performance with a pytorch ensemble model. This code re-implements the ensemble dynamics model with pytorch and closes the gap.

Reproduced results

The comparison are done on two tasks while other tasks are not tested. But on the tested two tasks, the pytorch implementation achieves similar performance compared to the official tensorflow code.

Dependencies

MuJoCo 1.5 & MuJoCo 2.0

Usage

python main_mbpo.py --env_name 'Walker2d-v2' --num_epoch 300 --model_type 'pytorch'

python main_mbpo.py --env_name 'Hopper-v2' --num_epoch 300 --model_type 'pytorch'

Reference

Official tensorflow implementation: https://github.com/JannerM/mbpo
Code to the reproducibility challenge paper: https://github.com/jxu43/replication-mbpo

A pytorch reprelication of the model-based reinforcement learning algorithm MBPO

Related tags

Overview

Overview

Reproduced results

Dependencies

Usage

Reference

Owner

Xingyu Lin

UnpNet - Rethinking 3-D LiDAR Point Cloud Segmentation(IEEE TNNLS)

Some bravo or inspiring research works on the topic of curriculum learning.

SWA Object Detection

Dynamic hair modeling from monocular videos using deep neural networks

A strongly-typed genetic programming framework for Python

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Multi-Stage Progressive Image Restoration

Use unsupervised and supervised learning to predict stocks

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly

DeepAL: Deep Active Learning in Python

Mini Software that give reminder to drink water as per your weight.

Pytorch implementation of "Neural Wireframe Renderer: Learning Wireframe to Image Translations"

Scheme for training and applying a label propagation framework

This is the repository for The Machine Learning Workshops, published by AI DOJO

Learn about Spice.ai with in-depth samples

An offline deep reinforcement learning library

Multi-atlas segmentation (MAS) is a promising framework for medical image segmentation

Feed forward VQGAN-CLIP model, where the goal is to eliminate the need for optimizing the latent space of VQGAN for each input prompt

ML-Ensemble – high performance ensemble learning