Generalized Decision Transformer for Offline Hindsight Information Matching

If you use this codebase for your research, please cite the paper:

@article{furuta2021generalized,
  title={Generalized Decision Transformer for Offline Hindsight Information Matching},
  author={Hiroki Furuta and Yutaka Matsuo and Shixiang Shane Gu},
  journal={arXiv preprint arXiv:2111.10364},
  year={2021}
}

Installation

Experiments require MuJoCo. Follow the instructions in the mujoco-py repo to install. Then, dependencies can be installed with the following command:

conda env create -f conda_env.yml

Downloading datasets

Datasets are stored in the data directory. Install the D4RL repo, following the instructions there. Then, run the following script in order to download the datasets and save them in our format:

python download_d4rl_datasets.py

Run experiments

Run train_cdt.py to train Categorical DT:

python train_cdt.py --env halfcheetah --dataset medium-expert --gpu 0 --seed 0 --dist_dim 30 --n_bins 31 --condition 'reward' --save_model True

python train_cdt.py --env halfcheetah --dataset medium-expert --gpu 0 --seed 0 --dist_dim 30 --n_bins 31 --condition 'xvel' --save_model True

Run eval_cdt.py to eval CDT using saved weights:

python eval_cdt.py --env halfcheetah --dataset medium-expert --gpu 0 --seed 0 --dist_dim 30 --n_bins 31 --condition 'reward' --save_rollout True
python eval_cdt.py --env halfcheetah --dataset medium-expert --gpu 0 --seed 0 --dist_dim 30 --n_bins 31 --condition 'xvel' --save_rollout True

For Bi-directional DT, run train_bdt.py & eval_bdtf.py

python train_bdt.py --env halfcheetah --dataset medium-expert --gpu 0 --seed 0 --dist_dim 30 --n_bins 31 --z_dim 16 --save_model True
python eval_bdt.py --env halfcheetah --dataset medium-expert --gpu 0 --seed 0 --dist_dim 30 --n_bins 31 --z_dim 16 --save_rollout True

Reference

This repository is developed on top of original Decision Transformer.

Generalized Decision Transformer for Offline Hindsight Information Matching

Related tags

Overview

Generalized Decision Transformer for Offline Hindsight Information Matching

Installation

Downloading datasets

Run experiments

Reference

Owner

Hiroki Furuta

Code for ECIR'20 paper Diagnosing BERT with Retrieval Heuristics

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

EfficientNetV2 implementation using PyTorch

The Habitat-Matterport 3D Research Dataset - the largest-ever dataset of 3D indoor spaces.

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Discord-Protect is a simple discord bot allowing you to have some security on your discord server by ordering a captcha to the user who joins your server.

Try out deep learning models online on Google Colab

GrabGpu_py: a scripts for grab gpu when gpu is free

Add gui for YoloV5 using PyQt5

a reccurrent neural netowrk that when trained on a peice of text and fed a starting prompt will write its on 250 character text using LSTM layers

Uses OpenCV and Python Code to detect a face on the screen

Apache Flink

Single/multi view image(s) to voxel reconstruction using a recurrent neural network

An ML & Correlation platform for transforming disparate data points of interest into usable intelligence.

Our CIKM21 Paper "Incorporating Query Reformulating Behavior into Web Search Evaluation"

This provides the R code and data to replicate results in "The USS Trustee’s risky strategy"

Implementation of Neonatal Seizure Detection using EEG signals for deploying on edge devices including Raspberry Pi.

Face Recognition & AI Based Smart Attendance Monitoring System.

chen2020iros: Learning an Overlap-based Observation Model for 3D LiDAR Localization.

Diverse graph algorithms implemented using JGraphT library.