PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Last update: Dec 14, 2022

Related tags

Deep Learning HIGL

Overview

HIGL

This is a PyTorch implementation for our paper: Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning (NeurIPS 2021).

Our code is based on official implementation of HRAC (NeurIPS 2020) and Map-planner (NeurIPS 2019)

Installation

conda create -n higl python=3.6
conda activate higl
./install_all.sh

Also, to run the MuJoCo experiments, a license is required (see here).

Usage

Training & Evaluation

Point Maze

./scripts/point_maze_sparse.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/point_maze_sparse.sh dense 5e5 0 2
./scripts/point_maze_sparse.sh sparse 5e5 0 2

Ant Maze (U-shape)

./scripts/higl_ant_maze_u.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/higl_ant_maze_u.sh dense 10e5 0 2
./scripts/higl_ant_maze_u.sh sparse 10e5 0 2

Ant Maze (W-shape)

./scripts/higl_ant_maze_w.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/higl_ant_maze_w.sh dense 10e5 0 2
./scripts/higl_ant_maze_w.sh sparse 10e5 0 2

Reacher & Pusher

./scripts/higl_fetch.sh ${env} ${timesteps} ${gpu} ${seed}
./scripts/higl_fetch.sh Reacher3D-v0 5e5 0 2
./scripts/higl_fetch.sh Pusher-v0 10e5 0 2

Stochastic Ant Maze (U-shape)

./scripts/higl_ant_maze_u_stoch.sh ${reward_shaping} ${timesteps} ${gpu} ${seed}
./scripts/higl_ant_maze_u_stoch.sh dense 10e5 0 2
./scripts/higl_ant_maze_u_stoch.sh sparse 10e5 0 2

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Related tags

Overview

HIGL

Installation

Usage

Training & Evaluation

Owner

Junsu Kim

ONNX-GLPDepth - Python scripts for performing monocular depth estimation using the GLPDepth model in ONNX

Fuzzing JavaScript Engines with Aspect-preserving Mutation

A PyTorch implementation of Radio Transformer Networks from the paper "An Introduction to Deep Learning for the Physical Layer".

Tensorflow implementation of MIRNet for Low-light image enhancement

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Framework to build and train RL algorithms

QQ Browser 2021 AI Algorithm Competition Track 1 1st Place Program

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

DCGAN-tensorflow - A tensorflow implementation of Deep Convolutional Generative Adversarial Networks

Predicting the duration of arrival delays for commercial flights.

Pytorch Implementation of Residual Vision Transformers(ResViT)

Running AlphaFold2 (from ColabFold) in Azure Machine Learning

Crossover Learning for Fast Online Video Instance Segmentation (ICCV 2021)

Measures input lag without dedicated hardware, performing motion detection on recorded or live video

Adjusting for Autocorrelated Errors in Neural Networks for Time Series

Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework

This code is a near-infrared spectrum modeling method based on PCA and pls

[SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search

A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.

CapsuleVOS: Semi-Supervised Video Object Segmentation Using Capsule Routing