NovelD: A Simple yet Effective Exploration Criterion

Intro

This is an implementation of the method proposed in

NovelD: A Simple yet Effective Exploration Criterion and BeBold: Exploration Beyond the Boundary of Explored Regions

Citation

If you use this code in your own work, please cite our paper:

@article{zhang2021noveld,
  title={NovelD: A Simple yet Effective Exploration Criterion},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

@article{zhang2020bebold,
  title={BeBold: Exploration Beyond the Boundary of Explored Regions},
  author={Zhang, Tianjun and Xu, Huazhe and Wang, Xiaolong and Wu, Yi and Keutzer, Kurt and Gonzalez, Joseph E and Tian, Yuandong},
  journal={arXiv preprint arXiv:2012.08621},
  year={2020}
}

Installation

# Install Instructions
conda create -n ride python=3.7
conda activate noveld 
git clone [email protected]:tianjunz/NovelD.git
cd NovelD
pip install -r requirements.txt

Train NovelD on MiniGrid

OMP_NUM_THREADS=1 python main.py --model bebold --env MiniGrid-ObstructedMaze-2Dlhb-v0 --total_frames 500000000 --intrinsic_reward_coef 0.05 --entropy_cost 0.0005

Acknowledgements

Our vanilla RL algorithm is based on RIDE.

License

This code is under the CC-BY-NC 4.0 (Attribution-NonCommercial 4.0 International) license.

NovelD: A Simple yet Effective Exploration Criterion

Related tags

Overview

NovelD: A Simple yet Effective Exploration Criterion

Intro

Citation

Installation

Train NovelD on MiniGrid

Acknowledgements

License

Owner

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

Repository of Vision Transformer with Deformable Attention

StyleTransfer - Open source style transfer project, based on VGG19

UniFormer - official implementation of UniFormer

PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

FluxTraining.jl gives you an endlessly extensible training loop for deep learning

ProFuzzBench - A Benchmark for Stateful Protocol Fuzzing

A PyTorch Implementation of the Luna: Linear Unified Nested Attention

Rename Images with Auto Generated Neural Image Captions

[ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"

Interactive Image Generation via Generative Adversarial Networks

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Pytorch codes for "Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation"

This repository contains the code for "SBEVNet: End-to-End Deep Stereo Layout Estimation" paper by Divam Gupta, Wei Pu, Trenton Tabor, Jeff Schneider

Experiment about Deep Person Re-identification with EfficientNet-v2

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

Data-Driven Operational Space Control for Adaptive and Robust Robot Manipulation

A PyTorch implementation of EventProp [https://arxiv.org/abs/2009.08378], a method to train Spiking Neural Networks

This is a beginner-friendly repo to make a collection of some unique and awesome projects. Everyone in the community can benefit & get inspired by the amazing projects present over here.