Reinforcement learning library in JAX.

Last update: Oct 30, 2022

Overview

Magi RL library in JAX

Installation | Agents | Examples | Contributing | Documentation

Magi is a RL library developed on top of Acme.

Note: Magi is in alpha development so expect breaking changes!

Installation

Create a new Python virtual environment

python3 -m venv venv
source venv/bin/activate

Install dependencies and the package in editable mode by running

pip install -U pip setuptools wheel
pip install -r requirements.txt # This uses pinned dependencies, you may adjust this for your needs.
pip install -e .

If for some reason installation fails, first check out GitHub Actions badge to see if this fails on the latest CI run. If the CI is successful, then it's likely that there are some issues to setting up your own environment. Refer to .github/workflows/ci.yaml as the official source for how to set up the environment.

Agents

magi includes popular RL algorithm implementation such as SAC, DrQ, SAC-AE and PETS. Refer to magi/agents for a full list of agents.

Examples

Check out magi/examples where we include examples of using our RL agents on popular benchmark tasks.

Testing

On Linux, you can run tests with

JAX_PLATFORM_NAME=cpu pytest -n `grep -c ^processor /proc/cpuinfo` magi

Contributing

Refer to CONTRIBUTING.md.

Acknowledgements

Magi is inspired by many of the open-source RL projects out there. Here is a (non-exhaustive) list of related libraries and packages that Magi references:

License

Apache License 2.0

Citation

If you use Magi in your work, please cite us according to the CITATION file. You may learn more about the CITATION file from here.

Reinforcement learning library in JAX.

Related tags

Overview

Magi RL library in JAX

Installation

Agents

Examples

Testing

Contributing

Acknowledgements

License

Citation

Owner

Yicheng Luo

Pointer-generator - Code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks

Official repository for HOTR: End-to-End Human-Object Interaction Detection with Transformers (CVPR'21, Oral Presentation)

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Face Identity Disentanglement via Latent Space Mapping [SIGGRAPH ASIA 2020]

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

PyTorch Implementation of SSTNs for hyperspectral image classifications from the IEEE T-GRS paper "Spectral-Spatial Transformer Network for Hyperspectral Image Classification: A FAS Framework."

Simple codebase for flexible neural net training

Deep learning with dynamic computation graphs in TensorFlow

A modular, research-friendly framework for high-performance and inference of sequence models at many scales

Restricted Boltzmann Machines in Python.

It is the assignment for COMP 576 in Rice University

Simple implementation of Mobile-Former on Pytorch

RMTD: Robust Moving Target Defence Against False Data Injection Attacks in Power Grids

Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'

RLDS stands for Reinforcement Learning Datasets

TCube generates rich and fluent narratives that describes the characteristics, trends, and anomalies of any time-series data (domain-agnostic) using the transfer learning capabilities of PLMs.

⚾🤖⚾ Automatic baseball pitching overlay in realtime

Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.

Automatic labeling, conversion of different data set formats, sample size statistics, model cascade