Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments

Last update: Oct 29, 2022

Related tags

Overview

IROS21 information

To test the code and reproduce the experiments, follow the installation steps in Installation.md. Afterwards, follow the steps in Evaluations.md.

To test the different Waypoint Generators, follow the steps in waypoint_eval.md

DRL agents are located in the agents folder.

Arena-MARL

A flexible, high-performance 2D simulator with configurable agents, multiple sensors, and benchmark scenarios for testing robotic navigation in multi-agent settings.

Arena-MARL uses Flatland as the core simulator and is a modular high-level library for end-to-end experiments in embodied AI -- defining embodied AI tasks (e.g. navigation, obstacle avoidance, behavior cloning), training agents (via imitation or reinforcement learning, or no learning at all using conventional approaches like DWA, TEB or MPC), and benchmarking their performance on the defined tasks using standard metrics.


Before Training	After Training

What is this repository for?

Train DRL agents on ROS compatible simulations for autonomous navigation in highly dynamic environments. Flatland-DRL integration is inspired by Ronja Gueldenring's work: drl_local_planner_ros_stable_baselines. Test state of the art local and global planners in ROS environments both in simulation and on real hardware. Following features are included:

Setup to train a local planner with reinforcement learning approaches from stable baselines3
Training in simulator Flatland in train mode
Include realistic behavior patterns and semantic states of obstacles (speaking, running, etc.)
Include different obstacles classes (other robots, vehicles, types of persons, etc.)
Implementation of intermediate planner classes to combine local DRL planner with global map-based planning of ROS Navigation stack
Testing a variety of planners (learning based and model based) within specific scenarios in test mode
Modular structure for extension of new functionalities and approaches

Start Guide

We recommend starting with the start guide which contains all information you need to know to start off with this project including installation on Linux and Windows as well as tutorials to start with.

For Mac, please refer to our Docker.

1. Installation

Please refer to Installation.md for detailed explanations about the installation process.

1.1. Docker

We provide a Docker file to run our code on other operating systems. Please refer to Docker.md for more information.

2. Usage

DRL Training

Please refer to DRL-Training.md for detailed explanations about agent, policy and training setups.

Scenario Creation with the arena-scenario-gui

To create complex, collaborative scenarios for training and/or evaluation purposes, please refer to the repo arena-scenario-gui. This application provides you with an user interface to easily create complex scenarios with multiple dynamic and static obstacles by drawing and other simple UI elements like dragging and dropping. This will save you a lot of time in creating complex scenarios for you individual use cases.

Used third party repos:

Flatland: http://flatland-simulator.readthedocs.io
ROS navigation stack: http://wiki.ros.org/navigation
Pedsim: https://github.com/srl-freiburg/pedsim_ros
Pettingzoo: https://github.com/Farama-Foundation/PettingZoo
Supersuit: https://github.com/Farama-Foundation/SuperSuit

Multi Agent Reinforcement Learning for ROS in 2D Simulation Environments

Related tags

Overview

IROS21 information

Arena-MARL

What is this repository for?

Start Guide

1. Installation

1.1. Docker

2. Usage

DRL Training

Scenario Creation with the arena-scenario-gui

Used third party repos:

Owner

Official implementation of "Articulation Aware Canonical Surface Mapping"

Python-kafka-reset-consumergroup-offset-example - Python Kafka reset consumergroup offset example

An implementation of MobileFormer

PyTorch inference for "Progressive Growing of GANs" with CelebA snapshot

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]

Histology images query (unsupervised)

3D cascade RCNN for object detection on point cloud

Modular Probabilistic Programming on MXNet

Goal of the project : Detecting Temporal Boundaries in Sign Language videos

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

TensorFlow implementation of ENet

Bolt Online Learning Toolbox

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Implementation of average- and worst-case robust flatness measures for adversarial training.

QilingLab challenge writeup

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

This code provides various models combining dilated convolutions with residual networks

KwaiRec: A Fully-observed Dataset for Recommender Systems (Density: Almost 100%)

A PyTorch-based R-YOLOv4 implementation which combines YOLOv4 model and loss function from R3Det for arbitrary oriented object detection.