Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Last update: Nov 16, 2021

Related tags

Deep Learning marl-design

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Official implementation of:

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Shriram Chennakesavalu and Grant M. Rotskoff

https://arxiv.org/abs/2111.06875

Abstract: Experimental advances enabling high-resolution external control create new opportunities to produce materials with exotic properties. In this work, we investigate how a multi-agent reinforcement learning approach can be used to design external control protocols for self-assembly. We find that a fully decentralized approach performs remarkably well even with a "coarse" level of external control. More importantly, we see that a partially decentralized approach, where we include information about the local environment allows us to better control our system towards some target distribution. We explain this by analyzing our approach as a partially-observed Markov decision process. With a partially decentralized approach, the agent is able to act more presciently, both by preventing the formation of undesirable structures and by better stabilizing target structures as compared to a fully decentralized approach.

Installing prerequisites (using conda)

conda env create -f environment.yml -n marldesign
conda activate marldesign

Possible --centralize_approach values are ("plaquette", "all", "grid_n"), where 1 < n < region_num/2

Sample training commands

python train.py --active --centralize_states --centralize_approach plaquette
python train.py --active --centralize_rewards --centralize_approach all
python train.py --centralize_rewards --centralize_states --centralize_approach grid_1

Sample testing commands

python test.py --active --num_samples 10  --centralize_states --centralize_approach plaquette
python test.py --active --num_samples 10 --centralize_rewards --centralize_approach grid_1
python test.py --centralize_rewards --num_samples 10 --centralize_states --centralize_approach grid_2

For a more theoretical description of the systems described here, please visit https://github.com/rotskoff-group/dissipative-design

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Related tags

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Installing prerequisites (using conda)

Sample training commands

Sample testing commands

Owner

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

Civsim is a basic civilisation simulation and modelling system built in Python 3.8.

PyTorch ,ONNX and TensorRT implementation of YOLOv4

source code the paper Fast and Robust Iterative Closet Point.

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

Official implementation for paper: Feature-Style Encoder for Style-Based GAN Inversion

LOFO (Leave One Feature Out) Importance calculates the importances of a set of features based on a metric of choice,

.NET bindings for the Pytorch engine

Variational autoencoder for anime face reconstruction

A simple, fully convolutional model for real-time instance segmentation.

Vision-and-Language Navigation in Continuous Environments using Habitat

An Unbiased Learning To Rank Algorithms (ULTRA) toolbox

ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

Implements MLP-Mixer: An all-MLP Architecture for Vision.

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking

This is a repository with the code for the ACL 2019 paper

yolov5 deepsort 行人车辆跟踪检测计数

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Related tags

Overview

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

Installing prerequisites (using conda)

Sample training commands

Sample testing commands

Owner

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

Civsim is a basic civilisation simulation and modelling system built in Python 3.8.

PyTorch ,ONNX and TensorRT implementation of YOLOv4

source code the paper Fast and Robust Iterative Closet Point.

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

Official implementation for paper: Feature-Style Encoder for Style-Based GAN Inversion

LOFO (Leave One Feature Out) Importance calculates the importances of a set of features based on a metric of choice,

.NET bindings for the Pytorch engine

Variational autoencoder for anime face reconstruction

A simple, fully convolutional model for real-time instance segmentation.

Vision-and-Language Navigation in Continuous Environments using Habitat

An Unbiased Learning To Rank Algorithms (ULTRA) toolbox

ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

Implements MLP-Mixer: An all-MLP Architecture for Vision.

Luminaire is a python package that provides ML driven solutions for monitoring time series data.

Part-aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking

This is a repository with the code for the ACL 2019 paper

yolov5 deepsort 行人 车辆 跟踪 检测 计数

yolov5 deepsort 行人车辆跟踪检测计数