Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Last update: Jan 01, 2023

Overview

Pearl

The Parallel Evolutionary and Reinforcement Learning Library (Pearl) is a pytorch based package with the goal of being excellent for rapid prototyping of new adaptive decision making algorithms in the intersection between reinforcement learning (RL) and evolutionary computation (EC). As such, this is not intended to provide template pre-built algorithms as a baseline, but rather flexible tools to allow the user to quickly build and test their own implementations and ideas. A technical report can be found here.

Main Features

Features	Pearl
RL algorithms (e.g. Actor Critic)	✔️
EC algorithms (e.g. Genetic Algorithm)	✔️
Hybrid algorithms (e.g. CEM-DDPG)	✔️
Multi-agent suppport	✔️
Tensorboard integration	✔️
Modular and extensible components	✔️
Opinionated module settings	✔️
Custom callbacks	✔️

User Guide

Installation

There are two options to install this package:

pip install pearll
git clone [email protected]:LondonNode/Pearl.git

Module Guide

agents: implementations of RL and EC agents where the other modular components are put together
buffers: these handle storing and sampling of trajectories
callbacks: inject logic for every step made in an environment (e.g. save model, early stopping)
common: common methods applicable to all other modules (e.g. enumerations) and a main utils.py file with some useful general logic
explorers: action explorers for enhanced exploration by adding noise to actions and random exploration for first n steps
models: neural network structures which are structured as encoder -> torso -> head
signal_processing: signal processing logic for extra modularity (e.g. TD returns, GAE)
updaters: update neural networks and adaptive/iterative algorithms
settings.py: settings objects for the above components, can be extended for custom components

Agent Templates

See pearll/agents/templates.py for the templates to create your own agents! For more examples, see specific agent implementations under pearll/agents.

Agent Performance

To see training performance, use the command tensorboard --logdir runs or tensorboard --logdir <tensorboard_log_path> defined in your algorithm class initialization.

Python Scripts

To run these you'll need to go to wherever the library is installed, cd pearll.

demo.py: script to run very basic demos of agents with pre-defined hyperparameters, run python3 -m pearll.demo -h for more info
plot.py: script to plot more complex plots that can't be obtained via Tensorboard (e.g. multiple subplots), run python3 -m pearll.plot -h for more info

Developer Guide

Scripts

Linux

scripts/setup_dev.sh: setup your virtual environment
scripts/run_tests.sh: run tests

Windows

scripts/windows_setup_dev.bat: setup your virtual environment
scripts/windows_run_tests.bat: run tests

Dependency Management

Pearl uses poetry for dependency management and build release instead of pip. As a quick guide:

Run poetry add [package] to add more package dependencies.
Poetry automatically handles the virtual environment used, check pyproject.toml for specifics on the virtual environment setup.
If you want to run something in the poetry virtual environment, add poetry run as a prefix to the command you want to execute. For example, to run a python file: poetry run python3 script.py.

Credit

Citing Pearl

@misc{tangri2022pearl,
      title={Pearl: Parallel Evolutionary and Reinforcement Learning Library}, 
      author={Rohan Tangri and Danilo P. Mandic and Anthony G. Constantinides},
      year={2022},
      eprint={2201.09568},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Acknowledgements

Pearl was inspired by Stable Baselines 3 and Tonic

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Balanced-Evolutionary-Semi-Stacking Code for the paper ''BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalan

0 Jan 16, 2022

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

SECSE SECSE: Systemic Evolutionary Chemical Space Explorer Chemical space exploration is a major task of the hit-finding process during the pursuit of

64 Dec 16, 2022

Deep learning with dynamic computation graphs in TensorFlow

TensorFlow Fold TensorFlow Fold is a library for creating TensorFlow models that consume structured data, where the structure of the computation graph

1.8k Dec 28, 2022

A toolkit for developing and comparing reinforcement learning algorithms.

Status: Maintenance (expect bug fixes and minor updates) OpenAI Gym OpenAI Gym is a toolkit for developing and comparing reinforcement learning algori

29.6k Jan 8, 2023

PyTorch implementations of deep reinforcement learning algorithms and environments

Deep Reinforcement Learning Algorithms with PyTorch This repository contains PyTorch implementations of deep reinforcement learning algorithms and env

4.7k Jan 4, 2023

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Off-Policy Multi-Agent Reinforcement Learning (MARL) Algorithms This repository contains implementations of various off-policy multi-agent reinforceme

183 Dec 28, 2022

Reinforcement learning framework and algorithms implemented in PyTorch.

2.1k Jan 4, 2023

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

PyTorch RL Minimal Implementations There are implementations of some reinforcement learning algorithms, whose characteristics are as follow: Less pack

4 Dec 31, 2022

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

4.7k Jan 1, 2023

Comments

Bump pillow from 9.0.0 to 9.0.1
Bumps pillow from 9.0.0 to 9.0.1.

Release notes

Sourced from pillow's releases.

9.0.1

https://pillow.readthedocs.io/en/stable/releasenotes/9.0.1.html

Changes

In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [@radarhere, @hugovk]

Restrict builtins within lambdas for ImageMath.eval. CVE-2022-22817 #6009 [radarhere]

Changelog

Sourced from pillow's changelog.

9.0.1 (2022-02-03)

In show_file, use os.remove to remove temporary images. CVE-2022-24303 #6010 [radarhere, hugovk]

Restrict builtins within lambdas for ImageMath.eval. CVE-2022-22817 #6009 [radarhere]

Commits

6deac9e 9.0.1 version bump

c04d812 Update CHANGES.rst [ci skip]

4fabec3 Added release notes for 9.0.1

02affaa Added delay after opening image with xdg-open

ca0b585 Updated formatting

427221e In show_file, use os.remove to remove temporary images

c930be0 Restrict builtins within lambdas for ImageMath.eval

75b69dd Dont need to pin for GHA

cd938a7 Autolink CWE numbers with sphinx-issues

2e9c461 Add CVE IDs

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

@dependabot use these labels will set the current labels as the default for future PRs for this repo and language

@dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language

@dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language

@dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

dependencies
opened by dependabot[bot] 1
Feature/hybrid

Overhaul models and base agent structure to accommodate RL, MARL, EC in optimizing static functions and RL environments and hybrid algorithms combining RL and EC.

opened by 09tangriro 1
MORE AGENTS

The more agents created the better proof that the tools underlying work as intended.

Agents should be tested on particular environments to ensure performance.
feature good first issue

opened by 09tangriro 0

Releases(v0.4.1)

v0.4.1(May 9, 2022)

Bug fixes and optimizations.

See PR #11
Source code(tar.gz)
Source code(zip)
v0.4.0(May 8, 2022)

Optimizations interfacing with GPU devices. See PR #10
Source code(tar.gz)
Source code(zip)
v0.3.1(Apr 5, 2022)
Bug fixes:

allow different size discrete space output for DiscreteHead.

Update docstrings for pearll/updaters/environment module.

Source code(tar.gz)
Source code(zip)
v0.3.0(Mar 28, 2022)
Introduce model-based RL tools.

Validate model-based RL tools with implementation of DynaQ algorithm.

Cleaner signal_processing module interface using functools.

Source code(tar.gz)
Source code(zip)
v0.2.2(Mar 4, 2022)

Fixed issue running multi-agent algorithms on cuda devices. Now full support for cuda.
Source code(tar.gz)
Source code(zip)
v0.2.1(Mar 2, 2022)
Various bug fixes:

to_numpy cuda support.

FlattenEncoder flattens inputs appropriately.

Callbacks more robust.

Also added a tutorial library.
Source code(tar.gz)
Source code(zip)
v0.2.0(Jan 25, 2022)

Various bug fixes and tweaks to the interface.
Source code(tar.gz)
Source code(zip)
v0.1.0(Jan 11, 2022)

Pre-release before paper submission.
Source code(tar.gz)
Source code(zip)

Owner

GitHub Repository

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

This is a playground for pytorch beginners, which contains predefined models on popular dataset. Currently we support mnist, svhn cifar10, cifar100 st

2.4k Dec 28, 2022

A trusty face recognition research platform developed by Tencent Youtu Lab

Introduction TFace: A trusty face recognition research platform developed by Tencent Youtu Lab. It provides a high-performance distributed training fr

956 Jan 01, 2023

Code for "Adversarial attack by dropping information." (ICCV 2021)

AdvDrop Code for "AdvDrop: Adversarial Attack to DNNs by Dropping Information(ICCV 2021)." Human can easily recognize visual objects with lost informa

52 Nov 10, 2022

YOLOv4-v3 Training Automation API for Linux

This repository allows you to get started with training a state-of-the-art Deep Learning model with little to no configuration needed! You provide your labeled dataset or label your dataset using our

626 Dec 31, 2022

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

8.3k Dec 31, 2022

UMPNet: Universal Manipulation Policy Network for Articulated Objects

UMPNet: Universal Manipulation Policy Network for Articulated Objects Zhenjia Xu, Zhanpeng He, Shuran Song Columbia University Robotics and Automation

33 Dec 03, 2022

Code for "ATISS: Autoregressive Transformers for Indoor Scene Synthesis", NeurIPS 2021

ATISS: Autoregressive Transformers for Indoor Scene Synthesis This repository contains the code that accompanies our paper ATISS: Autoregressive Trans

138 Dec 22, 2022

PSPNet in Chainer

PSPNet This is an unofficial implementation of Pyramid Scene Parsing Network (PSPNet) in Chainer. Training Requirement Python 3.4.4+ Chainer 3.0.0b1+

76 Dec 12, 2022

Image reconstruction done with untrained neural networks.

PyTorch Deep Image Prior An implementation of image reconstruction methods from Deep Image Prior (Ulyanov et al., 2017) in PyTorch. The point of the p

192 Nov 30, 2022

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning" This is the code for the paper Solving Graph-based Public Goo

3 Dec 05, 2022

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

PySlowFast PySlowFast is an open source video understanding codebase from FAIR that provides state-of-the-art video classification models with efficie

5.3k Jan 03, 2023

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

SECSE SECSE: Systemic Evolutionary Chemical Space Explorer Chemical space exploration is a major task of the hit-finding process during the pursuit of

64 Dec 16, 2022

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

AI-Bot 一个基于watermelon改造的OpenAI-GPT-2的智能机器人在Binder上直接运行测试目前有两种实现方式 TF2的GPT-2 TF

9 Nov 16, 2022

Multilingual Image Captioning

Multilingual Image Captioning Authors: Bhavitvya Malik, Gunjan Chhablani Demo Link: https://huggingface.co/spaces/flax-community/multilingual-image-ca

32 Nov 25, 2022

Pretrained Cost Model for Distributed Constraint Optimization Problems

Pretrained Cost Model for Distributed Constraint Optimization Problems Requirements PyTorch 1.9.0 PyTorch Geometric 1.7.1 Directory structure baseline

2 Aug 28, 2022

Fast, accurate and reliable software for algebraic CT reconstruction

KCT CBCT Fast, accurate and reliable software for algebraic CT reconstruction. This set of software tools includes OpenCL implementation of modern CT

4 Dec 14, 2022

Official code repository for "Exploring Neural Models for Query-Focused Summarization"

Query-Focused Summarization Official code repository for "Exploring Neural Models for Query-Focused Summarization" This is a work in progress. Expect

29 Dec 18, 2022

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

Federated Distance (FedDist) This is the code accompanying the Percom2021 paper "A Federated Learning Aggregation Algorithm for Pervasive Computing: E

8 Jan 03, 2023

For visualizing the dair-v2x-i dataset

3D Detection & Tracking Viewer The project is based on hailanyi/3D-Detection-Tracking-Viewer and is modified, you can find the original version of the

34 Dec 29, 2022

Implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networks, using PyTorch

C-CNN: Contourlet Convolutional Neural Networks This repo implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networ

10 Nov 03, 2022

Adaptable tools to make reinforcement learning and evolutionary computation algorithms.

Related tags

Overview

Pearl

Main Features

User Guide

Installation

Module Guide

Agent Templates

Agent Performance

Python Scripts

Developer Guide

Scripts

Dependency Management

Credit

Citing Pearl

Acknowledgements

You might also like...

BESS: Balanced Evolutionary Semi-Stacking for Disease Detection via Partially Labeled Imbalanced Tongue Data

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

Deep learning with dynamic computation graphs in TensorFlow

A toolkit for developing and comparing reinforcement learning algorithms.

PyTorch implementations of deep reinforcement learning algorithms and environments

Pytorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Reinforcement learning framework and algorithms implemented in PyTorch.

Independent and minimal implementations of some reinforcement learning algorithms using PyTorch (including PPO, A3C, A2C, ...).

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Comments

Bump pillow from 9.0.0 to 9.0.1

9.0.1

Changes

9.0.1 (2022-02-03)

Feature/hybrid

MORE AGENTS

Releases(v0.4.1)

v0.4.1(May 9, 2022)

v0.4.0(May 8, 2022)

v0.3.1(Apr 5, 2022)

v0.3.0(Mar 28, 2022)

v0.2.2(Mar 4, 2022)

v0.2.1(Mar 2, 2022)

v0.2.0(Jan 25, 2022)

v0.1.0(Jan 11, 2022)

Owner

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

A trusty face recognition research platform developed by Tencent Youtu Lab

Code for "Adversarial attack by dropping information." (ICCV 2021)

YOLOv4-v3 Training Automation API for Linux

DeepFaceLive - Live Deep Fake in python, Real-time face swap for PC streaming or video calls

UMPNet: Universal Manipulation Policy Network for Articulated Objects

Code for "ATISS: Autoregressive Transformers for Indoor Scene Synthesis", NeurIPS 2021

PSPNet in Chainer

Image reconstruction done with untrained neural networks.

Code for "Solving Graph-based Public Good Games with Tree Search and Imitation Learning"

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

AI-Bot - 一个基于watermelon改造的OpenAI-GPT-2的智能机器人

Multilingual Image Captioning

Pretrained Cost Model for Distributed Constraint Optimization Problems

Fast, accurate and reliable software for algebraic CT reconstruction

Official code repository for "Exploring Neural Models for Query-Focused Summarization"

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

For visualizing the dair-v2x-i dataset

Implemenets the Contourlet-CNN as described in C-CNN: Contourlet Convolutional Neural Networks, using PyTorch