Multimodal Reinforcement Learning

JAX implementations of the following multimodal reinforcement learning approaches.

Dual-coding Episodic Memory from "Grounded Language Learning Fast and Slow"

The goal in this setting is for the agent to be presented with multiple objects with made up names following "This is a _____" statements and to then carry out an instruction such as "Move the wazzle to the table." This task requires the agent to learn long-term language and vision representations for concepts like "This is a" and objects that carry over between episodes such as "table" while also being able to learn one-shot representations of novel objects and their names.

Usage

Start by setting up the environment locally by running

poetry install
poetry shell

The learning environment depends on Docker and requires that the Docker Desktop program is running (on Mac). Once that's done you can run the default environment (fast mapping with 3 objects from the paper).

python fast_slow_learning/main.py

Solving reinforcement learning tasks which require language and vision

Related tags

Overview

Multimodal Reinforcement Learning

Usage

Owner

Henry Prior

VR-Caps: A Virtual Environment for Active Capsule Endoscopy

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Hierarchical User Intent Graph Network for Multimedia Recommendation

To build a regression model to predict the concrete compressive strength based on the different features in the training data.

PyElastica is the Python implementation of Elastica, an open-source software for the simulation of assemblies of slender, one-dimensional structures using Cosserat Rod theory.

A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution

A fast Protein Chain / Ligand Extractor and organizer.

a practicable framework used in Deep Learning. So far UDL only provide DCFNet implementation for the ICCV paper (Dynamic Cross Feature Fusion for Remote Sensing Pansharpening)

A simple Rock-Paper-Scissors game using CV in python

MGFN: Multi-Graph Fusion Networks for Urban Region Embedding was accepted by IJCAI-2022.

Binary Stochastic Neurons in PyTorch

MoCoPnet - Deformable 3D Convolution for Video Super-Resolution

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

Distinguishing Commercial from Editorial Content in News

Deep Learning Tutorial for Kaggle Ultrasound Nerve Segmentation competition, using Keras

Collect super-resolution related papers, data, repositories

Implementation of ConvMixer for "Patches Are All You Need? 🤷"

Deep Semisupervised Multiview Learning With Increasing Views (IEEE TCYB 2021, PyTorch Code)

Code for EMNLP 2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"

Implementations of CNNs, RNNs, GANs, etc