PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Last update: Dec 12, 2022

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

This repository is the Pytorch implementation of our paper "Maria: A Visual Experience Powered Conversational Agent" in ACL 2021.

In this paper, we present Maria, a neural conversation agent powered by the visual world experiences which are retrieved from a large-scale image index. Maria consists of three flexible components, i.e., text-to-image retriever, visual concept detector and visual-knowledge-grounded response generator.

Coming soon!

Summary

Maria: A Visual Experience Powered Conversational Agent

Dependencies

python 3.7
pytorch 1.4.0
Ubuntu 18.04

Usage

Citation

If you find this paper helps your research, please kindly consider citing our paper in your publications.

@inproceedings{liang2021maria,
   title={Maria: A Visual Experience Powered Conversational Agent},
   author={Liang, Zujie and Hu, Huang and Xu, Can and Chongyang, Tao and Geng, Xiubo and Chen, Danqi and Liang, Fan and Jiang, Daxin},
   booktitle={Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL)},
   year={2021}
}

Acknowledgment

Special thanks to the authors of OSCAR, vokenization, and py-bottom-up-attention.

PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

Summary

Dependencies

Usage

Text-to-Image Retrieval Model

Bottom-up Detector Model

Dialog Generation Model

Citation

Acknowledgment

Owner

Jokie

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

Pytorch and Torch testing code of CartoonGAN

Code for testing various M1 Chip benchmarks with TensorFlow.

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

Deep Dual Consecutive Network for Human Pose Estimation (CVPR2021)

These are the materials for the paper "Few-Shot Out-of-Domain Transfer Learning of Natural Language Explanations"

Single/multi view image(s) to voxel reconstruction using a recurrent neural network

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Learning Neural Network Subspaces

Prototype for Baby Action Detection and Classification

Official repository of my book: "Deep Learning with PyTorch Step-by-Step: A Beginner's Guide"

Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer

ThunderGBM: Fast GBDTs and Random Forests on GPUs

Continuous Conditional Random Field Convolution for Point Cloud Segmentation

This repository contains implementations and illustrative code to accompany DeepMind publications

Code accompanying "Learning What To Do by Simulating the Past", ICLR 2021.

Pytorch Implementation of Various Point Transformers

PyTorch GPU implementation of the ES-RNN model for time series forecasting