BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Last update: Dec 28, 2022

Related tags

Overview

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Abhinanda R. Punnakkal*, Arjun Chandrasekaran*, Nikos Athanasiou, Alejandra Quiros-Ramirez, Michael J. Black. * denotes equal contribution

Project Website | Paper | Video | Poster

BABEL is a large dataset with language labels describing the actions being performed in mocap sequences. BABEL labels about 43 hours of mocap sequences from AMASS [1] with action labels. Sequences have action labels at two possible levels of abstraction:

Sequence labels which describe the overall action in the sequence
Frame labels which describe all actions in every frame of the sequence. Each frame label is precisely aligned with the duration of the corresponding action in the mocap sequence, and multiple actions can overlap.

To download the BABEL action labels, visit our 'Data' page. You can download the mocap sequences from AMASS.

Tutorials

We release some helper code in Jupyter notebooks to load the BABEL dataset, visualize mocap sequences and their action labels, search BABEL for sequences containing specific actions, etc.

See notebooks/ for more details.

Action Recognition

We provide features, training and inference code, and pre-trained checkpoints for 3D skeleton-based action recognition.

Please see action_recognition/ for more details.

Acknowledgements

The notebooks in this repo are inspired by the those provided by AMASS. The Action Recognition code is based on the 2s-AGCN implementation.

References

[1] Mahmood, Naureen, et al. "AMASS: Archive of motion capture as surface shapes." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2019.

License

Software Copyright License for non-commercial scientific research purposes. Please read carefully the terms and conditions and any accompanying documentation before you download and/or use the AMASS dataset, and software, (the "Model & Software"). By downloading and/or using the Model & Software (including downloading, cloning, installing, and any other use of this GitHub repository), you acknowledge that you have read these terms and conditions, understand them, and agree to be bound by them. If you do not agree with these terms and conditions, you must not download and/or use the Model & Software. Any infringement of the terms of this agreement will automatically terminate your rights under this License.

Contact

The code in this repository is developed by Abhinanda Punnakkal and Arjun Chandrasekaran.

If you have any questions you can contact us at [email protected].

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Related tags

Overview

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

Tutorials

Action Recognition

Acknowledgements

References

License

Contact

Owner

Vision-Language Transformer and Query Generation for Referring Segmentation (ICCV 2021)

Official code for Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset

This is Unofficial Repo. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)

Official implementation of the paper 'High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network' in CVPR 2021

Learning trajectory representations using self-supervision and programmatic supervision.

Res2Net for Instance segmentation and Object detection using MaskRCNN

Learning Efficient Online 3D Bin Packing on Packing Configuration Trees

Object Depth via Motion and Detection Dataset

The implementation of the algorithm in the paper "Safe Deep Semi-Supervised Learning for Unseen-Class Unlabeled Data" published in ICML 2020.

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Joint detection and tracking model named DEFT, or ``Detection Embeddings for Tracking.

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Deep Reinforcement Learning for Multiplayer Online Battle Arena

The goal of the exercises below is to evaluate the candidate knowledge and problem solving expertise regarding the main development focuses for the iFood ML Platform team: MLOps and Feature Store development.

Public implementation of the Convolutional Motif Kernel Network (CMKN) architecture

A medical imaging framework for Pytorch

A small tool to joint picture including gif

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images (ISBI 2021, MELBA 2021)