Learning Representational Invariances for Data-Efficient Action Recognition

Last update: Nov 22, 2022

Overview

Learning Representational Invariances for Data-Efficient Action Recognition

Official PyTorch implementation for Learning Representational Invariances for Data-Efficient Action Recognition. We follow the code structure of MMAction2.

See the project page for more details.

Installation

We use PyTorch-1.6.0 with CUDA-10.2 and Torchvision-0.7.0.

Please refer to install.md for installation.

Data Preparation

First, please download human detection results and put them in the corresponding folder under data: UCF-101, HMDB-51, Kinetics-100.

Second, please refer to data_preparation.md to prepare raw frames of UCF-101 and HMDB-51. (Instructions of extracting frames from Kinetics-100 will be available soon.)

(Optional) You can download the pre-extracted ImageNet scores: UCF-101, HMDB-51.

Training

We use 8 RTX2080 Ti GPUs to run our experiments. You would need to adjust your training schedule accordingly if you have less GPUs. Please refer to here.

Supervised learning

PORT=${PORT:-29500}

python -m torch.distributed.launch \
--nproc_per_node=8 \
--master_port=$PORT \
tools/train.py \
$CONFIG \
--launcher pytorch ${@:3} \
--validate

You need to replace $CONFIG with the actual config file:

For supervised baseline, please use config files in configs/recognition/r2plus1d.
For strongly-augmented supervised learning, please use config files in configs/supervised_aug.

Semi-supervised learning

PORT=${PORT:-29500}

python -m torch.distributed.launch \
--nproc_per_node=8 \
--master_port=$PORT \
tools/train_semi.py \
$CONFIG \
--launcher pytorch ${@:3} \
--validate

You need to replace $CONFIG with the actual config file:

For single dataset semi-supervised learning, please use config files in configs/semi.
For cross-dataset semi-supervised learning, please use config files in configs/semi_both.

Testing

# Multi-GPU testing
./tools/dist_test.sh $CONFIG ${path_to_your_ckpt} ${num_of_gpus} --eval top_k_accuracy

# Single-GPU testing
python tools/test.py $CONFIG ${path_to_your_ckpt} --eval top_k_accuracy

NOTE: Do not use multi-GPU testing if you are currently using multi-GPU training.

Other details

Please see getting_started.md for the basic usage of MMAction2.

Acknowledgement

Codes are built upon MMAction2.

Learning Representational Invariances for Data-Efficient Action Recognition

Related tags

Overview

Learning Representational Invariances for Data-Efficient Action Recognition

Installation

Data Preparation

Training

Supervised learning

Semi-supervised learning

Testing

Other details

Acknowledgement

Owner

Virginia Tech Vision and Learning Lab

Python inverse kinematics for your robot model based on Pinocchio.

Rainbow: Combining Improvements in Deep Reinforcement Learning

A Fast and Accurate One-Stage Approach to Visual Grounding, ICCV 2019 (Oral)

Deep Learning and Reinforcement Learning Library for Scientists and Engineers 🔥

A disassembler for the RP2040 Programmable I/O State-machine!

AFL binary instrumentation

Self-describing JSON-RPC services made easy

Digital Twin Mobility Profiling: A Spatio-Temporal Graph Learning Approach

ncnn is a high-performance neural network inference framework optimized for the mobile platform

The codes I made while I practiced various TensorFlow examples

Single cell current best practices tutorial case study for the paper:Luecken and Theis, "Current best practices in single-cell RNA-seq analysis: a tutorial"

PyTorch-Multi-Style-Transfer - Neural Style and MSG-Net

Official PaddlePaddle implementation of Paint Transformer

A collection of educational notebooks on multi-view geometry and computer vision.

buildseg is a building extraction plugin of QGIS based on PaddlePaddle.

Subgraph Based Learning of Contextual Embedding

ProMP: Proximal Meta-Policy Search

This repository is the official implementation of the Hybrid Self-Attention NEAT algorithm.

A Unified Framework and Analysis for Structured Knowledge Grounding

Discovering Dynamic Salient Regions with Spatio-Temporal Graph Neural Networks