Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Last update: Dec 07, 2022

Related tags

Deep Learning SELFY

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

This is the official implementation of the paper "Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition" by H.Kwon, M.Kim, S.Kwak, and M.Cho. For more information, checkout the project website and the paper on arXiv.

Environment:

Cuda: 9.0
gcc: 7.3.0
Python 3.6.8
PyTorch 1.0.1
TorchVison: 0.2.2
Spatial Correlation Sampler
Others: environment.yml

Anaconda environment setting

git clone https://github.com/arunos728/SELFY.git
cd selfy
conda env create -f environment.yml
conda activate selfy

Installing Correlation sampler

cd Pytorch-Correlation-extension
python setup.py install

# check whether SpatialCorrelationSampler is installed correctly.
python check.py forward
python check.py backward
python checkCorrelationSampler.py

Please check this repo for the detailed instructions.

Dataset preparation

Please refer to TSM repo for the detailed data preparation instructions.

File lists (.txt files in ./data) specify configurations of each video clips (path, #frames, class). We upload our Something-Something-V1 & V2 video file lists in ./data. The path of the file lists should be added into the scripts for training (or testing).

Training & Testing

For training SELFYNet on Something-Something, use the following command:

    ./scripts/train_SELFY_Something.sh

For testing your trained model on Something-Something, use the following command:

    ./scripts/test_SELFY_Something.sh

Citation

If you use this code or ideas from the paper for your research, please cite our paper:

@inproceedings{kwon2021learning,
  title={Learning self-similarity in space and time as generalized motion for video action recognition},
  author={Kwon, Heeseung and Kim, Manjin and Kwak, Suha and Cho, Minsu},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={13065--13075},
  year={2021}
}

Contact

Heeseung Kwon([email protected]), Manjin Kim([email protected])

Questions can also be left as issues in the repository. We will be happy to answer them.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Related tags

Overview

Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition

Environment:

Anaconda environment setting

Installing Correlation sampler

Dataset preparation

Training & Testing

Citation

Contact

Owner

SSD: A Unified Framework for Self-Supervised Outlier Detection [ICLR 2021]

Deep-learning-roadmap - All You Need to Know About Deep Learning - A kick-starter

CROSS-LINGUAL ABILITY OF MULTILINGUAL BERT: AN EMPIRICAL STUDY

Implementation of ToeplitzLDA for spatiotemporal stationary time series data.

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

🦕 NanoSaur is a little tracked robot ROS2 enabled, made for an NVIDIA Jetson Nano

scikit-learn inspired API for CRFsuite

FishNet: One Stage to Detect, Segmentation and Pose Estimation

A curated list of long-tailed recognition resources.

Pytorch implementation of ProjectedGAN

Improving the robustness and performance of biomedical NLP models through adversarial training

[ICCV 2021] Our work presents a novel neural rendering approach that can efficiently reconstruct geometric and neural radiance fields for view synthesis.

Replication package for the manuscript "Using Personality Detection Tools for Software Engineering Research: How Far Can We Go?" submitted to TOSEM

Cards Against Humanity AI

PyTorch implementation of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

Source Code for ICSE 2022 Paper - ``Can We Achieve Fairness Using Semi-Supervised Learning?''

Face Library is an open source package for accurate and real-time face detection and recognition

BookMyShowPC - Movie Ticket Reservation App made with Tkinter