Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Last update: Jul 08, 2021

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

This is a PyTorch implementation of the model described in our paper:

Z. Qi, S. Wang, C. Su, L. Su, W. Zhang, and Q. Huang. Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis. ACM MM 2020.

Dependencies

Pytorch 1.2.0
Cuda 9.2.148
Cudnn 7.6.2
Opencv-python 4.2.0.34
Python 3.6.9

Data

Dataset Prepare

Download the pre-trained concept detector weights from Baidu passward 'wv0e' or Google Grive and put them in folder weights/
Download the FCVID dataset from http://bigvid.fudan.edu.cn/FCVID/.
The annotation information of each dataset is provided in folder data/FCVID/video_labels.
Extract the video frames for each video and put the extracted frames in folder data/FCVID/frames/.

For ActivityNet dataset ( http://activity-net.org/. ) , we use the latest released version of the dataset (v1.3).

Train

python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --no_test

for other hyperparameters, please refer to opts.py file.

Test

Pretrained model weigths are avaiable in Baidu passward 'szlk' or Google Grive
Download the pre-trained weights and put them in folder results/
python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --resume_path pretrained_model/tdcmn_si_soa.pth --no_train --test_crop_number 1

Citation

Please cite our paper if you use this code in your own work:

@inproceedings{qi2020modeling,
  title={Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis},
  author={Qi, Zhaobo and Wang, Shuhui and Su, Chi and Su, Li and Zhang, Weigang and Huang, Qingming},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={3798--3806},
  year={2020}
}

Contcat

If you have any problem about our code, feel free to contact

[email protected]

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Dependencies

Data

Dataset Prepare

Train

Test

Citation

Contcat

Owner

qzhb

Python-experiments - A Repository which contains python scripts to automate things and make your life easier with python

BuildingNet: Learning to Label 3D Buildings

Mixed Transformer UNet for Medical Image Segmentation

Artificial Intelligence search algorithm base on Pacman

Software Platform for solving and manipulating multiparametric programs in Python

gym-anm is a framework for designing reinforcement learning (RL) environments that model Active Network Management (ANM) tasks in electricity distribution networks.

Functional deep learning

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Deeply Supervised, Layer-wise Prediction-aware (DSLP) Transformer for Non-autoregressive Neural Machine Translation

SCALE: Modeling Clothed Humans with a Surface Codec of Articulated Local Elements (CVPR 2021)

Curved Projection Reformation

Accurate Phylogenetic Inference with Symmetry-Preserving Neural Networks

Unconstrained Text Detection with Box Supervisionand Dynamic Self-Training

[CVPR'2020] DeepDeform: Learning Non-rigid RGB-D Reconstruction with Semi-supervised Data

UAV-Networks-Routing is a Python simulator for experimenting routing algorithms and mac protocols on unmanned aerial vehicle networks.

DeiT: Data-efficient Image Transformers

Conversational text Analysis using various NLP techniques

Remote sensing change detection tool based on PaddlePaddle

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

FANet - Real-time Semantic Segmentation with Fast Attention