A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Last update: Nov 29, 2022

Related tags

Deep Learning CVPR2021_VSPW_Implement

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Preparation

Download VSPW dataset

The VSPW dataset with extracted frames and masks is available here. Now you can directly download VSPW_480P dataset.

Dependencies

Python 3.7
Pytorch 1.3.1
Numpy

Download the ImageNet-pretrained models at this link. Put it in the root folder and decompress it.

Train and Test

Resize the frames and masks of the VSPW dataset to 480p.

python change2_480p.py

Edit the .sh files in scripts/ and change the $DATAROOT to your path to VSPW_480p.

Image-based methods

PSPNet

sh scripts/run_psp.sh

OCRNet

sh scripts/run_ocr.sh

Video-based methods

TCB-PSP

sh run_temporal_psp.sh

TCB-OCR

sh run_temporal_ocr.sh

Evaluation on TC and VC

Change dataroot and prediction root in TC_cal.py and VC_perclip.py.

python TC_cal.py

python VC_perclip.py

This implementation utilized this code and RAFT.

Citation

@inproceedings{miao2021vspw,

  title={VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild},

  author={Miao, Jiaxu and Wei, Yunchao and  Wu, Yu and Liang, Chen and Li, Guangrui and Yang, Yi},

  booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition},

  year={2021}

}

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Related tags

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

Preparation

Download VSPW dataset

Dependencies

Train and Test

Image-based methods

Video-based methods

Evaluation on TC and VC

Citation

Owner

UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

RID-Noise: Towards Robust Inverse Design under Noisy Environments

Official implementation of the paper Label-Efficient Semantic Segmentation with Diffusion Models

The implementation of ICASSP 2020 paper "Pixel-level self-paced learning for super-resolution"

An adaptive hierarchical energy management strategy for hybrid electric vehicles

BBScan py3 - BBScan py3 With Python

MLP-Like Vision Permutator for Visual Recognition (PyTorch)

TensorFlow-based neural network library

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

An Abstract Cyber Security Simulation and Markov Game for OpenAI Gym

RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving

Transformer in Computer Vision

PFENet: Prior Guided Feature Enrichment Network for Few-shot Segmentation (TPAMI).

Numenta published papers code and data

A code generator from ONNX to PyTorch code

🔮 Execution time predictions for deep neural network training iterations across different GPUs.

Code for CVPR2021 paper "Learning Salient Boundary Feature for Anchor-free Temporal Action Localization"

CDGAN: Cyclic Discriminative Generative Adversarial Networks for Image-to-Image Transformation