This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

Last update: Jan 07, 2023

Related tags

Deep Learning StridedTransformer-Pose3D

Overview

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

This repo is the official implementation of Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation in Pytorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_gt.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M with 351-frames:

python main.py --frames 351 --refine --reload 1  --refine_reload 1 --previous_dir 'checkpoint/351'

Train the model

To train on Human3.6M with 351-frame:

python main.py --frames 351 --train 1 \

After training for several epoches, add refine module

python main.py --frames 351 --train 1 --refine --lr 1e-5 --reload 1 --previous_dir [your model saved path] \

Citation

If you find our work useful in your research, please consider citing:

@article{li2021exploiting,
  title={Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Ding, Runwei and Liu, Mengyuan and Wang, Pichao and Yang, Wenming},
  journal={arXiv preprint arXiv:2103.14304},
  year={2021}
}

Acknowledgement

Our code is built on top of ST-GCN and is extended from the following repositories. We thank the authors for releasing the codes.

This is an official implementation for "Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation".

Related tags

Overview

Exploiting Temporal Contexts with Strided Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

User-friendly bulk RNAseq deconvolution using simulated annealing

Human annotated noisy labels for CIFAR-10 and CIFAR-100.

68 keypoint annotations for COFW test data

BOVText: A Large-Scale, Multidimensional Multilingual Dataset for Video Text Spotting

RLHive: a framework designed to facilitate research in reinforcement learning.

MWPToolkit is a PyTorch-based toolkit for Math Word Problem (MWP) solving.

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Unofficial implementation of "TTNet: Real-time temporal and spatial video analysis of table tennis" (CVPR 2020)

A python3 tool to take a 360 degree survey of the RF spectrum (hamlib + rotctld + RTL-SDR/HackRF)

Lowest memory consumption and second shortest runtime in NTIRE 2022 challenge on Efficient Super-Resolution

Contrastively Disentangled Sequential Variational Audoencoder

Learning Energy-Based Models by Diffusion Recovery Likelihood

WHENet: Real-time Fine-Grained Estimation for Wide Range Head Pose

Multivariate Time Series Forecasting with efficient Transformers. Code for the paper "Long-Range Transformers for Dynamic Spatiotemporal Forecasting."

Security evaluation module with onnx, pytorch, and SecML.

3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks

Code for the paper "Zero-shot Natural Language Video Localization" (ICCV2021, Oral).

A higher performance pytorch implementation of DeepLab V3 Plus(DeepLab v3+)

PyTorch implementation of PP-LCNet: A Lightweight CPU Convolutional Neural Network