3D HourGlass Networks for Human Pose Estimation Through Videos

Last update: Jan 02, 2023

Overview

3D-HourGlass-Network

3D CNN Based Hourglass Network for Human Pose Estimation (3D Human Pose) from videos. This was my summer'18 research project.

Discussion

In this work I try to extend the idea in Carriera et. al. CVPR'17 of 3D CNN inflation for action recognition from videos to human pose estimation from videos. We use a pretrained hourglass network with a fully connected depth regressor, inflate the 2D convolutions to 3D convolutions and perform temporal 3D human pose estimation. This inflation helps the network learn features from nearby frames and refine its predictions. Similar idea was used in Girdhar et. al. CVPR'18 (at about the same time!) where they perform multiperson human pose estimartion from videos using an inflated Mask RCNN

Requirements

python 3.6
pytorch 0.4
torchvision
progress

Datasets

We used Human 3.6 dataset for this project.

Instructions to run

python main.py -expID [EXP-NAME] -nFramesReg [NUM-FRAMES]

Results

We improved the baseline performance of hourglass network from MPJPE of 64 to MPJPE 62.8 and thus show significance of temporal features in real world problems. This idea could be easily extended for other tasks also like semantic segmentation and object detection.

3D HourGlass Networks for Human Pose Estimation Through Videos

Related tags

Overview

3D-HourGlass-Network

Discussion

Requirements

Datasets

Instructions to run

Results

Owner

Naman Jain

Metric learning algorithms in Python

A minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose

[NeurIPS 2021] SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

Pseudo-rng-app - whos needs science to make a random number when you have pseudoscience?

On Nonlinear Latent Transformations for GAN-based Image Editing - PyTorch implementation

🤗 Paper Style Guide

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

CondNet: Conditional Classifier for Scene Segmentation

CRNN With PyTorch

we propose a novel deep network, named feature aggregation and refinement network (FARNet), for the automatic detection of anatomical landmarks.

A Transformer-Based Siamese Network for Change Detection

Deep Learning and Logical Reasoning from Data and Knowledge

For IBM Quantum Challenge Africa 2021, 9 September (07:00 UTC) - 20 September (23:00 UTC).

StarGAN v2 - Official PyTorch Implementation (CVPR 2020)

Explaining Deep Neural Networks - A comparison of different CAM methods based on an insect data set

Este conversor criará a medida exata para sua receita de capuccino gelado da grandiosa Rafaella Ballerini!

【steal piano】GitHub偷情分析工具！

Code for ICML 2021 paper: How could Neural Networks understand Programs?

Revisiting Self-Training for Few-Shot Learning of Language Model.

Code for DeepCurrents: Learning Implicit Representations of Shapes with Boundaries