Model-based 3D Hand Reconstruction via Self-Supervised Learning, CVPR2021

Last update: Dec 12, 2022

Related tags

Overview

S²HAND: Model-based 3D Hand Reconstruction via Self-Supervised Learning

S²HAND presents a self-supervised 3D hand reconstruction network that can jointly estimate pose, shape, texture, and the camera viewpoint. Specifically, we obtain geometric cues from the input image through easily accessible 2D detected keypoints. To learn an accurate hand reconstruction model from these noisy geometric cues, we utilize the consistency between 2D and 3D representations and propose a set of novel losses to rationalize outputs of the neural network. For the first time, we demonstrate the feasibility of training an accurate 3D hand reconstruction network without relying on manual annotations. For more details, please see our paper, video, and project page.

Code

Environment

Training is implemented with PyTorch. This code was developed under Python 3.6 and Pytorch 1.1.

Please compile the extension modules by running:

pip install tqdm tensorboardX transforms3d chumpy scikit-image

git clone https://github.com/TerenceCYJ/neural_renderer.git
cd neural_renderer
python setup.py install
rm -r neural_renderer

Note that we modified the neural_renderer/lighting.py compared to daniilidis-group/neural_renderer.

Data

For example, for 3D hand reconstruction task on the FreiHAND dataset:

Download the FreiHAND dataset from the website.
Modify the input and output directory accordingly in examples/config/FreiHAND/*.json.

For HO3D dataset:

Download the HO3D dataset from the website.
Modify the input and output directory accordingly in examples/config/HO3D/*.json.

Offline 2D Detection

Offline 2D keypoint detection use a off-the-shelf detector like pytorch-openpose.
- We also provide detected 2D keypoints for FreiHAND training set. You may downlad and change the self.open_2dj_lists in the examples/data/dataset.py accordingly.
- Or Download the hand_pose_model.pth provided by pytorch-openpose, and put the file to examples/openpose_detector/src. Then use the following script and modify the input and output directory accordingly.
  
  python example/openpose_detector/hand_dectect.py

Training and Evaluation

HO3D

Evaluation: download the pretrained model [texturehand_ho3d.t7], and modify the "pretrain_model" in examples/config/HO3D/evaluation.json.

cd S2HAND
python3 ./examples/train.py --config_json examples/config/HO3D/evaluation.json

Training:

Stage-wise training:

python3 ./examples/train.py --config_json examples/config/HO3D/SSL-shape.json
python3 ./examples/train.py --config_json examples/config/HO3D/SSL-kp.json
python3 ./examples/train.py --config_json examples/config/HO3D/SSL-finetune.json

Or end-to-end training:

python3 ./examples/train.py --config_json examples/config/HO3D/SSL-e2e.json

Note: remember to check and inplace the dirs and files in the *.json files.

FreiHAND

Evaluation: download the pretrained model [texturehand_freihand.t7], and modify the "pretrain_model" in examples/config/FreiHAND/evaluation.json.

cd S2HAND
python3 ./examples/train.py --config_json examples/config/FreiHAND/evaluation.json

Training: refer to HO3D traing scripts.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{chen2021s2hand,
    title={Model-based 3D Hand Reconstruction via Self-Supervised Learning}, 
    author={Chen, Yujin and Tu, Zhigang and Kang, Di and Bao, Linchao and Zhang, Ying and Zhe, Xuefei and Chen, Ruizhi and Yuan, Junsong},
    booktitle={Conference on Computer Vision and Pattern Recognition},
    year={2021}
}

Model-based 3D Hand Reconstruction via Self-Supervised Learning, CVPR2021

Related tags

Overview

S²HAND: Model-based 3D Hand Reconstruction via Self-Supervised Learning

Code

Environment

Data

Offline 2D Detection

Training and Evaluation

HO3D

FreiHAND

Citation

Owner

Yujin Chen

Code repository accompanying the paper "On Adversarial Robustness: A Neural Architecture Search perspective"

Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation

Attentive Implicit Representation Networks (AIR-Nets)

code for Multi-scale Matching Networks for Semantic Correspondence, ICCV

Does MAML Only Work via Feature Re-use? A Data Set Centric Perspective

Plotting points that lie on the intersection of the given curves using gradient descent.

Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN", accepted to ACM MM 2021 BNI Track.

The VarCNN is an Convolution Neural Network based approach to automate Video Assistant Referee in football.

Hierarchical probabilistic 3D U-Net, with attention mechanisms (—𝘈𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘜-𝘕𝘦𝘵, 𝘚𝘌𝘙𝘦𝘴𝘕𝘦𝘵) and a nested decoder structure with deep supervision (—𝘜𝘕𝘦𝘵++).

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Deep Learning to Improve Breast Cancer Detection on Screening Mammography

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021)

Tianshou - An elegant PyTorch deep reinforcement learning library.

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

A Python implementation of the Locality Preserving Matching (LPM) method for pruning outliers in image matching.

An AutoML Library made with Optuna and PyTorch Lightning

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

Oriented Response Networks, in CVPR 2017

Model-based 3D Hand Reconstruction via Self-Supervised Learning, CVPR2021

Related tags

Overview

S2HAND: Model-based 3D Hand Reconstruction via Self-Supervised Learning

Code

Environment

Data

Offline 2D Detection

Training and Evaluation

HO3D

FreiHAND

Citation

Owner

Yujin Chen

Code repository accompanying the paper "On Adversarial Robustness: A Neural Architecture Search perspective"

Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation

Attentive Implicit Representation Networks (AIR-Nets)

code for Multi-scale Matching Networks for Semantic Correspondence, ICCV

Does MAML Only Work via Feature Re-use? A Data Set Centric Perspective

Plotting points that lie on the intersection of the given curves using gradient descent.

Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN", accepted to ACM MM 2021 BNI Track.

The VarCNN is an Convolution Neural Network based approach to automate Video Assistant Referee in football.

Hierarchical probabilistic 3D U-Net, with attention mechanisms (—𝘈𝘵𝘵𝘦𝘯𝘵𝘪𝘰𝘯 𝘜-𝘕𝘦𝘵, 𝘚𝘌𝘙𝘦𝘴𝘕𝘦𝘵) and a nested decoder structure with deep supervision (—𝘜𝘕𝘦𝘵++).

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Deep Learning to Improve Breast Cancer Detection on Screening Mammography

Bridging the Gap between Label- and Reference based Synthesis(ICCV 2021)

Tianshou - An elegant PyTorch deep reinforcement learning library.

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

Securetar - A streaming wrapper around python tarfile and allow secure handling files and support encryption

A Python implementation of the Locality Preserving Matching (LPM) method for pruning outliers in image matching.

An AutoML Library made with Optuna and PyTorch Lightning

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646

Code for WSDM 2022 paper, Contrastive Learning for Representation Degeneration Problem in Sequential Recommendation.

Oriented Response Networks, in CVPR 2017

S²HAND: Model-based 3D Hand Reconstruction via Self-Supervised Learning