The code for paper "Learning Implicit Fields for Generative Shape Modeling".

Last update: Dec 30, 2022

Related tags

Overview

implicit-decoder

The tensorflow code for paper "Learning Implicit Fields for Generative Shape Modeling", Zhiqin Chen, Hao (Richard) Zhang.

Project page | Paper

Improved TensorFlow1 implementation

Improved PyTorch implementation

Update

We have an improved implementation here, where we trained one model on the 13 ShapeNet categories.

We have a PyTorch implementation here.

Introduction

We advocate the use of implicit fields for learning generative models of shapes and introduce an implicit field decoder, called IM-NET, for shape generation, aimed at improving the visual quality of the generated shapes. An implicit field assigns a value to each point in 3D space, so that a shape can be extracted as an iso-surface. IM-NET is trained to perform this assignment by means of a binary classifier. Specifically, it takes a point coordinate, along with a feature vector encoding a shape, and outputs a value which indicates whether the point is outside the shape or not. By replacing conventional decoders by our implicit decoder for representation learning (via IM-AE) and shape generation (via IM-GAN), we demonstrate superior results for tasks such as generative shape modeling, interpolation, and single-view 3D reconstruction, particularly in terms of visual quality.

Citation

If you find our work useful in your research, please consider citing:

@article{chen2018implicit_decoder,
  title={Learning Implicit Fields for Generative Shape Modeling},
  author={Chen, Zhiqin and Zhang, Hao},
  journal={Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2019}
}

Dependencies

Requirements:

Python 3.5 with numpy, scipy and h5py
Tensorflow 1
PyMCubes (for matching cubes)
pycollada (for writing .dae files)
OpenCV-Python (for reading and writing images)

Our code has been tested with Python 3.5, TensorFlow 1.8.0, CUDA 9.1 and cuDNN 7.0 on Ubuntu 16.04 and Windows 10.

Datasets and Pre-trained weights

The original voxel models and rendered views are from HSP. Since our network takes point-value pairs, the voxel models require further sampling. The sampling method can be found in our project page.

We provide the ready-to-use datasets in hdf5 format, together with our pre-trained network weights. The weights for IM-GAN is the ones we used in our demo video. The weights for IM-SVR is the ones we used in the experiments in our paper.

Backup links:

Usage

For data preparation, please see directory point_sampling.

To train an autoencoder, go to IMGAN and use the following commands for progressive training. You may want to copy the commands in a .bat or .sh file.

python main.py --ae --train --epoch 50 --real_size 16 --batch_size_input 4096
python main.py --ae --train --epoch 100 --real_size 32 --batch_size_input 8192
python main.py --ae --train --epoch 200 --real_size 64 --batch_size_input 32768

The above commands will train the AE model 50 epochs in 16³ resolution (each shape has 4096 sampled points), then 50 epochs in 32³ resolution, and finally 100 epochs in 64³ resolution.

To train a latent-gan, after training the autoencoder, use the following command to extract the latent codes:

python main.py --ae

Then train the latent-gan and get some samples:

python main.py --train --epoch 10000
python main.py

You can change some lines in main.py to adjust the number of samples and the sampling resolution.

To train the network for single-view reconstruction, after training the autoencoder, copy the weights and latent codes to the corresponding folders in IMSVR. Go to IMSVR and use the following commands to train IM-SVR and get some samples:

python main.py --train --epoch 1000
python main.py

License

This project is licensed under the terms of the MIT license (see LICENSE for details).

The code for paper "Learning Implicit Fields for Generative Shape Modeling".

Related tags

Overview

implicit-decoder

Project page | Paper

Improved TensorFlow1 implementation

Improved PyTorch implementation

Update

Introduction

Citation

Dependencies

Datasets and Pre-trained weights

Usage

License

Owner

Zhiqin Chen

CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection

《Towards High Fidelity Face Relighting with Realistic Shadows》(CVPR 2021)

Magisk module to enable hidden features on Android 12 Developer Preview 1.

Automated image registration. Registrationimation was too much of a mouthful.

Repository for RNNs using TensorFlow and Keras - LSTM and GRU Implementation from Scratch - Simple Classification and Regression Problem using RNNs

source code the paper Fast and Robust Iterative Closet Point.

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

Compute descriptors for 3D point cloud registration using a multi scale sparse voxel architecture

This is a collection of our NAS and Vision Transformer work.

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

A symbolic-model-guided fuzzer for TLS

Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track (SIGIR 2021 Full Paper).

COIN the currently largest dataset for comprehensive instruction video analysis.

[ArXiv 2021] One-Shot Generative Domain Adaptation

All public open-source implementations of convnets benchmarks

Direct application of DALLE-2 to video synthesis, using factored space-time Unet and Transformers

Bag of Tricks for Natural Policy Gradient Reinforcement Learning

FB-tCNN for SSVEP Recognition

PyTorch Implementation of Backbone of PicoDet

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)