VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

DISTIL: Deep dIverSified inTeractIve Learning.

Sentinel-1 vessel detection model used in the xView3 challenge

SEJE Pytorch implementation

Robot Servers and Server Manager software for robo-gym

OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

Lab course materials for IEMBA 8/9 course "Coding and Artificial Intelligence"

A motion detection system with RaspberryPi, OpenCV, Python

💡 Type hints for Numpy

Ranger deep learning optimizer rewrite to use newest components

This library provides an abstraction to perform Model Versioning using Weight & Biases.

Multi-Joint dynamics with Contact. A general purpose physics simulator.

A machine learning library for spiking neural networks. Supports training with both torch and jax pipelines, and deployment to neuromorphic hardware.

Code and real data for the paper "Counterfactual Temporal Point Processes", available at arXiv.

PyTorch implementation of popular datasets and models in remote sensing

Evolving neural network parameters in JAX.

The second project in Python course on FCC

The easiest tool for extracting radiomics features and training ML models on them.

The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"

TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning

Optimizing Value-at-Risk and Conditional Value-at-Risk of Black Box Functions with Lacing Values (LV)