Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Last update: Oct 12, 2022

Related tags

Deep Learning deep-3dmask

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Kai-En Lin¹, Lei Xiao², Feng Liu², Guowei Yang¹, Ravi Ramamoorthi¹

¹University of California, San Diego, ²Facebook Reality Labs

Requirements

Install required packages

Make sure you have up-to-date NVIDIA drivers supporting CUDA 11.1 (10.2 could work but need to change cudatoolkit package accordingly)

Run

conda env create -f environment.yml
conda activate video_viewsynth

Usage

Rendering

Download our pretrained checkpoint and testing data. Extract the content to [path_to_data_directory]. It contains frames and background folders, as well as poses_bounds.npy.
In configs, setup data path by changing render_video.txt

root_dir should point to the frames folder mentioned in 1. and bg_dir should point to background folder.

out_dir can be your desired output folder.

ckpt_path should be the pretrained checkpoint path.
Run python render_llff_video.py --config [config_file_path]

e.g. python render_llff_video.py --config ../configs/render_video.txt

(Optional) For your own data, please run prepare_data.sh

sh render.sh [frame_folder] [starting_frame] [ending_frame] [output_folder_name]

Make sure your data is in this structure before running
```
[frame_folder] --- cam00 --- 00000.jpg
                |         |- 00001.jpg
                |         ...
                |- cam01
                |- cam02
                ...
                |- poses_bounds.npy
```
e.g. sh render.sh ~/deep_3d_data/frames 0 20 qual

Training

Train MPI

Download RealEstate10K dataset and extract the frames. There are scripts in preprocessing folder which can be used to generate the data.

The order should be download_data.py -> extract_frames.py -> compress_data.py.

Remember to change the path in compress_data.py.
Change the paths in config file train_realestate10k.txt

Run

cd train_mpi
python train.py --config ../configs/train_realestate10k.txt

Train Mask

Once MPI is trained, we can use the checkpoint to train 3D mask network.

Download dataset
Change the paths in config file train_mask.txt

Run

cd train_mask
python train.py --config ../configs/train_mask.txt

Citation

@inproceedings {lin2021deep,
    title = {Deep 3D Mask Volume for View Synthesis of Dynamic Scenes},
    author = {Kai-En Lin and Lei Xiao and Feng Liu and Guowei Yang and Ravi Ramamoorthi},
    booktitle = {ICCV},
    year = {2021},
}

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Related tags

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Requirements

Install required packages

Usage

Rendering

Training

Train MPI

Train Mask

Citation

Owner

Ken Lin

TorchFlare is a simple, beginner-friendly, and easy-to-use PyTorch Framework train your models effortlessly.

Learning-Augmented Dynamic Power Management

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

Nested cross-validation is necessary to avoid biased model performance in embedded feature selection in high-dimensional data with tiny sample sizes

This repo is official PyTorch implementation of MobileHumanPose: Toward real-time 3D human pose estimation in mobile devices(CVPRW 2021).

WiFi-based Multi-task Sensing

This solves the autonomous driving issue which is supported by deep learning technology. Given a video, it splits into images and predicts the angle of turning for each frame.

Pytorch library for fast transformer implementations

Automatic Idiomatic Expression Detection

An efficient implementation of GPNN

DecoupledNet is semantic segmentation system which using heterogeneous annotations

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

General Virtual Sketching Framework for Vector Line Art (SIGGRAPH 2021)

HackBMU-5.0-Team-Ctrl-Alt-Elite - HackBMU 5.0 Team Ctrl Alt Elite

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

Learning infinite-resolution image processing with GAN and RL from unpaired image datasets, using a differentiable photo editing model.

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

Self-Supervised Image Denoising via Iterative Data Refinement