Source code for 2021 ICCV paper "In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces"

Last update: Dec 06, 2022

Overview

In-the-Wild Single Camera 3D Reconstruction
Through Moving Water Surfaces

This is the PyTorch implementation for 2021 ICCV paper "In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces"

Project Page | Paper | Supplemental Material

In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces
Jinhui Xiong, Wolfgang Heidrich
KAUST
ICCV 2021 (Oral)

We propose a differentiable framework to estimate underwater scene geometry along with the time-varying water surface. The inputs to our model are a video sequence captured by a fixed camera. Dense correspondence from each frame to a world reference frame (selected from the input sequences) is pre-computed, ensuring the reconstruction is performed in a unified coordinate system. We feed the flow fields, together with initialized water surfaces and scene geometry (all are initialized as planar surfaces), into the framework, which incorporates ray casting, Snell’s law and multi-view triangulation. The gradients of the specially designed losses with respect to water surfaces and scene geometry are back-propagated, and all parameters are simultaneously optimized. The final result is a quality reconstruction of the underwater scene, along with an estimate of the time-varying water-air interface. The data shown here was captured in a public fountain environment.

Prerequisite

The code was tested with python>=3.7 & PyTorch>=1.3 & cuda>=10.0 on Nvidia RTX 2080 Ti
Minor change on the code if there is compatibility issue. It needs around 10 GB GPU memory.

Setup

conda create -n moving_water python=3.7
conda activate moving_water

conda install pytorch torchvision -c pytorch
conda install -c conda-forge opencv scikit-image
conda install -c anaconda scipy

Run the code

Please go to example folder, download the cached coefficient matrices (there are three matrices for each example) and execute:

python3 run.py

Citation

@inproceedings{xiong2021inthewild,
  title={In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces},
  author={Jinhui Xiong and Wolfgang Heidrich},
  year={2021},
  booktitle={ICCV}
}

Contact

Please contact Jinhui Xiong [email protected] if you have any question or comment.

Source code for 2021 ICCV paper "In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces"

Related tags

Overview

In-the-Wild Single Camera 3D Reconstruction
Through Moving Water Surfaces

Project Page | Paper | Supplemental Material

Prerequisite

Setup

Run the code

Citation

Contact

Owner

Object detection using yolo-tiny model and opencv used as backend

CyTran: Cycle-Consistent Transformers for Non-Contrast to Contrast CT Translation

Anomaly Detection Based on Hierarchical Clustering of Mobile Robot Data

[ICCV2021] Safety-aware Motion Prediction with Unseen Vehicles for Autonomous Driving

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

An extremely simple, intuitive, hardware-friendly, and well-performing network structure for LiDAR semantic segmentation on 2D range image. IROS21

Lighting the Darkness in the Deep Learning Era: A Survey, An Online Platform, A New Dataset

Programming with Neural Surrogates of Programs

A simple program for training and testing vit

Pytorch implementation of SenFormer: Efficient Self-Ensemble Framework for Semantic Segmentation

Semi-supervised Video Deraining with Dynamical Rain Generator (CVPR, 2021, Pytorch)

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Code-free deep segmentation for computational pathology

Spatial-Location-Constraint-Prototype-Loss-for-Open-Set-Recognition

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation

Official implementation for the paper "SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization".

Face recognition with trained classifiers for detecting objects using OpenCV

Predict bus arrival time using VertexAI and Nvidia's Jetson Nano

An Unsupervised Detection Framework for Chinese Jargons in the Darknet