Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

Last update: Dec 29, 2022

Overview

Pano3D

A Holistic Benchmark and a Solid Baseline for 360^o Depth Estimation

Pano3D is a new benchmark for depth estimation from spherical panoramas. We generate a dataset (using GibsonV2) and provide baselines for holistic performance assessment, offering:

Primary and secondary traits metrics:
- Direct depth performance:
  - (w)RMSE
  - (w)RMSLE
  - AbsRel
  - SqRel
  - (w)Relative accuracy (\delta) @ {1.05, 1.1, 1.25, 1.25², 1.25³ }
- Boundary discontinuity preservation:
  - Precision @ {0.25, 0.5, 1.0}m
  - Recall @ {0.25, 0.5, 1.0}m
  - Depth boundary errors of accuracy and completeness
- Surface smoothness:
  - RMSE^o
  - Relative accuracy (\alpha) @ {11.25^o, 22.5^o, 30^o}
Out-of-distribution & Zero-shot cross dataset transfer:
- Different depth distribution test set
- Varying scene context test set
- Shifted camera domain test set

By disentangling generalization and assessing all depth properties, Pano3D aspires to drive progress benchmarking for 360^o depth estimation.

Using Pano3D to search for a solid baseline results in an acknowledgement of exploiting complementary error terms, adding encoder-decoder skip connections and using photometric augmentations.

TODO

Demo

A publicly hosted demo of the baseline models can be found here. Using the web app, it is possible to upload a panorama and download a 3D reconstructed mesh of the scene using the derived depth map.

Note that due to the external host's caching issues, it might be necessary to refresh your browser's cache in between runs to update the 3D models.

Data

Download

To download the data, follow the instructions at vcl3d.github.io/Pano3D/download/.

Please note that getting access to the data download links is a two step process as the dataset is a derivative and compliance with the original dataset's terms and usage agreements is required. Therefore:

You first need to fill in this Google Form.
And, then, you need to perform an access request at each one of the Zenodo repositories (depending on which dataset partition you need):

After both these steps are completed, you will soon receive the download links for each dataset partition.

Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

Related tags

Overview

Pano3D

A Holistic Benchmark and a Solid Baseline for 360o Depth Estimation

TODO

Demo

Data

Download

Loader

Splits

Models

Download

Inference

Serve

Metrics

Direct

Boundary

Smoothness

Results

Owner

Visual Computing Lab, Information Technologies Institute, Centre for Reseach and Technology Hellas

This is a re-implementation of TransGAN: Two Pure Transformers Can Make One Strong GAN (CVPR 2021) in PyTorch.

Source code for EquiDock: Independent SE(3)-Equivariant Models for End-to-End Rigid Protein Docking (ICLR 2022)

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble

StyleTransfer - Open source style transfer project, based on VGG19

This project is a re-implementation of MASTER: Multi-Aspect Non-local Network for Scene Text Recognition by MMOCR

Autoencoder - Reducing the Dimensionality of Data with Neural Network

structured-generative-modeling

An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-Supervised Representations.

Code for Blind Image Decomposition (BID) and Blind Image Decomposition network (BIDeN).

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices, ACM Multimedia 2021

tf2-keras implement yolov5

Reinforcement-learning - Repository of the class assignment questions for the course on reinforcement learning

[ICLR 2022] Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Ludwig is a toolbox that allows to train and evaluate deep learning models without the need to write code.

MG-GCN: Scalable Multi-GPU GCN Training Framework

Mail classification with tensorflow and MS Exchange Server (ham or spam).

Animal Sound Classification (Cats Vrs Dogs Audio Sentiment Classification)

The Official PyTorch Implementation of "LSGM: Score-based Generative Modeling in Latent Space" (NeurIPS 2021)

Self-Supervised Contrastive Learning of Music Spectrograms

A Holistic Benchmark and a Solid Baseline for 360^o Depth Estimation