[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Last update: Jan 04, 2023

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Lukas Koestler^1* Nan Yang^1,2*,† Niclas Zeller^2,3 Daniel Cremers^1,2

^*equal contribution ^†corresponding author

¹Technical University of Munich ²Artisense
³Karlsruhe University of Applied Sciences

Conference on Robot Learning (CoRL) 2021, London, UK

3DV 2021 Best Demo Award

arXiv | Video | OpenReview | Project Page

Code and Data

📣 CVA-MVSNet released! Please check cva_mvsnet/.
📣 Replica training data released! Please check replica/.
C++ code realse before Christmas. Thank you for your patience!

Abstract

In this paper, we present TANDEM a real-time monocular tracking and dense mapping framework. For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of keyframes. To increase the robustness, we propose a novel tracking front-end that performs dense direct image alignment using depth maps rendered from a global model that is built incrementally from dense depth predictions. To predict the dense depth maps, we propose Cascade View-Aggregation MVSNet (CVA-MVSNet) that utilizes the entire active keyframe window by hierarchically constructing 3D cost volumes with adaptive view aggregation to balance the different stereo baselines between the keyframes. Finally, the predicted depth maps are fused into a consistent global map represented as a truncated signed distance function (TSDF) voxel grid. Our experimental results show that TANDEM outperforms other state-of-the-art traditional and learning-based monocular visual odometry (VO) methods in terms of camera tracking. Moreover, TANDEM shows state-of-the-art real-time 3D reconstruction performance.

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

A Machine Teaching Framework for Scalable Recognition

Reinforcement Learning for Portfolio Management

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

Improving Object Detection by Label Assignment Distillation

Implementation of the CVPR 2021 paper "Online Multiple Object Tracking with Cross-Task Synergy"

An example of Scatterbrain implementation (combining local attention and Performer)

Raindrop strategy for Irregular time series

[UNMAINTAINED] Automated machine learning for analytics & production

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI'22)

VIsually-Pivoted Audio and(N) Text

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

Synthetic structured data generators

Homepage of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.

Intrusion Test Tool with Python

HistoSeg : Quick attention with multi-loss function for multi-structure segmentation in digital histology images

One implementation of the paper "DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing".

CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

Have you ever wondered how cool it would be to have your own A.I

A project studying the influence of communication in multi-objective normal-form games

Semantic Segmentation with Pytorch-Lightning

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mappingin Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

A Machine Teaching Framework for Scalable Recognition

Reinforcement Learning for Portfolio Management

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

Improving Object Detection by Label Assignment Distillation

Implementation of the CVPR 2021 paper "Online Multiple Object Tracking with Cross-Task Synergy"

An example of Scatterbrain implementation (combining local attention and Performer)

Raindrop strategy for Irregular time series

[UNMAINTAINED] Automated machine learning for analytics & production

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI'22)

VIsually-Pivoted Audio and(N) Text

Compartmental epidemic model to assess undocumented infections: applications to SARS-CoV-2 epidemics in Brazil - Datasets and Codes

Synthetic structured data generators

Homepage of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.

Intrusion Test Tool with Python

HistoSeg : Quick attention with multi-loss function for multi-structure segmentation in digital histology images

One implementation of the paper "DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing".

CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

Have you ever wondered how cool it would be to have your own A.I

A project studying the influence of communication in multi-objective normal-form games

Semantic Segmentation with Pytorch-Lightning

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo