Learning Camera Localization via Dense Scene Matching, CVPR2021

Last update: Dec 01, 2022

Related tags

Overview

This repository contains code of our CVPR 2021 paper - "Learning Camera Localization via Dense Scene Matching" by Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan.

This paper presents a new method for scene agnostic camera localization using dense scene matching (DSM), where a cost volume is constructed between a query image and a scene. The cost volume and the corresponding coordinates are processed by a CNN to predict dense coordinates. Camera poses can then be solved by PnP algorithms.

If you find this project useful, please cite:

@inproceedings{Tang2021Learning,
  title={Learning Camera Localization via Dense Scene Matching},
  author={Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu and Ping Tan},
  booktitle={Computer Vision and Pattern Recognition (CVPR)},
  year={2021}
}

Usage

Environment

The codes are tested along with
- pytorch=1.4.0
- lmdb (optional)
- yaml
- skimage
- opencv
- numpy=1.17
- tensorboard

Installation

Build PyTorch operations

  cd libs/model/ops
  python setup.py install

Build PnP algorithm

  cd libs/utils/lm_pnp
  mkdir build
  cd build
  cmake ..
  make all

Train and Test

Download

You can download the trained models and label files for 7scenes, Cambridge, Scannet.

For 7scenes, you can use the prepared data in the following.

Chess Fire Heads Office Pumpkin Kitchen Stairs

For Cambridge landmarks, you can download image files here, and depths here.
Test

Please refer to configs/7scenes.yaml for detailed explaination of how to set label file path and image file path.
- 7scenes
```
python tools/video_test.py --config configs/7scenes.yaml
```
- Camrbrige
```
python tools/video_test.py --config configs/cambridge.yaml
```
Train

We use ResNet-FPN pretrained model.
```
  python tools/train_net.py
```

Learning Camera Localization via Dense Scene Matching, CVPR2021

Related tags

Overview

Usage

Environment

Installation

Train and Test

Owner

tangshitao

基于openpose和图像分类的手语识别项目

Official PyTorch implementation for "Mixed supervision for surface-defect detection: from weakly to fully supervised learning"

Converts an image into funny, smaller amongus characters

[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training

Rubik's Cube in pygame with OpenGL

Resizing Canny Countour In Python

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

STEFANN: Scene Text Editor using Font Adaptive Neural Network

Play the Namibian game of Owela against a terrible AI. Built using Django and htmx.

A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.

Image Recognition Model Generator

a micro OCR network with 0.07mb params.

Open Source Differentiable Computer Vision Library for PyTorch

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

Python rubik's cube solver

Python Computer Vision from Scratch

A small C++ implementation of LSTM networks, focused on OCR.

Amazing 3D explosion animation using Pygame module.