[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Last update: Dec 15, 2022

Related tags

Overview

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

This is an official PyTorch code repository of the paper "Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks " (ICCV, 2021).

Here, we present a versatile point cloud processing block that yields state-of-the-art results on many tasks.
The key idea is to process point clouds with many cheap low-dimensional different projections followed by standard convolutions. And we do so both in parallel and sequentially.

Datasets

We provide links to the datasets we used to train/evaluate. After unpacking and preparation, please edit the dataset path (data:path field) in configs/*.yaml

Pre-trained models

We provide our pre-trained models' weights in a single archive.

Building Dependencies

To install and build all the modules required, please run:

bash ./install_deps.sh

Code Structure

In layers/cloud_transform.py the core operations are implemented (rasterization Splat and de-rasterization Slice). While in layers\mutihead_ct_*.py we provide slightly different versions of Multi-Headed Cloud Transform (MHCT).

The model zoo is situated in model_zoo, where the models for corresponding tasks are constructed of Multi-Headed Cloud Transforms.

Run

We train our models in multi-GPU setting using DistributedDataParallel. To train on n GPUs, please run the following commands:

python train_${SCRIPT_NAME}.py ${EXP_NAME} -c configs/${CONFIG_NAME}.yaml --master localhost:3315 --rank 0 --num_nodes n
...
python train_${SCRIPT_NAME}.py ${EXP_NAME} -c configs/${CONFIG_NAME}.yaml --master localhost:3315 --rank  --num_nodes n

The semantics for evaluation scripts is almost the same:

python eval_${SCRIPT_NAME}.py ${EXP_NAME} -c configs/eval/${CONFIG_NAME}.yaml

Cite

If you find our work helpful, please do not hesitate to cite us.

@inproceedings{mazur2021cloudtransformers,
  title={Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks},
  author={Mazur, Kirill and Lempitsky, Victor},
  booktitle={International Conference on Computer Vision (ICCV)},
  year={2021}
}

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Related tags

Overview

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Datasets

Pre-trained models

Building Dependencies

Code Structure

Run

Cite

Owner

Visual Understanding Lab @ Samsung AI Center Moscow

OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

Pixie - A full-featured 2D graphics library for Python

A simple demo program for using OpenCV on Android

Slice a single image into multiple pieces and create a dataset from them

This repository summarized computer vision theories.

Image processing in Python

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

Use Convolutional Recurrent Neural Network to recognize the Handwritten line text image without pre segmentation into words or characters. Use CTC loss Function to train.

Localization of thoracic abnormalities model based on VinBigData (top 1%)

Motion detector, Full body detection, Upper body detection, Cat face detection, Smile detection, Face detection (haar cascade), Silverware detection, Face detection (lbp), and Sending email notifications

Fun program to overlay a mask to yourself using a webcam

An Implementation of the FOTS: Fast Oriented Text Spotting with a Unified Network

Dirty, ugly, and hopefully useful OCR of Facebook Papers docs released by Gizmodo

Face Detection with DLIB

Official code for :rocket: Unsupervised Change Detection of Extreme Events Using ML On-Board :rocket:

Application that instantly translates sign-language to letters.

PyQT5 app that colorize black & white pictures using CNN(use pre-trained model which was made with OpenCV)

An unofficial package help developers to implement ZATCA (Fatoora) QR code easily which required for e-invoicing

PAGE XML format collection for document image page content and more

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract