A general python framework for single object tracking in LiDAR point clouds, based on PyTorch Lightning.

Last update: Dec 23, 2022

Related tags

Deep Learning Open3DSOT

Overview

Open3DSOT

A general python framework for single object tracking in LiDAR point clouds, based on PyTorch Lightning.

The official code release of BAT and MM Track.

Features

Modular design. It is easy to config the model and training/testing behaviors through just a .yaml file.
DDP support for both training and testing.
Support all common tracking datasets (KITTI, NuScenes, Waymo Open Dataset).

📣 One tracking paper is accepted by CVPR2022 (Oral)! 👇

Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds | Code coming here soon...

Trackers

This repository includes the implementation of the following models:

MM-Track (CVPR2022 Oral)

[Paper] [Project Page]

MM-Track is the first motion-centric tracker in LiDAR SOT, which robustly handles distractors and drastic appearance changes in complex driving scenes. Unlike previous methods, MM-Track is a matching-free two-stage tracker which localizes the targets by explicitly modeling the "relative target motion" among frames.

BAT (ICCV2021)

[Paper] [Results]

Official implementation of BAT. BAT uses the BBox information to compensate the information loss of incomplete scans. It augments the target template with box-aware features that efficiently and effectively improve appearance matching.

P2B (CVPR2020)

[Paper] [Official implementation]

Third party implementation of P2B. Our implementation achieves better results than the official code release. P2B adapts SiamRPN to 3D point clouds by integrating a pointwise correlation operator with a point-based RPN (VoteNet).

Setup

Installation

Create the environment

git clone https://github.com/Ghostish/Open3DSOT.git
cd Open3DSOT
conda create -n Open3DSOT  python=3.6
conda activate Open3DSOT

Install pytorch
```
conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch
```
Our code is well tested with pytorch 1.4.0 and CUDA 10.1. But other platforms may also work. Follow this to install another version of pytorch. Note: In order to reproduce the reported results with the provided checkpoints, please use CUDA 10.x.
Install other dependencies:
```
pip install -r requirement.txt
```
Install the nuscenes-devkit if you use want to use NuScenes dataset:
```
pip install nuscenes-devkit
```

KITTI dataset

Download the data for velodyne, calib and label_02 from KITTI Tracking.
Unzip the downloaded files.

Put the unzipped files under the same folder as following.

[Parent Folder]
--> [calib]
    --> {0000-0020}.txt
--> [label_02]
    --> {0000-0020}.txt
--> [velodyne]
    --> [0000-0020] folders with velodynes .bin files

NuScenes dataset

Download the dataset from the download page

Extract the downloaded files and make sure you have the following structure:

[Parent Folder]
  samples	-	Sensor data for keyframes.
  sweeps	-	Sensor data for intermediate frames.
  maps	        -	Folder for all map files: rasterized .png images and vectorized .json files.
  v1.0-*	-	JSON tables that include all the meta data and annotations. Each split (trainval, test, mini) is provided in a separate folder.

Note: We use the train_track split to train our model and test it with the val split. Both splits are officially provided by NuScenes. During testing, we ignore the sequences where there is no point in the first given bbox.

Waymo dataset

Download and prepare dataset by the instruction of CenterPoint.

[Parent Folder]
  tfrecord_training	                    
  tfrecord_validation	                 
  train 	                                    -	all training frames and annotations 
  val   	                                    -	all validation frames and annotations 
  infos_train_01sweeps_filter_zero_gt.pkl
  infos_val_01sweeps_filter_zero_gt.pkl

Prepare SOT dataset. Data from specific category and split will be merged (e.g., sot_infos_vehicle_train.pkl).

  python datasets/generate_waymo_sot.py

Quick Start

Training

To train a model, you must specify the .yaml file with --cfg argument. The .yaml file contains all the configurations of the dataset and the model. Currently, we provide four .yaml files under the cfgs directory. Note: Before running the code, you will need to edit the .yaml file by setting the path argument as the correct root of the dataset.

python main.py --gpu 0 1 --cfg cfgs/BAT_Car.yaml  --batch_size 50 --epoch 60 --preloading

After you start training, you can start Tensorboard to monitor the training process:

tensorboard --logdir=./ --port=6006

By default, the trainer runs a full evaluation on the full test split after training every epoch. You can set --check_val_every_n_epoch to a larger number to speed up the training. The --preloading flag is used to preload the training samples into the memory to save traning time. Remove this flag if you don't have enough memory.

Testing

To test a trained model, specify the checkpoint location with --checkpoint argument and send the --test flag to the command.

python main.py --gpu 0 1 --cfg cfgs/BAT_Car.yaml  --checkpoint /path/to/checkpoint/xxx.ckpt --test

Reproduction

Model	Category	Success	Precision	Checkpoint
BAT-KITTI	Car	65.37	78.88	pretrained_models/bat_kitti_car.ckpt
BAT-NuScenes	Car	40.73	43.29	pretrained_models/bat_nuscenes_car.ckpt
BAT-KITTI	Pedestrian	45.74	74.53	pretrained_models/bat_kitti_pedestrian.ckpt

Three trained BAT models for KITTI and NuScenes datasets are provided in the pretrained_models directory. To reproduce the results, simply run the code with the corresponding .yaml file and checkpoint. For example, to reproduce the tracking results on KITTI Car, just run:

python main.py --gpu 0 1 --cfg cfgs/BAT_Car.yaml  --checkpoint ./pretrained_models/bat_kitti_car.ckpt --test

Acknowledgment

This repo is built upon P2B and SC3D.
Thank Erik Wijmans for his pytorch implementation of PointNet++

License

This repository is released under MIT License (see LICENSE file for details).

A general python framework for single object tracking in LiDAR point clouds, based on PyTorch Lightning.

Related tags

Overview

Open3DSOT

Features

📣 One tracking paper is accepted by CVPR2022 (Oral)! 👇

Trackers

MM-Track (CVPR2022 Oral)

BAT (ICCV2021)

P2B (CVPR2020)

Setup

Quick Start

Training

Testing

Reproduction

Acknowledgment

License

Owner

Kangel Zenn

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch

ICCV2021 Papers with Code

Inteligência artificial criada para realizar interação social com idosos.

[ACM MM 2021] Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)

Notes taking website build with Docker + Django + React.

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

Omniscient Video Super-Resolution

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

StyleSwin: Transformer-based GAN for High-resolution Image Generation

A demo of how to use JAX to create a simple gravity simulation

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

Objax Apache-2Objax (🥉19 · ⭐ 580) - Objax is a machine learning framework that provides an Object.. Apache-2 jax

LoL Runes Recommender With Python

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

DIP-football - A football video analyse system based on Yolov5, alphapose, Qt6

Machine Learning Model deployment for Container (TensorFlow Serving)

Demos of essentia classifiers hosted on replicate.ai

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

A general python framework for single object tracking in LiDAR point clouds, based on PyTorch Lightning.

Related tags

Overview

Open3DSOT

Features

📣 One tracking paper is accepted by CVPR2022 (Oral)! 👇

Trackers

MM-Track (CVPR2022 Oral)

BAT (ICCV2021)

P2B (CVPR2020)

Setup

Quick Start

Training

Testing

Reproduction

Acknowledgment

License

Owner

Kangel Zenn

Adversarial Learning for Semi-supervised Semantic Segmentation, BMVC 2018

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

NuPIC Studio is an all­-in-­one tool that allows users create a HTM neural network from scratch

ICCV2021 Papers with Code

Inteligência artificial criada para realizar interação social com idosos.

[ACM MM 2021] Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)

Notes taking website build with Docker + Django + React.

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

Omniscient Video Super-Resolution

Official code of paper: MovingFashion: a Benchmark for the Video-to-Shop Challenge

StyleSwin: Transformer-based GAN for High-resolution Image Generation

A demo of how to use JAX to create a simple gravity simulation

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

Objax Apache-2Objax (🥉19 · ⭐ 580) - Objax is a machine learning framework that provides an Object.. Apache-2 jax

LoL Runes Recommender With Python

Posterior temperature optimized Bayesian models for inverse problems in medical imaging

DIP-football - A football video analyse system based on Yolov5, alphapose, Qt6

Machine Learning Model deployment for Container (TensorFlow Serving)

Demos of essentia classifiers hosted on replicate.ai

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

NuPIC Studio is an all-in-one tool that allows users create a HTM neural network from scratch