The official implementation of ICCV paper "Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds".

Last update: Mar 26, 2022

Related tags

Deep Learning BAT

Overview

Box-Aware Tracker (BAT)

Pytorch-Lightning implementation of the Box-Aware Tracker.

Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds. ICCV 2021

Chaoda Zheng, Xu Yan, Jiaotao Gao, Weibing Zhao, Wei Zhang, Zhen Li*, Shuguang Cui

Citation

@InProceedings{zheng2021box,
  title={Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds},
  author={Chaoda Zheng, Xu Yan, Jiaotao Gao, Weibing Zhao, Wei Zhang, Zhen Li, Shuguang Cui},
  journal={ICCV},
  year={2021}
}

Features

Modular design. It is easy to config the model and trainng/testing behaviors through just a .yaml file.
DDP support for both training and testing.
Provide a 3rd party implementation of P2B.

Setup

Installation

create the environment

git clone https://github.com/Ghostish/BAT.git
cd BAT
conda create -n bat  python=3.6
conda activate bat

Install pytorch
```
conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit=10.1 -c pytorch
```
Our code is well tested with pytorch 1.4.0 and CUDA 10.1. But other platforms may also work. Follow this to install another version of pytorch.
Install other dependencies
```
pip install -r requirement.txt
```

KITTI dataset

Download the data for velodyne, calib and label_02 from KITTI Tracking.
Unzip the downloaded files.

Put the unzipped files under the same folder as following.

[Parent Folder]
--> [calib]
    --> {0000-0020}.txt
--> [label_02]
    --> {0000-0020}.txt
--> [velodyne]
    --> [0000-0020] folders with velodynes .bin files

Quick Start

Training

To train a model, you must specify the .yaml file with --cfg argument. The .yaml file contains all the configurations of the dataset and the model. Currently, we provide three .yaml files under the cfgs directory. Note: Before running the code, you will need to edit the .yaml file by setting the path argument as the correct root of the dataset.

python main.py --gpu 0 1 --cfg cfgs/BAT_Car.yaml  --batch_size 50 --epoch 60

After you start training, you can start Tensorboard to monitor the training process:

tensorboard --logdir=./ --port=6006

By default, the trainer runs a full evaluation on the full test split after training every epoch. You can set --check_val_every_n_epoch to a larger number to speed up the training.

Testing

To test a trained model, specify the checkpoint location with --checkpoint argument and send the --test flag to the command.

python main.py --gpu 0 1 --cfg cfgs/BAT_Car.yaml  --checkpoint /path/to/checkpoint/xxx.ckpt --test

Reproduction

This codebase produces better results than those we report in our original paper.

Model	Category	Success	Precision	Checkpoint
BAT	Car	65.37	78.88	pretrained_models/bat_kitti_car.ckpt
BAT	Pedestrian	45.74	74.53	pretrained_models/bat_kitti_pedestrian.ckpt

Two Trained BAT models for KITTI dataset are provided in the pretrained_models directory. To reproduce the results, simply run the code with the corresponding .yaml file and checkpoint. For example, to reproduce the tracking results on Car, just run:

python main.py --gpu 0 1 --cfg cfgs/BAT_Car.yaml  --checkpoint ./pretrained_models/bat_kitti_car.ckpt --test

To-dos

Acknowledgment

This repo is built upon P2B and SC3D.
Thank Erik Wijmans for his pytorch implementation of PointNet++

The official implementation of ICCV paper "Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds".

Related tags

Overview

Box-Aware Tracker (BAT)

Citation

Features

Setup

Quick Start

Training

Testing

Reproduction

To-dos

Acknowledgment

Owner

Kangel Zenn

Minimisation of a negative log likelihood fit to extract the lifetime of the D^0 meson (MNLL2ELDM)

Sequence to Sequence Models with PyTorch

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

CS583: Deep Learning

Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN", accepted to ACM MM 2021 BNI Track.

Intrusion Detection System using ensemble learning (machine learning)

Learning Versatile Neural Architectures by Propagating Network Codes

This project provides the code and datasets for 'CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection', CVPR 2019.

Here we present the implementation in TensorFlow of our work about liver lesion segmentation accepted in the Machine Learning 4 Health Workshop

Create Own QR code with Python

The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".

🏅 Top 5% in 제2회 연구개발특구 인공지능 경진대회 AI SPARK 챌린지

Official repo for our 3DV 2021 paper "Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements".

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.

Experiments and examples converting Transformers to ONNX

Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps

Code image classification of MNIST dataset using different architectures: simple linear NN, autoencoder, and highway network

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

FANet - Real-time Semantic Segmentation with Fast Attention

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding