PolyTrack: Tracking with Bounding Polygons

Last update: Sep 15, 2022

Related tags

Overview

PolyTrack: Tracking with Bounding Polygons

Abstract

In this paper, we present a novel method called PolyTrack for fast multi-object tracking and segmentation using bounding polygons. Polytrack detects objects by producing heatmaps of their center keypoint. For each of them, a rough segmentation is done by computing a bounding polygon over each instance instead of the traditional bounding box. Tracking is done by taking two consecutive frames as input and computing a center offset for each object detected in the first frame to predict their location in the second frame. A Kalman filter is also applied to reduce the number of ID switches. Since our target application is automated driving systems, we apply our method on urban environment videos. We train and evaluate PolyTrack on the MOTS and KITTIMOTS dataset.

Example results

Video examples from the KITTI MOTS test set:

Model

An overview of the PolyTrack architecture. The network takes as input the image at time t, I(t), the image at time t-1, I(t-1), as well as the heatmap at time t-1, H(t-1). Features are produced by the backbone and then used by five different network heads. The center heatmaps head is used for detecting and classifying objects, the polygon head is used for the segmentation part, the depth head is used to produce a relative depth between objects, the tracking head is used to produce an offset between frames at time t-1 and time t and finally the offset head is used for correctly upsampling images.

a) Generated Heatmap	b) Generated Output

a): The center heatmap produced by the network to detect objects, b): the output of our method: a bounding polygon for each object, a class label, a track id as well as an offset from the previous frame.

Installation

Please refer to INSTALL.md for installation instructions.

Folder organization

/experiments: bash files to start repeat our experiments, you can also find an example of how to perform a demo.
/src/lib : contains the code needed to generate and train a model
/src/tools : contains tools relevant to different datasets, you can find the files we used to generate our ground truth here.
/data : not included in the git repo, but contains images from the dataset with the following structure:
/data/MOTS/test/ : contains test images
/data/MOTS/train/ : contains train images
/data/MOTS/seqmaps/ : contains seqmaps
/data/MOTS/json_gt/ : contains ground truth files generated by our tools

License

PolyTrack is released under the MIT License. PolyTrack is based upon CenterTrack and CenterPoly. Portions of the code are borrowed from CornerNet (hourglassnet, loss functions), dla (DLA network) and DCNv2(deformable convolutions). Please refer to the original License of these projects (See NOTICE).

PolyTrack: Tracking with Bounding Polygons

Related tags

Overview

PolyTrack: Tracking with Bounding Polygons

Abstract

Example results

Model

Installation

Folder organization

License

Owner

Gaspar Faure

[TIP2020] Adaptive Graph Representation Learning for Video Person Re-identification

Two types of Recommender System : Content-based Recommender System and Colaborating filtering based recommender system

Using PyTorch Perform intent classification using three different models to see which one is better for this task

Automatically replace ONNX's RandomNormal node with Constant node.

WSDM‘2022: Knowledge Enhanced Sports Game Summarization

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation (CoRL 2021)

Simple improvement of VQVAE that allow to generate x2 sized images compared to baseline

Annotated, understandable, and visually interpretable PyTorch implementations of: VAE, BIRVAE, NSGAN, MMGAN, WGAN, WGANGP, LSGAN, DRAGAN, BEGAN, RaGAN, InfoGAN, fGAN, FisherGAN

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

Language Used: Python . Made in Jupyter(Anaconda) notebook.

Code for the paper: Sketch Your Own GAN

Official implementation of Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models at NeurIPS 2021

This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described in the paper.

This is the official repository of XVFI (eXtreme Video Frame Interpolation)

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

Plug and play transformer you can find network structure and official complete code by clicking List

PFFDTD is an open-source FDTD simulator for 3D room acoustics

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

2021 credit card consuming recommendation

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)