WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Last update: Jan 01, 2023

Related tags

Overview

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking [Paper Link]

Abstract

In this work, we contribute a new million-scale Unmanned Aerial Vehicle (UAV) tracking benchmark, called WebUAV-3M. Firstly, we collect 4,485 videos with more than 3M frames from the Internet. Then, an efficient and scalable Semi-Automatic Target Annotation (SATA) pipeline is devised to label the tremendous WebUAV-3M in every frame. To the best of our knowledge, the densely bounding box annotated WebUAV-3M is by far the largest public UAV tracking benchmark. We expect to pave the way for the follow-up study in the UAV tracking by establishing a million-scale annotated benchmark covering a wide range of target categories. Moreover, considering the close connections among visual appearance, natural language and audio, we enrich WebUAV-3M by providing natural language specification and audio description, encouraging the exploration of natural language features and audio cues for UAV tracking. Equipped with this benchmark, we delve into million-scale deep UAV tracking problems, aiming to provide the community with a dedicated large-scale benchmark for training deep UAV trackers and evaluating UAV tracking approaches. Extensive experiments on WebUAV-3M demonstrate that there is still a big room for robust deep UAV tracking improvements. The dataset, toolkits and baseline results will be available at this page.

WebUAV-3M dataset

Dataset coming here soon...

Evaluation toolkits

Toolkits coming here soon...

Baseline results

Results coming here soon...

Environment

The experiments are implemented using PyTorch or MATLAB with an Intel (R) Xeon (R) Gold 6230R CPU @ 2.10GHz and three NVIDIA RTX A5000 GPUs on an Ubuntu 18.04 server.

Citation

If you find the dataset and toolkits useful in your research, please consider citing:

@inproceedings{WebUAV_3M_2022,
    title={WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking},
    author = {Chunhui Zhang, and Guanjie Huang, and Li Liu, and Shan Huang, and Yinan Yang, and Yuxuan Zhang, and Xiang Wan, and Shiming Ge},
    journal = {arXiv:2201.07425},
    year = {2022}
  }

Acknowledgments

Thanks for the great [GOT-10k toolkit]

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Related tags

Overview

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking [Paper Link]

Abstract

WebUAV-3M dataset

Evaluation toolkits

Baseline results

Environment

Citation

Acknowledgments

Owner

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"

Filtering variational quantum algorithms for combinatorial optimization

Principled Detection of Out-of-Distribution Examples in Neural Networks

SuperSDR: multiplatform KiwiSDR + CAT transceiver integrator

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Long Expressive Memory (LEM)

Implementation of the Paper: "Parameterized Hypercomplex Graph Neural Networks for Graph Classification" by Tuan Le, Marco Bertolini, Frank Noé and Djork-Arné Clevert

GPU Accelerated Non-rigid ICP for surface registration

Alphabetical Letter Recognition

PyTorch Implementation of Unsupervised Depth Completion with Calibrated Backprojection Layers (ORAL, ICCV 2021)

alfred-py: A deep learning utility library for human

the code for our CVPR 2021 paper Bilateral Grid Learning for Stereo Matching Network [BGNet]

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

基于AlphaPose的TensorRT加速

The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

Official code for "InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization" (ICLR 2020, spotlight)

Add gui for YoloV5 using PyQt5

The implementation of the lifelong infinite mixture model

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Related tags

Overview

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking [Paper Link]

Abstract

WebUAV-3M dataset

Evaluation toolkits

Baseline results

Environment

Citation

Acknowledgments

Owner

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"

Filtering variational quantum algorithms for combinatorial optimization

Principled Detection of Out-of-Distribution Examples in Neural Networks

SuperSDR: multiplatform KiwiSDR + CAT transceiver integrator

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Long Expressive Memory (LEM)

Implementation of the Paper: "Parameterized Hypercomplex Graph Neural Networks for Graph Classification" by Tuan Le, Marco Bertolini, Frank Noé and Djork-Arné Clevert

GPU Accelerated Non-rigid ICP for surface registration

Alphabetical Letter Recognition

PyTorch Implementation of Unsupervised Depth Completion with Calibrated Backprojection Layers (ORAL, ICCV 2021)

alfred-py: A deep learning utility library for **human**

the code for our CVPR 2021 paper Bilateral Grid Learning for Stereo Matching Network [BGNet]

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of PyTorch tricks

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802

基于AlphaPose的TensorRT加速

The Implicit Bias of Gradient Descent on Generalized Gated Linear Networks

Official code for "InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization" (ICLR 2020, spotlight)

Add gui for YoloV5 using PyQt5

The implementation of the lifelong infinite mixture model

alfred-py: A deep learning utility library for human