git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Last update: Sep 08, 2021

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

This is the pytorch implementation of our paper "[Beta R-CNN: Looking into Pedestrian Detection from Another Perspective]", published in Neurips 2020.

Our method aiming at detecting highly occluded and highly-overlapped instances in crowded scenes especially for pedestrian detection.

Codes are prepared to release here. Due to the experiments are conducted with internal framework, we need some time to rewrite and clean the code. We will release the complete code soon.

Abstract

Recently significant progress has been made in pedestrian detection, but it remains challenging to achieve high performance in occluded and crowded scenes. It could be mostly attributed to the widely used representation of pedestrians, i.e., 2Daxis-aligned bounding box, which just describes the approximate location and size of the object. Bounding box models the object as a uniform distribution within the boundary, making pedestrians indistinguishable in occluded and crowded scenes due to much noise. To eliminate the problem, we propose a novel representation based on 2D beta distribution, named Beta Representation. It pictures a pedestrian by explicitly constructing the relationship between full-body and visible boxes,and emphasizes the center of visual mass by assigning different probability values to pixels. As a result, Beta Representation is much better for distinguishing highly-overlapped instances in crowded scenes with a new NMS strategy named BetaNMS. What’s more, to fully exploit Beta Representation, a novel pipeline Beta R-CNN equipped with BetaHead and BetaMask is proposed, leading to high detection performance in occluded and crowded scenes.

Method

The network structure and some visualization results are shown here:

Citation

@article{BetaRCNN,
  title={Beta R-CNN: Looking into Pedestrian Detection from Another Perspective},
  author={Xu, Zixuan and Li, Banghuai and Yuan, Ye and Dang, Anhong},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Contact

If you have any questions, please do not hesitate to contact Zixuan Xu ([email protected]).

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

Abstract

Method

Citation

Contact

Owner

Facestar dataset. High quality audio-visual recordings of human conversational speech.

Official repository of Semantic Image Matting

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Official implementation of "Membership Inference Attacks Against Self-supervised Speech Models"

GeDML is an easy-to-use generalized deep metric learning library

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

Supervised Contrastive Learning for Product Matching

Implementation of "GNNAutoScale: Scalable and Expressive Graph Neural Networks via Historical Embeddings" in PyTorch

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)

Machine learning, in numpy

Fuzzing the Kernel Using Unicornafl and AFL++

D-NeRF: Neural Radiance Fields for Dynamic Scenes

Deep Federated Learning for Autonomous Driving

LEAP: Learning Articulated Occupancy of People

Omnidirectional Scene Text Detection with Sequential-free Box Discretization (IJCAI 2019). Including competition model, online demo, etc.

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language Models for the legal domain in Spanish done @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Open-Set Recognition: A Good Closed-Set Classifier is All You Need

Implementation of Pix2Seq in PyTorch