Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Last update: Nov 11, 2022

Related tags

Overview

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

The offical implementation for the "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination" which is published in ACM MM 2020.

We propose Nearby Objects Hallucinator (NOH), which pinpoints the objects nearby each proposal with a Gaussian distribution, together with NOH-NMS, which dynamically eases the suppression for the space that might contain other objects with a high likelihood.

This work has won the first place at the CrowdHuman Challenge, 2020.

This repo is implemented based on detectron2.

Performance

Model	Backbone	AP	Recall	MR	Weights
Faster RCNN	ResNet-50	85.0	87.5	44.5	faster_rcnn_model_final.pth
NOH-NMS	ResNet-50	88.8	92.6	43.7	noh_nms_model_final.pth

Prepare Datasets

Download the CrowdHuman Datasets from http://www.crowdhuman.org/, and then move them under the directory like:

./data/crowdhuman
├── annotations
│   └── annotation_train.odgt
│   └── annotation_val.odgt
├── images
│   └── train
│   └── val

Installation

  cd detectron2
  pip install -e . 
  #or rebuild
  sh build.sh

Quick Start

See GETTING_STARTED.md in detectron2

Acknowledgement

detectron2

Citation

if you find this project useful for your research, please cite:

@inproceedings{zhou2020noh,
  title={NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination},
  author={Zhou, Penghao and Zhou, Chong and Peng, Pai and Du, Junlong and Sun, Xing and Guo, Xiaowei and Huang, Feiyue},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={1967--1975},
  year={2020}
}

Code for ACM MM 2020 paper "NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination"

Related tags

Overview

NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

Performance

Prepare Datasets

Installation

Quick Start

Acknowledgement

Citation

Owner

Tencent YouTu Research

End-To-End Crowdsourcing

Perspective: Julia for Biologists

[WACV 2022] Contextual Gradient Scaling for Few-Shot Learning

Official Pytorch implementation of 'GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network' (NeurIPS 2020)

This is an official implementation for "PlaneRecNet".

The tl;dr on a few notable transformer/language model papers + other papers (alignment, memorization, etc).

ESGD-M - A stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch

Virtual hand gesture mouse using a webcam

Official Implementation of VAT

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

Patch-Diffusion Code (AAAI2022)

The code for our paper CrossFormer: A Versatile Vision Transformer Based on Cross-scale Attention.

X-modaler is a versatile and high-performance codebase for cross-modal analytics.

OpenGAN: Open-Set Recognition via Open Data Generation

sssegmentation is a general framework for our research on strongly supervised semantic segmentation.

TOOD: Task-aligned One-stage Object Detection, ICCV2021 Oral

Mmdetection3d Noted - MMDetection3D is an open source object detection toolbox based on PyTorch

Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021

The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation