Codes_APN

Official codes of CVPR21 paper: Normal Learning in Videos with Attention Prototype Network (https://arxiv.org/abs/2108.11055)

Overview of our approach based on APU and CAU model:

Introduction

Frame reconstruction (current or future frame) based on Auto-Encoder (AE) is a popular method for video anomaly detection. With models trained on the normal data, the reconstruction errors of anomalous scenes are usually much larger than those of normal ones. Previous methods introduced the memory bank into AE, for encoding diverse normal patterns across the training videos. However, they are memory consuming and cannot cope with unseen new scenarios in the testing data. In this work, we propose a self-attention prototype unit (APU) to encode the normal latent space as prototypes in real time, free from extra memory cost. In addition, we introduce circulative attention mechanism to our backbone to form a novel feature extracting learner, namely Circulative Attention Unit(CAU). It enables the fast adaption capability on new scenes by only consuming a few iterations of update. Extensive experiments are conducted on various benchmarks. The superior performance over the state-of-the-art demonstrates the effectiveness of our method.

Performance

We achieved SOTA on many video anomaly detection datasets.

Unsupervised Anomaly Detection Model Training

bash train.sh

Unsupervised Anomaly Detection Model Testing

bash test.sh

If you find this work helpful, please cite:

@inproceedings{Nv2021APN,
  author    = {Chao Hu and
	       Fan Wu and
               Weijie Wu and
               Weibin Qiu and
               Shengxin Lai},
  title     = {Normal Learning in Videos with Attention Prototype Network},
  booktitle = {Computer Vision and Pattern Recognition},
  year      = {2021}
}

Normal Learning in Videos with Attention Prototype Network

Related tags

Overview

Codes_APN

Introduction

Performance

Unsupervised Anomaly Detection Model Training

Unsupervised Anomaly Detection Model Testing

Owner

Trash Sorter Extraordinaire is a software which efficiently detects the different types of waste in a pile of random trash through feeding it pictures or videos.

百度2021年语言与智能技术竞赛机器阅读理解Pytorch版baseline

True Few-Shot Learning with Language Models

Fine-grained Control of Image Caption Generation with Abstract Scene Graphs

This is the official code for the paper "Ad2Attack: Adaptive Adversarial Attack for Real-Time UAV Tracking".

Official implementation of Deep Reparametrization of Multi-Frame Super-Resolution and Denoising

Code for Recurrent Mask Refinement for Few-Shot Medical Image Segmentation (ICCV 2021).

Reproduce ResNet-v2(Identity Mappings in Deep Residual Networks) with MXNet

Visual dialog agents with pre-trained vision-and-language encoders.

Personalized Transfer of User Preferences for Cross-domain Recommendation (PTUPCDR)

This project is the PyTorch implementation of our CVPR 2022 paper:

Python with OpenCV - MediaPip Framework Hand Detection

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

A model to classify a piece of news as REAL or FAKE

Self-Attention Between Datapoints: Going Beyond Individual Input-Output Pairs in Deep Learning

GNEE - GAT Neural Event Embeddings

Code for "Multi-View Multi-Person 3D Pose Estimation with Plane Sweep Stereo"

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

Package for working with hypernetworks in PyTorch.