An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

Last update: Jun 27, 2021

Related tags

Overview

Fast Face Classification (F²C)

This is the code of our paper An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

Training on ultra-large-scale datasets is time-consuming and takes up a lot of hardware resource. Therefore we design a dul-data loaders and dynamic class pool to deal with large-scale face classification.

Pipeline

Preparation

As FFC contains LRU module, so you may use lru_python_impl.py or instead compile the code under lru_c directory.

If you choose lru_python_impl.py, you should rename lru_python_impl.py to lru_utils.py. As lru is not the bottleneck of the training procedure, so feel free to use python implementation, though the C++ implementation is 5~10 times faster than python version.

Compile LRU (optional)

Command to build LRU

cd lru_c
mkdir build
cd build
cmake ..
make
cd ../../ && ln -s lru_c/build/lru_utils.so .

You can compare this two implementation using lru_c/python/compare_time.py

Database

Training dataset
- MS-Celeb-1M
- Deepglint-360K
Test dataset
- SLLFW
- CPLFW: Baidu or Google Drive
- CALFW: Baidu or Google Drive
- CFP: Baidu or Google Drive
- AgeDB: Baidu or Google Drive
- YTF
- IJBC
- MegaFace
Data preprocess

We use 5 landmarks(Left eye center, right eye center, nose, left mouth corner and right mouth corner) to crop face as what ArcFace does. You can find code here.

Training

In main.py, you should provide the path to your training db at line 152-153.

args.source_lmdb = ['/path to msceleb.lmdb']
args.source_file = ['/path to kv file']

We choose lmdb as the format of our training db. Each element in source_file is the path to a text file, each line of which represents lmdb_key label pairs. You may refer to LFS for more details.

Now you can modify train_ffc.sh. Before running the training, you should set the port number and queue_size. queue_size is a trade-off term that controls the performance and the speed. Larger queue_size means higher performance at the cost of time and GPU resource. It can be any positive integer. The common setting is 1%, 0.1%, 0.001 % of the total identities.

Notice

The difference between r50 and ir50 is that r50 requires 224 × 224 images as input while ir50 requires 112 × 112 as what does by ArcFace. The network ir50 comes from ArcFace.

Evaluation

We provide the whole test script under evaluation_code directory. Each script requires the directory to the images and test pair files.

Tips

Code in evaluation_code/test_megaface.py is much faster than official version. It's also applicable to extremely large-scale testing.

An Efficient Training Approach for Very Large Scale Face Recognition or F²C for simplicity.

Related tags

Overview

Fast Face Classification (F²C)

Preparation

Compile LRU (optional)

Database

Training

Notice

Evaluation

Owner

BBScan py3 - BBScan py3 With Python

Implementation of Squeezenet in pytorch, pretrained models on Cifar 10 data to come

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

This is an official implementation for "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

DeLiGAN - This project is an implementation of the Generative Adversarial Network

This repository is for the preprint "A generative nonparametric Bayesian model for whole genomes"

[ICCV 2021] Official Tensorflow Implementation for "Single Image Defocus Deblurring Using Kernel-Sharing Parallel Atrous Convolutions"

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"

pq is a jq-like Pickle file viewer

Controlling Hill Climb Racing with Hand Tacking

FIGARO: Generating Symbolic Music with Fine-Grained Artistic Control

A general-purpose, flexible, and easy-to-use simulator alongside an OpenAI Gym trading environment for MetaTrader 5 trading platform (Approved by OpenAI Gym)

Imposter-detector-2022 - HackED 2022 Team 3IQ - 2022 Imposter Detector

Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

【steal piano】GitHub偷情分析工具！

Empower Sequence Labeling with Task-Aware Language Model

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

How to train a CNN to 99% accuracy on MNIST in less than a second on a laptop