Image Similarity Challenge

Goal of the Competition

Competitors built models to help detect whether a given query image is derived from any of the images in a large reference set.

Content tracing is a crucial component on all social media platforms today, used for such tasks as flagging misinformation and manipulative advertising, preventing uploads of graphic violence, and enforcing copyright protections. But when dealing with the billions of new images generated every day on sites like Facebook, manual content moderation just doesn't scale. They depend on algorithms to help automatically flag or remove bad content.

This competition allowed participants to test their skills in building a key part of that content tracing system, and in so doing contribute to making social media more trustworthy and safe for the people who use it.

A reference image is manipulated to produce new images.
In this challenge competitors built models to detect whether a given query image is derived from a reference set.

There were two tracks to this challenge:

For the Matching Track, competitors created models that directly detect whether a query image is derived from one of the images in a large corpus of reference images.
For the Descriptor Track, competitors generated useful vector representations of images (up to 256 dimensions). These descriptors are compared with Euclidean distance to detect whether a query image is derived from one of the images in a large corpus of reference images.

Winning Submissions

See below for links to winning submissions' arXiv papers and code.

Matching Track

Place	Team or User	Code	Paper	Score	Summary of Model
1	VisionForce	GitHub repository	D2LV: A Data-Driven and Local-Verification Approach for Image Copy Detection	0.8329	A "data-driven and local-verification (D^2LV)" approach using pre-training on a set of basic and advanced image augmentations, and a global-local and local-global matching strategy for testing.
2	separate	GitHub repository	2nd Place Solution to Facebook AI Image Similarity Challenge Matching Track	0.8291	A Vision Transformer approach that uses concatenated query and reference images to learn the relationship between query and reference images directly.
3	imgFp	GitHub repository	3rd Place: A Global and Local Dual Retrieval Solution to Facebook AI Image Similarity Challenge	0.7682	A global+local recall approach with EsViT for global recall and SIFT point features for local recall.

Descriptor Track

Place	Team or User	Code	Paper	Score	Summary of Model
1	lyakaap	GitHub repository	Contrastive Learning with Large Memory Bank and Negative Embedding Subtraction for Accurate Copy Detection	0.6354	Uses an EfficientNet backbone trained with contrastive loss and cross-batch memory, and a training neighbor subtraction step in post-processing.
2	S-square	GitHub repository	Producing augmentation-invariant embeddings from real-life imagery	0.5905	Ensembles EfficientNet and NFNet backbones using an ArcFace loss function, and applies a sample normalization step in post-processing.
3	VisionForce	GitHub repository	Bag of Tricks and A Strong baseline for Image Copy Detection	0.5788	Uses a pretrained Barlow Twins model, yolov5 model to detect overlays, and a descriptor stretching step in post-processing.

Winners of the Facebook Image Similarity Challenge

Related tags

Overview

Image Similarity Challenge

Goal of the Competition

Winning Submissions

Matching Track

Descriptor Track

Owner

DrivenData

Pytorch implementation of the paper "COAD: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking."

Sky Computing: Accelerating Geo-distributed Computing in Federated Learning

Official implementation of Deep Burst Super-Resolution

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

Angular & Electron desktop UI framework. Angular components for native looking and behaving macOS desktop UI (Electron/Web)

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Sinkformers: Transformers with Doubly Stochastic Attention

Automatically measure the facial Width-To-Height ratio and get facial analysis results provided by Microsoft Azure

The missing CMake project initializer

Python suite to construct benchmark machine learning datasets from the MIMIC-III clinical database.

Explainable Medical ImageSegmentation via GenerativeAdversarial Networks andLayer-wise Relevance Propagation

Teaches a student network from the knowledge obtained via training of a larger teacher network

LSTM model trained on a small dataset of 3000 names written in PyTorch

Red Team tool for exfiltrating files from a target's Google Drive that you have access to, via Google's API.

Optimized Gillespie algorithm for simulating Stochastic sPAtial models of Cancer Evolution (OG-SPACE)

Using this you can control your PC/Laptop volume by Hand Gestures (pinch-in, pinch-out) created with Python.

Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker

The source code for 'Noisy-Labeled NER with Confidence Estimation' accepted by NAACL 2021

Implementation of "A MLP-like Architecture for Dense Prediction"

Easy-to-use micro-wrappers for Gym and PettingZoo based RL Environments