This repository contains pre-trained models and some evaluation code for our paper Towards Unsupervised Dense Information Retrieval with Contrastive Learning

Last update: Jan 08, 2023

Related tags

Deep Learning contriever

Overview

Contriever: Towards Unsupervised Dense Information Retrieval with Contrastive Learning

This repository contains pre-trained models and some evaluation code for our paper Towards Unsupervised Dense Information Retrieval with Contrastive Learning.

We use a simple contrastive learning framework to pre-train models for information retrieval. Contriever, trained without supervision, is competitive with BM25 for [email protected] on the BEIR benchmark. After finetuning on MSMARCO, Contriever obtains strong performance, especially for the recall at 100.

Getting Started

Pre-trained models can be loaded through the HuggingFace transformers library:

import transformers
from src.contriever import Contriever

model = Contriever.from_pretrained("facebook/contriever")
tokenizer = transformers.BertTokenizerFast.from_pretrained("facebook/contriever")

Embeddings for different sentences can be obtained by doing the following:

sentences = [
    "Where was Marie Curie born?",
    "Maria Sklodowska, later known as Marie Curie, was born on November 7, 1867.",
    "Born in Paris on 15 May 1859, Pierre Curie was the son of Eugène Curie, a doctor of French Catholic origin from Alsace."
]

inputs = tokenizer(sentences, padding=True, truncation=True, return_tensors="pt")
embeddings = model(**inputs)

Then similarity scores between the different sentences can be obtained with a dot product between the embeddings:

score01 = embddings[0] @ embeddings[1] #1.0473
score02 = embddings[0] @ embeddings[2] #1.0095

BEIR evaluation

Scores on the BEIR benchmark can be reproduced using eval_beir.py.

python eval_beir.py --model_name_or_path facebook/contriever-msmarco --dataset scifact

Available models

Model	Description
facebook/contriever	Model pre-trained on Wikipedia and CC-net without any supervised data
facebook/contriever-msmarco	Pre-trained model fine-tuned on MS-MARCO

References

[1] G. Izacard, M. Caron, L. Hosseini, S. Riedel, P. Bojanowski, A. Joulin, E. Grave Towards Unsupervised Dense Information Retrieval with Contrastive Learning

@misc{izacard2021contriever,
      title={Towards Unsupervised Dense Information Retrieval with Contrastive Learning}, 
      author={Gautier Izacard and Mathilde Caron and Lucas Hosseini and Sebastian Riedel and Piotr Bojanowski and Armand Joulin and Edouard Grave},
      year={2021},
      eprint={2112.09118},
      archivePrefix={arXiv},
}

License

See the LICENSE file for more details.

This repository contains pre-trained models and some evaluation code for our paper Towards Unsupervised Dense Information Retrieval with Contrastive Learning

Related tags

Overview

Contriever: Towards Unsupervised Dense Information Retrieval with Contrastive Learning

Getting Started

BEIR evaluation

Available models

References

License

Owner

Meta Research

Code for Quantifying Ignorance in Individual-Level Causal-Effect Estimates under Hidden Confounding

💃 VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

You are AllSet: A Multiset Function Framework for Hypergraph Neural Networks.

Medical Image Segmentation using Squeeze-and-Expansion Transformers

RipsNet: a general architecture for fast and robust estimation of the persistent homology of point clouds

Audio2Face - Audio To Face With Python

Official code repository for the publication "Latent Equilibrium: A unified learning theory for arbitrarily fast computation with arbitrarily slow neurons"

(to be released) [NeurIPS'21] Transformers Generalize DeepSets and Can be Extended to Graphs and Hypergraphs

Submanifold sparse convolutional networks

The Official PyTorch Implementation of "LSGM: Score-based Generative Modeling in Latent Space" (NeurIPS 2021)

Official codebase for Legged Robots that Keep on Learning: Fine-Tuning Locomotion Policies in the Real World

LWCC: A LightWeight Crowd Counting library for Python that includes several pretrained state-of-the-art models.

Official implementation of Rich Semantics Improve Few-Shot Learning (BMVC, 2021)

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

Bulk2Space is a spatial deconvolution method based on deep learning frameworks

Code for reproducible experiments presented in KSD Aggregated Goodness-of-fit Test.

DLFlow is a deep learning framework.

DCGAN-tensorflow - A tensorflow implementation of Deep Convolutional Generative Adversarial Networks

Points2Surf: Learning Implicit Surfaces from Point Clouds (ECCV 2020 Spotlight)

Riemannian Geometry for Molecular Surface Approximation (RGMolSA)