(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Last update: Dec 01, 2022

Overview

Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

About

This repository accompanies the real-world experiments conducted in the paper "Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback" by Yuta Saito, which has been accepted at SIGIR2020 as a full paper.

If you find this code useful in your research then please cite:

@inproceedings{saito2020asymmetric,
  title={Asymmetric tri-training for debiasing missing-not-at-random explicit feedback},
  author={Saito, Yuta},
  booktitle={Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval},
  year={2020}
}

Dependencies

numpy==1.17.2
pandas==0.25.1
scikit-learn==0.22.1
tensorflow==1.15.2
optuna==0.17.0
pyyaml==5.1.2

Running the code

To run the simulation with real-world datasets,

download the Coat dataset from https://www.cs.cornell.edu/~schnabts/mnar/ and put train.ascii and test.ascii files into ./data/coat/ directory.
download the Yahoo! R3 dataset from https://webscope.sandbox.yahoo.com/catalog.php?datatype=r and put train.txt and test.txt files into ./data/yahoo/ directory.

Then, run the following commands in the ./src/ directory:

for the MF-IPS models without asymmetric tri-training

for data in yahoo coat
do
  for model in uniform user item both nb nb_true
  do
    python main.py -d $data -m $model
  done
done

for the MF-IPS models with asymmetric tri-training (our proposal)

for data in coat yahoo
do
  for model in uniform-at user-at item-at both-at nb-at nb_true-at
  do
    python main.py -d $data -m $model
  done
done

where (uniform, user, item, both, nb, nb_true) correspond to (uniform propenisty, user propensity, item propensity, user-item propensity, NB (uniform), NB (true)), respectively.

These commands will run simulations with real-world datasets conducted in Section 5. The tuned hyperparameters for all models can be found in ./hyper_params.yaml.
(By adding the -t option to the above code, you can re-run the hyperparameter tuning procedure by Optuna.)

Once the simulations have finished running, the summarized results can be obtained by running the following command in the ./src/ directory:

python summarize_results -d coat yahoo

This creates ./paper_results/.

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Related tags

Overview

Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

About

Dependencies

Running the code

Owner

yuta-saito

A Pytorch Implementation for Compact Bilinear Pooling.

PyTorch Implement for Path Attention Graph Network

A transformer-based method for Healthcare Image Captioning in Vietnamese

A Real-Time-Strategy game for Deep Learning research

Categorical Depth Distribution Network for Monocular 3D Object Detection

ML models implementation practice

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.

An open software package to develop BCI based brain and cognitive computing technology for recognizing user's intention using deep learning

Labels4Free: Unsupervised Segmentation using StyleGAN

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

A unified 3D Transformer Pipeline for visual synthesis

Image segmentation with private İstanbul Dataset

BirdCLEF 2021 - Birdcall Identification 4th place solution

[MedIA2021]MIDeepSeg: Minimally Interactive Segmentation of Unseen Objects from Medical Images Using Deep Learning

PyTorch implementation of ARM-Net: Adaptive Relation Modeling Network for Structured Data.

The code from the paper Character Transformations for Non-Autoregressive GEC Tagging

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

A community run, 5-day PyTorch Deep Learning Bootcamp

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

Send text to girlfriend in the morning

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

Related tags

Overview

Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback

About

Dependencies

Running the code

Owner

yuta-saito

A Pytorch Implementation for Compact Bilinear Pooling.

PyTorch Implement for Path Attention Graph Network

A transformer-based method for Healthcare Image Captioning in Vietnamese

A Real-Time-Strategy game for Deep Learning research

Categorical Depth Distribution Network for Monocular 3D Object Detection

ML models implementation practice

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for *Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances* paper.

An open software package to develop BCI based brain and cognitive computing technology for recognizing user's intention using deep learning

Labels4Free: Unsupervised Segmentation using StyleGAN

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

A unified 3D Transformer Pipeline for visual synthesis

Image segmentation with private İstanbul Dataset

BirdCLEF 2021 - Birdcall Identification 4th place solution

[MedIA2021]MIDeepSeg: Minimally Interactive Segmentation of Unseen Objects from Medical Images Using Deep Learning

PyTorch implementation of ARM-Net: Adaptive Relation Modeling Network for Structured Data.

The code from the paper Character Transformations for Non-Autoregressive GEC Tagging

This repository allows you to anonymize sensitive information in images/videos. The solution is fully compatible with the DL-based training/inference solutions that we already published/will publish for Object Detection and Semantic Segmentation.

A community run, 5-day PyTorch Deep Learning Bootcamp

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

Send text to girlfriend in the morning

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.