Inferring Lexicographically-Ordered Rewards from Preferences

Code author: Alihan Hüyük ([email protected])

This repository contains the source code necessary to replicate the main experimental results in the AAAI 2022 paper "Inferring Lexicographically-Ordered Reward from Preferences." Our proposed method, LORI, is implemented in files src/main-lori.py and src/main-lori-liver.py for the problem settings considered in the paper: cancer treatment and organ transplantation respectively.

Usage

First, install the required python packages by running:

    python -m pip install -r requirements.txt

Then, the experiments in the paper can be replicated by running:

    ./src/run.sh        # generates the results in Tables 2 and 3
    ./src/run-liver.sh  # generates the reward functions in (10) and (11)

Note that, in order to run the experiments for the transplantation setting, you need to get access to the Organ Procurement and Transplantation Network (OPTN) dataset for liver transplantations as of December 4, 2020.

Citing

If you use this software please cite as follows:

@inproceedings{huyuk2022inferring,
  author={Alihan H\"uy\"uk and William R. Zame and Mihaela van der Schaar},
  title={Inferring lexicographically-ordered rewards from preferences},
  booktitle={Proceedings of the 36th AAAI Conference on Artificial Intelligence},
  year={2022}
}

Inferring Lexicographically-Ordered Rewards from Preferences

Related tags

Overview

Inferring Lexicographically-Ordered Rewards from Preferences

Usage

Citing

Owner

Alihan Hüyük

Mask-invariant Face Recognition through Template-level Knowledge Distillation

An implementation of the research paper "Retina Blood Vessel Segmentation Using A U-Net Based Convolutional Neural Network"

Author Disambiguation using Knowledge Graph Embeddings with Literals

Model Serving Made Easy

Real-time LIDAR-based Urban Road and Sidewalk detection for Autonomous Vehicles 🚗

FinEAS: Financial Embedding Analysis of Sentiment 📈

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

Stochastic Extragradient: General Analysis and Improved Rates

TransGAN: Two Transformers Can Make One Strong GAN

Pytorch implementation of COIN, a framework for compression with implicit neural representations 🌸

DGCNN - Dynamic Graph CNN for Learning on Point Clouds

Tensorflow port of a full NetVLAD network

Multi-atlas segmentation (MAS) is a promising framework for medical image segmentation

Sleep staging from ECG, assisted with EEG

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

Deep Learning agent of Starcraft2, similar to AlphaStar of DeepMind except size of network.

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Open-source python package for the extraction of Radiomics features from 2D and 3D images and binary masks.

Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification