Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Last update: Dec 23, 2022

Related tags

Overview

Surface Form Competition

This is the official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right" We provide scripts for downloading/processing datasets and for reproducing our results on GPT-2 and GPT-3. We do not guarantee exact reproducibility, as library versions and GPUs may cause small differences, but these should be extremely minor.

Dependencies

We use python3 and pytorch 1.7.0, but we do not use cutting-edge features from either and expect to be largely forward and backward compatible. That is not a guarantee or promise.

You can use pip install -r requirements.txt to install the required libraries.

OpenAI Beta

To use GPT-3 you must use OpenAI Beta, which is limited access. You can apply for access here. Once you have access you will need to point the score.py to your API key with the --key argument or put your key in api.key which is the default path.

Downloading Datasets

DATA_README.md has thorough instructions for downloading and processing datasets. We provide automatic downloaders and processers for datasets where possible in data_downloaders/ but see DATA_README for full instructions.

Running Scorers

Once you have a dataset downloaded, running all the zero-shot scoring strategies at once is as simple as:

python score.py 
   
     --model

where is the abbreviation for a given dataset used for table rows in the paper. If there is any confusion, simply look in score.py to see how dataset selection works. is the name of either a GPT-2 or GPT-3 model e.g. xl, davinci, etc. To speed things up you can use a larger --batch if you have enough GPU memory.

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Related tags

Overview

Surface Form Competition

Dependencies

OpenAI Beta

Downloading Datasets

Running Scorers

Owner

Peter West

Let's Git - Versionsverwaltung & Open Source Hausaufgabe

🤗 Paper Style Guide

Stratified Transformer for 3D Point Cloud Segmentation (CVPR 2022)

Implementation of Bottleneck Transformer in Pytorch

Robust, modular and efficient implementation of advanced Hamiltonian Monte Carlo algorithms

An implementation of the AdaOPS (Adaptive Online Packing-based Search), which is an online POMDP Solver used to solve problems defined with the POMDPs.jl generative interface.

Wordle-solver - Wordle answer generation program in python

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily

a basic code repository for basic task in CV(classification,detection,segmentation)

DeepCAD: A Deep Generative Network for Computer-Aided Design Models

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Official Implementation for the "An Empirical Investigation of 3D Anomaly Detection and Segmentation" paper.

Automatic Video Captioning Evaluation Metric --- EMScore

CVPR2021 Workshop - HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization.

GANmouflage: 3D Object Nondetection with Texture Fields

The repo of Feedback Networks, CVPR17

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

A lightweight library designed to accelerate the process of training PyTorch models by providing a minimal