KNIGHT

The official repository holding the data for the ISBI 2022 KNIGHT Challenge

About

The KNIGHT Challenge asks teams to develop models to classify patients with kidney tumors in terms of their "risk score" as defined by the recently-release American Urological Association (AUA) Guidelines for Renal Masses. KNIGHT makes use of the imaging and clinical data from the MICCAI KiTS21 Challenge.

Accessing the Data

A JSON file with each patient's clinical data lives in this repository at knight/data/knight.json. The imaging associated with each of the 300 patients can be downloaded with the knight/scripts/get_imaging.py script (requires Python 3).

If you wish to make use of the segmentations used for the KiTS21 challenge, you can access those by cloning the official KiTS21 repository.

The prediction target for the KNIGHT challenge is the attribute entitled "aua_risk_group" in the knight.json file. The primary task is a binary classification between the two higher-risk groups ("high_risk" and "very_high_risk") versus the three lower-risk groups ("benign", "low_risk", and "intermediate_risk"). A secondary task is the five-way classification problem for each group individually.

Participants are encouraged to make use of the clinical data as well as the imaging in order to make their predictions. The following clinical attributes will be made available at inference time for cases in the test set.

"age_at_nephrectomy"
"gender"
"body_mass_index"
"comorbidities"
"smoking_history"
"age_when_quit_smoking"
"pack_years"
"chewing_tobacco_use"
"alcohol_use"
"last_preop_egfr"
"radiographic_size"
"voxel_spacing"

All other attributes will NOT be made available and participants should not train models that take as inputs any clinical attributes not listed above.

The official repository of the ISBI 2022 KNIGHT Challenge

Related tags

Overview

KNIGHT

About

Accessing the Data

Owner

Nicholas Heller

Final Project for the Intel AI Readiness Boot Camp NLP (Jan)

Python module (C extension and plain python) implementing Aho-Corasick algorithm

Stanford CoreNLP provides a set of natural language analysis tools written in Java

Club chatbot

Constituency Tree Labeling Tool

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

ByT5: Towards a token-free future with pre-trained byte-to-byte models

Submit issues and feature requests for our API here.

DiY Oxygen Concentrator based on the OxiKit

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

Pipelines de datos, 2021.

The Sudachi synonym dictionary in Solar format.

Package for controllable summarization

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

Natural Language Processing at EDHEC, 2022

Code release for "COTR: Correspondence Transformer for Matching Across Images"

Vad-sli-asr - A Python scripts for a speech processing pipeline with Voice Activity Detection (VAD)

Entity Disambiguation as text extraction (ACL 2022)