General Multi-label Image Classification with Transformers

Last update: Dec 21, 2022

Overview

General Multi-label Image Classification with Transformers
Jack Lanchantin, Tianlu Wang, Vicente Ordóñez Román, Yanjun Qi
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
[paper] [poster] [slides]

Training and Running C-Tran

Python version 3.7 is required and all major packages used and their versions are listed in requirements.txt.

C-Tran on COCO80 Dataset

Download COCO data (19G)

wget http://cs.virginia.edu/~jjl5sw/data/vision/coco.tar.gz
mkdir -p data/
tar -xvf coco.tar.gz -C data/

Train New Model

python main.py  --batch_size 16  --lr 0.00001 --optim 'adam' --layers 3  --dataset 'coco' --use_lmt --dataroot data/

C-Tran on VOC20 Dataset

Download VOC2007 data (1.7G)

wget http://cs.virginia.edu/~jjl5sw/data/vision/voc.tar.gz
mkdir -p data/
tar -xvf voc.tar.gz -C data/

Train New Model

python main.py  --batch_size 16  --lr 0.00001 --optim 'adam' --layers 3  --dataset 'voc' --use_lmt --grad_ac_step 2 --dataroot data/

Citing

@article{lanchantin2020general,
  title={General Multi-label Image Classification with Transformers},
  author={Lanchantin, Jack and Wang, Tianlu and Ordonez, Vicente and Qi, Yanjun},
  journal={arXiv preprint arXiv:2011.14027},
  year={2020}
}

General Multi-label Image Classification with Transformers

Related tags

Overview

Training and Running C-Tran

C-Tran on COCO80 Dataset

C-Tran on VOC20 Dataset

Citing

Owner

QData

A whale detector design for the Kaggle whale-detector challenge!

Pytorch implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

PyTorch implementation of the paper: "Preference-Adaptive Meta-Learning for Cold-Start Recommendation", IJCAI, 2021.

Multiview 3D object detection on MultiviewC dataset through moft3d.

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

Continuous Security Group Rule Change Detection & Response at scale

Exploring Versatile Prior for Human Motion via Motion Frequency Guidance (3DV2021)

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Data Augmentation Using Keras and Python

SatelliteNeRF - PyTorch-based Neural Radiance Fields adapted to satellite domain

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps[AAAI2021]

AI Flow is an open source framework that bridges big data and artificial intelligence.

SegNet model implemented using keras framework

Self-describing JSON-RPC services made easy

Code implementation from my Medium blog post: [Transformers from Scratch in PyTorch]

Self-Supervised Learning for Domain Adaptation on Point-Clouds

Unofficial pytorch implementation of 'Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization'

Identify the emotion of multiple speakers in an Audio Segment

PRIN/SPRIN: On Extracting Point-wise Rotation Invariant Features