A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Last update: Nov 09, 2022

Related tags

Deep Learning pytorch-SimSiam

Overview

Exploring simple siamese representation learning

This is a PyTorch re-implementation of the SimSiam paper on ImageNet dataset. The results match that reported in the paper. The implementation is based on the codes of MOCO.

Unsupervised pre-training

To run unsupervised pre-training on ImageNet,

sh train_simsiam.sh

This is to do the unsupervised pre-training for 100 epochs. Please modify the path to your ImageNet data folder.

Note 1: I try to follow the setting in the paper, which is bs=512 and lr=0.1 on 8-GPU, but somehow I can not fit it. So I used the max batch_size that I can fit (432) while kept the lr unchaged (0.1).

Note 2: In pre-training, I didn't fix the lr of prediction MLP. According to the paper (Table. 1), fixing the lr of prediction MLP can give slightly improvements (67.7% -> 68.1%). You can try it if interested.

Linear evaluation

To run linear evaluation,

sh train_lincls.sh

The linear evaluation is done using NVIDIA LARC optimizer by setting trus_coefficient=0.001 and clip=False. The batch size is 4096.

Note: I first followed the setting in the paper, which is Lr=0.32 (0.02*4096/256). But I can only got a result of 66.0%. Then I increased the learning rate to Lr=1.6 (0.1*4096.256) and achieved the result of 67.8%. The results and models are given below.

SimSiam	pretrained batchsize	lincls Lr	Top-1 Acc
Reported	512	0.32	67.7%
Reproduced	432 (Model)	1.6	67.8% (Model)
Reproduced	432	0.32	66.0%

Acknowledgement

Thank Xinlei for his help on some implementation details.

A PyTorch re-implementation of the paper 'Exploring Simple Siamese Representation Learning'. Reproduced the 67.8% Top1 Acc on ImageNet.

Related tags

Overview

Exploring simple siamese representation learning

Unsupervised pre-training

Linear evaluation

Acknowledgement

Owner

Taojiannan Yang

[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation

A PyTorch implementation of Radio Transformer Networks from the paper "An Introduction to Deep Learning for the Physical Layer".

Context Axial Reverse Attention Network for Small Medical Objects Segmentation

git《Commonsense Knowledge Base Completion with Structural and Semantic Context》(AAAI 2020) GitHub: [fig1]

Implementation of the federated dual coordinate descent (FedDCD) method.

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

zeus is a Python implementation of the Ensemble Slice Sampling method.

An unreferenced image captioning metric (ACL-21)

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

TensorFlow implementation of "Learning from Simulated and Unsupervised Images through Adversarial Training"

Graph Neural Networks with Keras and Tensorflow 2.

TabNet for fastai

Flybirds - BDD-driven natural language automated testing framework, present by Trip Flight

This repository is the code of the paper Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

A map update dataset and benchmark

Awesome Transformers in Medical Imaging

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

NAS-FCOS: Fast Neural Architecture Search for Object Detection (CVPR 2020)