RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

Last update: Sep 20, 2022

Related tags

Text Data & NLP ru-clip-tiny

Overview

RuCLIPtiny

Zero-shot image classification model for Russian language

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts). Our model is based on ConvNeXt-tiny and DistilRuBert-tiny, and is supported by extensive research zero-shot transfer, computer vision, natural language processing, and multimodal learning.

Result evaluation

Our model achieved 46.62% top1 and 73.18% top5 zero-shot accuracy on CIFAR100

Examples

Evaluate & Simple usage

Finetuning

ONNX conversion and speed testing

Model weights

Usage

Install rucliptiny module and requirements first. Use this trick

!gdown -O ru-clip-tiny.pkl https://drive.google.com/uc?id=1-3g3J90pZmHo9jbBzsEmr7ei5zm3VXOL
!pip install git+https://github.com/cene555/ru-clip-tiny.git

Example in 3 steps

Download CLIP image from repo

!wget -c -O CLIP.png https://github.com/openai/CLIP/blob/main/CLIP.png?raw=true

Import libraries

from rucliptiny.predictor import Predictor
from rucliptiny import RuCLIPtiny
import torch

torch.manual_seed(1)
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

Load model

model = RuCLIPtiny()
model.load_state_dict(torch.load('ru-clip-tiny.pkl'))
model = model.to(device).eval()

Use predictor to get probabilities

predictor = Predictor()

classes = ['диаграмма', 'собака', 'кошка']
text_probs = predictor(model=model, images_path=["CLIP.png"],
                       classes=classes, get_probs=True,
                       max_len=77, device=device)

Cosine similarity Visualization Example

Speed Testing

NVIDIA Tesla K80 (Google Colab session)

TORCH	batch	encode_image	encode_text	total
RuCLIPtiny	2	0.011	0.004	0.015
RuCLIPtiny	8	0.011	0.004	0.015
RuCLIPtiny	16	0.012	0.005	0.017
RuCLIPtiny	32	0.014	0.005	0.019
RuCLIPtiny	64	0.013	0.006	0.019

We would like to express my gratitude to Sber AI for the grants provided, for which research was carried out, as part of the Artificial Intelligence International Junior Contest (AIIJC)

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

Related tags

Overview

RuCLIPtiny

Result evaluation

Examples

Model weights

Usage

Example in 3 steps

Cosine similarity Visualization Example

Speed Testing

Owner

Shahmatov Arseniy

Training open neural machine translation models

✨Fast Coreference Resolution in spaCy with Neural Networks

Blue Brain text mining toolbox for semantic search and structured information extraction

a chinese segment base on crf

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

Contains the code and data for our #ICSE2022 paper titled as "CodeFill: Multi-token Code Completion by Jointly Learning from Structure and Naming Sequences"

A Facebook Messenger Chatbot using NLP

端到端的长本文摘要模型（法研杯2020司法摘要赛道）

FewCLUE: 为中文NLP定制的小样本学习测评基准

Stack based programming language that compiles to x86_64 assembly or can alternatively be interpreted in Python

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Just Another Telegram Ai Chat Bot Written In Python With Pyrogram.

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

Rhythm-Finder is a unsupervised ML driven python powered web-application that can find the songs that suits you.

🧪 Cutting-edge experimental spaCy components and features

Crie tokens de autenticação íntegros e seguros com UToken.

CredData is a set of files including credentials in open source projects

Train and use generative text models in a few lines of code.