A Joint Video and Image Encoder for End-to-End Retrieval

Last update: Dec 25, 2022

Related tags

Computer Vision frozen-in-time

Overview

Frozen️ in Time ❄️ ️️️️ ⏳

A Joint Video and Image Encoder for End-to-End Retrieval

(arXiv)

Repository to contain the code, models, data for end-to-end retrieval.

Work in progress

Code provided to train end-to-end model on MSRVTT.

Set path locations in msrvtt_4f_i21k.json

conda env create -f requirements/frozen.yml

python train.py --config configs/msrvtt_4f_i21k.json

TODO:

[x] conda env

[ ] msrvtt data zip

[ ] pretrained models

[ ] webvid data

[ ] Other benchmarks

Owner

PhD Student, VGG, Oxford

GitHub Repository

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

11 Jan 02, 2022

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

SEAM The implementation of Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentaion. You can also download the repos

459 Dec 26, 2022

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Scene Text-Spotting based on PSEnet+CRNN Pytorch implementation of an end to end Text-Spotter with a PSEnet text detector and CRNN text recognizer. We

62 Oct 10, 2022

Basic functions manipulating images using the OpenCV library

OpenCV Basic functions manipulating images using the OpenCV library. Reading Ima

3 Feb 17, 2022

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

Scene Text Localization & Recognition Resources Read this institute-wise: English, 简体中文. Read this year-wise: English, 简体中文. Tags: [STL] (Scene Text L

901 Dec 11, 2022

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

MTLFace This repository contains the PyTorch implementation and the dataset of the paper: When Age-Invariant Face Recognition Meets Face Age Synthesis

120 Jan 05, 2023

Official PyTorch implementation for "Mixed supervision for surface-defect detection: from weakly to fully supervised learning"

Mixed supervision for surface-defect detection: from weakly to fully supervised learning [Computers in Industry 2021] Official PyTorch implementation

169 Dec 30, 2022

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

About An OCR translator tool. Made by me by utilizing Tesseract, compiled to .exe using pyinstaller. I made this program to learn more about python. I

41 Dec 30, 2022

Demo processor to illustrate OCR-D Python API

ocrd_vandalize/ Demo processor to illustrate the OCR-D/core Python API Description :TODO: write docs :) Installation From PyPI pip3 install ocrd_vanda

5 May 05, 2022

A curated list of promising OCR resources

Call for contributor(paper summary,dataset generation,algorithm implementation and any other useful resources) awesome-ocr A curated list of promising

1.6k Jan 04, 2023

Repositório para registro de estudo da biblioteca opencv (Python)

OpenCV (Python) Objetivo do Repositório: Registrar avanços no estudo da biblioteca opencv. O repositório estará aberto a qualquer pessoa e há tambem u

1 Jun 14, 2022

A small C++ implementation of LSTM networks, focused on OCR.

clstm CLSTM is an implementation of the LSTM recurrent neural network model in C++, using the Eigen library for numerical computations. Status and sco

794 Dec 30, 2022

零样本学习测评基准，中文版

ZeroCLUE 零样本学习测评基准，中文版零样本学习是AI识别方法之一。简单来说就是识别从未见过的数据类别，即训练的分类器不仅仅能够识别出训练集中已有的数据类别，还可以对于来自未见过的类别的数据进行区分。这是一个很有用的功能，使得计算机能够具有知识迁移的能力，并无需任何训练数据，很符合现

27 Dec 10, 2022

Binarize document images

Binarization Binarization for document images Examples Introduction This tool performs document image binarization (i.e. transform colour/grayscale to

48 Jan 02, 2023

CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering" official PyTorch implementation.

LED2-Net This is PyTorch implementation of our CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering". Y

83 Jan 04, 2023

A Joint Video and Image Encoder for End-to-End Retrieval

Related tags

Overview

Frozen️ in Time ❄️ ️️️️ ⏳

A Joint Video and Image Encoder for End-to-End Retrieval

(arXiv)

Work in progress

Owner

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Basic functions manipulating images using the OpenCV library

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

Official PyTorch implementation for "Mixed supervision for surface-defect detection: from weakly to fully supervised learning"

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

Demo processor to illustrate OCR-D Python API

A curated list of promising OCR resources

Repositório para registro de estudo da biblioteca opencv (Python)

A small C++ implementation of LSTM networks, focused on OCR.

零样本学习测评基准，中文版

Binarize document images

CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering" official PyTorch implementation.

A Python wrapper for Google Tesseract

Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

PianoVisuals - Create background videos synced with piano music using opencv

This repository contains codes on how to handle mouse event using OpenCV

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

A Joint Video and Image Encoder for End-to-End Retrieval

Related tags

Overview

Frozen️ in Time ❄️ ️️️️ ⏳

A Joint Video and Image Encoder for End-to-End Retrieval

(arXiv)

Work in progress

Owner

Image Detector and Convertor App created using python's Pillow, OpenCV, cvlib, numpy and streamlit packages.

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Basic functions manipulating images using the OpenCV library

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集 シーンテキストの位置認識と識別のための論文リソースの要約

When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework (CVPR 2021 oral)

Official PyTorch implementation for "Mixed supervision for surface-defect detection: from weakly to fully supervised learning"

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

Demo processor to illustrate OCR-D Python API

A curated list of promising OCR resources

Repositório para registro de estudo da biblioteca opencv (Python)

A small C++ implementation of LSTM networks, focused on OCR.

零样本学习测评基准，中文版

Binarize document images

CVPR 2021 Oral paper "LED2-Net: Monocular 360˚ Layout Estimation via Differentiable Depth Rendering" official PyTorch implementation.

A Python wrapper for Google Tesseract

Forked from argman/EAST for the ICPR MTWI 2018 CHALLENGE

PianoVisuals - Create background videos synced with piano music using opencv

This repository contains codes on how to handle mouse event using OpenCV

The official code for the ICCV-2021 paper "Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates".

A general list of resources to image text localization and recognition 场景文本位置感知与识别的论文资源与实现合集シーンテキストの位置認識と識別のための論文リソースの要約