TextBPN

Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection； Accepted by ICCV2021.

Note: The complete code (including training and testing) will be released in TextBPN V2. Relevant work is advancing, and those who are interested in our work can pay more attention to the updates here.

1.Prerequisites t

python 3.9;
PyTorch 1.7.0;
Numpy >=1.2.0
CUDA 11.1;
GCC >=10.0;
NVIDIA GPU(with 11G or larger GPU memory for inference);

2.Dataset Links

3.Models

Total-Text model (pretrained on ICDAR2017-MLT)
CTW-1500 model (pretrained on ICDAR2017-MLT)
MSRA-TD500 model (pretrained on ICDAR2017-MLT)

4.Running Evaluation

run:

sh eval.sh

The details are as follows:

#!/bin/bash
##################### Total-Text ###################################
# test_size=[640,1024]--cfglib/option
CUDA_LAUNCH_BLOCKING=1 python eval_textBPN.py --exp_name Totaltext --checkepoch 390 --dis_threshold 0.3 --cls_threshold 0.825 --test_size 640 1024 --gpu 1

###################### CTW-1500 ####################################
# test_size=[640,1024]--cfglib/option
# CUDA_LAUNCH_BLOCKING=1 python eval_textBPN.py --exp_name Ctw1500 --checkepoch 560 --dis_threshold 0.3 --cls_threshold 0.8 --test_size 640 1024 --gpu 1

#################### MSRA-TD500 ######################################
# test_size=[640,1024]--cfglib/option
#CUDA_LAUNCH_BLOCKING=1 python eval_textBPN.py --exp_name TD500 --checkepoch 680 --dis_threshold 0.3 --cls_threshold 0.925 --test_size 640 1024 --gpu 1

TextBPN Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

Related tags

Overview

TextBPN

1.Prerequisites t

2.Dataset Links

3.Models

4.Running Evaluation

5.Experiments results

Owner

S.X.Zhang

Selfplay In MultiPlayer Environments

PyTorch Code of "Memory In Memory: A Predictive Neural Network for Learning Higher-Order Non-Stationarity from Spatiotemporal Dynamics"

Official Pytorch implementation of the paper "Action-Conditioned 3D Human Motion Synthesis with Transformer VAE", ICCV 2021

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

A deep learning model for style-specific music generation.

FaceAnon - Anonymize people in images and videos using yolov5-crowdhuman

AdaNet is a lightweight TensorFlow-based framework for automatically learning high-quality models with minimal expert intervention

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

Flaxformer: transformer architectures in JAX/Flax

Codes for TIM2021 paper "Anchor-Based Spatio-Temporal Attention 3-D Convolutional Networks for Dynamic 3-D Point Cloud Sequences"

Implementation of the HMAX model of vision in PyTorch

A modular, open and non-proprietary toolkit for core robotic functionalities by harnessing deep learning

Display, filter and search log messages in your terminal

Background Matting: The World is Your Green Screen

PromptDet: Expand Your Detector Vocabulary with Uncurated Images

FinEAS: Financial Embedding Analysis of Sentiment 📈

Official Repsoitory for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

GenshinMapAutoMarkTools - Tools To add/delete/refresh resources mark in Genshin Impact Map

Pytorch implementation of Each Part Matters: Local Patterns Facilitate Cross-view Geo-localization https://arxiv.org/abs/2008.11646