SemTorch

Last update: Dec 07, 2022

Related tags

Overview

SemTorch

This repository contains different deep learning architectures definitions that can be applied to image segmentation.

All the architectures are implemented in PyTorch and can been trained easily with FastAI 2.

In Deep-Tumour-Spheroid repository can be found and example of how to apply it with a custom dataset, in that case brain tumours images are used.

These architectures are classified as:

Semantic Segmentation: each pixel of an image is linked to a class label.
Instance Segmentation: is similar to semantic segmentation, but goes a bit deeper, it identifies , for each pixel, the object instance it belongs to.
Salient Object Detection (Binary clases only): detection of the most noticeable/important object in an image.

🚀 Getting Started

To start using this package, install it using pip:

For example, for installing it in Ubuntu use:

pip3 install SemTorch

👩‍💻 Usage

This package creates an abstract API to access a segmentation model of different architectures. This method returns a FastAI 2 learner that can be combined with all the fastai's functionalities.

# SemTorch
from semtorch import get_segmentation_learner

learn = get_segmentation_learner(dls=dls, number_classes=2, segmentation_type="Semantic Segmentation",
                                 architecture_name="deeplabv3+", backbone_name="resnet50", 
                                 metrics=[tumour, Dice(), JaccardCoeff()],wd=1e-2,
                                 splitter=segmentron_splitter).to_fp16()

You can find a deeper example in Deep-Tumour-Spheroid repository, in this repo the package is used for the segmentation of brain tumours.

def get_segmentation_learner(dls, number_classes, segmentation_type, architecture_name, backbone_name,
                             loss_func=None, opt_func=Adam, lr=defaults.lr, splitter=trainable_params, 
                             cbs=None, pretrained=True, normalize=True, image_size=None, metrics=None, 
                             path=None, model_dir='models', wd=None, wd_bn_bias=False, train_bn=True,
                             moms=(0.95,0.85,0.95)):

This function return a learner for the provided architecture and backbone

Parameters:

dls (DataLoader): the dataloader to use with the learner
number_classes (int): the number of clases in the project. It should be >=2
segmentation_type (str): just Semantic Segmentation accepted for now
architecture_name (str): name of the architecture. The following ones are supported: unet, deeplabv3+, hrnet, maskrcnn and u2^net
backbone_name (str): name of the backbone
loss_func (): loss function.
opt_func (): opt function.
lr (): learning rates
splitter (): splitter function for freazing the learner
cbs (List[cb]): list of callbacks
pretrained (bool): it defines if a trained backbone is needed
normalize (bool): if normalization is applied
image_size (int): REQUIRED for MaskRCNN. It indicates the desired size of the image.
metrics (List[metric]): list of metrics
path (): path parameter
model_dir (str): the path in which save models
wd (float): wieght decay
wd_bn_bias (bool):
train_bn (bool):
moms (Tuple(float)): tuple of different momentuns

Returns:

learner: value containing the learner object

Supported configs

Architecture	supported config	backbones
unet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `xresnet18`, `xresnet34`, `xresnet50`, `xresnet101`, `xresnet152`, `squeezenet1_0`, `squeezenet1_1`, `densenet121`, `densenet169`, `densenet201`, `densenet161`, `vgg11_bn`, `vgg13_bn`, `vgg16_bn`, `vgg19_bn`, `alexnet`
deeplabv3+	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `resnet50c`, `resnet101c`, `resnet152c`, `xception65`, `mobilenet_v2`
hrnet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`hrnet_w18_small_model_v1`, `hrnet_w18_small_model_v2`, `hrnet_w18`, `hrnet_w30`, `hrnet_w32`, `hrnet_w48`
maskrcnn	`Semantic Segmentation`,`binary`	`resnet50`
u2^net	`Semantic Segmentation`,`binary`	`small`, `normal`

📩 Contact

📧 [email protected]

💼 Linkedin David Lacalle Castillo

SemTorch

Related tags

Overview

SemTorch

🚀 Getting Started

👩‍💻 Usage

Parameters:

Returns:

Supported configs

📩 Contact

Owner

David Lacalle Castillo

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrieval.

Random maze generator and solver

Some codes from PyImageSearch course's and external projects.

A small C++ implementation of LSTM networks, focused on OCR.

Computer vision applications project (Flask and OpenCV)

Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

A curated list of resources dedicated to scene text localization and recognition

Um simples projeto para fazer o reconhecimento do captcha usado pelo jogo bombcrypto

One Metrics Library to Rule Them All!

TensorFlow Implementation of FOTS, Fast Oriented Text Spotting with a Unified Network.

Assignment work with webcam

This repository lets you train neural networks models for performing end-to-end full-page handwriting recognition using the Apache MXNet deep learning frameworks on the IAM Dataset.

python ocr using tesseract/ with EAST opencv detector

Python library to extract tabular data from images and scanned PDFs

Distort a video using Seam Carving (video) and Vibrato effect (sound)

M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム

[BMVC'21] Official PyTorch Implementation of Grounded Situation Recognition with Transformers

Packaged, Pytorch-based, easy to use, cross-platform version of the CRAFT text detector

computer vision, image processing and machine learning on the web browser or node.