Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Last update: Dec 09, 2022

Related tags

Deep Learning GUI-iVOS_and_GIS

Overview

GUI for iVOS(interactive VOS) and GIS (Guided iVOS)

GUI Implementation of

CVPR2021 paper "Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps"

ECCV2020 paper "Interactive Video Object Segmentation Using Global and Local Transfer Modules"

Githubs:
CVPR2021 / ECCV2020

Project Pages:
CVPR2021 / ECCV2020

Codes in this github:

Real-world GUI evaluation on DAVIS2017 based on the DAVIS framework
GUI for other videos

Prerequisite

cuda 11.0
python 3.6
pytorch 1.6.0
davisinteractive 1.0.4
numpy, cv2, PtQt5, and other general libraries of python3

Directory Structure

root/apps: QWidget apps.
root/checkpoints: save our checkpoints (pth extensions) here.
root/dataset_torch: pytorch datasets.
root/libs: library of utility files.
root/model_CVPR2021 : networks and GUI models for CVPR2021
- detailed explanations on [Github:CVPR2021]
root/model_ECCV2020 : networks and GUI models for ECCV2020
- detailed explanations (including building correlation package) on [Github:ECCV2020]
root/eval_GIS_RS1.py : DAVIS2017 evaluation based on the DAVIS framework.
root/eval_GIS_RS4.py : DAVIS2017 evaluation based on the DAVIS framework.
root/eval_IVOS.py : DAVIS2017 evaluation based on the DAVIS framework.
root/IVOS_demo_customvideo.py : GUI for custom videos

Instruction

To run

Edit eval_GIS_RS1.py``eval_GIS_RS4.py``eval_IVOS.py``IVOS_demo_customvideo.py to set the directory of your DAVIS2017 dataset and other configurations.
Download our parameters and place the file as root/checkpoints/GIS-ckpt_standard.pth.
- For CVPR2021 evaluation [Google-Drive]
- For ECCV2020 evaluation [Google-Drive]
Run eval_GIS_RS1.py``eval_GIS_RS4.py``eval_IVOS.py for real-world GUI evaluation on DAVIS2017 or
Run IVOS_demo_customvideo.py to apply our method on the other videos

To use

Left click for the target object and right click for the background.

Select any frame to interact by dragging the slidder under the main image
Give interaction
Run VOS
Find worst frame (if GIS, a candidate frame-RS1 or frames-RS4 are given) and reinteract.
Iterate until you get satisfied with VOS results.
By selecting satisfied button, your evaluation result (consumed time and frames) will be recorded on root/results.

Reference

Please cite our paper if the implementations are useful in your work:

@Inproceedings{
Yuk2021GIS,
title={Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps},
author={Yuk Heo and Yeong Jun Koh and Chang-Su Kim},
booktitle={CVPR},
year={2021},
url={https://openaccess.thecvf.com/content/CVPR2021/papers/Heo_Guided_Interactive_Video_Object_Segmentation_Using_Reliability-Based_Attention_Maps_CVPR_2021_paper.pdf}
}

@Inproceedings{
Yuk2020IVOS,
title={Interactive Video Object Segmentation Using Global and Local Transfer Modules},
author={Yuk Heo and Yeong Jun Koh and Chang-Su Kim},
booktitle={ECCV},
year={2020},
url={https://openreview.net/forum?id=bo_lWt_aA}
}

Our real-world evaluation demo is based on the GUI of IPNet:

@Inproceedings{
Oh2019IVOS,
title={Fast User-Guided Video Object Segmentation by Interaction-and-Propagation Networks},
author={Seoung Wug Oh and Joon-Young Lee and Seon Joo Kim},
booktitle={CVPR},
year={2019},
url={https://openaccess.thecvf.com/content_ICCV_2019/papers/Oh_Video_Object_Segmentation_Using_Space-Time_Memory_Networks_ICCV_2019_paper.pdf}
}

Pytorch GUI(demo) for iVOS(interactive VOS) and GIS (Guided iVOS)

Related tags

Overview

GUI for iVOS(interactive VOS) and GIS (Guided iVOS)

Prerequisite

Directory Structure

Instruction

To run

To use

Reference

Owner

Yuk Heo

Colour detection is necessary to recognize objects, it is also used as a tool in various image editing and drawing apps.

Do Smart Glasses Dream of Sentimental Visions? Deep Emotionship Analysis for Eyewear Devices

Codes for paper "KNAS: Green Neural Architecture Search"

⚡ H2G-Net for Semantic Segmentation of Histopathological Images

This is official implementaion of paper "Token Shift Transformer for Video Classification".

Object detection, 3D detection, and pose estimation using center point detection:

Semiconductor Machine learning project

Meta graph convolutional neural network-assisted resilient swarm communications

code for paper -- "Seamless Satellite-image Synthesis"

10x faster matrix and vector operations

Classification Modeling: Probability of Default

NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size

The implementation of FOLD-R++ algorithm

A solution to ensure Crowd Management with Contactless and Safe systems.

CPPE - 5 (Medical Personal Protective Equipment) is a new challenging object detection dataset

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

Keras-tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation（Unfinished）

“Robust Lightweight Facial Expression Recognition Network with Label Distribution Training”, AAAI 2021.

Multi-task head pose estimation in-the-wild

Java and SHACL code commented in the paper "Towards compliance checking in reified I/O logic via SHACL" submitted to ICAIL 2021