Paper: De-rendering Stylized Texts

Last update: Dec 18, 2022

Related tags

Overview

Paper: De-rendering Stylized Texts

Wataru Shimoda¹, Daichi Haraguchi², Seiichi Uchida², Kota Yamaguchi¹
¹CyberAgent.Inc, ² Kyushu University
Accepted to ICCV2021. [Publication] [Arxiv] [project-page]

Introduction

This repository contains the codes for "De-rendering stylized texts".

Concept

We propose to parse rendering parameters of stylized texts utilizing a neural net.

Demo

The proposed model parses rendering parameters based on famous 2d graphic engine[Skia.org|python implementation], which has compatibility with CSS in the Web. We can export the estimated rendering parameters and edit texts by an off-the-shelf rendering engine.

Installation

Requirements

Python >= 3.7
Pytorch >= 1.8.1
torchvision >= 0.9.1

pip install -r requiements.txt

Font data

The proposed model is trained with google fonts.
Download google fonts and locate in data/fonts/ as gfonts.

cd data/fonts
git clone https://github.com/google/fonts.git gfonts

Pre-rendered alpha maps

The proposed model parses rendering parameters and refines them through the differentiable rendering model, which uses pre-rendered alpha maps.
Generate pre-rendered alpha maps.

python -m util_lib.gen_pams

Pre-rendered alpha maps would be generated in data/fonts/prerendered_alpha.

Usage

Test

Download the pre-trained weight from this link (weight).
Locate the weight file in weights/font100_unified.pth.

Example usage.

python test.py --imgfile=example/sample.jpg

Note

imgfile option: path of an input image
results would be generated in res/

Data generation

in progress

Train

in progress

Todo

Testing codes
Codes for the text image generator
Training codes
Add notebooks for the guide

Reference

@InProceedings{Shimoda_2021_ICCV,
    author    = {Shimoda, Wataru and Haraguchi, Daichi and Uchida, Seiichi and Yamaguchi, Kota},
    title     = {De-Rendering Stylized Texts},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {1076-1085}
}

Contact

This repository is maintained by Wataru shimoda(wataru_shimoda[at]cyberagent.co.jp).

Paper: De-rendering Stylized Texts

Related tags

Overview

Paper: De-rendering Stylized Texts

Introduction

Concept

Demo

Installation

Requirements

Font data

Pre-rendered alpha maps

Usage

Test

Data generation

Train

Todo

Reference

Contact

Owner

CyberAgent AI Lab

Source code for Transformer-based Multi-task Learning for Disaster Tweet Categorisation (UCD's participation in TREC-IS 2020A, 2020B and 2021A).

This code is 3d-CNN model that can predict environmental value

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

small collection of functions for neural networks

Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency[ECCV 2020]

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

Speech recognition tool to convert audio to text transcripts, for Linux and Raspberry Pi.

My personal code and solution to the Synacor Challenge from 2012 OSCON.

Simple Dynamic Batching Inference

[CVPR'21] MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation

BabelCalib: A Universal Approach to Calibrating Central Cameras. In ICCV (2021)

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN.

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

Canonical Appearance Transformations

Neural Koopman Lyapunov Control

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Source code for paper "Document-Level Relation Extraction with Adaptive Thresholding and Localized Context Pooling", AAAI 2021

Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

pip install python-office

Deep Markov Factor Analysis (NeurIPS2021)