Colab notebook for openai/glide-text2im.

Last update: Oct 19, 2022

Overview

GLIDE text2im on Colab

This repository provides a Colab notebook to produce images conditioned on text prompts with GLIDE [1].

Usage

Run text2im.ipynb

Tip: press <Ctrl+F9> to run everything.

Results

The process is based on the small, filtered-data GLIDE model, with classifier-free guidance.

Results consist of 64x64 images, and the corresponding 256x256 upsampled versions.

Expected run-time: 2m30s (for the one-time set-up), 1 min (64x64 sampling), 30 sec (256x256 upsampling).

_{Several uncurated samples obtained with the same prompt: "a magnificent French rooster singing".}

Safety considerations

The small model has 300 million parameters, compared to the unreleased 3.5 billion parameter model.

As described in Appendix F.1, the training dataset was filtered so that it would not contain:

images of humans and human-like objects,
images of violent objects,
two prevalent hate symbols in America (swastika and confederate flag).

References

[1] Alex Nichol, Prafulla Dhariwal, Aditya Ramesh, et al. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models. arXiv preprint 2112.10741. 2021.

Colab notebook for openai/glide-text2im.

Related tags

Overview

GLIDE text2im on Colab

Usage

Results

Safety considerations

References

Owner

Wok

Large-scale Hyperspectral Image Clustering Using Contrastive Learning, CIKM 21 Workshop

Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)

A tool for calculating distortion parameters in coordination complexes.

Sequence to Sequence Models with PyTorch

efficient neural audio synthesis in the waveform domain

A Bayesian cognition approach for belief updating of correlation judgement through uncertainty visualizations

Code for our NeurIPS 2021 paper 'Exploiting the Intrinsic Neighborhood Structure for Source-free Domain Adaptation'

Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

A Python implementation of global optimization with gaussian processes.

This repository contains code accompanying the paper "An End-to-End Chinese Text Normalization Model based on Rule-Guided Flat-Lattice Transformer"

Data, model training, and evaluation code for "PubTables-1M: Towards a universal dataset and metrics for training and evaluating table extraction models".

kapre: Keras Audio Preprocessors

Benchmark datasets, data loaders, and evaluators for graph machine learning

Graph Self-Supervised Learning for Optoelectronic Properties of Organic Semiconductors

Semi-Supervised Learning with Ladder Networks in Keras. Get 98% test accuracy on MNIST with just 100 labeled examples !

Bagua is a flexible and performant distributed training algorithm development framework.

Aws-machine-learning-university-accelerated-tab - Machine Learning University: Accelerated Tabular Data Class

CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.

A video scene detection algorithm is designed to detect a variety of different scenes within a video