Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Last update: Jan 02, 2023

Overview

Augmented Reality 101

The development of areas such as computer vision, image processing, and computer graphics, allow the introduction of technologies such as Augmented Reality.

Azuma defines Augmented Reality as "a technology that adds computer-generated virtual content to real-world views through devices".

Introduction

The purpose of these map is to give you an idea about Augmented Reality and to guide you through the main features that surround this technology.

Read complete post in AR 101 — Augmented Reality.

Definition and basic features

Read complete post in AR 101 — A brief summary (Part 1).

Horizontal and vertical trends

Read complete post in AR 101 — Augmented Reality Trends (Part 2).

Basic process and main components

Read complete post in AR 101 — Components of the Augmented Reality System (Part 3).

Augmented Reality Application

In this repository, I want to present a basic implementation that projects on the screen a 3D model aligned (orientation and translation) to a predefined flat surface.

However, currently the industry is investing in different frameworks as ARCore, ARKit, and Vuforia, among others, which provide the community more accessible technologies with more realistic results and experiences.

The repository has two parts:

Image is the implementation, step by step, with some basic definitions, to add a 3D model to a flat image.
Video is the implementation to have the experience in real-time through a camera.

Instalation

git clone [email protected]:mafda/augmented_reality_101.git

Environment

The tools we will use are Python 3 and OpenCV 4.2.

Create virtual environment:

python -m venv /path/to/new/virtual/environment

Activate environment:

source /path/to/new/virtual/environment/bin/activate

Install requirements.txt file:

pip install -r requirements.txt

For Image

python -m jupyter notebook

For Video

python ar_python3_opencv4.py

Model 3D

Chair from Clara.io

Results

Repository References

JE Solem, Programming Computer Vision with Python: Tools and algorithms for analyzing images. O'Reilly Media, Inc.
Programming Computer Vision with Python
Open source Python module for computer vision
Augmented reality with Python and OpenCV
augmented-reality
OBJFileLoader

Map References

Azuma, R. T. (1997). A survey of augmented reality. Presence: Teleoper. Virtual Environ., 6(4):355–385. Paper
Chatzopoulos, D., Bermejo, C., Huang, Z., and Hui, P. (2017). Mobile augmented reality survey: From where we are to where we go. IEEE Access, 5:6917–6950. Paper
Craig, A. (2013). Understanding Augmented Reality: Concepts and Applications. Elsevier Science, 1 edition. Book
Fleck, P., Arth, C., Pirchheim, C., and Schmalstieg, D. (2015). Tracking and mapping with a swarm of heterogeneous clients. In 2015 IEEE International Symposium on Mixed and Augmented Reality, pages 136–139. Paper
Huang, Z., Hui, P., Peylo, C., and Chatzopoulos, D. (2013). Mobile augmented reality survey: a bottom-up approach. CoRR. Paper
Lehiani, Y., Maidi, M., Preda, M., and Ghorbel, F. (2015). Object identification and tracking for steady registration in mobile augmented reality. In 2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), pages 54–59. Paper
Ling, H. (2017). Augmented reality in reality. IEEE MultiMedia, 24(3):10–15. Paper
Papagiannis, H. (2017). Augmented Human: How Technology Is Shaping the New Reality. O’Reilly Media. Book
Peddie, J. (2017). Augmented Reality: Where We Will All Live. Springer International Publishing. Book
Roberto, R., Lima, J. P., and Teichrieb, V. (2016). Tracking for mobile devices: A systematic mapping study. Computers & Graphics, 56:20 – 30. Paper

made with 💙 by mafda

Introduction to Augmented Reality (AR) with Python 3 and OpenCV 4.2.

Related tags

Overview

Augmented Reality 101

Introduction

Definition and basic features

Horizontal and vertical trends

Basic process and main components

Augmented Reality Application

Instalation

Environment

Model 3D

Results

Repository References

Map References

Owner

fernanda rodríguez

Code for AAAI 2021 paper: Sequential End-to-end Network for Efficient Person Search

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.

The code for “Oriented RepPoints for Aerail Object Detection”

Generating .npy dataset and labels out of given image, containing numbers from 0 to 9, using opencv

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

The papers published in top-tier AI conferences in recent years.

QuanTaichi: A Compiler for Quantized Simulations (SIGGRAPH 2021)

A python program to block out your face

Localization of thoracic abnormalities model based on VinBigData (top 1%)

A small C++ implementation of LSTM networks, focused on OCR.

Generate text images for training deep learning ocr model

Image augmentation library in Python for machine learning.

Code for the paper "Controllable Video Captioning with an Exemplar Sentence"

Python bindings for JIGSAW: a Delaunay-based unstructured mesh generator.

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Recognizing the text contents from a scanned visiting card

([email protected]) Boosting Co-teaching with Compression Regularization for Label Noise

Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.

a Deep Learning Framework for Text

An Implementation of the alogrithm in paper IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection