SRA's seminar on Introduction to Computer Vision Fundamentals

Overview

Introduction to Computer Vision

This repository includes basics to :

  1. Python
  2. Numpy: A python library
  3. Git
  4. Computer Vision.

The aim of this repository is to provide:

  • A brief idea of algorithms involved in Computer Vision .
  • Introduction to Version Control System: Git and GitHub.
  • Computer Vision and Image Processing basics, idea of implementation of various algorithms involved using numpy (instead of any dedicated image processing library like OpenCV.)
  • Introduction to a commonly used Image Processing Library: OpenCV

Demonstration

Table Of Contents

Comments
  • Add suboptimal 2D convolution

    Add suboptimal 2D convolution

    This pull request intends to add a suboptimal implementation of generic 2D convolution. This is done for the purpose of giving a rough idea to Fys about how to work with python arrays/loops, etc. Fys will be asked to improve this implementation and complete tasks related to convolution on top of it.

    opened by meshtag 5
  • Morphology notes updated.

    Morphology notes updated.

    I have added images for dilation and erosion, replaced the previous gif of dilation and erosion with new ones and added a few lines explaining morphology.

    opened by Aryaman22102002 2
  • Updated cv-basics/

    Updated cv-basics/

    Optimised code and flow as discussed in:

    1. cv-basics/5_opencv_overview.ipynb
    2. python-numpy-basics/7_classes_and_objects.ipynb

    Added an image :

    1. cv-basics/image/bcci.png
    opened by dhairyashah1 1
  • Port to C++ : Assignments related to PIXELS seminar

    Port to C++ : Assignments related to PIXELS seminar

    Is your feature request related to a problem? Please describe. This feature request is created to keep a record of porting and potential addition of new assignments related to the seminar in C++ as discussed in this thread.

    Describe the solution you'd like

    • Create a separate main folder for containing all assignments. Individual assignments related to specific topics might be grouped together inside the main parent folder of assignments.
    • You might chose to add reference links in individual questions, which may provide additional material on a related topic for that question (this is suggested solely for the purpose of providing more (potentially real world) info related to the topic asked in original question and hence, should not in any way lead to the solution).
    enhancement 
    opened by meshtag 0
  • Add Content: Interpolations.

    Add Content: Interpolations.

    Is your feature request related to a problem? Please describe. As discussed in the thread, concepts of interpolation can also be added.

    Describe the solution you'd like

    • Create a implementations of interpolation from scratch using necessary OpenCV C++ API.
    • Add a Makefile to compile and build executables.
    • Add a .md file to explain the theory of interpolations and instructions to build and run the executables.

    Additional context Reference: Ancient Secrets of computer vision.

    Note: Content is not finalised and open for discussion

    enhancement 
    opened by amanchhaparia 0
  • Add Content: Image Storing Formats.

    Add Content: Image Storing Formats.

    Is your feature request related to a problem? Please describe. As discussed in the thread, It is important to have a familiarity of how images are store.

    Describe the solution you'd like

    • Add the theory of basic image storing formats such as .bmp, .tiff, .jpg, png etc.
    • Implement a .cpp file on how image can be read from the bmp format.
      • Consider only 8 bit grayscale BitMap image (Since they are easy to read and contains only 2D form of data).
      • Use simple posix read() api to read the image bitmap file.
      • Directly storing the values of various attributes of image in struct is suggested.
    • A similar example can be added to demonstrate how to edit/write a grayscale bitmap image.
    • Add a Makefile to compile and build the executable.
    • Add a .md file explaining the theory and instructions to build and run the executables.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 2
  • Add Content: Build Systems

    Add Content: Build Systems

    Is your feature request related to a problem? Please describe. As discussed in the thread, Concepts of Build System should be added.

    Describe the solution you'd like

    • Content should be added for manual creating and linking the object files.
    • Importance of build systems.
    • Add the contents for Makefile.
    • Add contents for Cmake.

    Additional context Can refer from here: Embedded Study Group Week 2.

    Note: Content is not finalised and open for discussion.

    enhancement Build-Systems 
    opened by amanchhaparia 0
  • Add Content: C++ basic concepts for seminar.

    Add Content: C++ basic concepts for seminar.

    Is your feature request related to a problem? Please describe. Since the seminar is being ported to C++ as discussed in this thread, it is important to teach some important C++ concepts.

    Describe the solution you'd like

    • Some advance concepts of C++ like handling 2D arrays/vector, pointer etc.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 1
Releases(v1.0)
  • v1.0(Sep 7, 2022)

    This release contains the 1st version of the PIXELS Seminar conducted in 2021. The content of this release is implemented in Python and uses numpy and OpenCV Python API.

    • This release can be used as a reference to basic Image Processing using Python.
    • Contains a tutorial for necessary numpy methods.
    • Tutorials on commonly used OpenCV functions in Python.
    • Implementation of blob detection a very commonly used algorithm in Python.
    Source code(tar.gz)
    Source code(zip)
Owner
Society of Robotics and Automation
The Society of Robotics and Automation is a society for VJTI students. As the name suggests, we deal with Robotics, Machine Vision and Automation .
Society of Robotics and Automation
OCR, Scene-Text-Understanding, Text Recognition

Scene-Text-Understanding Survey [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper [2014-Front.Comput.Sci] Scene Text Detection and

Alan Tang 354 Dec 12, 2022
A Joint Video and Image Encoder for End-to-End Retrieval

Frozen️ in Time ❄️ ️️️️ ⏳ A Joint Video and Image Encoder for End-to-End Retrieval (arXiv) Repository to contain the code, models, data for end-to-end

225 Dec 25, 2022
Text to QR-CODE

QR CODE GENERATO USING PYTHON Author : RAFIK BOUDALIA. Installation Use the package manager pip to install foobar. pip install pyqrcode Usage from tki

Rafik Boudalia 2 Oct 13, 2021
ARU-Net - Deep Learning Chinese Word Segment

ARU-Net: A Neural Pixel Labeler for Layout Analysis of Historical Documents Contents Introduction Installation Demo Training Introduction This is the

128 Sep 12, 2022
computer vision, image processing and machine learning on the web browser or node.

Image processing and Machine learning labs   computer vision, image processing and machine learning on the web browser or node note Fast Fourier Trans

ryohei tanaka 487 Nov 11, 2022
Document manipulation detection with python

image manipulation detection task: -- tianchi function image segmentation salie

JiaKui Hu 3 Aug 22, 2022
零样本学习测评基准,中文版

ZeroCLUE 零样本学习测评基准,中文版 零样本学习是AI识别方法之一。 简单来说就是识别从未见过的数据类别,即训练的分类器不仅仅能够识别出训练集中已有的数据类别, 还可以对于来自未见过的类别的数据进行区分。 这是一个很有用的功能,使得计算机能够具有知识迁移的能力,并无需任何训练数据, 很符合现

CLUE benchmark 27 Dec 10, 2022
A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.

LAREX LAREX is a semi-automatic open-source tool for layout analysis on early printed books. It uses a rule based connected components approach which

162 Jan 05, 2023
Code related to "Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity" paper

DataTuner You have just found the DataTuner. This repository provides tools for fine-tuning language models for a task. See LICENSE.txt for license de

81 Jan 01, 2023
Repository for playing the computer vision apps: People analytics on Raspberry Pi.

play-with-torch Repository for playing the computer vision apps: People analytics on Raspberry Pi. Tools Tested Hardware RasberryPi 4 Model B here, RA

eMHa 1 Sep 23, 2021
learn how to use Gesture Control to change the volume of a computer

Volume-Control-using-gesture In this project we are going to learn how to use Gesture Control to change the volume of a computer. We first look into h

Diwas Pandey 49 Sep 22, 2022
Code for the ACL2021 paper "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction"

CSCBLI Code for our ACL Findings 2021 paper, "Combining Static Word Embedding and Contextual Representations for Bilingual Lexicon Induction". Require

Jinpeng Zhang 12 Oct 08, 2022
CellProfiler is a open-source application for biological image analysis

CellProfiler is a free open-source software designed to enable biologists without training in computer vision or programming to quantitatively measure phenotypes from thousands of images automaticall

CellProfiler 732 Dec 23, 2022
Random maze generator and solver

Maze Generator and Solver I wrote a maze generator that works with two commonly known algorithms: Depth First Search and Randomized Prims. Both of the

Daniel Pérez 10 Sep 23, 2022
The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

3D Human Pose Estimation with Spatial and Temporal Transformers This repo is the official implementation for 3D Human Pose Estimation with Spatial and

Ce Zheng 363 Dec 28, 2022
👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike

Quick Info this library tries to solve language detection of very short words and phrases, even shorter than tweets makes use of both statistical and

Peter M. Stahl 532 Dec 28, 2022
Document Image Dewarping

Document image dewarping using text-lines and line Segments Abstract Conventional text-line based document dewarping methods have problems when handli

Taeho Kil 268 Dec 23, 2022
An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

Optical_Character_Recognition An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports. As an IOT/Compute

Ramsis Hammadi 1 Feb 12, 2022
一款基于Qt与OpenCV的仿真数字示波器

一款基于Qt与OpenCV的仿真数字示波器

郭赟 4 Nov 02, 2022
Generate text images for training deep learning ocr model

New version release:https://github.com/oh-my-ocr/text_renderer Text Renderer Generate text images for training deep learning OCR model (e.g. CRNN). Su

Qing 1.2k Jan 04, 2023