Mô tả

Computer vision is an interdisciplinary field that deals with how computers can be made to gain high-level understanding from digital images or videos. From the perspective of engineering, it seeks to automate tasks that the human visual system can do. Computer vision is concerned with the automatic extraction, analysis and understanding of useful information from a single image or a sequence of images. It involves the development of a theoretical and algorithmic basis to achieve automatic visual understanding. As a scientific discipline, computer vision is concerned with the theory behind artificial systems that extract information from images. The image data can take many forms, such as video sequences, views from multiple cameras, or multi-dimensional data from a medical scanner. As a technological discipline, computer vision seeks to apply its theories and models for the construction of computer vision systems.


Distinctions

The fields most closely related to computer vision are image processing, image analysis and machine vision. There is a significant overlap in the range of techniques and applications that these cover. This implies that the basic techniques that are used and developed in these fields are similar, something which can be interpreted as there is only one field with different names. On the other hand, it appears to be necessary for research groups, scientific journals, conferences and companies to present or market themselves as belonging specifically to one of these fields and, hence, various characterizations which distinguish each of the fields from the others have been presented.

Computer graphics produces image data from 3D models, computer vision often produces 3D models from image data. There is also a trend towards a combination of the two disciplines, e.g., as explored in augmented reality.

The following characterizations appear relevant but should not be taken as universally accepted:

  • Image processing and image analysis tend to focus on 2D images, how to transform one image to another, e.g., by pixel-wise operations such as contrast enhancement, local operations such as edge extraction or noise removal, or geometrical transformations such as rotating the image. This characterization implies that image processing/analysis neither require assumptions nor produce interpretations about the image content.

  • Computer vision includes 3D analysis from 2D images. This analyzes the 3D scene projected onto one or several images, e.g., how to reconstruct structure or other information about the 3D scene from one or several images. Computer vision often relies on more or less complex assumptions about the scene depicted in an image.

  • Machine vision is the process of applying a range of technologies & methods to provide imaging-based automatic inspection, process control and robot guidance in industrial applications. Machine vision tends to focus on applications, mainly in manufacturing, e.g., vision-based robots and systems for vision-based inspection, measurement, or picking (such as bin picking). This implies that image sensor technologies and control theory often are integrated with the processing of image data to control a robot and that real-time processing is emphasized by means of efficient implementations in hardware and software. It also implies that the external conditions such as lighting can be and are often more controlled in machine vision than they are in general computer vision, which can enable the use of different algorithms.

  • There is also a field called imaging which primarily focuses on the process of producing images, but sometimes also deals with processing and analysis of images. For example, medical imaging includes substantial work on the analysis of image data in medical applications.

  • Finally, pattern recognition is a field which uses various methods to extract information from signals in general, mainly based on statistical approaches and artificial neural networks. A significant part of this field is devoted to applying these methods to image data.

Applications

Applications range from tasks such as industrial machine vision systems which, say, inspect bottles speeding by on a production line, to research into artificial intelligence and computers or robots that can comprehend the world around them. The computer vision and machine vision fields have significant overlap. Computer vision covers the core technology of automated image analysis which is used in many fields. Machine vision usually refers to a process of combining automated image analysis with other methods and technologies to provide automated inspection and robot guidance in industrial applications. In many computer-vision applications, the computers are pre-programmed to solve a particular task, but methods based on learning are now becoming increasingly common. Examples of applications of computer vision include systems for:

  • Automatic inspection, e.g., in manufacturing applications;

  • Assisting humans in identification tasks, e.g., a species identification system

  • Controlling processes, e.g., an industrial robot;

  • Detecting events, e.g., for visual surveillance or people counting, e.g., in the restaurant industry;

  • Interaction, e.g., as the input to a device for computer-human interaction;

  • Modeling objects or environments, e.g., medical image analysis or topographical modeling;

  • Navigation, e.g., by an autonomous vehicle or mobile robot; and

  • Organizing information, e.g., for indexing databases of images and image sequences.


Medicine

One of the most prominent application fields is medical computer vision, or medical image processing, characterized by the extraction of information from image data to diagnose a patient. An example of this is detection of tumors, arteriosclerosis or other malign changes; measurements of organ dimensions, blood flow, etc. are another example. It also supports medical research by providing new information: e.g., about the structure of the brain, or about the quality of medical treatments. Applications of computer vision in the medical area also includes enhancement of images interpreted by humans—ultrasonic images or X-ray images for example—to reduce the influence of noise.


Machine Vision

A second application area in computer vision is in industry, sometimes called machine vision, where information is extracted for the purpose of supporting a manufacturing process. One example is quality control where details or final products are being automatically inspected in order to find defects. Another example is measurement of position and orientation of details to be picked up by a robot arm. Machine vision is also heavily used in agricultural process to remove undesirable food stuff from bulk material, a process called optical sorting.


Military

Military applications are probably one of the largest areas for computer vision. The obvious examples are detection of enemy soldiers or vehicles and missile guidance. More advanced systems for missile guidance send the missile to an area rather than a specific target, and target selection is made when the missile reaches the area based on locally acquired image data. Modern military concepts, such as "battlefield awareness", imply that various sensors, including image sensors, provide a rich set of information about a combat scene which can be used to support strategic decisions. In this case, automatic processing of the data is used to reduce complexity and to fuse information from multiple sensors to increase reliability.


Autonomous vehicles

One of the newer application areas is autonomous vehicles, which include submersibles, land-based vehicles (small robots with wheels, cars or trucks), aerial vehicles, and unmanned aerial vehicles (UAV). The level of autonomy ranges from fully autonomous (unmanned) vehicles to vehicles where computer-vision-based systems support a driver or a pilot in various situations. Fully autonomous vehicles typically use computer vision for navigation, e.g. for knowing where it is, or for producing a map of its environment (SLAM) and for detecting obstacles. It can also be used for detecting certain task specific events, e.g., a UAV looking for forest fires. Examples of supporting systems are obstacle warning systems in cars, and systems for autonomous landing of aircraft. Several car manufacturers have demonstrated systems for autonomous driving of cars, but this technology has still not reached a level where it can be put on the market. There are ample examples of military autonomous vehicles ranging from advanced missiles to UAVs for recon missions or missile guidance. Space exploration is already being made with autonomous vehicles using computer vision, e.g., NASA's Curiosity and CNSA's Yutu-2 rover.


Tactile Feedback

Materials such as rubber and silicon are being used to create sensors that allow for applications such as detecting micro undulations and calibrating robotic hands. Rubber can be used in order to create a mold that can be placed over a finger, inside of this mold would be multiple strain gauges. The finger mold and sensors could then be placed on top of a small sheet of rubber containing an array of rubber pins. A user can then wear the finger mold and trace a surface. A computer can then read the data from the strain gauges and measure if one or more of the pins is being pushed upward. If a pin is being pushed upward then the computer can recognize this as an imperfection in the surface. This sort of technology is useful in order to receive accurate data of the imperfections on a very large surface. Another variation of this finger mold sensor are sensors that contain a camera suspended in silicon. The silicon forms a dome around the outside of the camera and embedded in the silicon are point markers that are equally spaced. These cameras can then be placed on devices such as robotic hands in order to allow the computer to receive highly accurate tactile data.

Other application areas include:

  • Support of visual effects creation for cinema and broadcast, e.g., camera tracking (matchmoving).

  • Surveillance.

  • Driver drowsiness detection

  • Tracking and counting organisms in the biological sciences

[Reference: Wikipedia]

Bạn sẽ học được gì

Use OpenCV to work with image files

Understanding the fundamentals of computer vision & image processing

Use Python and OpenCV to draw shapes on images and videos

Get started with image manipulation with OpenCV, including smoothing, blurring, thresholding, and morphological operations.

OpenCV Image Manipulation Fundamentals using Python. Also includes a Python basics refresher session.

Open and Stream video with Python and OpenCV

Detect Objects, including corner, edge, and grid detection techniques with OpenCV and Python

Create Face Detection Software Using Haar Classifier

Have a toolbox of the most powerful Computer Vision models

Understand the theory behind Computer Vision

Create powerful Computer Vision applications

Yêu cầu

  • Basic Python programming knowledge

Nội dung khoá học

9 sections

Introduction To Computer Vision In Python! Face Detection & Image Processing

22 lectures
Getting Started With Computer Vision In Python! General Course Overview...
04:30
PyCharm, Numpy and OpenCV Setup For Beginners
07:29
How to Read an Image
10:20
How to Write an Image
04:49
Basic Operations In openCV
10:01
How to Read A Video
08:05
How to convert Videos to Gray mode
04:38
Some VideoCapture properties
07:45
How to save Video Captures
10:29
Face Detection Using Haar Cascade Classifier
14:37
Face Detection in a video
11:52
Eye Detection Using Haar Cascade Classifier
08:44
Eye Detection in a Video
10:55
Assignment and Solution on Face and Eye Detection In an Image
11:19
Assignment and Solution on Face and Eye Detection in a video
14:57
How to draw geometric shapes
10:08
How to draw a Rectangle
09:05
How to draw a Circle
08:28
How to add Text on image
07:24
Camera Setting Parameter
07:55
How To Add Text, Date and Time on a Video
13:41
Congratulations on your journey! Never stop learning...
00:10

Working with Numpy In Computer Vision

13 lectures
How to create image using Numpy Zeros Method
06:40
How Handle Mouse Click Event In OpenCV
14:45
How to Detect BGR Channels of an Image Using Mouse Click
13:43
How to draw and connect points
11:35
How to show image colour on a new window
10:55
How to Duplicate and Position an Image
11:41
How to Add Two Images
11:38
Bitwise Operations (AND, OR, XOR, NOT) in OpenCV
18:19
How to create a BGR Selector using a Trackbar
16:23
Create Trackbar position on Image
09:31
Image Detection In OpenCV
11:15
How to Track the HSV Values
23:22
How to Detect VideoCapture Object
05:07

Image Thresholding In Computer Vision

2 lectures
Introduction To Image Thresholding
14:24
Adaptive Thresholding
13:48

Working With Matplotlib In Computer Vision

2 lectures
Matplotlip with OpenCV
11:07
How to open Multiple Images on Matplotlib
10:16

Morphological Transformation In Computer Vision

6 lectures
Introduction To Morphological Transformation
08:52
Dilational Morphological Transformation
14:14
Erosional Morphological Transformation
11:01
Opening Morphological Transformation
10:01
Closing Morphological Transformation
09:22
Types of Morphological Transformation
04:10

Image Filtration Processes In Computer Vision

5 lectures
2D Convolution in OpenCV
11:49
Different Types of Image Filtration In OpenCV
16:11
Laplacian Gradient
10:35
Sobel Gradient
12:47
Canny Edge Detection
08:41

Image Pyramids In Computer Vision

4 lectures
Gaussian Pyramid
16:01
How to create Multiple Image Pyramid
06:55
Laplacian Pyramid
09:31
How to blend Images in OpenCV
13:17

Contours In Computer Vision

3 lectures
How to Draw Contours in OpenCV
10:29
Motion Detection In OpenCV
14:51
Motion Tracker In OpenCV
17:34

Geometric Shapes Analysis In Computer Vision

4 lectures
How to find Geometric Shapes
08:47
How to Add Text on Shapes
15:18
Template Matching in OpenCV
16:26
Congratulations on your journey. Never Stop Learning!
00:15

Đánh giá của học viên

Chưa có đánh giá
Course Rating
5
0%
4
0%
3
0%
2
0%
1
0%

Bình luận khách hàng

Viết Bình Luận

Bạn đánh giá khoá học này thế nào?

image

Đăng ký get khoá học Udemy - Unica - Gitiho giá chỉ 50k!

Get khoá học giá rẻ ngay trước khi bị fix.