Syllabus - AI for Computer Vision (AD-701)
Artificial Intelligence & Data Science
AI for Computer Vision (AD-701)
VII-Semester
Unit I
Introduction to Image Formation and Processing
Computer Vision, Geometric primitives and transformations, Photometric image formation, digital camera, Point operators, Linear filtering, More neighborhood operators, Fourier transforms, Pyramids and wavelets, Geometric transformations, Global optimization.
Unit II
Feature Detection, Matching and Segmentation
Points and patches, Edges, Lines, Segmentation, Active contours, Split and merge, Mean shift and mode finding, Normalized cuts, Graph cuts and energy-based methods.
Unit III
Feature-based Alignment & Motion Estimation
feature-based alignment, Pose estimation, Geometric 2D and 3D intrinsic calibration, Triangulation, Two-frame structure from motion, Factorization, Bundle adjustment, Constrained structure and motion, Translational alignment, Parametric motion, Spline-based motion, Optical flow, Layered motion.
Unit IV
3D Reconstruction
Shape from X, Active range finding, Surface representations, Point-based representations Volumetric representations, Model-based reconstruction, Recovering texture maps and albedos.
Unit V
Image-based Rendering and Recognition
View interpolation Layered depth images, Light fields and Lumigraphs, Environment mattes Video-based rendering, Object detection, Face recognition, Instance recognition, Category recognition, Context and scene understanding, Recognition databases and test sets.
Course Objective
To understand the fundamental concepts related to Image formation and processing. To learn feature detection, matching and detection. To become familiar with feature based alignment and motion estimation. To develop skills on 3D reconstruction. To understand image based rendering and recognition.
Course Outcome
After the completion of this course, the students will be able to: 1: Understand basic knowledge, theories and methods in image processing and computer vision. 2: Implement basic and some advanced image processing techniques in OpenCV. 3: Apply 2D a feature-based based image alignment, segmentation and motion estimations. 4: Apply 3D image reconstruction techniques. 5: Design and develop innovative image processing and computer vision applications.
Practicals
-
OpenCV Installation and working with Python
Basic Image Processing , loading images, Cropping, Resizing, Thresholding, Contour analysis, Bolb detection
-
Image Annotation – Drawing lines, text circle, rectangle, ellipse on images
Image Enhancement, Understanding Color spaces, color space conversion, Histogram equialization, Convolution, Image smoothing, Gradients, Edge Detection
-
Image Features and Image Alignment – Image transforms – Fourier, Hough, Extract ORB Image features, Feature matching and cloning
Feature matching based image alignment
-
Image segmentation using Graphcut / Grabcut
Camera Calibration with circular grid
-
Pose Estimation
3D Reconstruction – Creating Depth map from stereo images
Reference Books
-
Richard Szeliski, “Computer Vision: Algorithms and Applications”, Springer- Texts in Computer Science, Second Edition, 2022.
-
D. A. Forsyth, J. Ponce, “Computer Vision: A Modern Approach”, Pearson Education, Second Edition, 2015.
-
Richard Hartley and Andrew Zisserman, “Multiple View Geometry in Computer Vision”, Second Edition, Cambridge University Press, March 2004.
-
Christopher M. Bishop, “Pattern Recognition and Machine Learning”, Springer, 2006
-
E. R. Davies, “Computer and Machine Vision”, Fourth Edition, Academic Press, 2012.