Syllabus - AI for Computer Vision (AD-701)


Artificial Intelligence & Data Science

AI for Computer Vision (AD-701)

VII-Semester

Unit I

Introduction to Image Formation and Processing

Computer Vision, Geometric primitives and transformations, Photometric image formation, digital camera, Point operators, Linear filtering, More neighborhood operators, Fourier transforms, Pyramids and wavelets, Geometric transformations, Global optimization.

Unit II

Feature Detection, Matching and Segmentation

Points and patches, Edges, Lines, Segmentation, Active contours, Split and merge, Mean shift and mode finding, Normalized cuts, Graph cuts and energy-based methods.

Unit III

Feature-based Alignment & Motion Estimation

feature-based alignment, Pose estimation, Geometric 2D and 3D intrinsic calibration, Triangulation, Two-frame structure from motion, Factorization, Bundle adjustment, Constrained structure and motion, Translational alignment, Parametric motion, Spline-based motion, Optical flow, Layered motion.

Unit IV

3D Reconstruction

Shape from X, Active range finding, Surface representations, Point-based representations Volumetric representations, Model-based reconstruction, Recovering texture maps and albedos.

Unit V

Image-based Rendering and Recognition

View interpolation Layered depth images, Light fields and Lumigraphs, Environment mattes Video-based rendering, Object detection, Face recognition, Instance recognition, Category recognition, Context and scene understanding, Recognition databases and test sets.

Course Objective

To understand the fundamental concepts related to Image formation and processing. To learn feature detection, matching and detection. To become familiar with feature based alignment and motion estimation. To develop skills on 3D reconstruction. To understand image based rendering and recognition.

Course Outcome

After the completion of this course, the students will be able to: 1: Understand basic knowledge, theories and methods in image processing and computer vision. 2: Implement basic and some advanced image processing techniques in OpenCV. 3: Apply 2D a feature-based based image alignment, segmentation and motion estimations. 4: Apply 3D image reconstruction techniques. 5: Design and develop innovative image processing and computer vision applications.

Practicals

  • OpenCV Installation and working with Python

    Basic Image Processing , loading images, Cropping, Resizing, Thresholding, Contour analysis, Bolb detection

  • Image Annotation – Drawing lines, text circle, rectangle, ellipse on images

    Image Enhancement, Understanding Color spaces, color space conversion, Histogram equialization, Convolution, Image smoothing, Gradients, Edge Detection

  • Image Features and Image Alignment – Image transforms – Fourier, Hough, Extract ORB Image features, Feature matching and cloning

    Feature matching based image alignment

  • Image segmentation using Graphcut / Grabcut

    Camera Calibration with circular grid

  • Pose Estimation

    3D Reconstruction – Creating Depth map from stereo images

Reference Books

  • Richard Szeliski, “Computer Vision: Algorithms and Applications”, Springer- Texts in Computer Science, Second Edition, 2022.

  • D. A. Forsyth, J. Ponce, “Computer Vision: A Modern Approach”, Pearson Education, Second Edition, 2015.

  • Richard Hartley and Andrew Zisserman, “Multiple View Geometry in Computer Vision”, Second Edition, Cambridge University Press, March 2004.

  • Christopher M. Bishop, “Pattern Recognition and Machine Learning”, Springer, 2006

  • E. R. Davies, “Computer and Machine Vision”, Fourth Edition, Academic Press, 2012.