Syllabus - Artificial Intelligence and Machine Learning (AL 504 (B))
CSE
Artificial Intelligence and Machine Learning (AL 504 (B))
V-Semester
UNIT I
Introduction
Origins and challenges of NLP – Language Modeling: Grammar-based LM, Statistical LM – Regular Expressions, Finite-State Automata – English Morphology, Transducers for lexicon and rules, Tokenization, Detecting and Correcting Spelling Errors, Minimum Edit Distance.
UNIT II
Word Level Analysis
Unsmoothed N-grams, Evaluating N-grams, Smoothing, Interpolation and Backoff – Word Classes, Part-of-Speech Tagging, Rule-based, Stochastic and Transformation-based tagging, Issues in PoS tagging – Hidden Markov and Maximum Entropy models, Viterbi algorithms and EM training.
UNIT III
Syntactic Analysis
Context-Free Grammars, Grammar rules for English, Treebanks, Normal Forms for grammar – Dependency Grammar – Syntactic Parsing, Ambiguity, Dynamic Programming parsing – Shallow parsing – Probabilistic CFG, Probabilistic CYK, Probabilistic Lexicalized CFGs – Feature structures, Unification of feature structures.
UNIT IV
Semantics and Pragmatics
Requirements for representation, First-Order Logic, Description Logics – Syntax-Driven Semantic analysis, Semantic attachments – Word Senses, Relations between Senses, Thematic Roles, selectional restrictions – Word Sense Disambiguation, WSD using Supervised, Dictionary & Thesaurus, Bootstrapping methods – Word Similarity using Thesaurus and Distributional methods.Compositional semantics.
UNIT V
Application of NLP
intelligent work processors: Machine translation, user interfaces, Man-Machine interfaces, natural language querying, tutoring and authoring systems, speech recognition, and commercial use of NLP.
Course Objective
Students should develop a basic understanding in natural language processing methods and strategies and to evaluate the strengths and weaknesses of various Natural Language Processing (NLP) methods & technologies and gain an insight into the application areas of Natural language processing.
Course Outcome
["Define different data models used in Information Retrieval using NLP.", "Demonstrate current methods for statistical approaches to machine translation.", "Apply syntactic parsing and semantic analysis on text.", "Solve and implement real world problems using NLP."]
Practicals
Reference Books
-
Daniel Jurafsky, James H. Martin―Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech, Pearson Publication.
-
Steven Bird, Ewan Klein and Edward Loper, ―Natural Language Processing with Python, OReilly Media.
-
Manning and Schutze "Foundations of Statistical Natural Language Processing", MIT Press.
-
Breck Baldwin, Language Processing with Java and LingPipe Cookbook, Atlantic Publisher.
-
Richard M Reese, Natural Language Processing with Java, OReilly Media.
-
Nitin Indurkhya and Fred J. Damerau, Handbook of Natural Language Processing, Chapman and Hall/CRC Press.
-
Tanveer Siddiqui, U.S. Tiwary, Natural Language Processing and Information Retrieval, Oxford University Press.