Syllabus - Information Retrieval (AD 603 (C))


Artificial Intelligence and Data Science

Information Retrieval (AD 603 (C))

VI-Semester

Unit-I

Introduction

History of IR- Components of IR - Issues - Open source Search engine. Frameworks - The Impact of the web on IR - The role of artificial intelligence (AI) in IR – IR Versus Web Search - Components of a search engine, characterizing the web.

Unit-II

Boolean and Vector space retrieval models

Term weighting - TF-IDF weighting- cosinesimilarity - Pre-processing - Inverted indices - efficient processing with sparse vectors LanguageModel based IR - Probabilistic IR -Latent Semantic indexing - Relevance feedback and queryexpansion.

Unit-III

Web search overview

web structure the user paid placement search engine optimization, Web Search Architectures - crawling - meta-crawlers, Focused Crawling - web indexes - Near duplicate detection - Index Compression - XML retrieval.

Unit-IV

Link Analysis

hubs and authorities - Page Rank and HITS algorithms - Searching and Ranking Relevance Scoring and ranking for Web - Similarity - Hadoop & Map Reduce - Evaluation - Personalized search - Collaborative filtering and content-based recommendation of documents - handling And products QuestionAnswering, Cross-Lingual Retrieval. - Snippet generation, Summarization. invisible Web

Unit-V

Information filtering

organization and relevance feedback - Text Mining- Text classification and clustering - Categorization algorithms, naive Bayes, decision trees and nearest neighbor - Clustering algorithms: agglomerative clustering, k-means, expectation maximization (EM).

Practicals

Reference Books

  • C. Manning, P. Raghvan and H Schutze: Introduction to Information Retrieval, Cambridge University Press, 2008.

  • Ricardo Baeza -Yates and Berthier Ribeiro –Neto, Modern Information Retrieval The Concepts and Technology behind Search 2nd Edition, ACM Press Books 2011.

  • Bruce Croft, Donald Metzler and Trevor Strohman Search Engines Information Retrieval in Practice 1st Edition Addison Wesley, 2009

  • MarkLevene, An Introduction to Search Engines and Web Navigation, 2nd Edition Wiley 2010.