Syllabus - Big Data Analytics (CD603 (A))


CSE-Data Science/Data Science

Big Data Analytics (CD603 (A))

VI

Unit1

Introduction to Big data, Big data characteristics, Types of big data, Traditional versusBig data, Evolution of Big data, challenges with Big Data, Technologies available for BigData, Infrastructure for Big data, Use of Data Analytics, Desired properties of Big Datasystem.

Unit2

Introduction to Hadoop, Core Hadoop components, Hadoop Eco system, HivePhysical Architecture, Hadoop limitations, RDBMS Versus Hadoop, Hadoop Distributed Filesystem, Processing Data with Hadoop, Managing Resources and Application with HadoopYARN, MapReduce programming.

Unit3

Introduction to Hive Hive Architecture, Hive Data types, Hive Query Language,Introduction to Pig, Anatomy of Pig, Pig on Hadoop, Use Case for Pig, ETL Processing, Datatypes in Pig running Pig, Execution model of Pig, Operators, functions,Data types of Pig.

Unit4

Introduction to NoSQL, NoSQL Business Drivers, NoSQL Data architectural patterns,Variations of NOSQL architectural patterns using NoSQL to Manage Big Data, Introductionto MangoDB

Unit5

Mining social Network Graphs: Introduction Applications of social Network mining,Social Networks as a Graph, Types of social Networks, Clustering of social Graphs DirectDiscovery of communities in a social graph, Introduction to recommender system.

Practicals

Reference Books

  • RadhaShankarmani, M. Vijaylakshmi, " Big Data Analytics", Wiley, Secondedition

  • Seema Acharya, SubhashiniChellappan, " Big Data and Analytics", Wiley, Firstedition