Syllabus - PYTHON FOR DATA SCIENCE (DS406)


CSE-Data Science/Data Science

PYTHON FOR DATA SCIENCE (DS406)

IV

Unit – I

Python Concepts, Data Structures and OOPs in Python

Interpreter – Program Execution – Statements – Expressions – Flow Controls – Functions – Numeric Data Types – Sequences – Strings – Tuples – Lists – Dictionaries – Class Definition – Constructors – Object Creation – Inheritance.

Unit – II

Numpy and Pandas Libraries of Python

Numerical operations with Numpy– Pandas Series and Dataframes– Data Manipulation with Pandas – Overloading – Text Filesand Binary Files – Reading and Writing.

Unit – III

Data Wrangling

Combining and Merging Data Sets – Reshaping and Pivoting – Data Transformation – String manipulations – Regular Expressions.

Unit – IV

Data Aggregation and Group Operations

GroupBy Mechanics – Data Aggregation – GroupWise Operations – Transformations – Pivot Tables – Cross Tabulations – Date and Time data types.

Unit – V

Visualization in Python

Matplotlib Package – Plotting Graph - Controlling Graphs – Adding Text – More Graph Types – Getting and Setting Values – Patches.

Practicals

  • Write a python program to reverse a string.

  • Write a python program to perform following operation using lists: a. append element in the list b. compare two lists c. convert list to dictionary

  • Write a Program to transpose a table/pandas data frame.

  • Write a NumPy program to create a 3x3 matrix with values ranging from 2 to 10.

  • Write a python program to perform following operation on Data Frame: a. Create two different Data Frames and perform the merging operations on it. b. Create two different Data Frames and perform the grouping operations on it. c. Create two different Data Frames and perform the concatenating operations on it

  • Program to check regular expression pattern is matching with string or not in Python

  • Create a sample dataset and apply the following aggregation function on it: mean(), median()Mean and median min(), max() Minimum and maximum std(), var() Standard deviation and variance sum() Sum of all items

  • Write a python program to get row wise proportion using crosstab () function.

  • Write a python program to display a bar chart of the popularity of programming languages.

  • Write a python program to create bar plot of scores by group and gender. Use multiple X values on the same chart for men and women.

Reference Books

  • Mark Lutz, “Programming Python”, O'Reilly Media, 4th edition, 2010.

  • Joel Grus, “Data Science from scratch”, O'Reilly, 2015.

  • Tim Hall and J-P Stacey, “Python 3 for Absolute Beginners”, Apress, 1st edition, 2009.

  • Magnus Lie Hetland, “Beginning Python: From Novice to Professional”, Apress, Second Edition, 2005.

  • Shai Vaingast, “Beginning Python Visualization Crafting Visual Transformation Scripts”, Apress, 2nd edition, 2014.

  • Wes Mc Kinney, “Python for Data Analysis”, O'Reilly Media, 2012.