Syllabus - Data Science using R Programming (AB-703)


Automation and Robotics Engineering

Data Science using R Programming (AB-703)

VII-Semester

R Introduction

Overview of R Programming, Downloading and installing, Help of Function, Viewing documentation, General issues in R, Package Management

Data Inputting in R and Data Visualization

Data Types, Subsetting, Writing data, Reading from csv files, Creating a vector and vector operation, Initializing data frame, Control structure, Re-directing R Output Data Visualization; Creating bar chart and dot plot, Creating histogram and box plot, Plotting with base graphics, Plotting and coloring in R

Basic Statistics, Functions and Programming in R

Computing Basic Statistics, Measures of central tendency, Measures of variability, Skewness and kurtosis, Summary functions, describe functions, and descriptive statistics by group, Correlations, Comparing means of two samples, Testing a proportion, Data Munging Basics Functions and Programming in R; Flow control: For loop, If condition, Debugging tools

Data manipulation in R

List Management, Data Transformation, Merging Data Frames, Outlier Detection, Combining multiple vectors

R Database and Statistical Modelling in R

Performing queries, RODBC and DBI Package, Advanced Data handling, Combined and restructuring data frames Statistical Modelling in R; Logical Regression, Hierarchical Clustering PCA for Dimensionality Reduction

Course Objective

In this course, students will learn how to program in R and how to use R for effective data analysis and visualization.

Course Outcome

By the end of the course students shall be confident and equipped with all the knowledge required to perform analytical activities in R. Specifically, understand the fundamental syntax of R, apply critical programming language concepts, import a variety of data formats into R, prepare or tidy datas for analysis, query data using SQL and R, analyze a data set in R and present findings using the appropriate R packages, visualize data attributes using ggplot2 and other R packages.

Practicals

Reference Books

  • Wickham, H. & Grolemund, G. (2018). R for Data Science. O’Reilly: New York. Available for free at http://r4ds.had.co.nz