Back close

Course Detail

Course Name Introduction to Data Analytics with R Programming
Course Code 23DLS505
Program
Semester 1
Credits 4

Syllabus

Unit I

Introduction, Causality and Experiments, Data Preprocessing: Data cleaning, Data reduction, Data transformation, Data discretization. Visualization and Graphing: Visualizing Categorical Distributions, Visualizing Numerical Distributions, Overlaid Graphs, plots, and summary statistics of exploratory data analysis, Randomness, Probability, Introduction to Statistics, Sampling, Sample Means and Sample Sizes.

Unit II

Descriptive statistics – Central tendency, dispersion, variance, covariance, kurtosis, five point summary, Distributions, Bayes Theorem, Error Probabilities; Permutation Testing, Statistical Inference; Hypothesis Testing, Assessing Models, Decisions and Uncertainty, Comparing Samples, A/B Testing, P-Values, Causality.

Unit III

Estimation, Prediction, Confidence Intervals, Inference for Regression, Classification , Graphical Models, Updating Predictions.

Text Book

  1. Adi Adhikari and John DeNero, “Computational and Inferential Thinking: The Foundations of Data Science”, e-book.

Reference Books

  1. Data Mining for Business Analytics: Concepts, Techniques and Applications in R, by Galit Shmueli, Peter C. Bruce, Inbal Yahav, Nitin R. Patel, Kenneth C. Lichtendahl Jr., Wiley India, 2018.
  2. Rachel Schutt & Cathy O’Neil, “Doing Data Science” O’ Reilly, First Edition, 2013.

DISCLAIMER: The appearance of external links on this web site does not constitute endorsement by the School of Biotechnology/Amrita Vishwa Vidyapeetham or the information, products or services contained therein. For other than authorized activities, the Amrita Vishwa Vidyapeetham does not exercise any editorial control over the information you may find at these locations. These links are provided consistent with the stated purpose of this web site.

Admissions Apply Now