DURATION: 5 Day (40 hours).
Time Division (Break: 15 + 45 + 15 mines).

Course Outcomes:

Important Note:

Courseware – Reference material/ppt along with lab files/exercises will be provided.

Module 1: Introduction to Data Science & Machine Learning:

Module 2: Python for Data Analysis & Pre-processing:

Introduction to Python

Python Libraries – NumPy, Pandas, matplotlib, Seaborn scikit-learn, Tensor Flow, Keras, Pytorch.
Exploratory Data Analysis (EDA).
Data Cleaning Techniques, Handling Missing Data, Handling Categorical Data.
Introduction to EDA, 2D Scatter-plot, 3D Scatter-plot, Pair plots.
Univariate, Bivariate, and Multivariate Analysis, Box-plot.

Data Pre-Processing

Data Transformation

Module 3: Supersized Machine Learning – Regression

Simple Linear Regression

Concept of Linear Regression.
Ordinary Least Square and Regression Errors.
Data Processing & Train and Test of Model.
Model Evaluation Parameters like R-squared, Score, RMSE and their Interpretations.
Prediction Plot & its Interpretation.
Hands-on Problem.

Multiple Linear Regression

Concept of Multiple Linear Regression.
Degrees of Freedom.
Adjusted R-Squared.
Assumptions of Multiple Linear Regression – Linearity, Multicollinearity, Autocorrelation,.
Indigeneity, Normality of Residuals, Homoscedasticity, etc..
Concept of time-lag data in Autocorrelation.
Concept of Dummy variable trap.
Hands-on Problem.

Module 4: Supervised Machine Learning – Classification

Logistic Regression

Support Vector Machine (SVM)

Decision Tree Classifier

Random Forest Classifier

Evaluation Metrics for Classification Models

Need for Evaluation and Accuracy Paradox.
Different Measures for Classification Models – Accuracy, Precision, Recall, F1 Score, etc.
Threshold and Adjusting Thresholds.
AUC ROC Curve.
Hands-on Problem.

Module 5: Feature Selection and Dimensionality Reduction

Univariate Feature Selection

Feature Selection Importance.
Concept of Univariate Feature Selection.
F-Test for Regression and Classification.
Hands on F-test (p value analysis).
Chi-Squared for Classification.
Feature Selection Techniques – Select Best, Select Percentile & Generic Univariate Select.
Hands-on Chi-squared (p value analysis).

Recursive Feature Elimination (RFE)

Principle Component Analysis (PCA)

Module 6: Cross validation & Hyper parameter Tuning

Hyper parameter Tuning

Module 7: Supervized Machine Learning – Natural Language Processing

Module 8: Supervized Machine Learning – Clustering

Module 9: Introduction to Deep Learning

Machine Learning Specialty