Curriculum
OUTLINE

Course Outline for Beginners - 20 months
Module name Module type Duration (months)
Advanced SQL & AI for Data Professionals Core 1
Excel & Dashboarding with AI Storytelling Core 1
Python Foundations + AI Coding Assistants Core 1
Data to Decisions: Product Analytics with AI Core 1
Generative AI for Data Analytics & Automation Core 1
Interview Readiness Core 1
Advanced Analytics with Python, Pandas & AI Workflows Core 1
Statistics, Experimentation & AI for Data-Driven Decisions Core 1
Advanced Product Analytics Core 1
Cloud & Data Analytics on AWS Elective 1
AI For everyone Elective 1
Mathematical Foundations for AI Core 1
Machine Learning with AI Workflows Core 1
Supervised Machine Learning with AI Core 1
Un-supervised Machine Learning with AI Core 1
Time Series and Recommender Systems with AI Workflows Core 1
MLOps & AI Deployment Core 1
Deep Learning & Neural Networks Core 1
AI for Computer Vision Core 1
AI for Natural Language Processing Core 1
AI engineering and Agentic AI Core 1
Course Outline for Intermediate - 14.5 months
Module name Module type Duration (months)
Advanced Analytics with Python, Pandas & AI Workflows Core 1
Statistics, Experimentation & AI for Data-Driven Decisions Core 1
Advanced Product Analytics Core 1
Cloud & Data Analytics on AWS Elective 1
AI For everyone Elective
Mathematical Foundations for AI Core 1
Machine Learning with AI Workflows Core 1
Supervised Machine Learning with AI Core 1
Un-supervised Machine Learning with AI Core 1
Time Series and Recommender Systems with AI Workflows Core 1
MLOps & AI Deployment Core 1
Deep Learning & Neural Networks Core 1
AI for Computer Vision Core 1
AI for Natural Language Processing Core 1
AI engineering and Agentic AI Core 1

1 Placement assistance for Data Analyst/Product Analyst roles via mastery-based evaluation starts after completion of this module

2 Pick one case study across industries like E-commerce, Healthcare, Social Media Marketing and FinTech.

3 You can pursue the Deep Learning specialisation after completing the Machine Learning specialisation or vice versa

4 Must have Data Science certification (cleared till MLOps module) and enrolled in the Data Learning track. Must meet attendance and PSP requirements in past modules.

5 After completion of this module, placement assistance for Data Scientist (ML/DS) roles via mastery-based evaluation will start.

* Not an entry point for intermediate learners. An intermediate learner can choose/skip any of the above

A Deep Dive into
the
Curriculum

Note: Listed below is the detailed insight into the Data Science and Machine Learning curriculum

Beginner Module - The Basics (20 months)

Topics Covered:
01 Advanced SQL & AI for Data Professionals
  • Data Foundations + AI
  • Schema & SQL Basics
  • Data Filtering with SQL
  • NULLs & Aggregation
  • GROUP BY Metrics
  • Applied SQL Lab 1: GroupBy Problems
  • CASE WHEN and Conditional Aggregation
  • JOINs 1: INNER JOIN
  • JOINs 2: LEFT + FULL OUTER + CROSS
  • JOINs 3: SELF JOIN + NON-EQUI JOINS
  • Applied SQL Lab 2: JOIN-heavy Problems
  • Subqueries 1: Non-Correlated
  • Subqueries 2: Correlated + EXISTS
  • CTEs + UNION / UNION ALL
  • Window Functions 1: ROW_NUMBER & Aggregates
  • Window Functions 2: Advanced Ranking & Frames
  • Window Functions 3: Navigation & Trend Analysis
  • Date & Time SQL 1: Core
  • Applied SQL Lab 3: Windows + Dates
  • Date & Time SQL 2: Growth Metrics
  • Funnel Analytics in SQL
  • Cohorts & Retention Analysis
  • Views, Indexing & Query Performance
  • Functions & Stored Procedures
  • Applied SQL Lab 4: Timed Interview Drill
02 Excel & Dashboarding with AI Storytelling
  • Excel 1: Cleanup + Structure
  • Excel 2: Core Formulas
  • Excel 3: Lookups + Reporting
  • Excel 4: Pivot Tables + Mini Dashboard
  • Live Business Case: Excel Reporting Case
  • Tableau 1: Viz Fundamentals + KPIs
  • Tableau 2: Filters + Calculations
  • Tableau 3: Dashboard Lab 1
  • Tableau 4: LOD (Practical)
  • Tableau 5: Table Calcs + Analytics
  • Tableau 6: Dashboard Creation
  • Live Business Case 4: Dashboard Storytelling
03 Python Foundations + AI Coding Assistants
  • Python 1: Data Types + Variables + I/O
  • Python 2: Control Flow + Loops
  • Python 3: Functions + Lists + Strings
  • Python 4: Data Structures
  • Python 5
  • Python 6
  • Python Problem Solving & Debugging Masterclass
  • Pandas 1: Read + Filter + Select
  • Pandas 2: Groupby KPIs
  • Pandas 3: Merge + Cleaning
  • Pandas 4: EDA & Data Quality Checks
  • Pandas 5: Complete EDA
04 Data to Decisions: Product Analytics with AI
  • Product Analytics 1: Metrics Foundations (DAU, MAU, Retention, Funnels)
  • Product Analytics 2: Event Tracking & Data Instrumentation Basics
  • Product Analytics 3: User Segmentation & Behavioral Analysis
  • Product Analytics 4: Funnel Analysis (Conversion & Drop-offs)
  • Product Analytics 5: Cohort Analysis & Retention Deep Dive
  • Product Analytics 6: Growth Metrics (Activation, Engagement, Retention)
  • Product Analytics 7: A/B Testing Fundamentals for Product Decisions
  • Product Analytics 8: Experiment Design & Common Pitfalls
  • Product Analytics 9: Revenue Metrics (AOV, LTV, CAC Basics)
  • Product Analytics 10: Feature Adoption & Product Usage Analysis
  • Product Analytics 11: AI for Product Analytics (Insight Generation & Hypothesis)
  • Applied Product Lab: End-to-End Growth Case + AI-Assisted Insights
05 Generative AI for Data Analytics & Automation
  • GenAI 1: Introduction to AI, LLMs & Analyst Workflows
  • GenAI 2: Prompt Engineering for Data Tasks (SQL, Python, Insights)
  • GenAI 3: Using AI for SQL Query Generation & Debugging
  • GenAI 4: Using AI for Python & Pandas Automation
  • GenAI 5: AI for Data Cleaning & EDA
  • GenAI 6: AI for Dashboard Insights & Storytelling
  • GenAI 7: Building AI-Assisted Reports & Business Narratives
  • GenAI 8: Automating Repetitive Analysis using AI
  • GenAI 9: Evaluating AI Outputs (Hallucination, Bias, Errors)
  • GenAI 10: Using AI APIs (OpenAI) for Data Workflows
  • GenAI 11: Building Mini AI Tools for Analysts (End-to-End)
  • Applied GenAI Lab: Solve a Business Case using AI + Analytics
06 Interview Readiness
  • Interview Prep 1: SQL Basics & Pattern Recognition
  • Interview Prep 2: SQL Intermediate (JOINs, Aggregations)
  • Interview Prep 3: SQL Advanced (Window Functions, CTEs)
  • Interview Prep 4: SQL Case-Based Problem Solving
  • Interview Prep 5: Python Basics for Interviews
  • Interview Prep 6: Pandas-Based Interview Questions
  • Interview Prep 7: Product Case Interviews (Metrics & Business Thinking)
  • Interview Prep 8: Estimation & Guesstimates for Analysts
  • Interview Prep 9: Resume Building (Projects, Impact, Storytelling)
  • Interview Prep 10: LinkedIn & Portfolio Optimization
07 Advanced Analytics with Python, Pandas & AI Workflows
  • Advanced Pandas 1: Efficient Data Manipulation & Chaining
  • Advanced Pandas 2: Groupby Advanced Patterns & Transformations
  • Advanced Pandas 3: Merge, Join & Multi-Table Data Handling
  • Advanced Pandas 4: Handling Large Datasets (Performance & Memory)
  • Advanced Pandas 5: Time Series Analysis Basics
  • Advanced Pandas 6: Data Cleaning at Scale (Edge Cases)
  • NumPy Foundations for Analytics
  • Feature Engineering for Analytical Problems
  • Building Reusable Data Pipelines in Python
  • Automation for Analysts (Scripts & Scheduling)
  • AI Workflows 1: Using AI for Analysis & Code Generation
  • AI Workflows 2: Building End-to-End AI-Assisted Data Solutions
08 Statistics, Experimentation & AI for Data-Driven Decisions
  • Statistics 1: Probability Foundations for Analytics
  • Statistics 2: Distributions (Normal, Binomial, Real-world Context)
  • Statistics 3: Sampling Techniques & Bias
  • Statistics 4: Central Limit Theorem (Intuition + Applications)
  • Statistics 5: Hypothesis Testing Fundamentals
  • Statistics 6: p-values, Confidence Intervals & Errors
  • Statistics 7: A/B Testing (Design & Setup)
  • Statistics 8: A/B Testing (Analysis & Interpretation)
  • Statistics 9: Experimentation Pitfalls (Bias, Peeking, Simpson’s Paradox)
  • Statistics 10: Metric Design for Experiments
  • Statistics 11: AI-Assisted Statistical Interpretation & Validation
  • Applied Experiment Lab: End-to-End A/B Test Case
09 Advanced Product Analytics
  • 1: Advanced Funnel Analysis (Multi-step, Multi-channel)
  • 2: Retention Models & Cohort Deep Dive
  • 3: Customer Lifetime Value (LTV) Modeling
  • 4: Customer Acquisition Cost (CAC) & Payback
  • 5: Growth Loops & Product-Led Growth
  • 6: Segmentation Strategies (Behavioral & Value-based)
  • 7: Churn Analysis & Retention Strategies
  • 8: Predictive Analytics (Intro for Product)
  • 9: Recommendation Systems (Conceptual)
  • 10: Decision Frameworks for Product Teams
  • 11: AI for Product Strategy & Insights
  • Applied Product Capstone: End-to-End Product Analytics Case
10 Cloud & Data Analytics on AWS
  • Cloud 1: Introduction to Cloud & Modern Data Stack
  • Cloud 2: AWS Fundamentals (S3, IAM Basics)
  • Cloud 3: Data Storage with S3 (Data Lakes Concept)
  • Cloud 4: Querying Data with Amazon Athena
  • Cloud 5: Data Warehousing with Amazon Redshift
  • Cloud 6: ETL Basics with AWS Glue
  • Cloud 7: Data Pipelines (End-to-End Flow)
  • Cloud 8: Connecting Tableau to Cloud Data Sources
  • Cloud 9: Performance & Cost Optimization Basics
  • Cloud 10: Intro to Databricks & Lakehouse Concept
  • Cloud 11: Real-world Architecture for Data Analytics
  • Applied Cloud Lab: Build End-to-End Data Pipeline on AWS
11 AI For everyone
12 Mathematical Foundations for AI
  • Linear Algebra 1 - The ML context
  • Linear Algebra 2 - Dot products and hyperplanes
  • Linear Algebra 3 - Halfspaces and distance
  • Linear Algebra 4 - Loss minimization in classification
  • Linear Algebra 5 - Problem solving
  • Optimization 1 - The need for calculus in ML
  • Optimization 2 - Towards gradient descent
  • Optimization 3 - Gradient descent in action
  • Optimization 4 - Constrained optimization
  • Optimization 5 - Principal component analysis
13 Machine Learning with AI Workflows
  • ML: Intro to Machine Learning
  • ML: Linear Regression-1
  • ML: Linear Regression-2
  • ML: Linear Regression-3
  • ML: Linear Regression-4
  • ML: Polynomial Regression, Bias-Variance
  • ML: Regularisation, Cross Validation
  • ML: Logistic Regression-1
  • ML: Logistic Regression-2
  • ML: Classification Metrics (CM)
  • ML: CM contd. + Imbalanced Data
14 Supervised Machine Learning with AI
  • ML: k Nearest Neighbors-1
  • ML: k Nearest Neighbors-2
  • ML: Decision Trees-1
  • ML: Decision Trees-2
  • ML: Bagging and Random Forest
  • ML: Boosting-1
  • ML: Boosting-2
  • ML: Other Ensemble Techniques
  • ML: Naive Bayes 1
  • ML: Naive Bayes 2
  • ML: SVM-1
  • ML: SVM-2
  • ML: Supervised ML Wrap-up (Optional)
15 Un-supervised Machine Learning with AI
  • Dimensionality Reduction using PCA
  • Dimensionality Reduction using tSNE
  • Dimensionality Reduction using tSNE contd + UMAP
  • Clustering using KMeans
  • Clustering using KMeans++
  • Clustering using GMM
  • Clustering using Hierarchical Clustering
  • Clustering with DBSCAN
  • Anomaly Detection
  • ML: End to End Business Case - 1
  • ML: End to End Business Case - 2
16 Time Series and Recommender Systems with AI Workflows
  • ML: Intro to Time Series and Forecasting
  • ML: Time Series Analysis - 1
  • ML: Time Series Analysis - 2
  • ML: Time Series Analysis - 3
  • ML: Time Series Analysis - 4
  • ML: Time Series Analysis - 5
  • ML: Recommender Systems-1
  • ML: Recommender Systems-2
  • ML: Recommender Systems-3
  • ML: Recommender Systems-4
  • ML: Recommender Systems-5
  • ML: End-to-end Practical Session
17 MLOps & AI Deployment
  • Git & GitHub: Setup for MLOps
  • Building Cars24 ML tool using Streamlit
  • Develop Web APIs using Flask
  • Containerization - Docker & DockerHub
  • Deploying APIs on AWS using ECS
  • GitHub Actions - Setting up CI pipelines
  • GitHub Actions - Setting up CD pipelines
  • Experiment Tracking & Data Management using MLFlow
  • ML System Design - 1
  • ML System Design - 2
  • Building ML pipelines with AWS Sagemaker
  • Processing large scale data using Apache Spark
18 Deep Learning & Neural Networks
  • NN: Intro to Neural Networks
  • NN: Forward and Back Propagation
  • N-Layer Neural Network - 1
  • N-Layer Neural Network - 2
  • N-Layer BackPropagation
  • Tensorflow and Keras - 1
  • Tensorflow and Keras - 2
  • Optimizers for NNs
  • Hyper Parameter Tuning for NNs
  • Autoencoders
  • Practical aspects of designing MLPs and debugging
  • Model interpretability: LIME
19 AI for Computer Vision
  • Introduction to Computer Vision (CNN)
  • Revisiting CNN: Deal with Overfitting
  • CNN under the hood
  • Introduction to Transfer Learning
  • Image similarity: Understanding Embeddings
  • CNN for medical diagnosis
  • Object Localisation and Detection - 1
  • Object Localisation and Detection - 2
  • Object Segmentation
  • Siamese networks
  • Generative Models & GANs Introduction
20 AI for Natural Language Processing
  • NLP: Intro to NLP
  • NLP: Text Representation
  • NLP: Word Embedding - Word2Vec
  • NLP: Language Modeling
  • NLP: Topic Modeling
  • NLP: RNNs
  • NLP: LSTM
  • NLP: NER
  • NLP: Attention
  • NLP: Transformers
  • NLP: BERT
21 AI engineering and Agentic AI
  • Introduction to AI Engineering
  • Refresher: Modern Text & Image AI Model Architectures
  • Getting started with Text Generation LLM APIs (OpenAI API)
  • Designing Eval Pipelines for AI Apps and Observability
  • Prompt Engineering: Introduction
  • Retrieval-Augmented Generation (RAG) Introduction
  • Embeddings Deep Dive
  • Multimodal & Tabular RAG
  • Prompt Engineering: Security
  • Agents: Foundations & Planning
  • Advanced Agent Concepts
  • Agent: Frameworks and Protocols
  • Fine Tuning Existing Models
  • Audio, Image, Video, Expressions: AI Modalities
  • Scaling AI Applications
  • Model Quantization Techniques
  • Advanced Fine Tuning Techniques
  • Dataset Engineering && Inference Optimization

Intermediate Module - The Basics (14.5 months)

Topics Covered:
01 Advanced Analytics with Python, Pandas & AI Workflows
  • Advanced Pandas 1: Efficient Data Manipulation & Chaining
  • Advanced Pandas 2: Groupby Advanced Patterns & Transformations
  • Advanced Pandas 3: Merge, Join & Multi-Table Data Handling
  • Advanced Pandas 4: Handling Large Datasets (Performance & Memory)
  • Advanced Pandas 5: Time Series Analysis Basics
  • Advanced Pandas 6: Data Cleaning at Scale (Edge Cases)
  • NumPy Foundations for Analytics
  • Feature Engineering for Analytical Problems
  • Building Reusable Data Pipelines in Python
  • Automation for Analysts (Scripts & Scheduling)
  • AI Workflows 1: Using AI for Analysis & Code Generation
  • AI Workflows 2: Building End-to-End AI-Assisted Data Solutions
02 Statistics, Experimentation & AI for Data-Driven Decisions
  • Statistics 1: Probability Foundations for Analytics
  • Statistics 2: Distributions (Normal, Binomial, Real-world Context)
  • Statistics 3: Sampling Techniques & Bias
  • Statistics 4: Central Limit Theorem (Intuition + Applications)
  • Statistics 5: Hypothesis Testing Fundamentals
  • Statistics 6: p-values, Confidence Intervals & Errors
  • Statistics 7: A/B Testing (Design & Setup)
  • Statistics 8: A/B Testing (Analysis & Interpretation)
  • Statistics 9: Experimentation Pitfalls (Bias, Peeking, Simpson’s Paradox)
  • Statistics 10: Metric Design for Experiments
  • Statistics 11: AI-Assisted Statistical Interpretation & Validation
  • Applied Experiment Lab: End-to-End A/B Test Case
03 Advanced Product Analytics
  • 1: Advanced Funnel Analysis (Multi-step, Multi-channel)
  • 2: Retention Models & Cohort Deep Dive
  • 3: Customer Lifetime Value (LTV) Modeling
  • 4: Customer Acquisition Cost (CAC) & Payback
  • 5: Growth Loops & Product-Led Growth
  • 6: Segmentation Strategies (Behavioral & Value-based)
  • 7: Churn Analysis & Retention Strategies
  • 8: Predictive Analytics (Intro for Product)
  • 9: Recommendation Systems (Conceptual)
  • 10: Decision Frameworks for Product Teams
  • 11: AI for Product Strategy & Insights
  • Applied Product Capstone: End-to-End Product Analytics Case
04 Cloud & Data Analytics on AWS
  • Cloud 1: Introduction to Cloud & Modern Data Stack
  • Cloud 2: AWS Fundamentals (S3, IAM Basics)
  • Cloud 3: Data Storage with S3 (Data Lakes Concept)
  • Cloud 4: Querying Data with Amazon Athena
  • Cloud 5: Data Warehousing with Amazon Redshift
  • Cloud 6: ETL Basics with AWS Glue
  • Cloud 7: Data Pipelines (End-to-End Flow)
  • Cloud 8: Connecting Tableau to Cloud Data Sources
  • Cloud 9: Performance & Cost Optimization Basics
  • Cloud 10: Intro to Databricks & Lakehouse Concept
  • Cloud 11: Real-world Architecture for Data Analytics
  • Applied Cloud Lab: Build End-to-End Data Pipeline on AWS
05 AI For everyone
06 Mathematical Foundations for AI
  • Linear Algebra 1 - The ML context
  • Linear Algebra 2 - Dot products and hyperplanes
  • Linear Algebra 3 - Halfspaces and distance
  • Linear Algebra 4 - Loss minimization in classification
  • Linear Algebra 5 - Problem solving
  • Optimization 1 - The need for calculus in ML
  • Optimization 2 - Towards gradient descent
  • Optimization 3 - Gradient descent in action
  • Optimization 4 - Constrained optimization
  • Optimization 5 - Principal component analysis
07 Machine Learning with AI Workflows
  • ML: Intro to Machine Learning
  • ML: Linear Regression-1
  • ML: Linear Regression-2
  • ML: Linear Regression-3
  • ML: Linear Regression-4
  • ML: Polynomial Regression, Bias-Variance
  • ML: Regularisation, Cross Validation
  • ML: Logistic Regression-1
  • ML: Logistic Regression-2
  • ML: Classification Metrics (CM)
  • ML: CM contd. + Imbalanced Data
08 Supervised Machine Learning with AI
  • ML: k Nearest Neighbors-1
  • ML: k Nearest Neighbors-2
  • ML: Decision Trees-1
  • ML: Decision Trees-2
  • ML: Bagging and Random Forest
  • ML: Boosting-1
  • ML: Boosting-2
  • ML: Other Ensemble Techniques
  • ML: Naive Bayes 1
  • ML: Naive Bayes 2
  • ML: SVM-1
  • ML: SVM-2
  • ML: Supervised ML Wrap-up (Optional)
09 Un-supervised Machine Learning with AI
  • Dimensionality Reduction using PCA
  • Dimensionality Reduction using tSNE
  • Dimensionality Reduction using tSNE contd + UMAP
  • Clustering using KMeans
  • Clustering using KMeans++
  • Clustering using GMM
  • Clustering using Hierarchical Clustering
  • Clustering with DBSCAN
  • Anomaly Detection
  • ML: End to End Business Case - 1
  • ML: End to End Business Case - 2
10 Time Series and Recommender Systems with AI Workflows
  • ML: Intro to Time Series and Forecasting
  • ML: Time Series Analysis - 1
  • ML: Time Series Analysis - 2
  • ML: Time Series Analysis - 3
  • ML: Time Series Analysis - 4
  • ML: Time Series Analysis - 5
  • ML: Recommender Systems-1
  • ML: Recommender Systems-2
  • ML: Recommender Systems-3
  • ML: Recommender Systems-4
  • ML: Recommender Systems-5
  • ML: End-to-end Practical Session
11 MLOps & AI Deployment
  • Git & GitHub: Setup for MLOps
  • Building Cars24 ML tool using Streamlit
  • Develop Web APIs using Flask
  • Containerization - Docker & DockerHub
  • Deploying APIs on AWS using ECS
  • GitHub Actions - Setting up CI pipelines
  • GitHub Actions - Setting up CD pipelines
  • Experiment Tracking & Data Management using MLFlow
  • ML System Design - 1
  • ML System Design - 2
  • Building ML pipelines with AWS Sagemaker
  • Processing large scale data using Apache Spark

12 Deep Learning & Neural Networks
  • NN: Intro to Neural Networks
  • NN: Forward and Back Propagation
  • N-Layer Neural Network - 1
  • N-Layer Neural Network - 2
  • N-Layer BackPropagation
  • Tensorflow and Keras - 1
  • Tensorflow and Keras - 2
  • Optimizers for NNs
  • Hyper Parameter Tuning for NNs
  • Autoencoders
  • Practical aspects of designing MLPs and debugging
  • Model interpretability: LIME
13 AI for Computer Vision
  • Introduction to Computer Vision (CNN)
  • Revisiting CNN: Deal with Overfitting
  • CNN under the hood
  • Introduction to Transfer Learning
  • Image similarity: Understanding Embeddings
  • CNN for medical diagnosis
  • Object Localisation and Detection - 1
  • Object Localisation and Detection - 2
  • Object Segmentation
  • Siamese networks
  • Generative Models & GANs Introduction
14 AI for Natural Language Processing
  • NLP: Intro to NLP
  • NLP: Text Representation
  • NLP: Word Embedding - Word2Vec
  • NLP: Language Modeling
  • NLP: Topic Modeling
  • NLP: RNNs
  • NLP: LSTM
  • NLP: NER
  • NLP: Attention
  • NLP: Transformers
  • NLP: BERT
15 AI engineering and Agentic AI
  • Introduction to AI Engineering
  • Refresher: Modern Text & Image AI Model Architectures
  • Getting started with Text Generation LLM APIs (OpenAI API)
  • Designing Eval Pipelines for AI Apps and Observability
  • Prompt Engineering: Introduction
  • Retrieval-Augmented Generation (RAG) Introduction
  • Embeddings Deep Dive
  • Multimodal & Tabular RAG
  • Prompt Engineering: Security
  • Agents: Foundations & Planning
  • Advanced Agent Concepts
  • Agent: Frameworks and Protocols
  • Fine Tuning Existing Models
  • Audio, Image, Video, Expressions: AI Modalities
  • Scaling AI Applications
  • Model Quantization Techniques
  • Advanced Fine Tuning Techniques
  • Dataset Engineering && Inference Optimization