Data Scientist Learning Path

Your comprehensive resource for building expertise in data science.

01

Foundations of Data Science and Machine Learning

Section 1.1: What is Data Science? Overview and Roles

Video: What is Data Science? Overview and Roles (10min)

Section 1.2: Introduction to Machine Learning

Video: Introduction to Machine Learning (8min)

Section 1.3: Applications of ML

Video: Applications of ML (5min)

Section 1.4: Overview of the ML Workflow

Video: Overview of the ML Workflow (6min)

Section 1.5: Ethics in AI

Video: Ethics in AI (5min)

02

Python for Data Science

Section 2.1: Python Basics

Video: Python Basics (10min)

Section 2.2: Data Manipulation with Pandas and NumPy

Video: Data Manipulation with Pandas and NumPy (Short videos about 5-10min)

Section 2.3: Data Visualization with Matplotlib, Seaborn, and Plotly

Video: Data Visualization with Matplotlib, Seaborn, and Plotly (15min)

Section 2.4: Working with Jupyter Notebook and Google Colab

Video: Working with Jupyter Notebook and Google Colab (10min)

Section 2.5: Overview of Python

Course: Overview of Statistical Learning (1 hour)

03

Data Wrangling and Preparation

Section 3.1: Handling Missing Data and Outliers

Video: Handling Missing Data and Outliers (19min)

Section 3.2: Feature Engineering

Video: Feature Engineering (29min)

Section 3.3: Exploratory Data Analysis (EDA) Techniques

Video: Exploratory Data Analysis (EDA) Techniques (8min)

Section 3.4: Automating Data Pipelines with Python

Video: Automating Data Pipelines with Python (31min)

04

Machine Learning Essentials

Section 4.1: Linear Regression and Logistic Regression

Video: Linear Regression and Logistic Regression (5min)
Course: Linear Regression (Ch.3) (1 hour)
Course: Classification (Ch.4) (1 hour)

Section 4.2: Decision Trees, Random Forests

Video: Decision Trees (10min)
Video: Random Forests (8min)
Course: Tree-Based Methods (Ch.8) (1 hour)

Section 4.3: Support Vector Machines (SVM)

Video: Support Vector Machines (SVM) (2min)
Course: Support Vector Machines (Ch.3) (1 hour)

Section 4.4: K-Nearest Neighbors (KNN)

Video: K-Nearest Neighbors (KNN) (2min)

Section 4.5: Clustering and Dimensionality Reduction

Section 4.6: Model Evaluation

Video: Model Evaluation (Metrics: Accuracy, Precision, Recall, AUC) (10min)

Supplemental: Machine Learning Specialization by Andrew Ng

Coursera Course: Machine Learning Specialization (Free to Audit) by Andrew Ng

05

Advanced Machine Learning

Section 5.1: Hyperparameter Tuning

Video: Hyperparameter Tuning (Grid Search, Random Search, Bayesian Optimization) (8min)

Section 5.2: Ensemble Learning

Video: Ensemble Learning (Bagging, Boosting, Stacking) (8min)

Section 5.3: Time Series Analysis

Video: Time Series Analysis (ARIMA) (16min)

Section 5.4: Advanced Clustering Techniques

Video: Advanced Clustering Techniques: K-Means Clustering (12min)

Section 5.5: Feature Selection and Engineering Techniques

Video: Feature Selection and Engineering Techniques (22min)

06

Mathematics for Machine Learning

Section 6.1: Linear Algebra

Video: Linear Algebra (Vectors, Matrices, Eigenvalues) (17min)

Section 6.2: Calculus for Optimization

Video: Calculus for Optimization (Gradients, Chain Rule) (21min)

Section 6.3: Probability and Statistics

Video: Probability and Statistics (Distributions, Bayes' Theorem) (16min)
More Videos: Probability and Statistics (5-10min each)

Section 6.4: Optimization Techniques

Video: Optimization Techniques (Gradient Descent, Regularization) (15min)

Supplemental: Mathematics for Machine Learning

07

Deep Learning and Neural Networks

Section 7.1: Basics of Neural Networks

Video: Basics of Neural Networks (Perceptrons, Activation Functions) (4min)

Section 7.2: Feedforward Neural Networks (FNNs) with TensorFlow/Keras

Video: Feedforward Neural Networks (FNNs) with TensorFlow/Keras (20min)

Section 7.3: Convolutional Neural Networks (CNNs) for Image Processing

Video: Convolutional Neural Networks (CNNs) for Image Processing (10min)

Section 7.4: Recurrent Neural Networks (RNNs) and LSTMs for Sequence Data

Video: Recurrent Neural Networks (RNNs) and LSTMs for Sequence Data (16min)

Section 7.5: Transformers and Attention Mechanisms (BERT, GPT Models)

Video: Transformers and Attention Mechanisms (BERT, GPT Models) (24min)

Section 7.6: Transfer Learning (Pretrained Models: ResNet, VGG, etc.)

Video: Transfer Learning (Pretrained Models: ResNet, VGG, etc.) (9min)

Section 7.7: Deep Learning

Course: Deep Learning (Ch.10) (1 hour)

08

Natural Language Processing (NLP)

Section 8.1: Text Preprocessing

Video: Text Preprocessing (Tokenization, Stemming, Lemmatization) (15min)

Section 8.2: Vectorization Techniques

Video: Vectorization Techniques (TF-IDF, Word2Vec, GloVe) (8min)

Section 8.3: Building NLP Models

Video: Building NLP Models (Sentiment Analysis, Text Classification) (More videos for 5mins)

Section 8.4: Sequence Models and Transformers

Video: Sequence Models and Transformers (BERT, GPT) (9min)

09

Model Deployment and MLOps

Section 9.1: Building APIs with Flask and FastAPI

Video: Building APIs with Flask and FastAPI (16min)

Section 9.2: Model Deployment on Cloud Platforms

Video: Model Deployment on Cloud Platforms (AWS, GCP, Azure) (13min)

Section 9.3: MLOps Best Practices

Video: MLOps Best Practices (CI/CD, Docker, Monitoring) (12min)

Section 9.4: Managing Data and Model Drift

Video: Managing Data and Model Drift (15min)

10

Big Data and AI Strategy

Section 10.1: Big Data Technologies

Video: Big Data Technologies: Hadoop, Spark and Beyond (10min)

Section 10.2: Data Strategy

Video: Data Strategy: Building Scalable Systems (8min)

Section 10.3: AI in Business

Video: AI in Business: Developing AI Strategies (7min)

Section 10.4: Data Security and Privacy

Video: Data Security and Privacy in the Age of AI (5min)

11

Capstone Project

Step 1: Define a Business Problem and Select a Dataset

Define a business problem and select a relevant dataset for analysis.

Step 2: Perform EDA and Preprocessing

Conduct exploratory data analysis and preprocess the data for modeling.

Step 3: Build and Evaluate Machine Learning and Deep Learning Models

Build and evaluate machine learning and deep learning models.

Step 4: Deploy the Model and Present Findings with a Dashboard

Deploy the model and present findings using a dashboard.