
Expires in:
Expires in:
Expires in:



Self-paced course
Self-paced course
Data Science in Python: Unsupervised Learning
Data Science in Python: Unsupervised Learning
Rating 4.7
40 reviews
40 reviews
40 reviews
Course Description
This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.
We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.
From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.
We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.
Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.
Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.
Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.
If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.
Course Description
This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.
We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.
From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.
We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.
Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.
Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.
Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.
If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.
Course Content
42.0 video hours
Skills you'll learn in this course
Prepare datasets & engineer features for clustering & anomaly detection
Segment customers with k‑means, hierarchical & DBSCAN algorithms
Reduce dimensions using PCA & t‑SNE for clearer visual storytelling
Translate unsupervised patterns into actionable business strategies
Meet your instructors


Alice Zhao
Lead Data Science Instructor
Alice Zhao is a seasoned data scientist and author of the book, SQL Pocket Guide, 4th Edition (O'Reilly). She has taught numerous courses in Python, SQL, and R as a data science instructor at Maven Analytics and Metis, and as a co-founder of Best Fit Analytics.
Student reviews
I didn’t think Maven's courses could get any better, but this one has been the most enjoyable by far. I am completely hooked on UL! Alice is an outstanding tutor, and her enthusiasm is absolutely contagious. I am already applying what I have learned to my day-to-day work, and the insights I am now able to provide are truly game-changing. I can’t wait to dive into more courses!
Ewa
I recently completed the "Data Science in Python: Unsupervised Learning" course, and I am thoroughly impressed. Alice Zhao, the instructor, did a fantastic job guiding us through the complexities of unsupervised learning. Her teaching style is engaging and clear, making even the most challenging concepts accessible. One of the standout features of this course was the organisation of the project solution videos. Instead of one long solution video, Alice split them into manageable sections, which made it much easier to check my project to see if I was on the right track. This approach enhanced my learning experience to a significant extent and provided a convenient means for checking things before proceeding in the projects. I particularly enjoyed the projects included in the course. They were well-designed and relevant, providing a practical experience that reinforced the theoretical knowledge. By the end of the course, I found that I understood unsupervised learning concepts much more deeply than I ever did with classification. Overall, I highly recommend this course to anyone interested in data science and looking to deepen their understanding of unsupervised learning. Alice Zhao's expertise and the course's structure make it a valuable learning experience.
Andrew Hubbard
Included learning paths
Course credential
You’ll earn the course certification by completing this course and passing the assessment requirements

Data Science in Python: Unsupervised Learning

Data Science in Python: Unsupervised Learning
CPE Accreditation

CPE Credits:
0
Field of Study:
Information Technology
Delivery Method:
QAS Self Study
Maven Analytics LLC is registered with the National Association of State Boards of Accountancy (NASBA) as a sponsor of continuing professional education on the National Registry of CPE Sponsors. State boards of accountancy have the final authority on the acceptance of individual courses for CPE credit. Complaints regarding registered sponsors may be submitted to the National Registry of CPE Sponsors through its website: www.nasbaregistry.org.
For more information regarding administrative policies such as complaints or refunds, please contact us at admin@mavenanalytics.io or (857) 256-1765.
*Last Updated: May 25, 2023
Course Description
This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.
We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.
From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.
We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.
Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.
Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.
Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.
If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.
Course Description
This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.
We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.
From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.
We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.
Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.
Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.
Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.
If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.
Curriculum
1
Orientation & Benchmark Assessment
1
Orientation & Benchmark Assessment
1
Orientation & Benchmark Assessment
2
Getting Started
2
Getting Started
2
Getting Started
3
Intro to Data Science
6 MIN
6 MIN
3
Intro to Data Science
6 MIN
6 MIN
3
Intro to Data Science
6 MIN
6 MIN
4
Unsupervised Learning 101
4
Unsupervised Learning 101
4
Unsupervised Learning 101
5
Pre-Modeling Data Prep
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
5
Pre-Modeling Data Prep
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
5
Pre-Modeling Data Prep
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6
Clustering
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6
Clustering
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6
Clustering
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
7
PROJECT: Clustering Clients
7
PROJECT: Clustering Clients
7
PROJECT: Clustering Clients
8
Anomaly Detection
6 MIN
6 MIN
6 MIN
8
Anomaly Detection
6 MIN
6 MIN
6 MIN
8
Anomaly Detection
6 MIN
6 MIN
6 MIN
9
Dimensionality Reduction
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
9
Dimensionality Reduction
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
9
Dimensionality Reduction
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
10
Recommenders
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
10
Recommenders
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
10
Recommenders
6 MIN
6 MIN
6 MIN
6 MIN
6 MIN
11
PROJECT: Recommending Restaurants
11
PROJECT: Recommending Restaurants
11
PROJECT: Recommending Restaurants
12
Unsupervised Learning Review
12
Unsupervised Learning Review
12
Unsupervised Learning Review
13
Final Project
13
Final Project
13
Final Project
14
Final Assessment
6 MIN
14
Final Assessment
6 MIN
14
Final Assessment
6 MIN
15
Course Feedback & Next Steps
15
Course Feedback & Next Steps
15
Course Feedback & Next Steps
Meet your instructors

Alice Zhao
Lead Data Science Instructor
Alice Zhao is a seasoned data scientist and author of the book, SQL Pocket Guide, 4th Edition (O'Reilly). She has taught numerous courses in Python, SQL, and R as a data science instructor at Maven Analytics and Metis, and as a co-founder of Best Fit Analytics.
Student reviews
I didn’t think Maven's courses could get any better, but this one has been the most enjoyable by far. I am completely hooked on UL! Alice is an outstanding tutor, and her enthusiasm is absolutely contagious. I am already applying what I have learned to my day-to-day work, and the insights I am now able to provide are truly game-changing. I can’t wait to dive into more courses!

Ewa
I recently completed the "Data Science in Python: Unsupervised Learning" course, and I am thoroughly impressed. Alice Zhao, the instructor, did a fantastic job guiding us through the complexities of unsupervised learning. Her teaching style is engaging and clear, making even the most challenging concepts accessible. One of the standout features of this course was the organisation of the project solution videos. Instead of one long solution video, Alice split them into manageable sections, which made it much easier to check my project to see if I was on the right track. This approach enhanced my learning experience to a significant extent and provided a convenient means for checking things before proceeding in the projects. I particularly enjoyed the projects included in the course. They were well-designed and relevant, providing a practical experience that reinforced the theoretical knowledge. By the end of the course, I found that I understood unsupervised learning concepts much more deeply than I ever did with classification. Overall, I highly recommend this course to anyone interested in data science and looking to deepen their understanding of unsupervised learning. Alice Zhao's expertise and the course's structure make it a valuable learning experience.

Andrew Hubbard
Included learning paths
Course credential
You’ll earn the course certification by completing this course and passing the assessment requirements

Data Science in Python: Unsupervised Learning

Data Science in Python: Unsupervised Learning
CPE Accreditation

CPE Credits:
0
Field of Study:
Information Technology
Delivery Method:
QAS Self Study
Maven Analytics LLC is registered with the National Association of State Boards of Accountancy (NASBA) as a sponsor of continuing professional education on the National Registry of CPE Sponsors. State boards of accountancy have the final authority on the acceptance of individual courses for CPE credit. Complaints regarding registered sponsors may be submitted to the National Registry of CPE Sponsors through its website: www.nasbaregistry.org.
For more information regarding administrative policies such as complaints or refunds, please contact us at admin@mavenanalytics.io or (857) 256-1765.
*Last Updated: May 25, 2023
More courses you may like


