Self-paced course

Self-paced course

Data Science in Python: Unsupervised Learning

Data Science in Python: Unsupervised Learning

Rating 4.7

40 reviews

40 reviews

40 reviews

Course Description

This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.

We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.

From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.

We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.

Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.

Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.

Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.

If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.

Course Description

This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.

We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.

From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.

We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.

Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.

Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.

Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.

If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.

Course Content

42.0 video hours

Skills you'll learn in this course

Prepare datasets & engineer features for clustering & anomaly detection

Segment customers with k‑means, hierarchical & DBSCAN algorithms

Reduce dimensions using PCA & t‑SNE for clearer visual storytelling

Translate unsupervised patterns into actionable business strategies

Meet your instructors

Alice Zhao

Lead Data Science Instructor

Alice Zhao is a seasoned data scientist and author of the book, SQL Pocket Guide, 4th Edition (O'Reilly). She has taught numerous courses in Python, SQL, and R as a data science instructor at Maven Analytics and Metis, and as a co-founder of Best Fit Analytics.

Student reviews

I didn’t think Maven's courses could get any better, but this one has been the most enjoyable by far. I am completely hooked on UL! Alice is an outstanding tutor, and her enthusiasm is absolutely contagious. I am already applying what I have learned to my day-to-day work, and the insights I am now able to provide are truly game-changing. I can’t wait to dive into more courses!

Ewa

I recently completed the "Data Science in Python: Unsupervised Learning" course, and I am thoroughly impressed. Alice Zhao, the instructor, did a fantastic job guiding us through the complexities of unsupervised learning. Her teaching style is engaging and clear, making even the most challenging concepts accessible. One of the standout features of this course was the organisation of the project solution videos. Instead of one long solution video, Alice split them into manageable sections, which made it much easier to check my project to see if I was on the right track. This approach enhanced my learning experience to a significant extent and provided a convenient means for checking things before proceeding in the projects. I particularly enjoyed the projects included in the course. They were well-designed and relevant, providing a practical experience that reinforced the theoretical knowledge. By the end of the course, I found that I understood unsupervised learning concepts much more deeply than I ever did with classification. Overall, I highly recommend this course to anyone interested in data science and looking to deepen their understanding of unsupervised learning. Alice Zhao's expertise and the course's structure make it a valuable learning experience.

Andrew Hubbard

Included learning paths

Course credential

You’ll earn the course certification by completing this course and passing the assessment requirements

Data Science in Python: Unsupervised Learning

Data Science in Python: Unsupervised Learning

CPE Accreditation

CPE Credits:

0

Field of Study:

Information Technology

Delivery Method:

QAS Self Study

Maven Analytics LLC is registered with the National Association of State Boards of Accountancy (NASBA) as a sponsor of continuing professional education on the National Registry of CPE Sponsors. State boards of accountancy have the final authority on the acceptance of individual courses for CPE credit. Complaints regarding registered sponsors may be submitted to the National Registry of CPE Sponsors through its website: www.nasbaregistry.org.

For more information regarding administrative policies such as complaints or refunds, please contact us at admin@mavenanalytics.io or (857) 256-1765.

*Last Updated: May 25, 2023

Course Description

This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.

We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.

From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.

We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.

Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.

Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.

Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.

If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.

Course Description

This is a hands-on, project-based course designed to help you master the foundations for unsupervised learning in Python.

We’ll start by reviewing the data science workflow, discussing the techniques & applications of unsupervised learning, and walking through the data prep steps required for modeling. You’ll learn how to set the correct row granularity for modeling, apply feature engineering techniques, select relevant features, and scale your data using normalization and standardization.

From there we’ll fit, tune, and interpret 3 popular clustering models using scikit-learn. We’ll start with K-Means Clustering, learn to interpret the output’s cluster centers, and use inertia plots to select the right number of clusters. Next, we’ll cover Hierarchical Clustering, where we’ll use dendrograms to identify clusters and cluster maps to interpret them. Finally, we’ll use DBSCAN to detect clusters and noise points and evaluate the models using their silhouette score.

We’ll also use DBSCAN and Isolation Forests for anomaly detection, a common application of unsupervised learning models for identifying outliers and anomalous patterns. You’ll learn to tune and interpret the results of each model and visualize the anomalies using pair plots.

Next, we’ll introduce the concept of dimensionality reduction, discuss its benefits for data science, and explore the stages in the data science workflow in which it can be applied. We’ll then cover two popular techniques: Principal Component Analysis, which is great for both feature extraction and data visualization, and t-SNE, which is ideal for data visualization.

Last but not least, we’ll introduce recommendation engines, and you’ll practice creating both content-based and collaborative filtering recommenders using techniques such as Cosine Similarity and Singular Value Decomposition.

Throughout the course you’ll play the role of an Associate Data Scientist for the HR Analytics team at a software company trying to increase employee retention. Using the skills you learn throughout the course, you’ll use Python to segment the employees, visualize the clusters, and recommend next steps to increase retention.

If you’re an aspiring or seasoned data scientist looking for a practical overview of unsupervised learning techniques in Python with a focus on interpretation, this is the course for you.

Curriculum

1

Orientation & Benchmark Assessment

1

Orientation & Benchmark Assessment

1

Orientation & Benchmark Assessment

14

Final Assessment

14

Final Assessment

14

Final Assessment

15

Course Feedback & Next Steps

15

Course Feedback & Next Steps

15

Course Feedback & Next Steps

Meet your instructors

Alice Zhao

Lead Data Science Instructor

Alice Zhao is a seasoned data scientist and author of the book, SQL Pocket Guide, 4th Edition (O'Reilly). She has taught numerous courses in Python, SQL, and R as a data science instructor at Maven Analytics and Metis, and as a co-founder of Best Fit Analytics.

Student reviews

I didn’t think Maven's courses could get any better, but this one has been the most enjoyable by far. I am completely hooked on UL! Alice is an outstanding tutor, and her enthusiasm is absolutely contagious. I am already applying what I have learned to my day-to-day work, and the insights I am now able to provide are truly game-changing. I can’t wait to dive into more courses!

Ewa

I recently completed the "Data Science in Python: Unsupervised Learning" course, and I am thoroughly impressed. Alice Zhao, the instructor, did a fantastic job guiding us through the complexities of unsupervised learning. Her teaching style is engaging and clear, making even the most challenging concepts accessible. One of the standout features of this course was the organisation of the project solution videos. Instead of one long solution video, Alice split them into manageable sections, which made it much easier to check my project to see if I was on the right track. This approach enhanced my learning experience to a significant extent and provided a convenient means for checking things before proceeding in the projects. I particularly enjoyed the projects included in the course. They were well-designed and relevant, providing a practical experience that reinforced the theoretical knowledge. By the end of the course, I found that I understood unsupervised learning concepts much more deeply than I ever did with classification. Overall, I highly recommend this course to anyone interested in data science and looking to deepen their understanding of unsupervised learning. Alice Zhao's expertise and the course's structure make it a valuable learning experience.

Andrew Hubbard

Included learning paths

Course credential

You’ll earn the course certification by completing this course and passing the assessment requirements

Data Science in Python: Unsupervised Learning

Data Science in Python: Unsupervised Learning

CPE Accreditation

CPE Credits:

0

Field of Study:

Information Technology

Delivery Method:

QAS Self Study

Maven Analytics LLC is registered with the National Association of State Boards of Accountancy (NASBA) as a sponsor of continuing professional education on the National Registry of CPE Sponsors. State boards of accountancy have the final authority on the acceptance of individual courses for CPE credit. Complaints regarding registered sponsors may be submitted to the National Registry of CPE Sponsors through its website: www.nasbaregistry.org.

For more information regarding administrative policies such as complaints or refunds, please contact us at admin@mavenanalytics.io or (857) 256-1765.

*Last Updated: May 25, 2023

More courses you may like

READY TO GET STARTED

Sign Up Today and Start Learning For Free

READY TO GET STARTED

Sign Up Today and Start Learning For Free

READY TO GET STARTED

Sign Up Today and Start Learning For Free