Machine Learning in Python

Sharpening your predictive modelling skills to differentiate you as a data scientist instead of a data analyst; covering regressions, classifications, and clustering.








Course description

Machine Learning in Python builds upon the statistical knowledge you have gained earlier in the program. This course focuses on predictive modelling and enters multidimensional spaces which require an understanding of mathematical methods, transformations, and distributions. The course introduces these concepts as well as complex means of analysis such as clustering, factoring, Bayesian inference, and decision theory while also allowing you to exercise your Python programming skills.

Linear Regression

In this part of the course, we will discuss what the course covers, why you need to learn advanced statistics, what’s the differences are with machine learning, and how to get the most out of this training. In this section, you will also expand on what you learned in our statistics training with additional concepts and will apply all the theory in Python. This section serves two purposes: 1) a useful refresher of regression, and 2) a great way to reinforce what you have learned, applying it in practice while coding.


Linear Regression Practical Example

An all in one practical example, which will test your understanding of each of the concepts that we have discussed so far. We will focus on a used cars dataset and create a linear regression model to predict the prices of cars. At the end you will have a big assignment where you can dive deep into the optimization of a machine learning model.


Logistic Regression

Data scientists use logistic regressions when the dependent variable is binary (0 and 1, true and false, etc.). This type of data is encountered on a daily basis when working as a data scientist, and here, you will learn how to build a logistic regression, understand tables, interpret the coefficients of a logistic regression, calculate the accuracy of the model, as well as how to test. We will introduce under and overfitting, and will teach you how to test your models.


Cluster Analysis (Basics and Prerequisites)

Cluster analysis is the most intuitive and important example of unsupervised learning. However, to be able to understand cluster analysis, we must first explore the mathematics behind it.


K-Means Clustering

In this section, you will learn how to do Cluster analysis. Cluster analysis consists in dividing your data into separate groups based on an algorithm. Clustering is an amazing technique often employed in data science. But what’s more, it makes much more sense to study patterns observed in a particular group rather than trying to find patterns in the entire dataset. We will provide several practical examples that will help you understand how to carry out cluster analysis and the difference between classification and clustering.


Other Types of Clustering

In previous sections, we focus extensively on k-means clustering, as it is the fastest and most efficient method for clustering. In this section, we explore other approaches that are less common.