In the PCA section, it seems like there may be a typo where it is said that the first two principal components are responsible for almost 25% of the variation in the entire dataset, when it seems like it is the first three.

Data scientist & computer science PhD student. I write about my fun projects, in addition to how-to guides that help you get data for your own fun projects!

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store