This course introduces clustering, a common technique used widely in unsupervised machine learning. The course begins by defining what clustering means through graphical explanations, and describes the common applications of clustering. Next, it explores k-means clustering in detail, including the concepts of distance functions and k-modes; illustrates hierarchical clustering through visual examples of dendrograms, and discusses different types of clustering algorithms. The course ends with a comparison of the performance of different algorithms. An understanding of basic algebra is required and some knowledge of linear algebra will be helpful.
Angie Ma, Gary Willis, and Alessandra Stagliano are data scientists with ASI Data Science, a London based AI/machine learning solutions firm. Angie co-founded ASI and is also the founder of Data Science Lab London, one of the biggest communities of data scientists and data engineers in Europe, with over 2,500 members. Angie holds a PhD in physics from London's University College, Gary Willis holds a PhD in statistical physics from London's Imperial College, and Alessandra Stagliano holds a PhD in computer science from the University of Genoa. Collectively, the group has worked on over 150 commercial AI/machine learning projects.