Summary

In this chapter, we studied how to solve the clustering problem for large-scale graphs. To do this, we introduced the Power Iteration Clustering method, and showed how to apply it to the clustering of songs using social tags. Using the song clustering example, we also reviewed the main graph building and processing techniques that we learned throughout this book. You should now be well-acquainted with using Spark's graph processing power to solve more interesting problems.

Get Apache Spark Graph Processing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.