CHAPTER 9

image

Graph Processing with Spark

Data is generally stored and processed as a collection of records or rows. It is represented as a two-dimensional table with data divided into rows and columns. However, collections or tables are not the only way to represent data. Sometimes, a graph provides a better representation of data than a collection.

Graphs are ubiquitous. They are everywhere around us. For example, the Internet is a large graph of interconnected computers, routers, and switches. The World Wide Web is a large graph. Web pages connected by hypertext links form a graph. Social networks on sites such as Facebook, LinkedIn, and Twitter ...

Get Big Data Analytics with Spark: A Practitioner’s Guide to Using Spark for Large-Scale Data Processing, Machine Learning, and Graph Analytics, and High-Velocity Data Stream Processing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.