O'Reilly logo

Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark by Zubair Nabi

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

© Zubair Nabi 2016

Zubair Nabi, Pro Spark Streaming, 10.1007/978-1-4842-1479-4_1

1. The Hitchhiker’s Guide to Big Data

Zubair Nabi

(1)Lahore, Pakistan

Electronic supplementary material

The online version of this chapter (doi:10.​1007/​978-1-4842-1479-4_​1) contains supplementary material, which is available to authorized users.

From a little spark may burst a flame.

—Dante

By the time you get to the end of this paragraph, you will have processed 1,700 bytes of data. This number will grow to 500,000 bytes by the end of this book. Taking that as the average size of a book and multiplying it by the total number of books in the world (according to a Google estimate, there were 130 million books in the world in 20101) gives 65 TB. That is a staggering ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required