Understanding Big Data Using Hadoop and Spark
Massive amounts of data are being generated everyday, everywhere. As a result, a number of organizations are focusing on big data processing. In this course we’ll help you understand how Hadoop, as an ecosystem, helps us store, process, and analyze data. We will then smoothly move to developing large-scale distributed data processing applications using Apache Spark 2.
Prerequisites: Data scientists or big data architects interested in combining the data processing power of Hadoop and Apache Spark should be having prior knowledge of these technologies.
Resources: Code downloads and errata:
This path navigates across the following products (in sequential order):
Learning Hadoop 2 (1h 30m)
Apache Spark 2 for Beginners (5h 38m)