Chapter 10

The Hadoop Foundation and Ecosystem

In This Chapter

arrow Why the Hadoop ecosystem is foundational for big data

arrow Managing resources and applications with Hadoop YARN

arrow Storing big data with HBase

arrow Mining big data with Hive

arrow Interacting with the Hadoop ecosystem

As Chapter 9 explains, Hadoop MapReduce and Hadoop Distributed File System (HDFS) are powerful technologies designed to address big data challenges. That’s the good news. The bad news is that you really need to be a programmer or data scientist to be able to get the most out of these elemental components. Enter the Hadoop ecosystem. For several years and for the foreseeable future, open source as well as commercial developers all over the world have been building and testing tools to increase the adoption and usability of Hadoop. Many are working on bits of the ecosystem and offering their enhancements back to the Apache project. This constant flow of fixes and improvements helps to drive the entire ecosystem forward in a ...

Get Big Data For Dummies now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.