Other interesting projects

Whether you use a bundled distribution or stick with the base Apache Hadoop download, you will encounter many references to other related projects. We've covered several of these such as Hive, Samza, and Crunch in this book; we'll now highlight some of the others.

Note that this coverage seeks to point out the highlights (from the authors' perspective) as well as give a taste of the breadth of types of projects available. As mentioned earlier, keep looking out, as there will be new ones launching all the time.

HBase

Perhaps the most popular Apache Hadoop-related project that we didn't cover in this book is HBase (http://hbase.apache.org). Based on the BigTable model of data storage publicized by Google in an academic paper ...

Get Learning Hadoop 2 now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.