Chapter 4. The Modern Data Platform

In the long history of humankind (and animal kind, too) those who learned to collaborate and improvise most effectively have prevailed.

—William Arthur Ward

This chapter looks at key frameworks that help define the form and functionality of the modern data platform.

Designing a Hadoop Cluster

A Hadoop cluster is a high-performance super-computer platform. The Hadoop cluster must be designed from the ground up while keeping the following key considerations in mind:

Image Select a configuration from recommended hardware vendor lists. Determine what component of the hardware recommendation is not necessary, such as ...

Get Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.