CHAPTER 16

image

Hadoop in the Cloud

Hadoop requires commodity cluster hardware to operate. One solution is to design a cluster, procure hardware, select a distribution, install Hadoop, and administer the cluster in-house. Some vendors deliver a completely configured cluster based on customer specifications, but the jobs of administration, maintenance, and upgrading remain. Installing and maintaining a cluster can be an effective solution, but it requires significant initial investment; an experienced administration and maintenance staff; and data archival, backup, and restore facilities.

An alternative is to use a cloud solution. Numerous vendors ...

Get Pro Apache Hadoop, Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.