Recommendations for CDH cluster configuration

The hardware specification might vary according to the amount of data to be stored and the type of processing power required. It is recommended to use the following configurations:

  • 1 to 4 TB hard disks
  • Two (8 to 24 core) processors, running at least 2 to 2.5 GHz
  • 64 to 512 GB of memory
  • Bonded Gigabit Ethernet or 10 Gigabit Ethernets

Now, let's explain these hardware components in more detail:

  • CPU: The workload depends on this hardware component. It is recommended that we have a medium-clock-speed CPU with two slots for DataNodes. Why medium? This is because the high-end processor cost of a setup rises quickly, so we can have a comparatively cheaper CPU with more machines than use fewer machines with high-end ...

Get Learning HBase now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.