Designing tables

In HBase, when modeling the schema for any table, a designer should also keep in mind the following, among other things:

  • The number of column families and which data goes to which column family
  • The maximum number of columns in each column family
  • The type of data to be stored in the column
  • The number of historical values that need to be maintained for each column
  • The structure of a rowkey

Once we have answers, certain practices are followed to ensure optimal table design. Some of the design practices are as follows:

  • Data for a given column family goes into a single store on HDFS. This store might consist of multiple HFiles, which eventually get converted to a single HFile using compaction techniques.
  • Columns in a column family are also ...

Get HBase Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.