The Write pipeline

Write pipeline in HBase is carried out by the following steps:

  1. Client requests data to be written in HTable, the request comes to a RegionServer.
  2. The RegionServer writes the data first in WAL.
  3. The RegionServer identifies the Region which will store the data and the data will be saved in MemStore of that Region.
  4. MemStore holds the data in memory and does not persist it. When the threshold value reaches in the MemStore, then the data is flushed as a HFile in that region.

Get Hadoop Essentials now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.