Chapter 9. Performing Advanced Tasks on HBase

In this chapter, we will consider some advanced topics where Hbase is used extensively in the industry. We will walk through the following topics:

  • Machine learning using Hbase
  • Real-time data analysis using Hbase and Mahout
  • Full text indexing using Hbase

Machine learning using Hbase

Before we dive deep into the details of Hbase/Hadoop, Mahout, and machine learning, it's vital to discuss and highlight some important concepts, which will be used in this chapter.

Data science—in software engineering terms—is an operation of a set of programs that churns a large quantity of data to evaluate supervised or unsupervised learning models and provides a valuable tool to data scientists or systems through which decisions ...

Get HBase High Performance Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.