Apache Mahout

Apache Mahout is a Machine learning library that comes packaged with Apache Hadoop and forms an important part of the Hadoop ecosystem.

Mahout came into existence in 2008 as a subproject of Apache Lucene (an open source search engine). Lucene is an API that has an implementation of search, text mining, and information-retrieval techniques. Most of these search and text analytics internally apply Machine learning techniques. The recommendation engines that were built for the search engines started off under a new subproject called Mahout. Mahout means the rider of an elephant, signifying the running of Machine learning algorithms over Hadoop. It is a scalable Machine learning implementation that can run in a standalone mode (does not ...

Get Practical Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.