O'Reilly logo

Learning Apache Mahout by Chandramani Tiwary

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 5. Frequent Pattern Mining and Topic Modeling

In this chapter, we are going to discuss two important application areas of machine learning, frequent pattern mining and topic modeling. Frequent pattern mining helps identify frequent patterns among transactions. This type of technique is used widely in market basket analysis, upselling and cross-selling of products, and so on. There are many different algorithms to mine frequent patterns from databases such as Apriori, Tree projection, and FP-Growth; we will restrict our discussion to FP-Growth, which is implemented in Mahout. Topic modeling represents documents under consideration as topics. Each topic is a bag of words that we can use to label the topics. We will also discuss the Mahout ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required