Naïve Bayes and text mining

The multinomial Naïve Bayes classifier is particularly suited for text mining. The Naïve Bayes formula is quite effective to classify the following entities:

  • E-mail spams
  • Business news stories
  • Movie reviews
  • Technical papers as per field of expertise

This third use case consists of predicting the direction of a stock given the financial news. There are two type of news that affect the stock of a particular company:

  • Macro trends: Economic or social news such as conflicts, economic trends, or labor market statistics
  • Micro updates: Financial or market news related to a specific company such as earnings, change in ownership, or press releases

Macroeconomic news related to a specific company have the potential to affect the sentiments ...

Get Scala for Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.