O'Reilly logo

Natural Language Processing with Java and LingPipe Cookbook by Krishna Dayanidhi, Breck Baldwin

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Chapter 1. Simple Classifiers

In this chapter, we will cover the following recipes:

  • Deserializing and running a classifier
  • Getting confidence estimates from a classifier
  • Getting data from the Twitter API
  • Applying a classifier to a .csv file
  • Evaluation of classifiers – the confusion matrix
  • Training your own language model classifier
  • How to train and evaluate with cross validation
  • Viewing error categories – false positives
  • Understanding precision and recall
  • How to serialize a LingPipe object – classifier example
  • Eliminate near duplicates with the Jaccard distance
  • How to classify sentiment – simple version

Introduction

This chapter introduces the LingPipe toolkit in the context of its competition and then dives straight into text classifiers. Text classifiers ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required