Introduction to RTextTools

This R package is about Automatic Text Classification via Supervised Learning. It is a machine learning package for automatic text classification that makes it simple for novice users to get started with machine learning, while allowing experienced users to easily experiment with different settings and algorithm combinations. The package includes nine algorithms for ensemble classification (svm, slda, boosting, bagging, random forests, glmnet, decision trees, neural networks, and maximum entropy), comprehensive analytics, and thorough documentation. Here, we use the New York Times Times article as an example. First, let's look at the data:

>library(RTextTools) >data(NYTimes) >set.seed(123) # guarantees the same ...

Get Hands-On Data Science with Anaconda now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.