This R package is about Automatic Text Classification via Supervised Learning. It is a machine learning package for automatic text classification that makes it simple for novice users to get started with machine learning, while allowing experienced users to easily experiment with different settings and algorithm combinations. The package includes nine algorithms for ensemble classification (svm, slda, boosting, bagging, random forests, glmnet, decision trees, neural networks, and maximum entropy), comprehensive analytics, and thorough documentation. Here, we use the New York Times Times article as an example. First, let's look at the data:
>library(RTextTools) >data(NYTimes) >set.seed(123) # guarantees the same ...