GridSearch

We will now run GridSearch for our selected parameters. Here, we are choosing to include bigrams and trigrams while running GridSearch over the C parameter of LogisticRegression.

Our intention here is to automate as much as possible. Instead of trying varying values in C during our RandomizedSearch, we are trading off human learning time (a few hours) with compute time (a few extra minutes). This mindset saves us both time and effort.

from sklearn.model_selection import GridSearchCVparam_grid = dict(clf__C=[85, 100, 125, 150])grid_search = GridSearchCV(lr_clf, param_grid=param_grid, scoring='accuracy', n_jobs=-1, cv=3)grid_search.fit(X_train, y_train)grid_search.best_estimator_.steps

In the preceding lines of code, we have ran ...

Get Natural Language Processing with Python Quick Start Guide now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.