Conditional random fields (CRF) are an extension of the Logistic regression recipe in Chapter 3, Advanced Classifiers, but are applied to word tagging. At the end of Chapter 1, Simple Classifiers, we discussed various ways to encode a problem into a classification problem. CRFs treat the sequence tagging problem as finding the best category where each category (C) is one of the C*T tag (T) assignments to tokens.
For example, if we have the tokens
rain and tag
d for determiner and
n for noun, then the set of categories for the CRF classifier are:
Various optimizations are applied to keep this combinatoric nightmare computable, ...