Index
A
- accumulators, From Scikit-Learn to MLLib, Local fit, global evaluation
- acyclic data flows, Distributing the Corpus
- agglomerative clustering, Agglomerative clustering-Agglomerative clustering, Glossary
- application programming interface (API), defined, Glossary
B
- backoff, Unknown Words: Back-off and Smoothing-Unknown Words: Back-off and Smoothing
- backpropagation, Artificial Neural Networks
- bag-of-keyphrases, Predicting sentiment with a bag-of-keyphrases-Predicting sentiment with a bag-of-keyphrases
- bag-of-words (BOW), Contextual Features
- defined, Text Vectorization and Transformation Pipelines, Glossary
- text vectorization with, Words in Space
- Baleen ingestion engine, The Baleen Ingestion Engine
- defined, Glossary
- disk structure, The Baleen disk structure-The Baleen disk structure
- ball tree algorithm, Being Neighborly
- BaseEstimator interface (Scikit-Learn API), The BaseEstimator Interface
- betweenness centrality, Centrality-Centrality, Centrality, Glossary
- bias, defined, Glossary
- bias–variance trade-off, Cross-Validation
- bisecting k-means clustering, Text clustering with MLLib
- blocking
- defined, Blocking with Structure
- fuzzy, Fuzzy Blocking-Fuzzy Blocking
- with structure, Blocking with Structure
C
- canonicalization, Entity Resolution, Glossary
- centrality, Centrality-Centrality, Glossary
- chatbots, Language-Aware Data Products, Chatbots-Conclusion
Get Applied Text Analysis with Python now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.