Index

A

A Christmas Carol

abbreviations

adverbs

anagrams dictionary

anasquares

apostrophes

arguments

B

backquote

bag-of-words model

Bayesian inference

Bayesian model

bias

bigrams

bioinformatics

block of code

C

Caesar cipher

caret

centroids

classification

cluster means

clustering

clustering vector

coin tossing

collocations

commas

concordances

A Christmas Carol

Die Leiden des jungen Werthers

Enronsent

The Call of the Wild

context

array

scalar

string

corcordances

The Call of the Wild

corpora

corpus

corpus linguistics

corpus linguistics and sampling

corpus

EnronSent

correlation matrix

correlations

correlations and cosines

correlations and covariances

counting

covariance

CPAN

CRAN

crossword puzzles

crwth

cryptanalysis

D

dashes

dendrogram

Dickens

A Christmas Carol

dimension

dimensionless

DNA

dot product

doublets

E

eigenvalues

eigenvectors

end punctuation

Eszett

ETAOIN SHRDLU

events

exclamation points

F

factor analysis

false positives

filehandle

files comma-separated variables

flat file

frequencies

bigram

letter

letters

word

word lengths

words

G

Goethe

Die Leiden des jungen Werthers

H

hangman

hapax legomena

histogram

histograms

hyphens

I

independence

inner product

interpolation

array

inverse document frequency (IDF)

isograms

K

k-means clustering

key word in context (KWIC)

L

lemma

linear algebra

lipograms

logarithms

London

The Call of the Wild

M

Mahalanobis distance

main diagonal

main program

matrices

commuting

matrix factorization

matrix multiplication

matrix diagonal

mean word frequency ...

Get Practical Text Mining with Perl now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.