Comparison of Data Mining Algorithms

Classification: Predicting a Categorical Target Variable

AlgorithmDescriptionModelInputOutputProsConsUse Cases
Decision TreesPartitions the data into smaller subsets where each subset contains (mostly) responses of one class (either “yes” or “no”)A set of rules to partition a data set based on the values of the different predictors.No restrictions on variable type for predictors.The label cannot be numeric. It must be categorical.Intuitive to explain to nontechnical business users. Normalizing predictors is not necessary.Tends to overfit the data. Small changes in input data can yield substantially different trees. Selecting the right parameters can be challenging.Marketing segmentation, fraud detection.
Rule ...

Get Predictive Analytics and Data Mining now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.