Step 1

The raw data is scanned to determine the frequency of each product category bought during a visit. The results satisfying the support threshold are shown in Table F.1

Table TABLE F.1 A-priori algorithm: Step 1 results
Product Category Count
Branded bread 286
Own-label bread 238
Own-label breakfast cereals 225
Branded breakfast cereals 192
Eggs 178
Potatoes 160

Get Principles of Data Management now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.