O'Reilly logo

Data Architecture: A Primer for the Data Scientist by Dan Linstedt, W.H. Inmon

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

2.3

Parallel Processing

Abstract

The first approach to managing a growing workload is to get a bigger computer. At some point the cost and size of the bigger computer becomes prohibitive. At that point the workload needs to be spread across multiple processors that run in parallel to each other. One approach to parallel processing is called the MPP approach. Note that parallel processing reduces the elapsed time of processing, not the total amount of processing that occurs. In Big Data, it is necessary to parse data before it can be used. Parsing repetitive data is usually simple and straightforward whereas parsing nonrepetitive data is anything but simple and straightforward.

Keywords

MPP
repetitive unstructured data
nonrepetitive unstructured ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required