Using MADlib with Greenplum

MAD stands for Magnetic, Agile, and Deep; and lib denotes a library of scalable, parallel, and advanced in-database functions. The following figure shows the architecture of MADlib. The MADlib version used in the following example is v1.1:

Using MADlib with Greenplum

Greenplum Database extensions for MADlib would need to be installed on the segment servers on DCA.

$ pgxn install madlib
$ gppkg –i MADlib

The gppkg utility installs the MADlib extensions on all the Greenplum segment servers in parallel.

MADlib based in-database analytics is benchmarkedagainst PL/R and is found to be superior in terms of scalability and performance, and MADlib is a truly ...

Get Getting Started with Greenplum for Big Data Analytics now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.