This flood of data is coming from many sources. Consider the following:
The New York Stock Exchange generates about one terabyte of new trade data per day.
Facebook hosts approximately 10 billion photos, taking up one petabyte of storage.
Ancestry.com, the genealogy site, stores around 2.5 petabytes of data.
The Internet Archive stores around 2 petabytes of data and is growing at a rate of 20 terabytes per month.
Examples of data generation.
Share this highlighthttp://www.safaribooksonline.com/a/hadoop-the-definitive/784/