About the Reviewers

Aaron Binns spent over five years at the Internet Archive where he designed and built a petabyte-scale Hadoop cluster supporting full-text search and Big Data analytics, the majority of which was implemented in Pig. He was responsible for the construction and deployment of full-text search of domain-scale web archives of hundreds of millions of archived web pages, as well as the over two billion web pages indexed for full-text search in the Archive-It service. He also developed custom software, built on Lucene, to provide special functionality required for full-text search of archival web documents.

He currently works at TaskRabbit as a data scientist. He holds a Bachelor of Science degree in Computer Science from Case Western ...

Get Pig Design Patterns now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.