Appendix

Useful Reading

No matter how much the authors have tried, it is virtually impossible to cover the Hadoop ecosystem in a single book. This appendix provides additional reading recommendations that you might find useful. They are organized by the main topics covered in the book.

STORING AND ACCESSING HADOOP DATA

“Apache HBase Book.” http://hbase.apache.org/book.html.

“Bloom Filter.” http://en.wikipedia.org/wiki/Bloom_filter.

“BloomMapFile — Fail-Fast Version of MapFile for Sparsely Populated Key Space.” https://issues.apache.org/jira/browse/HADOOP-3063.

Borthakur, Dhruba. “Hadoop AvatarNode High Availability.” http://hadoopblog.blogspot.com/2010/02/hadoop-namenode-high-availability.html.

Chang, Fay; Dean, Jeffrey; Ghemawat, Sanjay; Hsieh, Wilson C.; Wallach, Deborah A.; Burrows, Mike; Chandra, Tushar; Fikes, Andrew; and Gruber, Robert E. “BigTable: A Distributed Storage System for Structured Data.” http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/archive/bigtable-osdi06.pdf.

Chen, Yanpei; Ganapathi, Archana Sulochana; and Katz, Randy H. “To Compress or not to Compress — Compute vs. I/O Tradeoffs for MapReduce Energy Efficiency.” http://www.eecs.berkeley.edu/Pubs/TechRpts/2010/EECS-2010-36.pdf.

Dikant, Peter. “Storing Log Messages in Hadoop.” http://blog.mgm-tp.com/2010/04/hadoop-log-management-part2/.

Dimiduk, Nick, and Khurana, Amandeep. HBase in Action (Shelter Island, NY: Manning Publications, 2012). http://www.amazon.com/HBase-Action-Nick-Dimiduk/dp/1617290521/ ...

Get Professional Hadoop Solutions now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.