Cover image for The Data Sessions: The Best of OSCON 2011

Book description

Looking for solutions to gather, store, and analyze the flood of Big Data? This video compilation gives you the best seat in the house for the inaugural sessions at OSCON Data. In more than a dozen segments, you'll learn valuable techniques, tools, and advice from leaders in the field. Discover open source technologies that make it possible to use new data sources and do new things with existing data.

Table of Contents

  1. Introduction to Hadoop - Tom Hanlon
    00:42:58
  2. Architectural Anti-patterns for Data Handling - Gleicon Moraes
    00:37:03
  3. Distributed Data Analysis with Hadoop and R - Jonathan Seidman and Ramesh Venkataramaiah
    00:41:35
  4. Facebook Messages and HBase - Nicolas Spiegelberg
    00:42:16
  5. Optimizing MySQL to Let People Argue - Jeremy Bingham
    00:25:24
  6. Big Data For Less Dealing with Large Data Sets on a Startups Budget - Kate Matsudaira
    00:35:08
  7. Designing and Implementing Asynchronous Distributed Systems: Challenges, Strategies, and a Million Things That Go Wrong - Scott
    00:41:10
  8. Lumberyard: Time Series Indexing at Scale - Josh Patterson
    00:37:58
  9. Part 01 - Consistency or Bust - Breaking a Riak Cluster - Jeffrey Kirkell
    00:39:47
  10. Part 02 - Consistency or Busy - Breaking a Riak Cluster - Jeffrey Kirkell
    00:38:51
  11. Esperwhispering: Get Your Real-Time Data Game On - Theo Schlossnagle
    00:43:27
  12. Part 01 - The Hitchhikers Guide to A Kaggle Competition - Krishna Sankar
    00:42:52
  13. Part 02 - The Hitchhikers Guide to A Kaggle Competition - Krishna Sankar
    00:46:42
  14. What Every Programmer Needs to Know About Disks - Ted Dziuba
    00:36:59
  15. Part 01 - Hands On Mahout - Mammoth Scale Machine Learning - Robin Anil and Ted Dunning
    00:45:54
  16. Part 02 - Hands on Mahout - Mammoth Scale Machine Learning - Robin Anil and Ted Dunning
    00:35:05