You are previewing Strata Conference New York + Hadoop World 2012: Complete Video Compilation.
O'Reilly logo
Strata Conference New York + Hadoop World 2012: Complete Video Compilation

Video Description

Explore the changes brought to technology and business by big data, data science, and pervasive computing with this complete video compilation of workshops and sessions from Strata Conference New York and Hadoop World 2012. With well over 100 hours of content, this video package includes the latest information on the skills, tools, and technologies you need to make data work—and build a data-driven business.

Table of Contents

  1. Thinking Big Together: Driving the Future of Data Science - Annika Jimenez and Anthony Goldbloom
  2. The End of the Data Warehouse - Ben Werther
  3. Beyond Batch - Doug Cutting
  4. Finance vs. Machine Learning - Cathy O'Neil
  5. From Traditional Database to Big Data Platform - Irfan Khan
  6. Moneyball for New York City - Michael Flowers
  7. The Democratization of Big Data: Bringing Hadoop to the Masses - James Markarian
  8. Hadoop: Thinking Big - John Schroeder
  9. Big Answers - Mike Olson
  10. The Composite Database - Rich Hickey
  11. The Human Face of Big Data - Rick Smolan
  12. Are We Really Winning the Information Revolution? - Samantha Ravich
  13. Big Data Direct The Era of Self-driven Big Data Exploration - Sharmila Shahani-Mulligan
  14. Bringing the 'So What' to Big Data - Tim Estes
  15. A Hands-on Introduction to Cross-disciplinary Analytics With Python - Part 1 - Roy Hyunjin Han
  16. A Hands-on Introduction to Cross-disciplinary Analytics With Python - Part 2 - Roy Hyunjin Han
  17. A Hands-on Introduction to Cross-disciplinary Analytics With Python - Part 3 - Roy Hyunjin Han
  18. A Hands-on Introduction to Cross-disciplinary Analytics With Python - Part 4 - Roy Hyunjin Han
  19. An Introduction to Hadoop - Part 1 - Mark Fei
  20. An Introduction to Hadoop - Part 2 - Mark Fei
  21. An Introduction to Hadoop - Part 3 - Mark Fei
  22. An Introduction to Hadoop - Part 4 - Mark Fei
  23. Testing Hadoop Applications - Part 1 - Tom Wheeler
  24. Testing Hadoop Applications - Part 2 - Tom Wheeler
  25. Testing Hadoop Applications - Part 3 - Tom Wheeler
  26. Testing Hadoop Applications - Part 4 - Tom Wheeler
  27. Using HBase - Part 1 - Amandeep Khurana and Matteo Bertozzi
  28. Using HBase - Part 2 - Amandeep Khurana and Matteo Bertozzi
  29. Eating at the Trough of Disillusionment - Alistair Croll
  30. What Do We Need to Teach Our Organizations About Big Data? - Robert
  31. Analytics for the Real World - Marshall Sponder
  32. Case Study: Big Data, Small(er) Company - Camille Fournier
  33. Every Visualization You've Seen is Worthless - Noah Iliinsky
  34. What Can Enterprises Learn from Startups? - Bjrn Herrmann
  35. Case Study: Changing the Culture of the Music Industry - David Boyle
  36. Case Study: Augmenting Humans to Make Better Policy Decisions - Sean Gourley
  37. Case Study: What's a Customer Worth? - Roberto Medri
  38. The Disappearing Interface: Case Studies in Augmented Humanity - JD Vogt
  39. Stuck in the Eighties: Why Marketers Still Don't Get Big Data - Tom Phillips
  40. Case Study: Data-Driven Door-to-Door Sales - Dirk Van den Poel and Dauwe Vercamer
  41. What Business People Need to Know About Data Governance - Micheline Casey
  42. How Much Privacy Can We Really Expect? - Mary Ludloff and Terence Craig
  43. Given Enough Monkeys - Some Thoughts on Randomness - Jesse Anderson
  44. Mainstream Big Data Through Storytelling - Kristian Hammond
  45. Linking Census and Enterprise Data Sets - Deborah Cooper
  46. Dealing with Dirty Data - Finding the Right Tool for the Job - Part 1 - Susan E. McGregor, Alice Brennan, and Michael Sullivan
  47. Dealing with Dirty Data - Finding the Right Tool for the Job - Part 2 - Susan E. McGregor, Alice Brennan, and Michael Sullivan
  48. Dealing with Dirty Data - Finding the Right Tool for the Job - Part 3 - Susan E. McGregor, Alice Brennan, and Michael Sullivan
  49. Building a Large-scale Data Collection System Using Flume NG - Part 1 - Hari Shreedharan, Will McQueen, Arvind Prabhakar, Prasad
  50. Building a Large-scale Data Collection System Using Flume NG - Part 2 - Hari Shreedharan, Will McQueen, Arvind Prabhakar, Prasad
  51. Building a Large-scale Data Collection System Using Flume NG - Part 3 - Hari Shreedharan, Will McQueen, Arvind Prabhakar, Prasad
  52. A Most Excellent Big Data Strategy - Bill Schmarzo
  53. Big Data Is Not Yet Another IT Project - Krish Krishnan
  54. Moving to Big Data: Strategies and Tactics for Setting Your Organization up for Success - Sheridan Hitchens
  55. Data Wrangling: Making People Productive with Data - Joe Hellerstein
  56. How To Plan a Successful Big Data Pilot - Michael Gold and Ryan McClarren
  57. Hadoop's Role in a Big Data Architecture - Jim Walker
  58. Not Just Hadoop: NoSQL in the Enterprise - Steve Francia
  59. Designing Data Visualizations Workshop - Part 1 - Noah Iliinsky
  60. Designing Data Visualizations Workshop - Part 2 - Noah Iliinsky
  61. Designing Data Visualizations Workshop - Part 3 - Noah Iliinsky
  62. Designing Data Visualizations Workshop - Part 4 - Noah Iliinsky
  63. Search and Real-time Analytics on Big Data - Part 1 - Sewook Wee, Ryan Tabora, and Jason Rutherglen
  64. Search and Real-time Analytics on Big Data - Part 2 - Sewook Wee, Ryan Tabora, and Jason Rutherglen
  65. Search and Real-time Analytics on Big Data - Part 3 - Sewook Wee, Ryan Tabora, and Jason Rutherglen
  66. Search and Real-time Analytics on Big Data - Part 4 - Sewook Wee, Ryan Tabora, and Jason Rutherglen
  67. Hadoop Data Warehousing with Hive - Part 1 - Dean Wampler
  68. Hadoop Data Warehousing with Hive - Part 2 - Dean Wampler
  69. Hadoop Data Warehousing with Hive - Part 3 - Dean Wampler
  70. Hadoop Data Warehousing with Hive - Part 4 - Dean Wampler
  71. Best Practices for Building and Deploying Predictive Models over Big Data - Part 1 - Robert Grossman and Collin Bennett
  72. Best Practices for Building and Deploying Predictive Models over Big Data - Part 2 - Robert Grossman and Collin Bennett
  73. Best Practices for Building and Deploying Predictive Models over Big Data - Part 3 - Robert Grossman and Collin Bennett
  74. Best Practices for Building and Deploying Predictive Models over Big Data - Part 4 - Robert Grossman and Collin Bennett
  75. 'Data Exponential' - K-12 Learning Analytics for Personalized Learning at Scale: Opportunities and Challenges - Roy Pea, Stephen
  76. Analyzing Millions of GitHub Commits: What Makes Developers Happy, Angry, and Everything in Between? - Ilya Grigorik and Brian D
  77. Best Practices for Publishing Data - Hjalmar Gislason
  78. Best Practices for Reproducible Research: A Case Study in Quantitative Finance - Chang She
  79. Beyond Hadoop: Fast Ad-Hoc Queries on Big Data - Mike Driscoll and Eric Tschetter
  80. Beyond Targeted Ads: Big Data for a Better World - Robert Kirkpatrick
  81. Big Data Analytics Platform at Nokia Selecting the Right Tool for the Right Workload - Yekesa Kosuru and Jim Tommaney
  82. Big Data for the Masses: How We Opened Up the Doors to Google's Dremel - Michael Manoochehri and Jim Caputo
  83. Big Data is a Hotbed of Thoughtcrime. So What? - Jim Adler
  84. Big Data: Turning the Information Overload into an Information Advantage - Chris Selland and Jerome Levadoux
  85. Big Data Wonderland: Two Views on the Big Data Revolution - Mark Madsen and Marc Demarest
  86. BizData Monetization: Turn Your Data into Dollars - Thomas Strachan
  87. Breeding Data Scientists - Amy O'Connor and Danielle Dean
  88. Building Rich, High Performance Tools for Practical Data Analysis - Wes McKinney
  89. Building the Next Platform for Analytic Apps in the Cloud - George Mathew
  90. Commercial Graph: A Map of Financial Relationships - Michael Radwin
  91. Creative Thinking and Data Science - Michael Stringer
  92. Continuous Experimentation with Continuous Deployment - Steve Mardenfeld
  93. Data Analysis for Explorers - Jesper Andersen
  94. Data Science on Hadoop: How Cloudera Impala Unlocks New Productivity and Insights - Justin Erickson and Marcel Kornacker
  95. Data Science with Hadoop at Opower - Erik Shilts
  96. Deconstructing the Database - Rich Hickey
  97. Demonstrating The Future of Data Science - Mike Maxey
  98. Deploy a Highly Available, Elastic, Multi-tenant Hadoop Cluster in 10 Minutes - Richard McDougall
  99. Designing Hadoop for the Enterprise Data Center - Jacob Rapp and Eric Sammer
  100. Designing for Data-driven Organizations - Bitsy Bentley
  101. Drive Smarter Decisions with Microsoft Big Data - Shawn Bice
  102. Explore/Exploit: Driving Business Value with Big Data - Raymie Stata
  103. GraphBuilder Scalable Graph Construction using Hadoop - Nilesh Jain
  104. HDFS - What is New and Future - Sanjay Radia and Todd Lipcon
  105. Hadoop as a Complementary Data Platform at PayPal - Moises Nascimento and Nagaraju Chayapathi
  106. Hadoop Analytics Without a Ph.D - Richard Daley
  107. Helping the World's Farmers Adapt to Climate Change - Siraj Khaliq
  108. hGraph: An Open System for Visualizing Personal Health Metrics - Juhan Sonin
  109. High Availability for the HDFS NameNode: Phase 2 - Aaron Myers and Todd Lipcon
  110. How Draw Something Absorbed 50 Million New Users, in 50 Days, With Zero App Downtime - Frank Weigel
  111. How to See Data - Kim Rees
  112. How a Traditional Media Company Embraced Big Data - Oscar Padilla, Franklin Rios, and Vineet Tyagi
  113. Is Your Cluster a Leaning Tower of Pisa? - Michael Segel
  114. Knitting Boar - Josh Patterson and Michael Katzenellenbogen
  115. Large Scale ETL with Hadoop - Eric Sammer
  116. Letting More Developers Dance with Elephants: What We Learned - Matt Winkler
  117. MapReduce Design Patterns - Donald Miner
  118. Maximizing ROI by Sharing your Hadoop Big Data Center - Rohit Valia
  119. Making Major League Data Work: Carving Up Big Data into Useful Applications for Specific Audiences - Richard Brath and Noah Schw
  120. Making Pig Fly: Optimizing Data Processing on Hadoop - Thejas Madhavan Nair and Jianyong Dai
  121. Moneyballing Criminal Justice: Using Data to Reduce Crime - Anne Milgram
  122. Monitoring Cloud Data - Gary Dusbabek
  123. Netflix's Evolving Data Science Architecture - Kurt Brown
  124. Of Rocket Ships and Washing Machines: Data Technology for People - Joe Hellerstein
  125. Performing Data Science with HBase - Aaron Kimball and Kiyan Ahmadizadeh
  126. Predictive Modeling and Operational Analytics over Streaming Data - Roger Barga
  127. Real-time Big Data Without Streaming - Ron Bodkin
  128. Real-time Learning with Bayesian Bandits - Ted Dunning
  129. Realtime Processing with Storm - Gabriel Eisbruch, Luis Daro Simonassi, and Jonathan Leibiusky
  130. Scala + Cascading = Scalding - Avi Bryant
  131. Scalable, Accessible, Predictive Analytics on Hadoop - Steven Hillion
  132. Searching for the Genetic Causes of Disease with Hadoop - Charles Schmitt
  133. Simple, Flexible Distributed Computing in Julia - Stefan Karpinski and Jeff Bezanson
  134. Start Small Before Going Big - Steve Yun and Joseph Rickert
  135. Storytelling with Data - Romy Misra
  136. Taming the Object Graph - Justin Moore
  137. The Art of Analytical Decomposition - Claudia Perlich
  138. The Death of the Enterprise Data Warehouse - Paul Groom
  139. The Language of Discovery: A Toolkit for Designing Big Data Interfaces and Interactions - Joe Lamantia
  140. They Don't Teach You That In School - Cathy O'Neil and Julie Steele
  141. This Message Will Self Destruct: The Implications of Self-Destructing Digital Data - Susan E. McGregor and Kathleen Duff
  142. Top 10 Things We Learned About Hadoop (since we started focusing on it) - Val Bercovici
  143. Trecul : Data Flow Processing Using LLVM-based JIT Compilation on Top of Hadoop - David Blair
  144. Turning Raw Data in Hadoop into Interactive BI (Capital One Labs Case Study) - Peter Schlampp
  145. Tying the Knot Between Hadoop and EDW - David Jonker
  146. Ubiquity, Interfaces, and Data: A Look Ahead to the Internet of Things - Rob Coneybeer
  147. UGD (User Generated Data), Product Development, and Privacy - Adrian Woodhead
  148. Using Data to Tune A Software Team - Jonathan Alexander
  149. Using Hadoop to do Agile Iterative ETL - Ben Werther and Kevin Beyer
  150. Visualizing Networks - Lynn Cherny
  151. Visualization An Emerging Collaboration Opportunity - Lee Feinberg
  152. Web Data Visualization: What's Becoming Easy, What's Becoming Possible - Kevin Lynagh, Kim Rees, Hadley Wickham, and David Nolen
  153. What Can We Learn from Billions of Foursquare Check-ins? - Blake Shaw
  154. Zillow: Disrupting the Real Estate Marketplace with Data - Stan Humphries