You are previewing Strata Conference New York + Hadoop World 2013: Complete Video Compilation.
O'Reilly logo
Strata Conference New York + Hadoop World 2013: Complete Video Compilation

Video Description

With this complete video compilation, you’ll get a front-row seat to every keynote, workshop, and session at O’Reilly’s Strata Conference New York and Hadoop World 2013. Explore the changes big data, data science, and pervasive computing have brought to technology and business.

Table of Contents

  1. Tutorials
    1. How to Build a Hadoop Data Application - Tom White, Eric Sammer, and Joey Echeverria - Part 1 00:28:36
    2. How to Build a Hadoop Data Application - Tom White, Eric Sammer, and Joey Echeverria - Part 2 00:13:18
    3. How to Build a Hadoop Data Application - Tom White, Eric Sammer, and Joey Echeverria - Part 3 00:25:08
    4. How to Build a Hadoop Data Application - Tom White, Eric Sammer, and Joey Echeverria - Part 4 00:32:15
    5. Data is Beautiful - Julie Rodriguez - Part 1 00:27:49
    6. Data is Beautiful - Julie Rodriguez - Part 2 00:28:29
    7. Using R and Hadoop for Statistical Computation at Scale - Antonio Piccolboni and Joseph Rickert - Part 1 1:20:27
    8. Using R and Hadoop for Statistical Computation at Scale - Antonio Piccolboni and Joseph Rickert - Part 2 1:24:36
    9. Building a Data Platform - John Akred, Richard Williamson, Stephen OSullivan - Part 1 00:50:23
    10. Building a Data Platform - John Akred, Richard Williamson, Stephen OSullivan - Part 2 00:26:10
    11. Building a Data Platform - John Akred, Richard Williamson, Stephen OSullivan - Part 3 00:40:00
    12. An Introduction to Real-Time Analytics with Cassandra and Hadoop - Patricia Gorla - Part 1 00:44:54
    13. An Introduction to Real-Time Analytics with Cassandra and Hadoop - Patricia Gorla - Part 2 00:37:48
    14. An Introduction to Real-Time Analytics with Cassandra and Hadoop - Patricia Gorla - Part 3 00:29:46
    15. An Introduction to Real-Time Analytics with Cassandra and Hadoop - Patricia Gorla - Part 4 00:20:34
    16. Teaching the Elephant to Read: Hadoop + Python + NLP - Sean Murphy and Benjamin Bengfort - Part 1 00:33:15
    17. Teaching the Elephant to Read: Hadoop + Python + NLP - Sean Murphy and Benjamin Bengfort - Part 2 00:38:21
    18. Teaching the Elephant to Read: Hadoop + Python + NLP - Sean Murphy and Benjamin Bengfort - Part 3 00:50:57
    19. Teaching the Elephant to Read: Hadoop + Python + NLP - Sean Murphy and Benjamin Bengfort - Part 4 00:30:54
    20. Getting Started with Julia - Leah Hanson - Part 1 00:45:06
    21. Getting Started with Julia - Leah Hanson - Part 2 00:45:10
    22. Getting Started with Julia - Leah Hanson - Part 3 00:39:20
    23. Getting Started with Julia - Leah Hanson - Part 4 00:39:41
    24. Mining Social Web APIs with IPython Notebook - Matthew Russell - Part 1 00:52:10
    25. Mining Social Web APIs with IPython Notebook - Matthew Russell - Part 2 00:29:42
    26. Mining Social Web APIs with IPython Notebook - Matthew Russell - Part 3 00:43:44
    27. Mining Social Web APIs with IPython Notebook - Matthew Russell - Part 4 00:14:08
  2. Hardcore Data Science
    1. Hardcore Data Science Intro - Ben Lorica 00:01:17
    2. Deep Learning and the Dream of AI - Brandon Ballinger 00:31:51
    3. Machine Learning Applications: Recommendation Engines Using Multiple Behavior Sources - Ted Dunning 00:44:29
    4. Algorithm Design Meets Big Data - Bahman Bahmani 00:42:33
    5. Quantitative Insights from Qualitative Data - Jacqueline Kazil 00:22:10
    6. Transfer Learning - Getting the Most Out of the Data You Have, Not the Data You Want - Brian Dalessandro 00:36:42
    7. Not Exactly! Fast Queries via Approximation Algorithms - Fangjin Yang and Nelson Ray 00:31:58
    8. Driving Data Decisions with Real-time Analytics - Dr. Vijay Srinivas Agneeswaran 00:33:30
  3. Data-Driven Business Day
    1. Data Driven Business Day Opening Remarks - Alistair Croll 00:02:59
    2. Hypercompetition and the New Rules of Strategic Management - Mona Vernon 00:15:41
    3. The Evolving Corporation - Jim Stogdill 00:16:49
    4. Putting the Puzzle Together: Exploiting Big Data to Piece Together the Customer Picture - Anjul Bhambhri 00:18:31
    5. Making Big Data Small - Baron Schwartz 00:20:23
    6. Social Data Intelligence: Integrating Social and Enterprise Data for Competitive Advantage - Susan Etlinger 00:25:58
    7. Using Graphs of Data to Understand your Customers, Users, and Employees - Ravi Iyer 00:22:46
    8. The Web That Time Forgot - Alex Wright 00:31:02
    9. Improving Retail Customer Experience With the 'Right Message at the Right Time' - Michael Cote 00:21:41
    10. Waging Peace with Big Data and New Technologies - Chris Perry and Marie O'Reilly 00:16:16
    11. How You Can Benefit from Thinking Like a Librarian - Bonnie Tijerina 00:15:35
    12. See Beyond Reality 3D Transformations of Analytics - Gabe Batstone 00:21:10
    13. Under the Hood at MetLife - Gary Hoberman 00:18:35
    14. Closing Remarks - Alistair Croll 00:02:14
  4. Keynotes
    1. Hadoop's Impact on the Future of Data Management - Mike Olson 00:15:51
    2. Separating Hadoop Myths from Reality - Jack Norris 00:09:59
    3. Big Impact from Big Data - Ken Rudin 00:11:57
    4. Can Big Data Reach One Billion People? - Quentin Clark 00:05:05
    5. What Makes Us Human? A Tale of Advertising Fraud - Claudia Perlich 00:08:21
    6. From Fiction to Facts with Big Data Analytics - Ben Werther 00:05:09
    7. Towards Strata 2014 - Roger Magoulas 00:05:26
    8. The Economic Potential of Open Data - Michael Chui 00:13:58
    9. The Future of Hadoop: What Happened and What's Possible? - Doug Cutting 00:14:40
    10. Designing Your Data-Centric Organization - Josh Klahr 00:11:59
    11. Encouraging You to Change the World with Big Data - David Parker 00:05:40
    12. The Value of Social (for) TV - Shawndra Hill 00:10:57
    13. Ubiquitous Satellite Imagery of our Planet - Will Marshall 00:10:26
    14. The Big Data Journey: Taking a holistic approach - John Choi 00:06:06
    15. Can Big Data Save Them? - Jim Kaskade 00:09:02
    16. Changing the Face of Technology - Black Girls CODE - Peta Clarke and Donna Knutt 00:04:44
    17. Beyond R and Ph.D.s: The Mythology of Data Science Debunked - Douglas Merrill 00:08:09
  5. Sessions
    1. Morse: Realtime ETL in Facebook Analytics Platform - Jun Fang 00:39:03
    2. Defining your Big Data Arsenal: NoSQL, Hadoop, and RDBMS - Matt Asay 00:37:37
    3. Scalable, Flexible Data Privacy in the Cloud - Ahmed Radwan 00:36:31
    4. Disruptive Data Science Case Study: Visa's Big Data Response to Cyber Threats - Ravi Devireddy and Annika Jimenez 00:48:05
    5. Big Data in the Real World - Eron Kelly and Albert Isern 00:37:36
    6. Viva la Revolution: How MailChimp is using Big Data to Help Users Help Themselves - John Foreman 00:41:06
    7. From Promise to a Platform: Next Steps in Bringing Workload Diversity to Hadoop - Henry Robinson 00:35:51
    8. Drought Prediction and Ecological Monitoring with the Internet of Things - Adam Wolf and Kelly Caylor 00:34:26
    9. Hadoop Adventures At Spotify - Adam Kawa 00:37:32
    10. Real-Time Analytical Processing (RTAP) using Spark and Shark - Jason (Jinquan) Dai 00:30:18
    11. Shift into High Gear: Dramatically Improve Hadoop and NoSQL Performance - M.C. Srivas 00:45:21
    12. Real-time Recommendations for Retail: Architecture, Algorithms, and Design - Jonathan Natkins and Juliet Hougland 00:39:49
    13. Apache HBase for Architects - Nick Dimiduk 00:40:04
    14. The Big Data Doctor Is In - Bill Schmarzo, John Akred, Anand Raman, and Scott Rose 00:32:49
    15. Big Data to Enable Connected Products - Ron Bodkin 00:36:19
    16. Bringing Video Game Super Powers To Life with Hadoop BI - Barry Livingston and Ben Werther 00:27:18
    17. Non-linear Storytelling: Towards New Methods and Aesthetics for Data Narrative - Giorgia Lupi 00:36:13
    18. Parquet: An Open Columnar Storage for Hadoop - Julien Le Dem and Nong Li 00:31:25
    19. Predictable Performance at Scale is the Key to Shorter Time to Results - Jeff Denworth 00:29:38
    20. Whats Next for Apache HBase: Multi-tenancy, Predictability, and Extensions - Jonathan Hsieh 00:39:38
    21. Ensuring 100% Database Uptime for Real-Time Big Data - Srini Srinivasan 00:39:58
    22. Hadoop and Data Science for the Enterprise - Mark Slusar 00:26:22
    23. How to Avoid Some Different Graphical Mistakes - Naomi Robbins 00:42:15
    24. Lessons Learned From A Decades Worth of Big Data At The U.S. National Security Agency (NSA) - Adam Fuchs 00:30:20
    25. Hardening Hadoop for the Enterprise: Managing Diverse Workloads, Securing and Governing your Big Data Platform - Paul Kent 00:38:41
    26. The Hidden Data Science Pipeline - Mark Mims 00:36:07
    27. Introducing a New Way to Interact with Insight - Stephanie McReynolds, Vaibhav Nivargi, Brian Zotter, and Stephen McDaniel 00:41:05
    28. Securing the Apache Hadoop Ecosystem - Aaron Myers and Shreepadma Venugopalan 00:40:20
    29. Is Your Cloud Ready for Big Data? - Richard McDougall 00:42:08
    30. Data Governance for Regulated Industries Using Hadoop - Justin Makeig 00:41:07
    31. Every Soldier is a Sensor: Countering Corruption in Afghanistan - Amy Gaskins 00:43:25
    32. AtlasDB: ACID Transactions for Your Favorite Key-value Store - Ari Gesher and Danielle Kramer 00:29:51
    33. Inject Big Data into your Corporate DNA: Enable Every Employee to Make Data Driven Decisions - Anurag Tandon 00:36:35
    34. Achieving Real Success with Hadoop - Amir Halfon 00:36:14
    35. Making Big Data Small - Baron Schwartz 00:37:11
    36. Hadoop Appliances: Engineered for the Enterprise - Dan McClary 00:34:23
    37. HDFS Snapshots and Beyond - Jing Zhao and Tsz-Wo Sze 00:43:52
    38. Running On-premise Hadoop as a Business - Sumeet Singh 00:47:00
    39. Information Revolution In Government - James Stewart and James Abley 00:30:16
    40. REEF - Retainable Evaluator Execution Framework - Russell Sears 00:28:17
    41. How to Stop Worrying and Start Modeling Big Data with Better Algorithms and H2O - Srisatish Ambati and Cliff Click 00:29:23
    42. Running Non-MapReduce Big Data applications on Apache Hadoop - Siddharth Seth and Hitesh Shah 00:40:39
    43. Instant Results and Infinite Storage with SAP and Hadoop - David Parker 00:42:41
    44. Managing a Rapidly Evolving Analytics Pipeline - Feng Peng 00:41:17
    45. The Evolution of Hadoop at Stripe: Replicating MongoDB into HBase in Realtime, and How We Bolted Analytics onto an Existing Syst 00:26:54
    46. The Big Data Journey: Identifying roads to success and transforming your organization - John Choi 00:42:58
    47. Turn Hadoop Data into Business Insights: A New Approach for Rapid Exploration and Analysis - Brett Sheppard, Clint Sharp, and Ni 00:42:23
    48. How to do Predictive Analytics with Limited Data - Ulrich Rueckert 00:40:22
    49. Unifying Your Data Management Platform with Hadoop: Batch and Real-time Machine Data Ingest, Alerts, and Analytics - Jayant Shek 00:40:11
    50. How is a rational (big) data deployment approach like optimizing the generation mix of a power company? - John Akred and Stephen 00:31:43
    51. How to Leverage Mainframe Data with Hadoop: Bridging the Gap Between Big Iron and Big Data - Jorge A Lopez and Matt Brandwein 00:41:33
    52. Real-time Stream Processing Architecture for Comcast IP Video - Chris Lintz and Gabriel Commeau 00:35:24
    53. GraphLab: Large-Scale Machine Learning on Graphs - Carlos Guestrin 00:42:41
    54. Visualizing Big Graphs and Social Networks - Richard Brath and David Jonker 00:45:00
    55. Building More Productive Data Science and Analytics Workflows - Wes McKinney 00:35:22
    56. SAS on Your Cluster, Serving your Data (Analysts) - Paul Kent 00:42:42
    57. How Nordstrom Utilizes Human Intelligence to Blend Brick-and-Mortar with Online Commerce - Erin Shellman and David Von Lehman 00:41:38
    58. Data Science Without a Scientist - Matt Schumpert 00:41:48
    59. Ancestry.com: Managing Big Data Reaching Back to the 11th Century with Hadoop - Scott Sorensen 00:35:54
    60. MySQL's NoSQL Interface - Dave Stokes 00:14:37
    61. Turkers Mapping Africa - Lyndon Estes 00:37:23
    62. Deeper Insight into Opertional BigData Cluster - Samuel Kommu 00:41:30
    63. Testing Riak for Multiple Data-Center Support: A Case Study - Jim Englert 00:37:11
    64. RAM (not disk) for SQL on Hadoop: The Secret - Paul Groom 00:37:08
    65. Information Security for the Data Management Professional - Micheline Casey 00:22:55
    66. Practical Performance Analysis and Tuning for Cloudera Impala - Greg Rahn 00:40:10
    67. Addressing Legacy Risks with Hadoop - Ravi Hubbly 00:45:29
    68. How to Get Statistics Right in AB Testing: The Short Answer (With Proof from Four Years of Fundraising Data from Wikipedia) - Za 00:40:33
    69. Big Data Architectural Patterns - Eddie Satterly 00:36:47
    70. Hadoop Internals for Oracle Developers and DBAs - Tanel Poder 00:40:12
    71. Apache Hadoop on the Open Cloud - Nirmal Ranganathan and David Dobbins 00:35:35
    72. Trickery and Tooling for Distributed System Diagnosis and Debugging - Philip Zeyliger 00:39:16
    73. Data Science of Love - Vaclav Petricek 00:37:00