O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Strata + Hadoop World San Jose 2015: Complete Video Compilation

Video Description

Go right to the heart of big data

Find out what happens when cutting-edge data science and new business fundamentals intersect. With this complete video compilation, you’ll be on hand for every presentation—whether it’s a keynote, a tutorial, or a workshop—held at the Strata Conference + Hadoop World Conference in San Jose, California during February, 2015.

In ten tracks, this year’s conference captured the most challenging problems and compelling opportunities in data today, including:

  • Business & Industry: How organizations of all sizes use data to make better decisions
  • Connected World: Navigating in an always-connected, always-on world
  • Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
  • Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
  • Hadoop & Beyond: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
  • The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks
  • Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
  • Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
  • Machine Data: Extracting meaningful insights from data collected and generated by things
  • Security: Fighting fraud, detecting threats, increasing trust—and securing data

You also have complete access to other conference events, such as Data-Driven Business Day, Hardcore Data Science Day, and Spark Camp.

Download these videos or stream them through our HD player, and gain a clear perspective on data, including all the analytics, architectures, techniques, tools, and technologies you need to use it successfully.

Table of Contents

  1. Business & Industry
    1. Hiding the Elephant - How Big Data Apps Make Magic While Hiding Hadoop - Ross Fubini, Ari Gesher, Wei Zheng, Omer Trajman, and Sylvain Le Borgne 00:39:26
    2. Pumping Up Retail Profits with Predictive Analytics - Adam Jorgensen 00:18:30
    3. If You Don't Have Anything Nice to Say, Please Say Something: Increasing Honesty in Airbnb Reviews - Dave Holtz 00:21:38
    4. Making Big Data Usable in Market Regulation - Scott Donaldson 00:38:57
    5. WANTED: Women in Data, Tech, and STEM - Moderated by: Cornelia Lévy-Bencheton, Panelists: Michele Chambers, Alice Zheng and Neha Narkhede 00:47:29
    6. Helping the Republican Party Use Data and Engineering to Win the US Senate - Azarias Reda 00:35:10
    7. Using Big Data to Identify the World's Top Experts - Nima Sarshar 00:19:32
    8. The New Data Organization: What do Successful Data-Driven Companies Look Like? - John Haddad 00:25:38
    9. Architecting for the Cloud - Chris Neumann 00:26:32
    10. Solving Customer Problems with Big Data across Thomson Reuters - Brian Ulicny 00:37:22
  2. Connected World
    1. Improving Business Operations with Predictive Maintenance and Service - Oliver Mainka 00:39:50
    2. Forget the Valley: Middle America Is Where Data Is Having Its Biggest Impact - Matt Asay 00:18:37
    3. Robot Reporters: How The Associated Press Embraced Data Automation - Adam Smith 00:21:10
    4. Which is More Interesting - Millions of Thermostats, or Millions of Minds in the Internet of Things? - Doug Stein 00:20:25
    5. Economic Insights from LinkedIn's Professional Network - June Andrews 00:19:42
    6. Using Data to Help Farmers Feed Growing Populations in a Changing Climate - Stewart Collis 00:35:08
  3. Data Science
    1. Bots Don't Drink Soda: Using Big Data to Find Real People - Michael Brown 00:18:56
    2. How to Detect Anomalies in High Cardinality Dimensions and Make Them Actionable - Shankar Vedaraman and Christopher Colburn 00:39:26
    3. Big Data and Design Working Together – When the Magic Happens - George Roumeliotis 00:32:32
    4. HOWTO Make Your Future Data Scientists Love You - Sasha Laundy 00:16:10
    5. From Academia to Data Science: Lessons Learned Founding the Insight Data Science Fellows Program - Jake Klamka and Kathy Copic 00:21:13
    6. The Two Cultures of People Science - Michelangelo D'Agostino 00:19:31
    7. Pro Bono Data Science in Action - Helping Teens in Crisis - Noelle Sio 00:21:29
    8. Data Applications: Speed vs Accuracy - Danielle Ben-Gera 00:35:02
    9. Behavior-driven Machine Translation - Irina Borisova and Asim Mathur 00:42:11
    10. Playing Nice in the Product Playground: Data Scientists, Engineers, and Product Managers Working Together to Create Innovative Data Products - Anu Tewary, Lucian Lita and Jonathan Goldman 00:47:16
    11. Machine Learning Building Blocks and the Workload Optimization Framework - Shai Fine 00:30:50
    12. Robust Event Detection Using Diverse Data Types - Harrison Mebane 00:16:38
    13. Purposeful Education with Job Market Data for Students, Educators, and Institutions - Jike Chong 00:26:26
    14. Real-Time Relevance for Mobile at LinkedIn - Michael Conover 00:37:53
  4. Design & Interfaces
    1. Building Interactive Data Visualizations - Jonathan Dinu - Part 1 00:31:35
    2. Building Interactive Data Visualizations - Jonathan Dinu - Part 2 00:28:55
    3. Building Interactive Data Visualizations - Jonathan Dinu - Part 3 00:49:04
    4. Building Interactive Data Visualizations - Jonathan Dinu - Part 4 00:32:04
    5. The Human-Data Interface: How to Design for “Irrational” Data Consumers - Cathy Tanimura 00:40:37
    6. Designing Delightful Data Products - Alonzo Canada 00:30:33
    7. Designing for Data - Etan Lightstone 00:31:03
    8. Humanizing Data - Building Systems and Interfaces for Domain Experts - Ari Gesher and James Thompson 00:41:09
    9. Architecting Interfaces that Learn - Tye Rattenbury and Jeffrey Heer 00:36:44
    10. What Designers and Data Scientists Can Learn from Each Other - Danyel Fisher and Miriah Meyer 00:30:24
    11. Data (Art &) Science - Eric Colson 00:40:36
    12. Designing with Data: A Human-centered Approach to Data-driven Design - Arianna McClain and Coe Leta Stafford 00:36:12
  5. Hadoop & Beyond
    1. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 1 00:57:21
    2. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 2 00:56:00
    3. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 3 00:39:16
    4. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 4 00:30:41
    5. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 5 00:31:02
    6. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Krishna Sankar - Part 6 00:41:55
    7. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Christopher Fregly - Part 7 00:36:06
    8. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 8 00:40:58
    9. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 1 00:56:12
    10. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 2 00:27:42
    11. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 3 00:47:55
    12. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 4 00:36:35
    13. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 1 00:43:38
    14. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 2 00:48:22
    15. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 3 00:50:39
    16. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 4 00:25:52
    17. Going Real-time: Data Collection and Stream Processing with Apache Kafka - Jay Kreps 00:39:29
    18. Stream Processing Everywhere - What to Use? - Jim Scott 00:39:06
    19. Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Engine - Richard Williamson 00:29:17
    20. From MapReduce to Programming Frameworks: Making Sense of Cloud Dataflow, Spark and New Tools for Big Data - Eric Schmidt 00:40:56
    21. Drill into Drill: How Providing Flexibility and Performance is Possible - Jacques Nadeau 00:43:17
    22. Three Approaches to Scalable Data Curation - Michael Stonebraker 00:38:20
    23. One Billion Objects in 2GB: Big Data Analytics on Small Clusters with Doradus OLAP - Randy Guck 00:48:21
    24. Big Data at Netflix: Faster and Easier - Kurt Brown 00:40:27
    25. Search Evolved: Unraveling Your Data - Costin Leau 00:40:39
    26. The Year in Review - Key Changes in the Hadoop Platform in the Past 12 Months - Jairam Ranganathan 00:42:01
    27. Building Interactive Data Applications at Scale - Fangjin Yang and Vadim Ogievetsky 00:42:56
    28. YARN vs. MESOS: Can’t We All Just Get Along? - Ted Dunning 00:40:03
  6. Hadoop Platform
    1. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 1 00:50:37
    2. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 2 00:38:39
    3. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 3 00:57:12
    4. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 4 00:44:54
    5. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 1 00:45:38
    6. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 2 00:28:39
    7. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 3 00:45:59
    8. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 1 00:46:36
    9. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 2 00:39:00
    10. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 3 00:42:54
    11. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 4 00:40:41
    12. Hadoop Puzzlers Reloaded - Aaron Myers and Daniel Templeton 00:44:02
    13. The Future of Apache Hadoop Security - Joey Echeverria 00:35:30
    14. Making HBase Accessible to Scientists - Spencer Herath and Aaron Benz 00:40:24
    15. Data Discovery on Hadoop - Sumeet Singh and Thiruvel Thirumoolan 00:39:40
    16. Yarns about YARN: Migrating to MapReduce v2 - Kathleen Ting and Miklos Christine 00:32:48
    17. Maintaining Low Latency while Maximizing Throughput on a Single Cluster - Yuliya Feldman 00:39:47
    18. Running Production Hadoop Clusters in Docker Containers - Nasser Manesh 00:45:44
    19. How to use Parquet as a Basis for ETL and Analytics - Julien Le Dem 00:40:08
    20. Adding Insert, Update, and Delete to Hive - Alan Gates 00:37:52
    21. Top Ten Pitfalls to Avoid in a SQL-on-Hadoop Implementation - Monte Zweben 00:35:05
  7. Hadoop in Action
    1. The Evolution of Hadoop at Spotify - Through Failures and Pain - Josh Baer and Rafal Wojdyla 00:40:03
    2. From Source to Solution: Building A System for Machine and Event-Oriented Data - Eric Sammer 00:41:59
    3. Design Patterns for Real Time Streaming Data Analytics - Sheetal Dolas 00:40:31
    4. Stock Market Order Flow Reconstruction in HBase on AWS - Tigran Khrimian 00:39:09
    5. Ticketmaster: Marketing and Selling the World's Tickets - John Carnahan 00:39:35
    6. Designing Data Architectures for Robust Decision Making - Gwen Shapira 00:38:35
    7. Friction-Free ETL: Automating Data Transformation with Impala - Marcel Kornacker 00:28:47
    8. The Truth About MapReduce Performance on SSDs - Yanpei Chen and Karthik Kambatla 00:37:13
    9. Hadoop as a Platform for Genomics - Allen Day and Sungwook Yoon 00:40:26
  8. Law, Ethics & Open Data
    1. Data Scientists and Lawyers - a Marriage made in Silicon Valley - Laura Fennell and Bill Loconzolo 00:39:07
    2. Big Data Ethics and a Future for Privacy - Jonathan King 00:38:05
    3. How Minority Becomes Majority - A Study of Gerrymandering - Tatsiana Maskalevich 00:39:48
  9. Machine Data / IoT
    1. Transformational Case Studies in Machine Data & Telemetry - Chad Meley and John Kreisa 00:42:18
    2. TSAR (the TimeSeries AggregatoR) - How to Count Tens of Billions of Daily Events in Real Time Using Open Source Technologies - Anirudh Todi 00:41:28
    3. An Open Source Approach to Gathering and Analyzing Device Sourced Health Data - Ian Eslick 00:41:41
    4. Building Adaptive Apps with APIs and Data - Anant Jhingran 00:38:36
    5. Dynamic Events in Massive Data Streams, from Astrophysics to Marketing Automation - Kirk Borne 00:40:06
    6. Forecasting Space-time Events - Jeremy Heffner 00:42:02
    7. The IoT P2P Backbone - Bruno Fernandez-Ruiz 00:27:05
    8. The Sushi Principle: Raw Data Is Better - Joseph Adler and Robert Johnson 00:38:14
    9. Practical Methods for Identifying Anomalies That Matter in Large Datasets - Robert Grossman 00:36:43
    10. Streaming Analytics: It’s Not The Same Game - Subutai Ahmad 00:38:46
    11. Machine Learning For Oil Exploration - Ben Hamner 00:35:30
  10. Security
    1. Data Science vs. The Bad Guys: Using Data to Defend LinkedIn Against Fraud and Abuse - David Freeman 00:29:49
    2. How to Ensure Your Hadoop Installation is Not the Next Big Data Breach - Terence Spies 00:34:17
    3. Securing the New Wearable World - Gary Davis 00:46:04
    4. The Physics of Apache Hadoop: Choosing the Right Hardware and OS Configuration Mix for Your Workloads - Woody Christy, Steve Anderson, Patrick Schots and Floris Grandvarlet 00:49:44
  11. Enterprise Adoption
    1. Database History from Codd to Brewer and Beyond - Douglas Turnbull 00:42:39
    2. Ideal Platform for Managing Log Data: Search or SQL? - Vinayak Borkar 00:43:29
    3. Getting Started with Data Governance: Paths Converge from Multiple Starting Points - Paula Wiles Sigmon 00:40:32
    4. Don’t Let Today’s Demands Kill Tomorrow’s Workforce! - Martin Waterhouse 00:29:29
  12. Spark in Action
    1. Lessons from Running Large Scale Spark Workloads - Reynold Xin and Matei Zaharia 00:38:58
    2. Introducing Hive's New Execution Engine - Spark - Xuefu Zhang and Chengxiang Li 00:40:33
    3. Machine Learning with H2O and Spark - Cliff Click and Michal Malohlava 00:38:56
    4. Spark Streaming - The State of the Union, and Beyond - Tathagata Das 00:36:46
    5. Why Spark Is the Next Top (Compute) Model - Dean Wampler 00:40:38
    6. Tuning and Debugging in Apache Spark - Patrick Wendell 00:45:15
    7. Everyday I’m Shuffling - Tips for Writing Better Spark Programs - Vida Ha and Holden Karau 00:36:24
  13. Hardcore Data Science
    1. Beyond DNNs towards New Architectures for Deep Learning, with Applications to Large Vocabulary Continuous Speech Recognition - Tara Sainath 00:34:05
    2. On the Computational and Statistical Interface and "Big Data" - Michael Jordan 00:48:48
    3. Interpretable Machine Learning in Practice - Maya Gupta 00:26:24
    4. Visual Understanding Beyond Naming - Alyosha Efros 00:36:31
    5. Finding Repeated Structure in Time Series Data: Commercial and Scientific Opportunities - Eamonn Keogh 00:21:25
    6. Tensor Methods for Large-scale Unsupervised Learning: Applications to Topic and Community Modeling - Anima Anandkumar 00:31:09
    7. A Quest for Visual Intelligence in Computers - Fei-Fei Li 00:29:39
    8. Graph Mining for Log Data - David Andrzejewski 00:27:59
    9. Why Julia's Important for Data Science - John Myles White 00:24:37
    10. Drugs, DNA, and Dinosaurs: Building High Quality Knowledge Bases with DeepDive - Chris Re 00:30:20
  14. Data-Driven Business Day
    1. Don't Let Data Get in the Way of a Good Story - Mark Madsen 00:26:42
    2. Big Data Stories: Decisions That Drive Successful Projects - Ellen Friedman 00:18:42
    3. Making Business Model Innovation More of a (Data) Science - Jerry Overton 00:17:58
    4. Data "Driven" is Really Data "Accessible” - Ann Johnson 00:14:28
    5. When Ones and Zeros Can Put Billions at Risk... - Anne Johnson 00:16:48
    6. Find the Business in Your Data - Arnab Chakraborty, Dr. Alexander Prinz, Reena Tiwari and Anne Johnson 00:28:00
    7. Tech Magic: 10 Disruptors Shaping the Sensed World - Leah Hunter 00:17:19
    8. Leveraging Big Data and Data Science in Upstream Oil and Gas Industry - Satyam Priyadarshy 00:23:43
    9. Using Data from Many Streams to Drive Social Impact - India Swearingen 00:20:08
    10. Smartphone Data: Tell the Story of People's Lives - Joerg Blumtritt 00:18:47
    11. Big Data Impacts Marketing Productivity at Cisco - Reena Tiwari 00:07:14
    12. National Drug Index: Revealing Prescription Inflation in the US - AJ Loiacono 00:20:48
    13. Shazam - Cait O'Riordan 00:13:40
    14. Digital Business Era: Stretch Your Boundaries - Prith Banerjee 00:13:26
    15. Data Products and the Wearables Revolution - Emi Nomura 00:12:50
    16. Unlocking the Data in Paper: A Case Study of New York Life - Kuang Chen 00:20:46
  15. R Day
    1. An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 1 00:46:48
    2. An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 2 00:24:43
    3. A Reactive Grammar of Graphics with ggvis - Winston Chang 00:52:54
    4. Reproducible R Reports with R Markdown - Garrett Grolemund - Part 1 00:33:58
    5. Reproducible R Reports with R Markdown - Garrett Grolemund - Part 2 00:30:40
    6. Analytic Web Applications with Shiny - Winston Chang - Part 1 00:31:13
    7. Analytic Web Applications with Shiny - Winston Chang - Part 2 00:30:05
  16. PyData
    1. Machine Learning with scikit-learn - Andreas Mueller - Part 1 00:44:30
    2. Machine Learning with scikit-learn - Andreas Mueller - Part 2 00:40:17
    3. Slicing Through Data with NumPy - Jennifer Klay - Part 1 00:46:14
    4. Slicing Through Data with NumPy - Jennifer Klay - Part 2 00:31:41
    5. Intro to Numba and Performance Python - Travis Oliphant - Part 1 00:44:32
    6. Intro to Numba and Performance Python - Travis Oliphant - Part 2 00:41:55
    7. Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 1 00:47:41
    8. Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 2 00:38:21
    9. Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 1 00:42:15
    10. Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 2 00:39:19
  17. Large-scale Machine Learning Day
    1. Large-scale Machine Learning Day - Yucheng Low - Part 2 00:34:00
    2. Large-scale Machine Learning Day - Yucheng Low - Part 3 00:26:23
    3. Large-scale Machine Learning Day - Alice Zheng - Part 4 00:30:17
    4. Large-scale Machine Learning Day - Chris DuBois - Part 5 00:38:33
    5. Large-scale Machine Learning Day - Alice Zheng - Part 6 00:42:19
    6. Large-scale Machine Learning Day - Shawn Scully - Part 7 00:54:18
  18. Sponsored
    1. Bringing OLAP Fully Online: Analyze Changing Datasets in MemSQL and Spark with Pinterest Demo - Eric Frenkiel 00:41:13
    2. From Domain-specific Solutions to an Open Platform Architecture for Big Data Analytics Based on Hadoop and Spark - Vin Sharma and Jason (Jinquan) Dai 00:36:56
    3. SAS Analytic Solutions Running on a Hadoop Cluster using YARN - James Kochuba 00:35:37
    4. Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar 00:45:13
    5. SQL in Hadoop: To Boldly Go where No Data Warehouse has Gone Before - Emma McGrattan 00:38:38
    6. A Simple, Fast Approach to Analytics for Big Data/IoT with kdb+ - Fintan Quill and Doug Talbott 00:34:58
    7. Scalable Realtime Analytics with declarative SQL like Complex Event Processing Scripts - Srinath Perera 00:43:37
    8. The Data Unification Imperative - Andy Palmer 00:41:12
    9. From Monitoring To Monetization With The Data Lake - Bill Schmarzo 00:42:41
    10. Breaking Through the Top 5 Enterprise Data Quality Roadblocks Inside Hadoop - George Corugedo 00:36:16
    11. Data Dexterity: Immediate Visibility Into All Information - Greg Goldsmith 00:30:32
    12. Extreme Sports and Beyond: Exploring a New Frontier in Data - Josh Byrd and Darren Chinen 00:37:57
    13. Cloud Machine Learning - Joseph Sirosh 00:40:30
    14. Credit Suisse Puts Vendors in the Hot Seat on Data Quality and Governance - Nitesh Ambastha, David Brewster and Nenshad Bardoliwalla 00:44:07
    15. Hive on Spark is Blazing Fast... Or Is It? - Carter Shanklin and Mostafa Mokhtar 00:41:34
    16. Tackling the World’s Biggest Data: Human Data - Richard Caudle 00:31:38
    17. Case Study: Data Warehousing in the Cloud with Snowflake at Kixeye - Jon Bock 00:34:30
    18. PostgreSQL Rising: The Other Elephant in the Room - Ozgun Erdogan 00:39:42
    19. Your First Big Data Application on AWS - Rahul Pathak 00:33:20
    20. Smart Enterprise Big Data Bus for the Modern Responsive Enterprise - Anand Venugopal 00:31:53
    21. Driving Better Business Results at Allstate with Machine Learning on Hadoop - Ryan Michaluk and Alexander Gray 00:40:13
    22. Big Data Architectural Pattern - Clint Sharp 00:27:09
    23. Perform Fast Analytics on Hadoop Data & Scalable Predictive Analytics with Open Innovations from HP Vertica - Steve Sarsfield and Sunil Venkayala 00:37:35
    24. Running Hadoop-as-a-Service in the Cloud - Lance Olson 00:42:14
    25. Real World Use Cases: Hadoop and NoSQL in Production - Ted Dunning and Ellen Friedman 00:39:58
  19. Keynotes
    1. Hadoop's Impact on the Future of Data Management - Amr Awadallah 00:15:05
    2. Close Encounters with the Third Kind of Database - Eric Frenkiel 00:05:22
    3. Impacting Business as it Happens - Anil Gadre 00:10:23
    4. A Bigger Lens Through which to View the World- the IBM Twitter Alliance - Adam Kocoloski 00:05:23
    5. Data Science: Where are We Going? - DJ Patil 00:12:59
    6. The Emerging Age of Data-Driven Policy Design: Examples from Trying to Manage the Global Climate - Solomon Hsiang 00:08:35
    7. Data: Open for Good and Secure by Default - Eddie Garcia 00:09:07
    8. Year Zero: How We’ll Run Our Lives in Ten Years’ Time - Alistair Croll 00:05:25
    9. Intel and the Role of Open Source in Delivering on the Promise of Big Data - Michael Greene 00:05:13
    10. Big Data Lessons from Our Cybernetic Past - Eden Medina 00:15:03
    11. New Directions for Spark in 2015 - Matei Zaharia 00:09:44
    12. A New Approach to Big Data - Roman Shaposhnik 00:05:12
    13. Charting a Path Forward: The Future of Data Visualization - Jeffrey Heer 00:10:10
    14. Connected Cows? - Joseph Sirosh 00:08:37
    15. Startup Showcase Winner Announcement 00:01:05
  20. Solutions Showcase Theater
    1. The Briefcase Cluster - Enabling Big Data Everywhere - Jim Scott 00:08:37
    2. Why Event Analytics Matter - Rohit Shrivastava 00:10:41
    3. Cracking the Data Conundrum - Steffin Harris 00:11:42
    4. Smart Data for Smarter Utilities - Irshad Raihan 00:08:33
    5. The Value of Churn Analytics at Cisco - Ivan Chen and Phil Hodsdon 00:12:57
    6. Big Data Governance - Felix Van de Maele 00:10:02
    7. Early Warnings for Customer Churn at a Leading Cloud Technology Firm! - Umair Rauf 00:11:27
    8. Harnessing Big Social Data to Deliver Human Data Intelligence - Jason Rose 00:09:32
    9. Operationalizing Hadoop – Are You Ready? - Valerie Fowler 00:10:20
    10. Multimedia Giant Turns Big Data into Real-Time Customer Insights - Brian Garrett 00:10:27
    11. Data Wrangling in the Wild - Sean Ma 00:10:02
    12. StreamAnalytix-Developing Enterprise Class, Real-time Streaming Applications on Apache Storm - Anand Venugopal 00:12:44
    13. Gaining Value From Data Where It's Born - Ryan Peterson 00:07:21
    14. Build a Foundation for Self-Service Data Prep, Analytics, and Governance - Oliver Claude 00:09:05
    15. Connecting the Big-Data Driven Enterprise in Online Retail - Ashley Stirrup 00:07:24
    16. Leading Telecommunications Company Uses BlueData to Spin Up Local, On Demand Hadoop and Spark Clusters to Enable Agile Deployment of Big Data Tools and Technologies - Nanda Vijaydev 00:10:07
    17. Taming Data Variety: Intelligent Solutions Using Machine Learning and Expert Crowdsourcing - Alan Wagner 00:08:18
    18. Everything You Need To Know About HBase in 10 Minutes or Less - Alex Newman 00:10:09
    19. The Emergence of the Data Refinery - Chuck Yarbrough 00:12:28
    20. Big Data Cluster Planning and Optimization Using Wolf Island Simulation Technology - Laurent Isenegger 00:10:37
    21. Prosthetic Implant Surgery - Where Big Data Means Big Savings - Rola Shaar 00:08:07
    22. Close the Skills Gap and Deliver Rapid Business Value with Big Data Apps - Manan Goel 00:10:46
    23. Distributed R - Scaling the R Language for Even Bigger Data - Sunil Venkayala 00:10:51
    24. Transforming Big Data Landscape with Apache Spark - Rishi Yadav 00:10:11
    25. Data Warehousing in the Cloud - Jon Bock 00:10:12
    26. Proactive Product Intelligence for Electronics - Rami Lokas 00:12:29
    27. Massive-Scale Security Incident Response Leveraging a Hadoop Architecture - Michael A. Davis 00:13:05
    28. Don’t be a Hadoop Breach Headline - Discovery and Sensitive Data in Hadoop - Jeremy Stieglitz 00:11:08
    29. Big Data vs. Climate Change - Srivatsan Ramanujam and John Cardente 00:11:13
    30. ZEAS – Enabling anyone to create Hadoop Enterprise applications fast using a GUI - Aditya Agrawal 00:10:48
    31. Power Tools for Big Data Analytics - Dan Steinberg 00:10:53
    32. Big Data on OpenStack - Kirk Lewis and Frank Rego 00:13:00
    33. Fighting ATM Fraud in Real Time with Hadoop Analytics - Christy Maver 00:08:30
    34. Scale Big Data cost down, while scaling performance out. An NTT mobile personalization retrospective, re-thinking the Big Data solution stack. - Robert Greene 00:11:18
    35. Dato Enables Large-Scale Deduplication at Zillow using GraphLab Create - Rajat Arya 00:08:04
    36. To Catch a Thief with Big Data - Kevin Petrie 00:12:24
    37. Jump into the Data Lake with Hadoop-Scale Data Integration - Greg Benson 00:10:15
    38. Predicting The Future To Improve Customer Satisfaction - Joe Rossi 00:08:38
    39. The Practical, Profitable Magic of Prescriptive Analytics - Andy Flint 00:09:23
    40. Changing the Culture Around Data: Empowering More People with Analytics - Gary Cottrell 00:09:53
    41. How Havas Media Found New Revenue Streams with UNIFi Software - Sean Keenan 00:06:41
    42. What Enterprises Can Learn From Real-Time Bidding - Peter Corless 00:10:49
    43. Big Data and the Data Quality Imperative - Ed Wrazen 00:11:53
    44. Tapjoy Scales and Saves Costs with Riak - Tom Sigler 00:09:34
    45. Smart Execution: How to Optimize Performance by Intelligently Leveraging Multiple Hadoop Analytics Engines - Matt Schumpert 00:10:45
    46. Jagex Game Studio Case Study - Gregory McPhee 00:09:04
    47. Supercharge Sqoop with magical JDBC drivers - Sumit Sarkar 00:09:59
    48. Big Data Analytics: Diverse Use Cases, Diverse Architectures - Ben Conners 00:08:45
    49. Accelerate your data with SequoiaDB - Tao Wang 00:07:36
    50. Building reliable Hadoop clusters with two copies - Iyer Venkatesan 00:10:32