O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Strata Conference New York + Hadoop World 2014: Video Compilation

Video Description

Use the power of big data to drive business strategy

What happens when cutting-edge data science and new business fundamentals intersect? Find out with this complete video compilation of Strata + Hadoop World 2014 in New York, where you’ll get a front-row seat to every keynote, workshop, and session.

Ten conference tracks were required to capture the most challenging problems and compelling opportunities in data today, with presentations from Mike Olson (Cloudera), Kim Rees (Periscopic), Roger Magoulas (O'Reilly), Douglas Merrill (ZestFinance), Amanda Cox (The New York Times), and scores of other experienced data practitioners from finance, media, government, and education.

Download these videos or stream them through our HD player, and gain a clear perspective on the future of big data, including all the analytics, architectures, techniques, tools, and technologies you need to use data successfully.

Tracks include:

  • Business & Industry: How organizations of all sizes use data to make better decisions
  • Connected World: Navigating in an always-connected, always-on world
  • Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
  • Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
  • Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
  • Machine Data: Extracting meaningful insights from data collected and generated by things
  • Security: Fighting fraud, detecting threats, increasing trust—and securing data
  • Beyond Hadoop: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
  • Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
  • The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks

Table of Contents

  1. Keynotes
    1. Open Standards and the Modern Data Center - Mike Olson 00:13:56
    2. What Would Google Do? Understanding the Future of Big Data - M. C. Srivas 00:09:07
    3. Keynote with Miriah Meyer 00:08:06
    4. Accelerating Parkinson’s Research with Big Data Technologies - Ron Kasabian 00:05:20
    5. Data & The New Era of Interactive Storytelling - Sharmila Shahani-Mulligan 00:05:12
    6. Spark Needs a Business Analyst Workflow - Ben Werther 00:05:01
    7. Statistics Without the Agonizing Pain - John Rauser 00:11:47
    8. Pax Data - Eli Collins 00:10:14
    9. The Power of Emotions: When Big Data meets Emotion Data - Rana El Kaliouby 00:10:51
    10. A New Data Science Economy - Joseph Sirosh 00:09:36
    11. Style Stalking: The Stochastic Patterns that Drive Fashion Trends - Karen Moon 00:10:03
    12. Pasta Mathematica - George Legendre 00:07:04
    13. Big Data - 2020 vision - John Schitka 00:05:21
    14. Turning Data into Decisions in a Big Data World - Rachel Hawley 00:05:12
    15. The Hidden Brain - Shankar Vedantam 00:09:13
    16. A Word Too Much Repeated Falls Out of Being - So Why is Big Data Being Talked About so Much? - Paul Zikopoulos 00:05:42
    17. Is Privacy Becoming a Luxury Good? - Julia Angwin 00:15:10
  2. Business & Industry
    1. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 1 00:49:28
    2. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 2 00:56:04
    3. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 3 00:39:44
    4. Building Privacy Protected Data Systems - Ari Gesher, John Grant, and Courtney Bowman - Part 4 00:46:44
    5. Just Enough Math - Paco Nathan and Allen Day - Part 1 00:41:36
    6. Just Enough Math - Paco Nathan and Allen Day - Part 2 00:44:25
    7. Just Enough Math - Paco Nathan and Allen Day - Part 3 00:51:45
    8. Just Enough Math - Paco Nathan and Allen Day - Part 4 00:52:30
    9. Solving the Right Problem - Max Shron and Sasha Laundy 00:43:22
    10. Transforming to a Data Driven Operations Model - Denise Asplund 00:37:49
    11. From Experiments to Insights at Pinterest - Andrea Burbank 00:38:05
    12. Case Study: -A Forensic Look at Success and Failure of Predictive Analytics in Healthcare - Eugene Kolker 00:31:52
    13. The Open Data 500: Building Businesses on Free Government Data - Joel Gurin and Laura Manley 00:34:51
    14. Decided by Data: Case Studies from a Data Driven Product Culture - Nellwyn Thomas 00:44:03
    15. Preemptive Shipping: How Gilt Predicts Which Customers Will Buy Products It Has Never Sold Before - Igor Elbert 00:45:11
    16. What are VCs Really Looking For? - Michael Dauber, Renee DiResta, Matt Turck, James Cham, and Jake Flomenberg 00:42:27
    17. PDF Prison Break: Freeing Data, Empowering Experts at Edmunds.com - John Akred and Karim Qazi 00:44:37
    18. Fashioning Fit: Determining Fit Through Data - Liza Kindred, David Whittemore, Gina Mancuso, and Rasmus Thofte 00:41:29
    19. From Runway to Database, the Season's Hottest Fashion: Data - Rachel Kalmar 00:41:18
    20. How Public Data Creates Revenue for a Scandinavian Retailer - Majken Sander 00:43:05
  3. Connected World
    1. Generating Possible A/B Tests for Uber Via a City Simulation Framework - Bradley Voytek 00:40:41
    2. The State GeoSpatial BigData - Mansour Raad 00:45:36
    3. Architecting World's Largest Biometric Identity System - Aadhaar Experience - Pramod Varma 00:52:42
    4. Pairing EMR Data with an Open Commons to Engage Communities, Provide Work Force Development and Predict Community Health Futures - Brigitte Piniewski 00:43:08
    5. Nanocubes: Interactive Visual Exploration of Large, Geospatial, Temporal Datasets - Lauro Lins 00:21:44
  4. Data Science
    1. Data Science at the Command Line - Jeroen Janssens - Part 1 00:44:54
    2. Data Science at the Command Line - Jeroen Janssens - Part 2 00:36:16
    3. Data Science at the Command Line - Jeroen Janssens - Part 3 00:41:05
    4. Data Science at the Command Line - Jeroen Janssens - Part 4 00:46:05
    5. Becoming a Scalable Data Scientist - Alice Zheng 1:06:30
    6. All the Data and Still Not Enough! - Claudia Perlich 00:41:37
    7. The Great Debate: If You Can't Code, You Can't Be a Data Scientist - Joseph Adler, Hilary Mason, Scott Nicholson, Lucian Lita, and Roger Magoulas 00:37:58
    8. Data Science Bootcamp - Laurie Skelly 00:41:24
    9. The Day Zach Galifianakis Saved Healthcare - Chris Harland 00:33:33
    10. Computing Professional Identity for the Economic Graph - Vitaly Gordon 00:42:52
    11. Multi-language Data Science with IPython, IJulia, IR, and Friends - Brian Granger and Fernando Pérez 00:40:58
    12. Using Data Science on Internet Search Behavior as a Proxy for Human Behavior - Juan Miguel Lavista 00:26:21
    13. AI in 2014: Progress and Problems - Beau Cronin 00:40:32
    14. Big Data Anti-Patterns - Douglas Moore 00:39:24
    15. Machine Learning system architecture – Microsoft Translator, a Case Study - Vishal Chowdhary 00:37:28
    16. Secure Machine Learning - Bahman Bahmani 00:40:10
    17. Fashioning Data: The Balance Between Creativity and Data-Driven Decisions - Karen Moon, Vijay Subramanian, and Liza Kindred 00:40:03
    18. Distributed Gradient Boosting Machine - Cliff Click 00:38:10
    19. Deploying and Evaluating Data Products - Josh Levy 00:28:11
  5. Design & Interfaces
    1. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 1 00:46:35
    2. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 2 00:52:20
    3. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 3 00:45:47
    4. D3.js Tutorial - D3 For Everyone! - Sebastian Gutierrez - Part 4 00:50:11
    5. Visual Change: The Power of Scaled Data Visualization in Action - Nathan Shetterley, Joshua Patterson, Allan Enemark, and Kathleen Moynahan 00:38:26
    6. The Future of Storytelling in Data Communication - Andrew Hill 00:35:54
    7. Graphistry: Scaling Visual Exploration with GPUs and Design - Leo Meyerovich 00:21:00
    8. Design and Data, A Human Centered Approach to Analysis, Experiment Design, and Visualization - Arianna McClain and Alisa Lemberg 00:37:29
    9. Visualization Typography: Designing Legends, Labels, Titles, and Text - Trina Chiasson 00:32:12
  6. Hadoop & Beyond
    1. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 1 00:37:45
    2. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 2 00:43:02
    3. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 3 00:50:56
    4. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin and Helena Edelson - Part 4 00:43:11
    5. Tackling Data Curation in Three Generations - Michael Stonebraker 00:40:20
    6. Advantages of a Domain-Specific Language Approach to Data Transformation - Joe Hellerstein and Sean Kandel 00:43:41
    7. Stories from the Trenches: The Challenges of Building an Analytics Stack - Fangjin Yang and Xavier Léauté 00:36:57
    8. Tachyon: A Memory Centric Storage System for Big Data Computing - Haoyuan Li 00:40:08
    9. Anomaly Detection with Apache Spark - Sean Owen 00:32:56
    10. Mixing Structured Data and Analytics with Spark SQL - Michael Armbrust 00:51:54
    11. Interactive Visual Data Exploration with Spark - Hossein Falaki 00:40:12
    12. Open Source Real Time BI using Storm, Hadoop, Titan, Druid & D3 - Anil Madan 00:50:36
    13. Highly Scalable Tile-Based Visualization for Exploratory Data Analysis - David Jonker and Rob Harper 00:37:35
  7. Hadoop Platform
    1. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 1 00:49:32
    2. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 2 00:38:49
    3. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 3 00:46:22
    4. Building A Data Platform - Stephen O'Sullivan, John Akred, and Richard Williamson - Part 4 00:43:41
    5. From Raw Data to Analytics with No ETL - Marcel Kornacker and Lenni Kuff 00:40:27
    6. SQL on Everything, in Memory - Julian Hyde 00:40:06
    7. From Oracle to Hadoop - Guy Harrison, David Robson, and Kathleen Ting 00:37:59
    8. Hive on Apache Tez: Benchmarked at Yahoo! Scale - Mithun Radhakrishnan 00:45:52
    9. Scaling Storm: Cluster Sizing and Performance Optimization - P. Taylor Goetz 00:39:46
    10. Building Real-time Data Products at LinkedIn with Apache Samza - Martin Kleppmann 00:49:42
    11. HBase: Where Online Meets Low Latency - Nick Dimiduk and Nicolas Liochon 00:36:03
    12. Apache HBase Application Archetypes - Jonathan Hsieh and Lars George 00:48:29
    13. Hadoop Operations - Best Practices from the Field - Chris Nauroth and Suresh Srinivas 00:40:32
    14. Resource Management with YARN - Anubhav Dhoot 00:40:02
    15. Bulk Loading Your Big Data into Apache HBase, a Full Walkthrough - Jean-Daniel Cryans 00:35:57
    16. An Independent Comparison of Open Source SQL-on-Hadoop - Greg Rahn 00:41:42
    17. Bringing PyData to Impala - Uri Laserson 00:28:45
  8. Hadoop in Action
    1. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 1 00:36:06
    2. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 2 00:50:19
    3. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 3 00:42:30
    4. Architectural Considerations for Hadoop Applications - Mark Grover, Jonathan Seidman, Gwen Shapira, and Ted Malaska - Part 4 00:42:38
    5. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 1 00:39:35
    6. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 2 00:35:04
    7. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 3 00:51:47
    8. Getting Started with HBase Application Development - Sridhar Reddy and Carol McDonald - Part 4 00:56:50
    9. How Goldman Sachs is Using Knowledge to Create an Information Edge - Peter Ferns 00:16:54
    10. Customer Intelligence: Harnessing Elephants at Transamerica - Stephen Lloyd, Vishal Bamba, and David Beaudoin 00:42:36
    11. Transitioning from Original Big Data to the New Big Data: L.L.Bean’s Journey - Chris Wilson and Doug Bryan 00:42:36
    12. Unlocking Big Data at CERN - Matthias Braeger and Manish Devgan 00:41:13
    13. Big Data Modeling: How FICO is Turning DBAs and into Data Engineers - Lelanie Moll, Deb Brooks, and Silaphet Mounkhaty 00:39:34
    14. How LinkedIn Democratizes Big Data Visualization - Praveen Neppalli Naga, Chi-Yi Kuan, and Jonathan Wu 00:40:21
    15. Better Care with Big Data: A Panel Discussion - Ryan Goldman, Ryan Brush, Sabrina Dahlgren, Aashima Gupta, and Michael Thompson 00:38:28
    16. Renaissance in Medicine: Next-Generation Big Data Workloads - Allen Day 00:40:04
    17. Image Processing on Hadoop - Ailey Crow 00:39:18
    18. The Next Generation of Big Data in the Cloud - Daniel Weeks 00:41:17
    19. Building an Enterprise Data Hub to Bridge the Gap Between Business and IT - Sabrina Dahlgren and Rajiv Synghal 00:37:42
  9. Law, Ethics & Open Data
    1. Better Accountability Through Open Data - Merici Vinton and Micheál Keane 00:38:18
    2. Wonk, Meet Geek - Jim Adler 00:36:56
    3. You Have Zero Privacy, You Own Your Data, and Other Myths - Gilad Rosner 00:37:44
    4. Homelessness Prevention by the Numbers - Stefan Heeke and Adeen Flinker 00:44:24
    5. Why Big Data Needs Thick Data - Tricia Wang and Matt LeMay 00:38:55
  10. Machine Data
    1. Connectivity, Real-Time Data, and Edge Analytics to Enable Intelligent Machines for the Industrial Internet - Alisher Maksumov and Jean Lau 00:48:12
    2. Data is a Local Problem - Alasdair Allan 00:39:29
    3. Super Simple Internet of Things Backend: Persistence Post Hadoop with Crate Data - Jodok Batlogg 00:26:32
    4. SmartCity StreamApp: An Internet of Things Service for Real-time Traffic Management - Damian Black 00:54:21
  11. Security
    1. Resolving Data Inaccuracy - Mike Armstrong 00:35:42
    2. Big Data vs Zombies: Using Algorithms, Big Data, and Large Scale Distributed Processing to Combat Identity Fraud - Jesse Shaw 00:40:24
    3. Why Should Anyone Care at All about Privacy, Privacy Engineering, or Data? - Michelle Dennedy 00:44:13
    4. Real-Time Cyber Threat Detection with Sqrrl and Spark - Adam Fuchs 00:43:41
    5. Big Data Framework for Anomaly Detection & Root Cause Analysis on Streaming Time Series Data - Roy Singh 00:39:41
  12. Enterprise Adoption
    1. In the Data Lake - Barry Devlin 00:39:57
    2. Unseating the Giants - Monte Zweben 00:38:30
    3. What’s Holding Up Your Hadoop? - Eddie Garcia 00:29:51
  13. Spark Camp
    1. Spark Camp - Paco Nathan and Patrick Wendell - Part 1 00:49:25
    2. Spark Camp - Michael Armbrust - Part 2 00:38:10
    3. Spark Camp - Joseph Bradley - Part 3 00:45:58
    4. Spark Camp - Tathagata Das - Part 4 00:36:30
    5. Spark Camp - Sameer Farooqui and Holden Karau - Part 5 00:44:45
    6. Spark Camp - Sameer Farooqui and Holden Karau - Part 6 00:37:16
    7. Spark Camp - Sameer Farooqui and Holden Karau - Part 7 00:56:29
    8. Spark Camp - Sameer Farooqui and Holden Karau - Part 8 00:34:54
  14. Hardcore Data Science
    1. Doing the Impossible (Almost) - Ted Dunning 00:24:22
    2. Tupleware: Redefining Modern Analytics - Tim Kraska 00:29:05
    3. Data Science for Humans, Not Robots - Alice Zheng 00:22:39
    4. Big Data: Efficient Collection and Processing - Anna Gilbert 00:42:22
    5. Computational Problems in Managing Social Information - Jon Kleinberg 00:51:26
    6. Small Data Problems - Kira Radinsky 00:23:26
    7. Building and Deploying Large-scale Machine Learning Pipelines Using the Berkeley Data Analytics Stack - Ben Recht 00:28:04
    8. Learning About Music and Listeners - Brian Whitman 00:29:59
    9. Statistical Topic Modeling - Hanna Wallach 00:28:48
    10. The Aha! Moment: From Data to Insight - Dafna Shahaf 00:26:32
  15. Data-Driven Business Day
    1. Designing for Interruption - Alistair Croll 00:13:04
    2. Check Your Bias, Feed Your Empathy - Farrah Bostic 00:27:21
    3. The Data Lake Dream - Edd Dumbill 00:19:35
    4. Why Marketing’s Approach to Big Data is All Wrong - Jennifer Zeszut 00:17:51
    5. Bigger is Better, but at What Cost? Towards Understanding the Economic Value of Data - Brian d'Alessandro 00:22:08
    6. The Sounds of (Data) Silence - Jana Eggers 00:21:38
    7. Panel: Deciding Better - Joe Caserta, Farrah Bostic, and Halle Tecco 00:34:04
    8. Making Strategic Decisions: Business Requirements for Analytics Projects - Joy Beatty 00:17:14
    9. The Future of Data - Kim Rees 00:25:47
    10. How Goldman Sachs is Using Knowledge to Create an Information Edge - Peter Ferns 00:16:54
    11. The Big (Data) Picture - Rohit Jain 00:13:53
    12. Improving Healthcare Business Strategies through Lean Data Partnerships - Brigitte Piniewski 00:12:54
    13. Building with Data: Lessons from Etsy - Nellwyn Thomas 00:18:03
    14. Reducing Employee Turnover by 75%: Applying Data and Predictive Analytics to Hiring and Team Assembly - Michael Rosenbaum 00:19:34
    15. Better Accountability Through Open Data - Merici Vinton 00:16:57
    16. The Unit: Building Data Science Teams the Special Operations Way - Amy Gaskins 00:17:58
    17. MapReduce ETL Processing for Healthcare Process Improvement Dashboards - Mary Ann Wayer 00:12:49
  16. Industrial Internet
    1. Industrial Internet Day Opening Remarks - Jon Bruner 00:07:43
    2. Taking the Industrial Internet to the Ends of the Earth - Daniel Koffler 00:52:56
    3. Oceans 2.0: The Last Remaining Wild West - Ami Daniel 00:34:40
    4. Big Data Analytics: Enabling Innovation while Reducing Risk - David Simchi-Levi 00:46:17
    5. Video Analytics in the Big & Fast Streaming Data Era - Victor Fang and Yu Cao 00:36:12
    6. The Industrial Internet and the Data Revolution - Nathan Oostendorp 00:38:44
    7. Bring Your Own Internet (of Things) - Alasdair Allan 00:35:51
    8. IIOT Applied: 10 Things I Learned While Deploying an IIoT Machine Learning System - Cameron Turner 00:45:50
    9. Industrial Internet Day Closing Panel - Jon Bruner, Leo Spiegel, Edy Liongosari, and Mark Grabb 00:48:41
  17. PyData at Strata
    1. IPython - Brian Granger and Fernando Pérez 1:10:39
    2. Collaborative Data Science with coLaboratory - Kayur Patel and Kester Tong 00:17:20
    3. Intro to NumPy and matplotlib - Jake Vanderplas - Part 1 00:40:33
    4. Intro to NumPy and matplotlib - Jake Vanderplas - Part 2 00:35:20
    5. Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 1 00:46:39
    6. Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 2 00:36:33
    7. Visualizing Data with Blaze and Bokeh - Andy Terrel 00:34:30
    8. Interactive Visualization with Bokeh - Peter Wang 00:53:58
    9. SciPy – An Exploration of the Most Useful Bits - Travis Oliphant - Part 1 00:46:01
    10. SciPy – An Exploration of the Most Useful Bits - Travis Oliphant - Part 2 00:45:06
    11. New and Upcoming Features in Pandas - Wes McKinney 00:43:07
    12. High Performance Python - Trent Nelson 00:43:20
  18. Sponsored
    1. Got the T-shirt: Real Experiences from a Hadoop Veteran - Jim Scott 00:43:44
    2. See the Fastest Spark-Powered Disparate Data Blending & Analysis Solution - Vaibhav Nivargi 00:35:12
    3. Disrupting the Traditional Analyst Workflow with Platfora and Spark - Peter Schlampp and Ed Smith 00:40:15
    4. Big Data Architectural Patterns - Todd Papaioannou 00:39:38
    5. An End-to-End Approach to Offloading the Data Warehouse with Hadoop - Jorge A Lopez 00:35:10
    6. Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar and Brett Rudenstein 00:39:59
    7. Using Graph to Discover Unseen Relationships in Big Data - Mike Hoskins 00:42:43
    8. Hadoop Effortlessly: A Data Inventory is Key to Data Self-service - Moderated by: Alex Gorelik - Panelists: Suresh Srinivas, Mike Sutten, John Mount, Clark Farrey, and Sunil Soares 00:46:35
    9. Building Real-Time Platforms with MemSQL and Apache Spark - Eric Frenkiel 00:31:58
    10. Unlocking Hadoop’s Potential with YARN - Sanjay Radia 00:41:21
    11. Real-time streaming and analytics with Amazon Elastic MapReduce and Amazon Kinesis - Steve McPherson 00:33:00
    12. NoSQL Solutions for Big Data Problems - Don Pinto 00:38:24
    13. Big Data SQL and Query Franchising: An Architecture for SQL Beyond Hadoop - Dan McClary 00:38:39
    14. Drive Data Quality at Your Company: Create a Data Lake - George Corugedo 00:37:14
    15. Important Advances in Hadoop: A Panel Discussion - Joey Jablonski, Armando Costa, Jim Burmingham, and Rob Johnson 00:46:17
    16. Cloud Machine Learning - Joseph Sirosh 00:38:35
    17. Embracing Diversity - Sid Sipes 00:33:16
    18. The Art of Prediction: Seamless Visualization and Modeling With Hadoop - Adam Pilz 00:31:37
    19. Extending "Variety" of Data to "Variety" of Users - Tina Groves 00:36:38
    20. How to Architect Big Data Apps with the Lambda Architecture - with Real Work Examples on Merging Batch and Real-Time Processing - Altan Khendup and Ron Bodkin 00:42:30
    21. What do Al Capone & Hadoop Have in Common? Visualizing Data at Scale – Making Sense Out of Big Data - James Dixon 00:41:19
    22. Distributed R - A Scalable and High-performance Platform for R - Sunil Venkayala and Indrajit Roy 00:39:06
    23. Getting Big Data to Work: Agile Data Transformation in Hadoop - Stephanie McReynolds, Xavier Quintuna, Shirshanka Das, Charlie Crocker, and Anna Dorofiyenko 00:40:25
    24. Now Playing at Netflix: Advanced Decision-Making with Hadoop, Starring MicroStrategy - Michael Hiskey 00:25:30
    25. Analytics the Way Nature Intended - Donald Farmer 00:40:23
    26. Western Union: Implementing a Hadoop-based Enterprise Data Hub with Informatica - Pravin Darbare and Sumeet Agrawal 00:41:48
    27. For Red Hat, it's 1994 All Over Again - Sarangan Rangachari 00:37:10
    28. Hadoop Responsibly with Big Data Governance - Moderated by: Barry Devlin - Panelists: Sunil Soares, Joseph Dossantos, and Jay Zaidi 00:43:08
    29. Big Content: Finding the Why Behind the What - Sid Probstein 00:34:08
  19. Solutions Showcase Theater
    1. Innovative Healthcare, Tech & Retail Companies Mix CRM Info with Big Data to Make Reps 10x More Productive, 40x More Useful and 30% More Profitable - Michael Hiskey 00:10:29
    2. Real-time Classification and Sentiment Analysis of Multi-lingual Content Using Advanced Analytics on Apache Storm - Anand Venugopal 00:10:28
    3. Hadoop at Bloomberg - Sudarshan Kadambi 00:08:14
    4. EVP Data Lake: Store Everything, Analyze Anything, Build What You Need - Ryan Peterson 00:10:17
    5. 10 Amazing Things to do With A Hadoop-based Data Lake - Greg Chase 00:10:01
    6. Solve Data Ingest Limitation with High Performance Networks Offloads - Asaf Wachtel 00:08:43
    7. Real-Time Big Data Architecture @ LivePerson - Shane K. Johnson 00:10:13
    8. From Infrastructure to Data Applications - Jonathan Gray 00:09:52
    9. From Big Iron to Big Data: Offloading Data & Workloads to Hadoop at a Major US Bank - Jorge A. Lopez 00:11:06
    10. Managing Data in Regulated Industries - Jim Clark 00:08:52
    11. The Pain Curve - Lack of Automation Leads to Failure - Greg Bruno 00:09:17
    12. Building the Enterprise Data Hub - Joe Caserta 00:12:22
    13. QlikView and Big Data Analytics at King - Donald Farmer 00:10:21
    14. Driving Growth in Transportation Using Big Data and Data Science - Marie Goodell 00:08:14
    15. Competitiveness in the Age of Big Data - Satyendra Rana 00:12:49
    16. Unraveling Hadoop's Meltdown Mysteries - Sean Suchter 00:09:44
    17. Let's Stop Pretending that One Size Fits All When it Comes to the Challenges of Working with Enterprise Data - Nenshad Bardoliwalla 00:10:15
    18. Waking Analysts from their Nightmare - George Corugedo 00:10:14
    19. "Mining" the IoT for Business Value: How WWT Helped One of the Largest Mining Companies Predict Engine Failures - Yoni Malchi 00:10:09
    20. All Hands on Deck: How to Get Non-technical Business Users to Tackle Big Data so you Can Focus on Complex Queries - Amit Bendov 00:12:33
    21. The Spark-Inspired Workflow - Kevin Beyer 00:09:12
    22. Do you Prefer to Hike up Machu Pichu or Take the Train? - Todd Goldman 00:07:02
    23. Using Big Data to Improve Patient Outcomes - John Armstrong 00:10:25
    24. Get Real with Hadoop - Jim Scott 00:11:58
    25. Big Data Analytics Heavyweight Sounds Off on Financial Services Use Cases - Matt Schumpert 00:11:26
    26. Real World Showcase of How a Retail Customer Uses and Can Use Microsoft Big Data and Business Analytics Technologies - Sanjay Soni 00:09:24
    27. Using Hadoop to Run Real-Time, Operational Applications - Rich Reimer 00:09:18
    28. Automated Data Inventory for Hadoop - Oliver Claude 00:10:20
    29. Keys to Optimizing Product Inventory and Pricing at One of the Largest Global Retailers - Julien Sauvage 00:10:11
    30. Consumer Behavior Analytics with Cubes on Hadoop - Ajay Anand 00:09:34
    31. Omneo’s Enterprise Data Hub: Helping Manufacturers Save Millions - Kathleen deValk 00:16:17
    32. Building an Enterprise Grade Big Data Risk Management Solution for Financial Services - Vamsi Chemitiganti 00:09:20
    33. Orange Silicon Valley spins up private Big Data as a Service with BlueData to create on-demand Spark and Hadoop Clusters - Tom Phelan 00:09:47
    34. Everything You Don't Know About HBase in 10 Minutes or Less - Alex Newman 00:11:36
    35. Big Data News Cases… What in the World are People Doing with Hadoop? - Gord Sissons 00:10:48
    36. Build Intelligent Applications with H20's Open Source - Joel Horwitz 00:10:09
    37. NoSQL Key Value Stores - The Key to Velocity - Brian Bulkowski 00:10:00
    38. Java & Big Data in Real Time - Matt Schuetze 00:09:23
    39. Using Operational Intelligence to Track 10M Cable TV Viewers in Real Time - Dr. William Bain 00:10:13
    40. Unlock the Value of Big Data with Hunk for Hadoop - Adrish Sannyasi 00:07:15
    41. Big Cybersecurity Data for Insider Threat Analysis - Joe Travaglini 00:06:03
    42. Customer Spotlight: Big Data, The Elephant and the Bear - Lawrence Schwartz 00:09:35
    43. Case Study: Improving Customer Experience by Employing Big Data Technologies in the Banking Industry - Martin Triska 00:10:05
    44. Better Manufacturing with Data: Using 3D Visual Analytics on the Shop Floor - Carl Byers 00:10:29
    45. MemSQL & Shutterstock: Insights in Real Time - Eric Frenkiel and Chris Fischer 00:08:14
    46. Running In-Memory Jobs and Traditional Jobs on the Same Hadoop Cluster - David Chaiken 00:10:02
    47. Data Transformation on Hadoop: Balancing Technology and Human Needs to Boost Performance and Increase ROI - Ravi Hubbly 00:08:59
    48. Big Data Analytics / IoT: New Customer Insights Using Network Data - Ankur Gupta 00:10:00
    49. Extending Enterprise Data Security to Hadoop - Raul Ortega 00:11:09
    50. Industrialized Hadoop Analytics and SQL: Unleashing the Business User - John Santaferraro 00:11:24
    51. Connection Analytics: Extracting Value from Social Networks Data - Sri Raghavan 00:10:39
    52. The Emergence of the Streamlined Data Refinery - Chuck Yarbrough 00:08:32
    53. Hardware Still Matters: Manageable Infrastructure Platforms for Dynamic Big Data Environments - Robert Novak 00:09:57