Strata + Hadoop World San Jose 2015: Video Compilation

Video description

Go right to the heart of big data

Find out what happens when cutting-edge data science and new business fundamentals intersect. With this complete video compilation, you’ll be on hand for every presentation—whether it’s a keynote, a tutorial, or a workshop—held at the Strata Conference + Hadoop World Conference in San Jose, California during February, 2015.

In ten tracks, this year’s conference captured the most challenging problems and compelling opportunities in data today, including:

  • Business & Industry: How organizations of all sizes use data to make better decisions
  • Connected World: Navigating in an always-connected, always-on world
  • Data Science: Everything from the latest algorithms and advances in machine learning to cultural change and team-building
  • Design & Interfaces: Capturing user experience, design, new interfaces, and visualization
  • Hadoop & Beyond: How tools like Cassandra, Storm, Accumulo, Kafka and Spark fit in the data science toolkit
  • The Hadoop Platform: A deep dive into the dominant big data stack, with practical lessons and integration tricks
  • Hadoop in Action: Real-world case studies of the Hadoop ecosystem in action
  • Law, Ethics & Open Data: Issues on governance, ethics, and compliance in the era of open data
  • Machine Data: Extracting meaningful insights from data collected and generated by things
  • Security: Fighting fraud, detecting threats, increasing trust—and securing data

You also have complete access to other conference events, such as Data-Driven Business Day, Hardcore Data Science Day, and Spark Camp.

Download these videos or stream them through our HD player, and gain a clear perspective on data, including all the analytics, architectures, techniques, tools, and technologies you need to use it successfully.

Publisher resources

View/Submit Errata

Table of contents

  1. Business Industry
    1. Hiding the Elephant - How Big Data Apps Make Magic While Hiding Hadoop - Ross Fubini, Ari Gesher, Wei Zheng, Omer Trajman, and Sylvain Le Borgne
    2. Pumping Up Retail Profits with Predictive Analytics - Adam Jorgensen
    3. If You Don't Have Anything Nice to Say, Please Say Something: Increasing Honesty in Airbnb Reviews - Dave Holtz
    4. Making Big Data Usable in Market Regulation - Scott Donaldson
    5. WANTED: Women in Data, Tech, and STEM - Moderated by: Cornelia Lévy-Bencheton, Panelists: Michele Chambers, Alice Zheng and Neha Narkhede
    6. Helping the Republican Party Use Data and Engineering to Win the US Senate - Azarias Reda
    7. Using Big Data to Identify the World's Top Experts - Nima Sarshar
    8. The New Data Organization: What do Successful Data-Driven Companies Look Like? - John Haddad
    9. Architecting for the Cloud - Chris Neumann
    10. Solving Customer Problems with Big Data across Thomson Reuters - Brian Ulicny
  2. Connected World
    1. Improving Business Operations with Predictive Maintenance and Service - Oliver Mainka
    2. Forget the Valley: Middle America Is Where Data Is Having Its Biggest Impact - Matt Asay
    3. Robot Reporters: How The Associated Press Embraced Data Automation - Adam Smith
    4. Which is More Interesting - Millions of Thermostats, or Millions of Minds in the Internet of Things? - Doug Stein
    5. Economic Insights from LinkedIn's Professional Network - June Andrews
    6. Using Data to Help Farmers Feed Growing Populations in a Changing Climate - Stewart Collis
  3. Data Science
    1. Bots Don't Drink Soda: Using Big Data to Find Real People - Michael Brown
    2. How to Detect Anomalies in High Cardinality Dimensions and Make Them Actionable - Shankar Vedaraman and Christopher Colburn
    3. Big Data and Design Working Together – When the Magic Happens - George Roumeliotis
    4. HOWTO Make Your Future Data Scientists Love You - Sasha Laundy
    5. From Academia to Data Science: Lessons Learned Founding the Insight Data Science Fellows Program - Jake Klamka and Kathy Copic
    6. The Two Cultures of People Science - Michelangelo D'Agostino
    7. Pro Bono Data Science in Action - Helping Teens in Crisis - Noelle Sio
    8. Data Applications: Speed vs Accuracy - Danielle Ben-Gera
    9. Behavior-driven Machine Translation - Irina Borisova and Asim Mathur
    10. Playing Nice in the Product Playground: Data Scientists, Engineers, and Product Managers Working Together to Create Innovative Data Products - Anu Tewary, Lucian Lita and Jonathan Goldman
    11. Machine Learning Building Blocks and the Workload Optimization Framework - Shai Fine
    12. Robust Event Detection Using Diverse Data Types - Harrison Mebane
    13. Purposeful Education with Job Market Data for Students, Educators, and Institutions - Jike Chong
    14. Real-Time Relevance for Mobile at LinkedIn - Michael Conover
  4. Design Interfaces
    1. Building Interactive Data Visualizations - Jonathan Dinu - Part 1
    2. Building Interactive Data Visualizations - Jonathan Dinu - Part 2
    3. Building Interactive Data Visualizations - Jonathan Dinu - Part 3
    4. Building Interactive Data Visualizations - Jonathan Dinu - Part 4
    5. The Human-Data Interface: How to Design for “Irrational” Data Consumers - Cathy Tanimura
    6. Designing Delightful Data Products - Alonzo Canada
    7. Designing for Data - Etan Lightstone
    8. Humanizing Data - Building Systems and Interfaces for Domain Experts - Ari Gesher and James Thompson
    9. Architecting Interfaces that Learn - Tye Rattenbury and Jeffrey Heer
    10. What Designers and Data Scientists Can Learn from Each Other - Danyel Fisher and Miriah Meyer
    11. Data (Art ) Science - Eric Colson
    12. Designing with Data: A Human-centered Approach to Data-driven Design - Arianna McClain and Coe Leta Stafford
  5. Hadoop Beyond
    1. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 1
    2. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 2
    3. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 3
    4. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 4
    5. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Reza Zadeh - Part 5
    6. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Krishna Sankar - Part 6
    7. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan and Christopher Fregly - Part 7
    8. Spark Camp: An Introduction to Apache Spark with Hands-on Tutorials - Paco Nathan - Part 8
    9. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 1
    10. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 2
    11. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 3
    12. Getting Started with Interactive SQL-on-Hadoop - John Russell and Alan Choi - Part 4
    13. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 1
    14. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 2
    15. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 3
    16. Owning Time Series With Team Apache: Cassandra, Spark, Spark Streaming, and Kafka - Patrick McFadin - Part 4
    17. Going Real-time: Data Collection and Stream Processing with Apache Kafka - Jay Kreps
    18. Stream Processing Everywhere - What to Use? - Jim Scott
    19. Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Engine - Richard Williamson
    20. From MapReduce to Programming Frameworks: Making Sense of Cloud Dataflow, Spark and New Tools for Big Data - Eric Schmidt
    21. Drill into Drill: How Providing Flexibility and Performance is Possible - Jacques Nadeau
    22. Three Approaches to Scalable Data Curation - Michael Stonebraker
    23. One Billion Objects in 2GB: Big Data Analytics on Small Clusters with Doradus OLAP - Randy Guck
    24. Big Data at Netflix: Faster and Easier - Kurt Brown
    25. Search Evolved: Unraveling Your Data - Costin Leau
    26. The Year in Review - Key Changes in the Hadoop Platform in the Past 12 Months - Jairam Ranganathan
    27. Building Interactive Data Applications at Scale - Fangjin Yang and Vadim Ogievetsky
    28. YARN vs. MESOS: Can’t We All Just Get Along? - Ted Dunning
  6. Hadoop Platform
    1. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 1
    2. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 2
    3. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 3
    4. Apache Hadoop Operations for Production Systems - Kathleen Ting, Philip Zeyliger, Philip Langdale, and Miklos Christine - Part 4
    5. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 1
    6. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 2
    7. Building an Apache Hadoop Data Application - Tom White, Joey Echeverria, and Ryan Blue - Part 3
    8. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 1
    9. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 2
    10. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 3
    11. Building A Data Platform - Manu Mukerji, Stephen O'Sullivan, and John Akred - Part 4
    12. Hadoop Puzzlers Reloaded - Aaron Myers and Daniel Templeton
    13. The Future of Apache Hadoop Security - Joey Echeverria
    14. Making HBase Accessible to Scientists - Spencer Herath and Aaron Benz
    15. Data Discovery on Hadoop - Sumeet Singh and Thiruvel Thirumoolan
    16. Yarns about YARN: Migrating to MapReduce v2 - Kathleen Ting and Miklos Christine
    17. Maintaining Low Latency while Maximizing Throughput on a Single Cluster - Yuliya Feldman
    18. Running Production Hadoop Clusters in Docker Containers - Nasser Manesh
    19. How to use Parquet as a Basis for ETL and Analytics - Julien Le Dem
    20. Adding Insert, Update, and Delete to Hive - Alan Gates
    21. Top Ten Pitfalls to Avoid in a SQL-on-Hadoop Implementation - Monte Zweben
  7. Hadoop in Action
    1. The Evolution of Hadoop at Spotify - Through Failures and Pain - Josh Baer and Rafal Wojdyla
    2. From Source to Solution: Building A System for Machine and Event-Oriented Data - Eric Sammer
    3. Design Patterns for Real Time Streaming Data Analytics - Sheetal Dolas
    4. Stock Market Order Flow Reconstruction in HBase on AWS - Tigran Khrimian
    5. Ticketmaster: Marketing and Selling the World's Tickets - John Carnahan
    6. Designing Data Architectures for Robust Decision Making - Gwen Shapira
    7. Friction-Free ETL: Automating Data Transformation with Impala - Marcel Kornacker
    8. The Truth About MapReduce Performance on SSDs - Yanpei Chen and Karthik Kambatla
    9. Hadoop as a Platform for Genomics - Allen Day and Sungwook Yoon
  8. Law, Ethics Open Data
    1. Data Scientists and Lawyers - a Marriage made in Silicon Valley - Laura Fennell and Bill Loconzolo
    2. Big Data Ethics and a Future for Privacy - Jonathan King
    3. How Minority Becomes Majority - A Study of Gerrymandering - Tatsiana Maskalevich
  9. Machine Data / IoT
    1. Transformational Case Studies in Machine Data Telemetry - Chad Meley and John Kreisa
    2. TSAR (the TimeSeries AggregatoR) - How to Count Tens of Billions of Daily Events in Real Time Using Open Source Technologies - Anirudh Todi
    3. An Open Source Approach to Gathering and Analyzing Device Sourced Health Data - Ian Eslick
    4. Building Adaptive Apps with APIs and Data - Anant Jhingran
    5. Dynamic Events in Massive Data Streams, from Astrophysics to Marketing Automation - Kirk Borne
    6. Forecasting Space-time Events - Jeremy Heffner
    7. The IoT P2P Backbone - Bruno Fernandez-Ruiz
    8. The Sushi Principle: Raw Data Is Better - Joseph Adler and Robert Johnson
    9. Practical Methods for Identifying Anomalies That Matter in Large Datasets - Robert Grossman
    10. Streaming Analytics: It’s Not The Same Game - Subutai Ahmad
    11. Machine Learning For Oil Exploration - Ben Hamner
  10. Security
    1. Data Science vs. The Bad Guys: Using Data to Defend LinkedIn Against Fraud and Abuse - David Freeman
    2. How to Ensure Your Hadoop Installation is Not the Next Big Data Breach - Terence Spies
    3. Securing the New Wearable World - Gary Davis
    4. The Physics of Apache Hadoop: Choosing the Right Hardware and OS Configuration Mix for Your Workloads - Woody Christy, Steve Anderson, Patrick Schots and Floris Grandvarlet
  11. Enterprise Adoption
    1. Database History from Codd to Brewer and Beyond - Douglas Turnbull
    2. Ideal Platform for Managing Log Data: Search or SQL? - Vinayak Borkar
    3. Getting Started with Data Governance: Paths Converge from Multiple Starting Points - Paula Wiles Sigmon
    4. Don’t Let Today’s Demands Kill Tomorrow’s Workforce! - Martin Waterhouse
  12. Spark in Action
    1. Lessons from Running Large Scale Spark Workloads - Reynold Xin and Matei Zaharia
    2. Introducing Hive's New Execution Engine - Spark - Xuefu Zhang and Chengxiang Li
    3. Machine Learning with H2O and Spark - Cliff Click and Michal Malohlava
    4. Spark Streaming - The State of the Union, and Beyond - Tathagata Das
    5. Why Spark Is the Next Top (Compute) Model - Dean Wampler
    6. Tuning and Debugging in Apache Spark - Patrick Wendell
    7. Everyday I’m Shuffling - Tips for Writing Better Spark Programs - Vida Ha and Holden Karau
  13. Hardcore Data Science
    1. Beyond DNNs towards New Architectures for Deep Learning, with Applications to Large Vocabulary Continuous Speech Recognition - Tara Sainath
    2. On the Computational and Statistical Interface and "Big Data" - Michael Jordan
    3. Interpretable Machine Learning in Practice - Maya Gupta
    4. Visual Understanding Beyond Naming - Alyosha Efros
    5. Finding Repeated Structure in Time Series Data: Commercial and Scientific Opportunities - Eamonn Keogh
    6. Tensor Methods for Large-scale Unsupervised Learning: Applications to Topic and Community Modeling - Anima Anandkumar
    7. A Quest for Visual Intelligence in Computers - Fei-Fei Li
    8. Graph Mining for Log Data - David Andrzejewski
    9. Why Julia's Important for Data Science - John Myles White
    10. Drugs, DNA, and Dinosaurs: Building High Quality Knowledge Bases with DeepDive - Chris Re
  14. Data-Driven Business Day
    1. Don't Let Data Get in the Way of a Good Story - Mark Madsen
    2. Big Data Stories: Decisions That Drive Successful Projects - Ellen Friedman
    3. Making Business Model Innovation More of a (Data) Science - Jerry Overton
    4. Data "Driven" is Really Data "Accessible” - Ann Johnson
    5. When Ones and Zeros Can Put Billions at Risk... - Anne Johnson
    6. Find the Business in Your Data - Arnab Chakraborty, Dr. Alexander Prinz, Reena Tiwari and Anne Johnson
    7. Tech Magic: 10 Disruptors Shaping the Sensed World - Leah Hunter
    8. Leveraging Big Data and Data Science in Upstream Oil and Gas Industry - Satyam Priyadarshy
    9. Using Data from Many Streams to Drive Social Impact - India Swearingen
    10. Smartphone Data: Tell the Story of People's Lives - Joerg Blumtritt
    11. Big Data Impacts Marketing Productivity at Cisco - Reena Tiwari
    12. National Drug Index: Revealing Prescription Inflation in the US - AJ Loiacono
    13. Digital Business Era: Stretch Your Boundaries - Prith Banerjee
    14. Data Products and the Wearables Revolution - Emi Nomura
    15. Unlocking the Data in Paper: A Case Study of New York Life - Kuang Chen
  15. R Day
    1. An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 1
    2. An Easy System for Data Wrangling With tidyr and dplyr - Garrett Grolemund - Part 2
    3. A Reactive Grammar of Graphics with ggvis - Winston Chang
    4. Reproducible R Reports with R Markdown - Garrett Grolemund - Part 1
    5. Reproducible R Reports with R Markdown - Garrett Grolemund - Part 2
    6. Analytic Web Applications with Shiny - Winston Chang - Part 1
    7. Analytic Web Applications with Shiny - Winston Chang - Part 2
  16. PyData
    1. Machine Learning with scikit-learn - Andreas Mueller - Part 1
    2. Machine Learning with scikit-learn - Andreas Mueller - Part 2
    3. Slicing Through Data with NumPy - Jennifer Klay - Part 1
    4. Slicing Through Data with NumPy - Jennifer Klay - Part 2
    5. Intro to Numba and Performance Python - Travis Oliphant - Part 1
    6. Intro to Numba and Performance Python - Travis Oliphant - Part 2
    7. Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 1
    8. Python Data Applications with Blaze and Bokeh - Andy Terrel and Matthew Rocklin - Part 2
    9. Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 1
    10. Analytics Beyond the Basics with pandas and SQL - Wes McKinney - Part 2
  17. Large-scale Machine Learning Day
    1. Large-scale Machine Learning Day - Yucheng Low - Part 2
    2. Large-scale Machine Learning Day - Yucheng Low - Part 3
    3. Large-scale Machine Learning Day - Alice Zheng - Part 4
    4. Large-scale Machine Learning Day - Chris DuBois - Part 5
    5. Large-scale Machine Learning Day - Alice Zheng - Part 6
    6. Large-scale Machine Learning Day - Shawn Scully - Part 7
  18. Sponsored
    1. Bringing OLAP Fully Online: Analyze Changing Datasets in MemSQL and Spark with Pinterest Demo - Eric Frenkiel
    2. From Domain-specific Solutions to an Open Platform Architecture for Big Data Analytics Based on Hadoop and Spark - Vin Sharma and Jason (Jinquan) Dai
    3. SAS Analytic Solutions Running on a Hadoop Cluster using YARN - James Kochuba
    4. Global Hadoop: Storage and Compute Challenges in Multi-Data Center Deployments - Jagane Sundar
    5. SQL in Hadoop: To Boldly Go where No Data Warehouse has Gone Before - Emma McGrattan
    6. A Simple, Fast Approach to Analytics for Big Data/IoT with kdb+ - Fintan Quill and Doug Talbott
    7. Scalable Realtime Analytics with declarative SQL like Complex Event Processing Scripts - Srinath Perera
    8. The Data Unification Imperative - Andy Palmer
    9. From Monitoring To Monetization With The Data Lake - Bill Schmarzo
    10. Breaking Through the Top 5 Enterprise Data Quality Roadblocks Inside Hadoop - George Corugedo
    11. Data Dexterity: Immediate Visibility Into All Information - Greg Goldsmith
    12. Extreme Sports and Beyond: Exploring a New Frontier in Data - Josh Byrd and Darren Chinen
    13. Cloud Machine Learning - Joseph Sirosh
    14. Credit Suisse Puts Vendors in the Hot Seat on Data Quality and Governance - Nitesh Ambastha, David Brewster and Nenshad Bardoliwalla
    15. Hive on Spark is Blazing Fast... Or Is It? - Carter Shanklin and Mostafa Mokhtar
    16. Tackling the World’s Biggest Data: Human Data - Richard Caudle
    17. Case Study: Data Warehousing in the Cloud with Snowflake at Kixeye - Jon Bock
    18. PostgreSQL Rising: The Other Elephant in the Room - Ozgun Erdogan
    19. Your First Big Data Application on AWS - Rahul Pathak
    20. Smart Enterprise Big Data Bus for the Modern Responsive Enterprise - Anand Venugopal
    21. Driving Better Business Results at Allstate with Machine Learning on Hadoop - Ryan Michaluk and Alexander Gray
    22. Big Data Architectural Pattern - Clint Sharp
    23. Perform Fast Analytics on Hadoop Data Scalable Predictive Analytics with Open Innovations from HP Vertica - Steve Sarsfield and Sunil Venkayala
    24. Running Hadoop-as-a-Service in the Cloud - Lance Olson
    25. Real World Use Cases: Hadoop and NoSQL in Production - Ted Dunning and Ellen Friedman
  19. Keynotes
    1. Hadoop's Impact on the Future of Data Management - Amr Awadallah
    2. Close Encounters with the Third Kind of Database - Eric Frenkiel
    3. Impacting Business as it Happens - Anil Gadre
    4. A Bigger Lens Through which to View the World- the IBM Twitter Alliance - Adam Kocoloski
    5. Data Science: Where are We Going? - DJ Patil
    6. The Emerging Age of Data-Driven Policy Design: Examples from Trying to Manage the Global Climate - Solomon Hsiang
    7. Data: Open for Good and Secure by Default - Eddie Garcia
    8. Year Zero: How We’ll Run Our Lives in Ten Years’ Time - Alistair Croll
    9. Intel and the Role of Open Source in Delivering on the Promise of Big Data - Michael Greene
    10. Big Data Lessons from Our Cybernetic Past - Eden Medina
    11. New Directions for Spark in 2015 - Matei Zaharia
    12. A New Approach to Big Data - Roman Shaposhnik
    13. Charting a Path Forward: The Future of Data Visualization - Jeffrey Heer
    14. Connected Cows? - Joseph Sirosh
    15. Startup Showcase Winner Announcement
  20. Solutions Showcase Theater
    1. The Briefcase Cluster - Enabling Big Data Everywhere - Jim Scott
    2. Why Event Analytics Matter - Rohit Shrivastava
    3. Cracking the Data Conundrum - Steffin Harris
    4. Smart Data for Smarter Utilities - Irshad Raihan
    5. The Value of Churn Analytics at Cisco - Ivan Chen and Phil Hodsdon
    6. Big Data Governance - Felix Van de Maele
    7. Early Warnings for Customer Churn at a Leading Cloud Technology Firm! - Umair Rauf
    8. Harnessing Big Social Data to Deliver Human Data Intelligence - Jason Rose
    9. Operationalizing Hadoop – Are You Ready? - Valerie Fowler
    10. Multimedia Giant Turns Big Data into Real-Time Customer Insights - Brian Garrett
    11. Data Wrangling in the Wild - Sean Ma
    12. StreamAnalytix-Developing Enterprise Class, Real-time Streaming Applications on Apache Storm - Anand Venugopal
    13. Gaining Value From Data Where It's Born - Ryan Peterson
    14. Build a Foundation for Self-Service Data Prep, Analytics, and Governance - Oliver Claude
    15. Connecting the Big-Data Driven Enterprise in Online Retail - Ashley Stirrup
    16. Leading Telecommunications Company Uses BlueData to Spin Up Local, On Demand Hadoop and Spark Clusters to Enable Agile Deployment of Big Data Tools and Technologies - Nanda Vijaydev
    17. Taming Data Variety: Intelligent Solutions Using Machine Learning and Expert Crowdsourcing - Alan Wagner
    18. Everything You Need To Know About HBase in 10 Minutes or Less - Alex Newman
    19. The Emergence of the Data Refinery - Chuck Yarbrough
    20. Big Data Cluster Planning and Optimization Using Wolf Island Simulation Technology - Laurent Isenegger
    21. Prosthetic Implant Surgery - Where Big Data Means Big Savings - Rola Shaar
    22. Close the Skills Gap and Deliver Rapid Business Value with Big Data Apps - Manan Goel
    23. Distributed R - Scaling the R Language for Even Bigger Data - Sunil Venkayala
    24. Transforming Big Data Landscape with Apache Spark - Rishi Yadav
    25. Data Warehousing in the Cloud - Jon Bock
    26. Proactive Product Intelligence for Electronics - Rami Lokas
    27. Massive-Scale Security Incident Response Leveraging a Hadoop Architecture - Michael A. Davis
    28. Don’t be a Hadoop Breach Headline - Discovery and Sensitive Data in Hadoop - Jeremy Stieglitz
    29. Big Data vs. Climate Change - Srivatsan Ramanujam and John Cardente
    30. ZEAS – Enabling anyone to create Hadoop Enterprise applications fast using a GUI - Aditya Agrawal
    31. Power Tools for Big Data Analytics - Dan Steinberg
    32. Big Data on OpenStack - Kirk Lewis and Frank Rego
    33. Fighting ATM Fraud in Real Time with Hadoop Analytics - Christy Maver
    34. Scale Big Data cost down, while scaling performance out. An NTT mobile personalization retrospective, re-thinking the Big Data solution stack. - Robert Greene
    35. Dato Enables Large-Scale Deduplication at Zillow using GraphLab Create - Rajat Arya
    36. To Catch a Thief with Big Data - Kevin Petrie
    37. Jump into the Data Lake with Hadoop-Scale Data Integration - Greg Benson
    38. Predicting The Future To Improve Customer Satisfaction - Joe Rossi
    39. The Practical, Profitable Magic of Prescriptive Analytics - Andy Flint
    40. Changing the Culture Around Data: Empowering More People with Analytics - Gary Cottrell
    41. How Havas Media Found New Revenue Streams with UNIFi Software - Sean Keenan
    42. What Enterprises Can Learn From Real-Time Bidding - Peter Corless
    43. Big Data and the Data Quality Imperative - Ed Wrazen
    44. Tapjoy Scales and Saves Costs with Riak - Tom Sigler
    45. Smart Execution: How to Optimize Performance by Intelligently Leveraging Multiple Hadoop Analytics Engines - Matt Schumpert
    46. Jagex Game Studio Case Study - Gregory McPhee
    47. Supercharge Sqoop with magical JDBC drivers - Sumit Sarkar
    48. Big Data Analytics: Diverse Use Cases, Diverse Architectures - Ben Conners
    49. Accelerate your data with SequoiaDB - Tao Wang
    50. Building reliable Hadoop clusters with two copies - Iyer Venkatesan

Product information

  • Title: Strata + Hadoop World San Jose 2015: Video Compilation
  • Author(s):
  • Release date: March 2015
  • Publisher(s): O'Reilly Media, Inc.
  • ISBN: 9781491924143