Strata Conference Santa Clara 2012: Video Compilation

Video description

The future clearly belongs to those who understand how to collect and use their data successfully—and the time to get started is now. At the Strata Conference, people from tech, marketing, and many other fields gathered to learn the latest skills, tools, and technologies for making a data-driven business work. This video compilation offers you a front row seat for every tutorial, session, and keynote at the conference.

View thought-provoking keynotes from industry leaders such as Avanish Kaushik (Market Motive), Coco Krumme (MIT Media Lab), Dave Campbell (Microsoft), and Doug Cutting (Cloudera). Then sit back and take in practical and inspiring sessions in seven different tracks: Data Science, Business & Industry, Visualization & Interface, Hadoop & Big Data, Policy & Privacy, and Domain Data.

You’ll also get the complete Strata Jumpstart, a day-long bootcamp for business leaders who want to become data driven. Download these videos or view them through our HD player, and discover how the world of big data can—and will—affect your organization.

Here are just a few of the sessions you’ll receive in this video package:

Data Science:

  • The Two Most Important Algorithms in Predictive Modeling Today—Jeremy Howard (Kaggle), Mike Bowles (Sole Proprietor)
  • Architecting Virtualized Infrastructure for Big Data—Richard McDougall (VMware)

Business & Industry:

  • Data Jujitsu: The Art of Turning Data into Product—DJ Patil (Greylock Partners)
  • Improving Productivity Using Real-Time Data—Jacomo Corbo (QuantumBlack)

Visualization & Interface:

  • Science of Visualization—Jock Mackinlay (Tableau Software)
  • Roll Your Own Front End: A Survey of Creative Coding Frameworks—Michael Edgcumbe (Columbia University), Eric Mika (The Department of Objects)

Hadoop & Big Data:

  • I Didn't Know You Could Do All that with Hadoop—Jack Norris (MapR)
  • Storm: distributed and fault-tolerant realtime computation—Nathan Marz (Twitter)

Domain Data:

  • Understanding Social Contagion—Marcel Salathé (Penn State University)
  • Changing Data Standards from Wall Street to DC and Beyond—John Mulholland (Fannie Mae)

Table of contents

  1. Strata Conference 2012: Day 1
    1. SQL and NoSQL Are Two Sides Of The Same Coin
    2. From Knowing What To Understanding Why
    3. The Model and the Train Wreck: A Training Data How-to
    4. Corpus Bootstrapping with NLTK
    5. The Importance of Importance: An Introduction to Feature Selection
    6. Social Network Analysis Isn't Just For People
    7. Array Theory vs. Set Theory in Managing Data
    8. Survival Analysis for Cache Time-to-Live Optimization
    9. The Data Science Debate
    10. Introduction to Apache Hadoop Part 1
    11. Introduction to Apache Hadoop Part 2
    12. Introduction to Apache Hadoop Part 3
    13. Introduction to Apache Hadoop Part 4
    14. The Two Most Important Algorithms in Predictive Modeling Today Part 1
    15. The Two Most Important Algorithms in Predictive Modeling Today Part 2
    16. The Two Most Important Algorithms in Predictive Modeling Today Part 3
    17. The Two Most Important Algorithms in Predictive Modeling Today Part 4
    18. Large scale web mining Part 1
    19. Large scale web mining Part 2
    20. Large scale web mining Part 3
    21. The Craft of Data Journalism Part 1
    22. The Craft of Data Journalism Part 2
    23. The Craft of Data Journalism Part 3
    24. The Craft of Data Journalism Part 4
    25. Big Data Without the Heavy Lifting Part 1
    26. Big Data Without the Heavy Lifting Part 2
    27. Big Data Without the Heavy Lifting Part 3
    28. Big Data Without the Heavy Lifting Part 4
    29. Big Data Entity Extraction With Less Work and Less Code Part 1
    30. Big Data Entity Extraction With Less Work and Less Code Part 2
    31. Big Data Entity Extraction With Less Work and Less Code Part 3
    32. Big Data Entity Extraction With Less Work and Less Code Part 4
    33. Introduction to R for Data Mining Part 1
    34. Introduction to R for Data Mining Part 2
    35. Introduction to R for Data Mining Part 3
    36. Introduction to R for Data Mining Part 4
    37. Building Applications with Apache Cassandra Part 1
    38. Building Applications with Apache Cassandra Part 2
    39. Building Applications with Apache Cassandra Part 3
    40. Building Applications with Apache Cassandra Part 4
    41. Hadoop Data Warehousing with Hive Part 1
    42. Hadoop Data Warehousing with Hive Part 2
    43. Hadoop Data Warehousing with Hive Part 3
    44. Hadoop Data Warehousing with Hive Part 4
    45. Hands-on Visualization with Tableau Part 1
    46. Hands-on Visualization with Tableau Part 2
    47. Hands-on Visualization with Tableau Part 3
    48. Hands-on Visualization with Tableau Part 4
    49. Designing Data Visualizations Workshop Part 1
    50. Designing Data Visualizations Workshop Part 2
    51. Designing Data Visualizations Workshop Part 3
    52. Designing Data Visualizations Workshop Part 4
    53. Developing applications for Apache Hadoop Part 1
    54. Developing applications for Apache Hadoop Part 2
    55. Developing applications for Apache Hadoop Part 3
    56. Developing applications for Apache Hadoop Part 4
    57. What Marketers Can Learn From Analysts
    58. Jumpstart Welcome
    59. Big Data and Supply Chain Management: Evolution or Disruptive Force?
    60. Ammunition for the CFO: How to be a Hard-Nosed Business Customer for Analytics
    61. 3 Essential Skills of a Data Driven CEO
    62. Business Intelligence: What have we been missing?
    63. Do it Right: Proven Techniques for Exploiting Big Data Analytics
    64. The Business of Big Data
    65. Big Data, Serious Games, and the Future of Work
    66. It's Not Just About the Data......the Power of Driving Impact Through Intent and Interconnectedness
    67. Wrap-up Session
  2. Strata Conference 2012: Day 2
    1. The Apache Hadoop Ecosystem
    2. Decoding the Great American ZIP myth
    3. Guns, Drugs and Oil: Attacking Big Problems with Big Data
    4. Machine Learning and Big Data: Sustainable Value or Hype?
    5. Learning Analytics: What Could You Do With Five Orders of Magnitude More Data About Learning?
    6. A Big Data Imperative: Driving Big Action
    7. The Information Architecture of Medicine is Broken
    8. Do We Have The Tools We Need To Navigate The New World Of Data?
    9. Street Fighting Data Science
    10. Data Ingest, Linking, and Data Integration via Automatic Code Generation
    11. Disambiguation: Embrace wrong answers and find truth
    12. Netflix recommendations: beyond the 5 stars
    13. Data Science in Product Development
    14. Mo' Data, Mo' Problems
    15. Business Management Strategies for Big Data
    16. Becoming a Data-Driven Organization
    17. Building a Data Strategy: Data Enabling Toys at Leapfrog
    18. Analytics in a Community-Driven Fashion Retailer
    19. Data Science in Marketing Analytics
    20. Science of Visualization
    21. Effective Data Visualization
    22. Building a Data Narrative: Discovering Haight Street
    23. Crafting Meaningful Data Experiences
    24. Roll Your Own Front End: A Survey of Creative Coding Frameworks
    25. Sketching With Data
    26. The Future of Hadoop: Becoming an Enterprise Standard
    27. Hadoop + JavaScript: what we learned
    28. Architecting Virtualized Infrastructure for Big Data
    29. Aggregating and serving local places data and ads at Citygrid
    30. Exploring Social Data: Use Cases for Real-World Application
    31. Understanding Social Contagion
    32. Changing Data Standards from Wall Street to DC and Beyond
    33. Big Data: Wall Street Style
    34. Big Data = Bigger Metadata
    35. Linked Data: Turning the Web into a Context Graph
    36. Data as a Strategic Weapon - Walmart, Netfix and Apigee Panel Discussion
    37. Creating Real Business Value with Big Data Analytics
    38. Getting the Most from Your Hadoop Big Data Cluster
    39. Amazon DynamoDB: A seamlessly scalable NoSQL service
    40. Turning Big Data Into Competitive Advantage
    41. Unleash Insights On All Data With Microsoft Big Data
    42. SQLFire - An Ultra-fast, Memory-optimized Distributed SQL Database
    43. MapReduce for the Rest of Us: Unlocking Data Science for the Business User
    44. Automated Understanding - The Next Evolution in Big Data Analytics
    45. RHadoop, R meets Hadoop
    46. Monitoring Apache Hadoop - a big data problem?
    47. How to develop Big Data Pipelines for Hadoop
    48. How Crunch Makes Writing, Testing and Running of MapReduce Pipelines Easy, Efficient and Even Fun!
    49. Analyzing Hadoop Source Code with Hadoop
    50. Strata 2012 Startup Showcase
  3. Strata Conference 2012: Day 3
    1. Democratization of Data Platforms
    2. 5 Big Questions about Big Data
    3. The Trouble with Taste
    4. Embrace the Chaos
    5. Open Data and the Internet of Things
    6. Big Data's Next Step: Applications
    7. Heritage Provider Network, Announces the Winner of the Second Heritage Health Progress Prize
    8. Using Google Data for Short-term Economic Forecasting
    9. Is this normal? Finding anomalies in real-time data
    10. From Predictive Modeling to Optimization: The Next Frontier
    11. Mining Unstructured Data: Practical Applications
    12. Migratory data: the distributed data you carry with you
    13. Humans, Machines, and the Dimensions of Microwork
    14. Big Data and Bibliometrics: Crowdsourcing the World's Largest Database of Research
    15. Democratizing BI at Microsoft: 40,000 Users and Counting
    16. Mining the Eventbrite Social Graph for Recommending Events
    17. Data Jujitsu: The Art of Turning Data into Product
    18. Data Marketplaces for your extended enterprise: Why Corporations Need These to Gain Value from Their Data
    19. Big Data Meets Big Weather
    20. Improving Productivity Using Real-Time Data
    21. Video Graphics - Engaging and Informing
    22. Rich Sports Data and Augmented Reality
    23. Visualizing Geo Data
    24. Beautiful Vectors: Emerging Geospatial technologies in the browser
    25. From Big Data to Big Insights
    26. Exploring the Stories Behind the Data
    27. Hadoop Analytics in Financial Services
    28. Using Map/Reduce To Speed Analysis of Video Surveillance
    29. Beyond Map/Reduce: Getting Creative With Parallel Processing
    30. Petabyte Scale, Automated Support for Remote Devices
    31. Big Analytics Beyond the Elephants
    32. If Data Wants to Be Free, is Privacy a Prison?
    33. Pretty Simple Data Privacy
    34. OODA Loop: How to Understand the Use Cases for Big Data
    35. It's Not Junk [Data] Anymore
    36. Big Data for the Common Good
    37. Personalized Medicine and Individual Cancer Care, it is a data problem
    38. Solving big data analytics with an emerging data-centric language
    39. Big Data and Machine Learning: A Reality Check
    40. Big Data Big Costs?
    41. Big Data Meets the Big Cloud: How To Monitor Thousands of Servers
    42. Big Data and the Social Firehose
    43. Big Data Applications in Action
    44. Start Innovating! Crowdsourcing and Big Data
    45. Apache Cassandra: NoSQL Applications in the Enterprise Today
    46. Storm: distributed and fault-tolerant realtime computation
    47. Analytics from 330 million smartphones
    48. Connecting Millions of Mobile Devices to the Cloud
    49. Open Source Ceph Storage: Scaling from Gigabytes to Exabytes with Intelligent Nodes
    50. Mapping social media networks (with no coding) using NodeXL

Product information

  • Title: Strata Conference Santa Clara 2012: Video Compilation
  • Author(s): O'Reilly Media, Inc.
  • Release date: March 2012
  • Publisher(s): O'Reilly Media, Inc.
  • ISBN: 0636920025467