You are previewing Time Series Databases: New Ways to Store and Access Data.
O'Reilly logo
Time Series Databases: New Ways to Store and Access Data

Book Description

Time series data is of growing importance, especially with the rapid expansion of the Internet of Things. This concise guide shows you effective ways to collect, persist, and access large-scale time series data for analysis. You’ll explore the theory behind time series databases and learn practical methods for implementing them. Authors Ted Dunning and Ellen Friedman provide a detailed examination of open source tools such as OpenTSDB and new modifications that greatly speed up data ingestion.

Table of Contents

  1. Preface
    1. In This Book
  2. 1. Time Series Data: Why Collect It?
    1. Time Series Data Is an Old Idea
    2. Time Series Data Sets Reveal Trends
    3. A New Look at Time Series Databases
  3. 2. A New World for Time Series Databases
    1. Stock Trading and Time Series Data
    2. Making Sense of Sensors
    3. Talking to Towers: Time Series and Telecom
    4. Data Center Monitoring
    5. Environmental Monitoring: Satellites, Robots, and More
    6. The Questions to Be Asked
  4. 3. Storing and Processing Time Series Data
    1. Simplest Data Store: Flat Files
    2. Moving Up to a Real Database: But Will RDBMS Suffice?
    3. NoSQL Database with Wide Tables
    4. NoSQL Database with Hybrid Design
    5. Going One Step Further: The Direct Blob Insertion Design
    6. Why Relational Databases Aren’t Quite Right
    7. Hybrid Design: Where Can I Get One?
  5. 4. Practical Time Series Tools
    1. Introduction to Open TSDB: Benefits and Limitations
    2. Architecture of Open TSDB
    3. Value Added: Direct Blob Loading for High Performance
    4. A New Twist: Rapid Loading of Historical Data
    5. Summary of Open Source Extensions to Open TSDB for Direct Blob Loading
    6. Accessing Data with Open TSDB
    7. Working on a Higher Level
    8. Accessing Open TSDB Data Using SQL-on-Hadoop Tools
    9. Using Apache Spark SQL
    10. Why Not Apache Hive?
    11. Adding Grafana or Metrilyx for Nicer Dashboards
    12. Possible Future Extensions to Open TSDB
      1. Cache Coherency Through Restart Logs
  6. 5. Solving a Problem You Didn’t Know You Had
    1. The Need for Rapid Loading of Test Data
    2. Using Blob Loader for Direct Insertion into the Storage Tier
  7. 6. Time Series Data in Practical Machine Learning
    1. Predictive Maintenance Scheduling
  8. 7. Advanced Topics for Time Series Databases
    1. Stationary Data
    2. Wandering Sources
    3. Space-Filling Curves
  9. 8. What’s Next?
    1. A New Frontier: TSDBs, Internet of Things, and More
    2. New Options for Very High-Performance TSDBs
    3. Looking to the Future
  10. A. Resources
    1. Tools for Working with NoSQL Time Series Databases
    2. More Information About Use Cases Mentioned in This Book
    3. Additional O’Reilly Publications by Dunning and Friedman
  11. About the Authors
  12. Colophon
  13. Copyright