Data persistence

As discussed in previous chapters, S3 is an object storage for large files and a combination of S3, HDFS, and databases such as Cassandra, and HBase enable us to store the IoT datasets in a distributed way. S3 provides the ability to query the data with high performance and persist the data efficiently, as discussed in previous chapters. AWS provides an S3 API to enable the object storage in public cloud deployments and Scality is another option that provides an S3 compatible API to store large files, somethings that comes in handy for on-premises deployments. For more details on Scality S3, please refer to https://www.scality.com/topics/s3-compatible-storage/.

Get Industrial Internet Application Development now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.