AWS Athena

AWS Athena is a relatively new service from AWS. Of course, this is specific to AWS, but other cloud providers may offer comparable services. Athena gives you the ability to write SQL queries to analyze data stored on S3. Before you can analyze your data with SQL, you must create a virtual database with associated tables across your structured or semi-structured S3 files. You may create these tables manually or with another AWS service called Glue.

I won't go into all of the details of setting up a new Athena database or tables but will show you the results and ease of use after you've set those up. In this example, I've created a database and table for web server logs from the Big Data Benchmark dataset.

This data is publicly ...

Get Serverless Design Patterns and Best Practices now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.