A brief tour of AWS Athena

In Athena, the data is not stored in a database; it remains in S3. When you create a table in Athena, you are creating an information layer that tells Athena where to find the data, how it is structured, and what format it is in. The schema in Athena is a logical namespace of objects. Once the data structure and location are known to Athena, you can query the data via standard SQL statements. 

Athena uses Hive Data Definition Language (DDL) to create or drop databases and tables (more information on Hive DDL can be found at https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL). Athena can understand multiple formats (CSV, TSV, JSON, and so on) through the use of serializer-deserializer (SerDes) libraries. ...

Get Effective Amazon Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.