O'Reilly logo

Practical Hadoop Ecosystem: A Definitive Guide to Hadoop-Related Frameworks and Tools by Deepak Vohra

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

© Deepak Vohra 2016

Deepak Vohra, Practical Hadoop Ecosystem, 10.1007/978-1-4842-2199-0_3

3. Apache Hive

Deepak Vohra

(1)Apt 105, White Rock, British Columbia, Canada

Apache Hive is a data warehouse framework for querying and managing large datasets stored in Hadoop distributed filesystems (HDFS) . Hive also provides a SQL-like query language called HiveQL . The HiveQL queries may be run in the Hive CLI shell . By default, Hive stores data in the HDFS, but also supports the Amazon S3 filesystem.

Hive stores data in tables. A Hive table is an abstraction and the metadata for a Hive table is stored in an embedded Derby database called a Derby metastore. Other databases such as MySQL and Oracle Database could also be configured as the Hive metastore ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required