AWS Hive

What is Hive?

Hive is a data warehouse software project built on top of Apache Hadoop for providing data query and analysis.
Hive gives an SQL-like interface to query data stored in various databases and file systems that integrates with Hadoop.

Hive advantages:

  1. Hive is very flexible and has many options to transform data which is complex such as JSON, AVRO and Parquet.
  2. Hive Supports both External over S3 and local Tables.

Hive use cases:

Transformation & Cleansing of data.

Hive antipattern use cases:

Real time- as hive’s performance is disk based. 

Our Hive Blogs

Architectures and meetups which includes Hive

English

Hebrew

Top Video English

Top Video Hebrew