Multiple records in the same table is a well-known technique. But in this post, we’ll look at inserting into multiple tables using a single expression.
Category: Apache Hive
Skewed Tables – All Articles
This is a collection of articles regarding skewed tables in Hive that have been published on this website.
Partitioning vs Bucketing in Hive
Partitioning in Hive divides huge tables into smaller logical tables depending on column values; one logical table is created for each individual value. By defining
Apache Hive Data Model
Apache Hive is built on top of Apache Hadoop, which is a distributed, fault-tolerant, and open source data warehouse platform for reading, writing, and handling
Partitioned, Bucketed and Skewed Tables in Hive
When working with a large amount of data on a Hadoop file system, both partitioning and bucketing in Hive are used to avoid table scans
When to avoid bucketing in Hive
When working with large datasets that need to be divided into chunks for better management and the possibility to connect queries with other large datasets,