Blog
Apache Hive
Hive SerDe RegEx
Sqoop
SQL Server
Courses
- CCA Data Analyst (CCA159) Exam
- Online Courses
General
Downloads
Contact Us

Category: Apache Hive

Multi Insert in Apache Hive

3rd Feb 2022 SHAFI SHAIK

Multiple records in the same table is a well-known technique. But in this post, we’ll look at inserting into multiple tables using a single expression.

Skewed Tables – All Articles

28th Jan 2022 SHAFI SHAIK

This is a collection of articles regarding skewed tables in Hive that have been published on this website.

Partitioning vs Bucketing in Hive

20th Jan 2022 SHAFI SHAIK

Partitioning in Hive divides huge tables into smaller logical tables depending on column values; one logical table is created for each individual value. By defining

Apache Hive Data Model

20th Jan 2022 SHAFI SHAIK

Apache Hive is built on top of Apache Hadoop, which is a distributed, fault-tolerant, and open source data warehouse platform for reading, writing, and handling

Partitioned, Bucketed and Skewed Tables in Hive

14th Jan 2022 SHAFI SHAIK

When working with a large amount of data on a Hadoop file system, both partitioning and bucketing in Hive are used to avoid table scans

When to avoid bucketing in Hive

13th Jan 2022 SHAFI SHAIK

When working with large datasets that need to be divided into chunks for better management and the possibility to connect queries with other large datasets,

« Previous Posts 1 … 3 4 5 6 7 … 47 Next Posts»

Search for:

Subscribe to Blog via Email

Enter your email address to subscribe to this blog and receive notifications of new posts by email.

Email Address:

Join 1,611 other subscribers.

Blog Stats

528,179 hits

LinkedIn
X
Facebook
Tumblr

Big Data & SQL

Blog at WordPress.com.

Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy

Subscribe Subscribed
- Big Data & SQL
- Already have a WordPress.com account? Log in now.