Apache Hive Course Contents

Apache Hive is built on top of Apache Hadoop, which is a distributed, fault-tolerant, and open source data warehouse platform for reading, writing, and handling massive datasets stored directly in HDFS or other data management structures such as Apache HBase. Hive is characterized by the ability to query massive datasets using Apache Tez or MapReduce.

This page combines all Hive-related subjects into a single article to make it more helpful and beneficial to learners. It is presented in a hierarchical manner, beginning with the basics and progressing to more advanced topics.

APACHE HIVE

HIVE DDL

IMPORTING DATA

TABLE

DATA TYPES

HIVE QUERIES – OPERATORS – CLAUSES

SYSTEM FUNCTIONS

VIRTUAL COLUMNS

FILE FORMATS & COMPRESSION

SUB-QUERIES

JOINS

VIEWS

PARTITIONS

SerDe

COMPLEX DATA TYPES

BUCKETING

SAMPLING

EXPORT DATA

NON-SQL STATEMENTS

METADATA & PROPERTIES

BATCH PROCESSING

PERFORMANCE TUNING

IMPALA

CCA 159

ASSIGNMENTS

Hope you find this page helpful.

Please subscribe for more interesting updates.

2 comments

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s