Apache Hive Course Contents

Apache Hive is built on top of Apache Hadoop, which is a distributed, fault-tolerant, and open source data warehouse platform for reading, writing, and handling massive datasets stored directly in HDFS or other data management structures such as Apache HBase. Hive is characterized by the ability to query massive datasets using Apache Tez or MapReduce.

This page combines all Hive-related subjects into a single article to make it more helpful and beneficial to learners. It is presented in a hierarchical manner, beginning with the basics and progressing to more advanced topics.

APACHE HIVE

HIVE DDL

IMPORTING DATA

TABLE

DATA TYPES

HIVE QUERIES – OPERATORS – CLAUSES

SYSTEM FUNCTIONS

VIRTUAL COLUMNS

FILE FORMATS & COMPRESSION

SUB-QUERIES

JOINS

VIEWS

PARTITIONS

SerDe

COMPLEX DATA TYPES

BUCKETING

SAMPLING

EXPORT DATA

NON-SQL STATEMENTS

METADATA & PROPERTIES

BATCH PROCESSING

PERFORMANCE TUNING

IMPALA

CCA 159

ASSIGNMENTS

Hope you find this page helpful.

Please subscribe for more interesting updates.

2 comments

Leave a Reply