The majority of the blog entries are written in a style that novices may understand. I will give some practice tasks to be completed using Apache Hive in order to develop learning abilities and to provide practical exposure and knowledge growth.
There will be a set of assignments with data sets, but without solutions.
Title : ArrayStructArray.
Dataset: Click here.
Assignment: A CSV data file including celebrities and their family details (extracted from wiki, however, do not depend on it’s reliability) is supplied to you. Your assignment as follows:
1) Load the data based on below table-structure.
Field Name | Data Type |
ID | INT |
CelebName | STRING |
Marriages | ARRAY<STRUCT< |
SPOUSE:STRING, | |
Children:ARRAY<STRING> | |
>> |
2) Get the first kid of each celebrity.
Hints: Click here.
All the best.
One comment