The majority of the blog entries are written in a style that novices may understand. I will give some practice tasks to be completed using Apache Hive in order to develop learning abilities and to provide practical exposure and knowledge growth.
There will be a set of assignments with data sets, but without solutions.
Title : StructInArray.
Dataset: Click here.
Assignment: A CSV data file including website interaction information is supplied to you. Your assignment as follows:
1) Load the data based on below table-structure.
id INT,
url STRING,
date STRING,
publisher STRING,
advt STRING
2) Parse the data and fetch Query keyword, host and country.
3) Get number of records per country.
Hints: Click here.
All the best.
One comment