The majority of the blog entries are written in a style that novices may understand. I will give some practice tasks to be completed using Apache Hive in order to develop learning abilities and to provide practical exposure and knowledge growth.
There will be a set of assignments with data sets, but without solutions.
Title : Windowing Functions.
Dataset: Click here.
Assignment: A CSV data file including city-wise sales information is supplied to you. Your assignment as follows:
1) Skip the header of the CSV and Load the data based on below table-structure.
YYYYMM INT,
Location VARCHAR(10),
Amount DECIMAL(7,2)
2) Get cumulative average of the sales amount as shown below.
3) Get the running/moving average of the sales amount as shown below.
Hints: Click here.
All the best.
One comment