This is a collection of posts about Apache Hive Complex data types that were first published on this blog.
Apache Hive – Store & Access Nested Values – Complex Data Types- Part-1
This is going to be a series of articles to discuss about complex data types that are available in Apache Hive. There will be too many simple to complex scenarios for you to learn.Introduction:Hadoop is designed to manage big data as well as complex data that cannot be managed using primitive data types. Complex data types are nested data structures…
Complex Data Types – Part-2
The information below pertains to the top-ranked universities in the United States, the United Kingdom, and Australia. The Times Higher Education collects the data using carefully crafted success indicators that evaluate an institution’s teaching, studying, knowledge sharing, and global outlook. We are going to consider the below data sets to learn “Complex Data Types” in Apache Hive. In this post…
Complex Data Types – Part-3
In the previous article, we looked at how to work with the complex datatype “Array.” Now, we’ll look at the “MAP,” which is a another complex datatype.Before we proceed for the exercise, let us know what MAP data type is.Map is a complex data type in Apache Hive that can store Key-Value pairs. Values from a map can be accessed…
Complex Data Types – Part-4
Read: Array – Complex Data Type In Apache Hive Read: Map – Complex Data Type in Apache Hive In this post, we’ll look at another complex data type in Apache Hive called “STRUCT.” STRUCT in Hive is similar to STRUCT in the C programming language. It’s a record type that contains a collection of named fields that can be of…
Complex Data Types – Part-5
Previous posts explained how to deal with Complex Data Types Array, Map, Struct individually. In this post, we will learn how to handle them when they are all in a single table. Look at the dataset below if you have any doubts that it won’t be in a single table. Employee Name : John SmithReporting To …
Complex Data Types – STRUCT within ARRAY -1
We’ve learned how to deal with each complex datatype separately so far, and we’ve even looked into the possibilities of including all complex data types in a single table. Let’s move on to more complicated data now. A complex data type, as previously stated, is a representation of multiple fields of a single item. MAP is a key value combination…
Complex Data Types – STRUCT within ARRAY -2
Related to the previous post, the following is another example of complex data types being used within another complex data type. We’ll deal with STRUCT inside an ARRAY again in this case. In addition to extracting the required data, we’ll see how to use Apache Hive’s built-in functions to the nested fields. Consider the below dataset. The data in the…
Complex Data Types – ARRAY within STRUCT
This is one of the articles that required a significant amount of time and effort to complete. Once an idea was formed, I went to wiki and found some relevant data to combine in a way that justified the topic title. We learned how to embed a complex data type STRUCT into an ARRAY datatype in previous articles. Let’s explore…
Complex Data Types – ARRAY in MAP
In this blog, we’ll look at how to manage a complex datatype ARRAY within another complex datatype MAP. The dataset under discussion in this blog already contains complex data type that must be dealt in a more nested manner. This blog has various scenarios and use cases for learning and practicing ‘Complex Datatypes’. The goal of offering multiple examples, however,…
Complex Data Types – ARRAY within MAP
Please click here for “ARRAY in MAP” The data is available in a number of formats, volumes, and types. It may be well-organized, partially-organized, or completely disorganized. Numbers or text may be used. It could be in a relational format, or it could be too large for row-column or tabular databases. Finally, data aids in the understanding and improvement of business…
Complex Data Types – STRUCT within MAP
As we’ve seen in previous blogs, Hive complex data types such as arrays, maps, and structs are a composite of primitive or complex data types. This means that integer, string, decimal, and other complex data types can be represented as an Array. Similarly, MAP and STRUCT can also have primitive and complex data types. We learned how to embed a complex data…
Is Complex Data Really Complex To Handle?
Let’s split the title-question into two. What is the complex data and how complex it is to handle. First, let’s see what complex data is. Any data that does not fit into relational database structures can be considered as complex data. Maps, system or application created logs, images, videos, streaming data are examples of complex data. Now, answer to the…