Since Apache Pig does not have SELECT statements like SQL, there is no straightforward solution for loading, filtering, and retrieving particular columns in a single
Author: SHAFI SHAIK
Hi, this is Shafi Shaik, Microsoft Solutions Specialist in Data platform as well as in Data Management & Analytics. I am a certified associate in Oracle SQL*Plus and extensively trained in MongoDB Administration. My current role is Database administration with hands-on expertise in almost all relational databases namely Microsoft SQL Server, MySQL, Oracle SQL*Plus, PostGreSQL & Teradata. I'm also specialized in database development, warehousing, traditional data analysis and Big Data analysis.
Limiting Rows in Result – Apache Pig
Data sampling is the best practice to understand the data patterns and trends of large datasets by looking at the smaller portion of the data.
Quoted Identifiers in Column Names – Apache Hive
In SQL, an identifier is a string of alphanumeric and underscore (_) characters surrounded by backtick (`). In Hive, quoted IDs are case-insensitive. For example,
Apache Pig Utility Commands
This post aims to let you know the shell and utility commands that help in various situations. Shell Commands (1) fs: Any FsShell command can
Sorting in Apache Pig
Rearranging the rows returned from a query result set in ascending or descending order is the most commonly utilized function by analysts. Sorting can be