Since Apache Pig does not have SELECT statements like SQL, there is no straightforward solution for loading, filtering, and retrieving particular columns in a single
Author: SHAFI SHAIK
Hi, this is Shafi Shaik, Microsoft Solutions Specialist in Data platform as well as in Data Management & Analytics. I am a certified associate in Oracle SQL*Plus and extensively trained in MongoDB Administration. My current role is Database administration with hands-on expertise in almost all relational databases namely Microsoft SQL Server, MySQL, Oracle SQL*Plus, PostGreSQL & Teradata. I'm also specialized in database development, warehousing, traditional data analysis and Big Data analysis.
Limiting Rows in Result – Apache Pig
Data sampling is the best practice to understand the data patterns and trends of large datasets by looking at the smaller portion of the data.
​Quoted Identifiers in Column Names – Apache Hive
In SQL, an identifier is a string of alphanumeric and underscore (_) characters surrounded by backtick (`). In Hive, quoted IDs are case-insensitive. For example,
Apache Pig Utility Commands
This post aims to let you know the shell and utility commands that help in various situations. Shell Commands (1) fs: Any FsShell command can
Sorting in Apache Pig
Rearranging the rows returned from a query result set in ascending or descending order is the most commonly utilized function by analysts. Sorting can be