The below example will let you know how to load the comma separated values in Apache Pig. Let’s consider the popular Emp data.OracleEmpData.csv7839,KING,PRESIDENT,0,1981-11-17,5000,0,107698,BLAKE,MANAGER,7839,1981-05-01,2850,0,307782,CLARK,MANAGER,7839,1981-09-06,2450,0,107566,JONES,MANAGER,7839,1981-02-04,2975,0,207788,SCOTT,ANALYST,7566,1987-07-13,3000,0,207902,FORD,ANALYST,7566,1981-12-03,3000,0,207369,SMITH,CLERK,7902,1980-12-17,800,0,207499,ALLEN,SALESMAN,7698,1981-02-20,1600,300,307521,WARD,SALESMAN,7698,1981-02-22,1250,500,307654,MARTIN,SALESMAN,7698,1981-09-28,1250,1400,307844,TURNER,SALESMAN,7698,1981-08-09,1500,0,307876,ADAMS,CLERK,7788,1987-07-13,1100,0,207900,JAMES,CLERK,7698,1981-12-03,950,0,307934,MILLER,CLERK,7782,1982-01-23,1300,0,10 You can
Author: SHAFI SHAIK
Hi, this is Shafi Shaik, Microsoft Solutions Specialist in Data platform as well as in Data Management & Analytics. I am a certified associate in Oracle SQL*Plus and extensively trained in MongoDB Administration. My current role is Database administration with hands-on expertise in almost all relational databases namely Microsoft SQL Server, MySQL, Oracle SQL*Plus, PostGreSQL & Teradata. I'm also specialized in database development, warehousing, traditional data analysis and Big Data analysis.
How to Get the Version of Apache Pig From Grunt
This article will help you understanding how to get the version number of Apache Pig you are using. Apache Pig version can be get from
NEXT_DAY Function in Apache Hive & Impala
During data processing or data transformation, we often look for a suitable function to perform a particular task. If we can’t find a solution, we
Convert Multiple Date Formats into A single Format
Representation of the date and time varies between countries, various applications or application logs. When stored in a table form, we always use a single
Does ‘Anti Join’ perform better than ‘Left Join’?
The answer is – YES..!!! Anti-join is a very efficient SQL construct offered by Apache Impala. Anti-join between two tables returns rows from the first
Modifying HDFS Default Configuration
This post discusses how to change the current HDFS configuration, such as the default block replication, the data node directory, the default block size, etc.