Loading Data In Apache Impala

In Apache Impala, similar to Hive, the LOAD command can be used to import data into tables. The source files might be in text, csv, or other formats. It may also come from the local system, the network, or the HDFS directory.

The LOAD DATA command speeds up the ETL process for an internal Impala table by copying a data file or all of the data files in a directory from HDFS to the table’s Impala data directory.

Loading data from HDFS:
LOAD DATA INPATH ‘/user/docs/empdata.txt’ into table tbStudentDetails;

Loading data from local system:
LOAD DATA LOCAL INPATH ‘Desktop/docs/empdata.txt’ into table tbStudentDetails;

Hope you find this article helpful.

Please subscribe for more interesting updates.

One comment

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s