In Apache Impala, similar to Hive, the LOAD command can be used to import data into tables. The source files might be in text, csv, or other formats. It may also come from the local system, the network, or the HDFS directory.
The LOAD DATA command speeds up the ETL process for an internal Impala table by copying a data file or all of the data files in a directory from HDFS to the table’s Impala data directory.
Loading data from HDFS:
LOAD DATA INPATH ‘/user/docs/empdata.txt’ into table tbStudentDetails;
Loading data from local system:
LOAD DATA LOCAL INPATH ‘Desktop/docs/empdata.txt’ into table tbStudentDetails;
Hope you find this article helpful.
Please subscribe for more interesting updates.
One comment