Throughout the year, “Big Data & SQL” published a number of articles about Apache Impala. It covers the majority of its features and functions. It’s time to turn them into a course and make it available for free to beginners, analysts, and developers alike.

To read the article, please click on each topic.
Definitions Of Big Data, Hadoop And Ecosystem Components
CDH And Hdp Legacy Virtual Machines
Starting Hosts And Roles In Cloudera VM
Introduction to Impala
- Impala an Introduction
- Cloudera Impala OR Apache Impala?
- Apache Impala Architecture
- Apache Hive vs Apache Impala
- Advantages & Limitations of Impala
- Connecting to Impala
- Impala Shell Commands and Interface
Querying with Hive and Impala
- Working with Hive and Impala
- SQL Language Statements
- Big Data: Apache Hive & Impala Data Types Quick Reference
- DDL Statements
- DML Statements
- Loading Data into Impala Table
- Data Types
- Literals
- Operators
- Is Distinct From Operator
Built-in Functions
- Impala Mathematical Functions
- ABS
- ABS
- ACOS
- ASIN
- ATAN
- ATAN2
- BIN
- CEIL, CEILING, DCEIL
- CONV
- COS
- COSH
- COT
- DEGREES
- E
- EXP
- FACTORIAL
- FLOOR, DFLOOR
- FMOD
- FNV_HASH
- GREATEST
- HEX
- IS_INF
- IS_NAN
- LEAST
- LN
- LOG
- LOG10
- LOG2
- MAX_INT, MAX_TINYINT, MAX_SMALLINT, MAX_BIGINT
- MIN_INT, MIN_TINYINT, MIN_SMALLINT, MIN_BIGINT
- MOD
- MURMUR_HASH
- NEGATIVE
- PI
- PMOD
- POSITIVE
- POW, POWER, DPOW, FPOW
- PRECISION
- QUOTIENT
- RADIANS
- RAND, RANDOM
- ROUND, DROUND
- SCALE
- SIGN
- SIN
- SINH
- SQRT
- TAN
- TANH
- TRUNCATE, DTRUNC, TRUNC
- UNHEX
- WIDTH_BUCKET
- Impala Type Conversion Functions
- Impala Date and Time Functions
- ADD_MONTHS
- ADDDATE
- CURRENT_TIMESTAMP
- DATE_ADD
- DATE_PART
- DATE_SUB
- DATE_TRUNC
- DATEDIFF
- DAY
- DAYNAME
- DAYOFWEEK
- DAYOFYEAR
- DAYS_ADD
- DAYS_SUB
- EXTRACT
- FROM_TIMESTAMP
- FROM_UNIXTIME
- FROM_UTC_TIMESTAMP
- HOUR
- HOURS_ADD
- HOURS_SUB
- INT_MONTHS_BETWEEN
- MICROSECONDS_ADD
- MICROSECONDS_SUB
- MILLISECOND
- MILLISECONDS_ADD
- MILLISECONDS_SUB
- MINUTE
- MINUTES_ADD
- MINUTES_SUB
- MONTH
- MONTHNAME
- MONTHS_ADD
- MONTHS_BETWEEN
- MONTHS_SUB
- NANOSECONDS_ADD
- NANOSECONDS_SUB
- NEXT_DAY
- NOW
- QUARTER
- SECOND
- SECONDS_ADD
- SECONDS_SUB
- SUBDATE
- TIMEOFDAY
- TIMESTAMP_CMP
- TO_DATE
- TO_TIMESTAMP
- TO_UTC_TIMESTAMP
- TRUNC
- UNIX_TIMESTAMP
- UTC_TIMESTAMP
- WEEKOFYEAR
- WEEKS_ADD
- WEEKS_SUB
- YEAR
- YEARS_ADD
- YEARS_SUB
- Impala Conditional Functions
- Impala String Functions
- ASCII
- BASE64DECODE
- BASE64ENCODE
- BTRIM
- CHAR_LENGTH
- CHR
- CONCAT
- CONCAT_WS
- FIND_IN_SET
- GROUP_CONCAT
- INITCAP
- INSTR
- LEFT
- LENGTH
- LEVENSHTEIN, LE_DST
- LOCATE
- LOWER, LCASE
- LPAD
- LTRIM
- PARSE_URL
- REGEXP_ESCAPE
- REGEXP_EXTRACT
- REGEXP_LIKE
- REGEXP_REPLACE
- REPEAT
- REPLACE
- REVERSE
- RIGHT
- RPAD
- RTRIM
- SPACE
- SPLIT_PART
- STRLEFT
- STRRIGHT
- SUBSTR, SUBSTRING
- TRA`1NSLATE
- TRIM
- UPPER, UCASE
- Impala Aggregate Functions
- Impala Analytic Functions
- OVER Clause
- Window Clause
- AVG Function – Analytic Context
- COUNT Function – Analytic Context
- CUME_DIST Function (CDH 5.5 or higher only)
- DENSE_RANK Function
- FIRST_VALUE Function
- LAG Function
- LAST_VALUE Function
- LEAD Function
- MAX Function – Analytic Context
- MIN Function – Analytic Context
- NTILE Function (CDH 5.5 or higher only)
- PERCENT_RANK Function (CDH 5.5 or higher only)
- RANK Function
- ROW_NUMBER Function
- SUM Function – Analytic Context
- Impala Bit Functions
- BITAND
- BITNOT
- BITOR
- BITXOR
- COUNTSET
- GETBIT
- ROTATELEFT
- ROTATERIGHT
- SETBIT
- SHIFTLEFT
- SHIFTRIGHT
- Impala Miscellaneous Functions
- CURRENT_DATABASE
- EFFECTIVE_USER
- GET_JSON_OBJECT
- LOGGED_IN_USER
- PID
- SLEEP
- USER
- UUID
- VERSION
SUB-QUERIES
VIEWS
JOINS
- Joins In HiveQL
- Outer Joins
- SQL-92 And SQL-89 Joins
- Does ‘ANTI-JOIN’ Perform Better Than ‘LEFT JOIN’?
Complex Data Types
- Complex Data Types – Part-1
- Complex Data Types – Part-2
- Complex Data Types – Part-3
- Complex Data Types – Part-4
- Complex Data Types – Part-5
- Complex Data Types – Array In Map
- Complex Data Types – Struct Within Map
- Complex Data Types – Array Within Map
- Complex Data Types – Array Within Struct
- Complex Data Types – Struct Within Array -1
- Complex Data Types – Struct Within Array -2
INDEXES & Performance Tuning
- Column Statistics
- Table Statistics
- Terminate Query Execution In Apache Hive & Impala
- Analyze Table
- Explain Table
Common Table Expressions & Temporary Tables
- Chaining Common Table Expressions
- Get the unique values from Common Table Expressions
- Creating tables from existing tables
Data Storage and File Format
- Choosing File Type and Compression Technique
- Compression algorithms for ORC
- Input File Formats-1
- Input File Formats-2
- Input File Format Constraints In Hive
- Apache Avro
- Using Avro Schema Url In Apache Hive
- Avro Schema & Sample Data
- Partitioning Tables
- Dividing An Ordered Partition Into Groups
Metadata Queries
- Show Files Statement
- Show All available Functions
- Show Databases
- Show Tables
- Show Tables with Search Pattern
- Show Create Table
- Show Create View
- Show Create Function
- Describe Database and Table
- Get IP Address Of the Instance
- Clearing Screen – Hdfs Terminal, Hive, Impala
- Version And Release Build Of Impala Daemon
- Refreshing Metadata In Impala
- CDH Compatible Versions – Hive, Impala And Hue
- Host In Which Impala Is Running
- Roles and Users Metadata
- Show Create Statements in Impala
- Invalidate Metada Command in Impala
Impala Daemon – Interactive Shell Scripting – Batch Processing
- Limit – Offset
- Executing SQL Script File In Hive
- Batch Processing
- Querying In Non-interactive Mode
- Connecting To Impala Shell
- Reserved Keywords As Columns
- Export Query Results To A File
- Executing SQL File In Non-interactive Mode
Happy learning..!!