This page combines all Apache Pig related posts into a single to make it more helpful and beneficial to learners. It is presented in a hierarchical manner, beginning with the basics and progressing to more advanced topics.
Apache Pig
Versions & Connectivity
Relations in Apache Pig
Loading Data
Sample Datasets
Querying & Storing Data
- Sampling Data
- Filtering Data (Where, Where..And.., Where..Or..)
- Selecting Data
- Selecting Specific Columns Data
- Storing Data
- Removing Duplicates
- Sorting
Merging & Combining Data
- GROUP BY Clause
- UNION Operator
- COGROUP Operator
- Normal / Natural / Equi Joins
- Self Joins
- Outer Joins
- Cross Joins
Shell & CLI Commands in Apache Pig
Complex Types
- Tuples & Bag
- Map
One comment