Commonly used terms in Hadoop / Big Data are described in this post. VMWare: A hypervisor is installed on the physical server using VMware server
Category: Spark & SparkSQL
Spark SQL – Print Line Command
We utilize the “Println” command in Spark SQL using Scala to display the result-set on the screen, and you’ll see how to use it in
Spark SQL – Working With Variables
This post will show you how to use Scala with Spark SQL to define variables and assign values to them. Variables are just reserved memory
Spark SQL – Arithmetic Operations
A symbol that instructs the compiler to perform specified mathematical or logical operations is known as an operator. Scala has a lot of built-in operators,
Get IP Address of Hadoop & Hive
This article will show you how to get the IP address of a Hadoop server or a Hadoop NameNode. If you are using Cloudera Quickstart
Creating a DataFrame in SparkSQL
A DataFrame in Spark is a distributed collection of data with named columns. It’s similar to a table in a relational database or a data