As all (or most) of us know, SELECT DISTINCT eliminates duplicate records from the results and returns only unique values. But it’s very expensive in
Author: SHAFI SHAIK
Hi, this is Shafi Shaik, Microsoft Solutions Specialist in Data platform as well as in Data Management & Analytics. I am a certified associate in Oracle SQL*Plus and extensively trained in MongoDB Administration. My current role is Database administration with hands-on expertise in almost all relational databases namely Microsoft SQL Server, MySQL, Oracle SQL*Plus, PostGreSQL & Teradata. I'm also specialized in database development, warehousing, traditional data analysis and Big Data analysis.
Creating Table From Existing Table in Hive, Impala, SQL Server, Oracle SQL*Plus, PostgreSQL and MySQL
In some scenarios we may need to create a table based on the existing ones. Sometimes we may need the table along with the data
Best Usage of NTILE function in SQL Server and other RDBMSs
SQL Server NTILE() is a window function that distributes rows of an ordered partition right into a unique quantity of approximately same partition or group
Calculating “Approximate Median” in Cloudera Impala, Apache Hive, SQL Server, Oracle and MySQL
APPROX_MEDIAN is an approximate inverse distribution function that accept a nonstop/continuous dispersion model. It takes a numeric or datetime value and returns an estimated middle
Multiple Ways to Find Missing Serial Numbers in SQL
In my previous blogs, I had mentioned that there will be many ways to solve a problem. The below is one more example. Often, in
Handling “Json” and “Unstructured” Data in SQL
The below is to understand how we can handle JSON data in prior versions of SQL Server 2016. Sample JSON data is: {“accountNumber”: 2020112,