What
Is Spark SQL?
The Spark
SQL is used for real-time, in-memory and parallelized SQL-on-Hadoop engine.
The Spark SQL is not a general purpose SQL layer and it’s used to allow us to
do several advanced analytics with data.
The Spark SQL supports only a subset of SQL functionality and users have to
write code in Java, Python and so on to execute a query.
Great
Features of Spark SQL -
ü Spark
SQL provides security through encryption using SSL for HTTP protocols.
ü The
Spark SQL supports lots of features to analysis the large scale of data.
ü The
Spark SQL supports lots of data types for machine learning.
ü In
the Spark SQL, you can easily to write data pipelines.
ü In
the Spark SQL, easy to add optimization rules, data types and data source by
using the Scala programming language
When
To Use Spark SQL?
Spark SQL is the best SQL-on-Hadoop tool and best
used of Spark SQL is fetch data for diverse machine learning tasks.
Disadvantage
of Spark SQL -
The Spark SQL is lacks advanced security
features.
I hope you are enjoying with this post! Please share with you friends!! Thank you!!!