Skip to main content

Posts

Showing posts with the label Disadvantage of Spark SQL

Spark SQL vs. Apache Drill

Spark SQL - The Spark SQL is used for real-time, in-memory and parallelized SQL-on-Hadoop engine. The Spark SQL is not a general purpose SQL layer and it’s used to allow us to do several advanced analytics with data. The Spark SQL supports only a subset of SQL functionality and users have to write code in Java , Python and so on to execute a query. Great Features of Spark SQL - ü   Spark SQL provides security through encryption using SSL for HTTP protocols. ü   The Spark SQL supports lots of features to analysis the large scale of data. ü   The Spark SQL supports lots of data types for machine learning . ü   In the Spark SQL, you can easily to write data pipelines. ü   In the Spark SQL, easy to add optimization rules, data types and data source by using the Scala programming language When To Use Spark SQL? Spark SQL is the best SQL-on-Hadoop tool and best used of Spark SQL is fetch data for diverse machine learning tasks. ...

Apache Spark - Features, Advantage and Disadvantage!

What Is Apache Spark? The Apache Spark is an open source, very fast, in-memory data processing and general engine and used for the large amount of data processing. Apache Spark is a cluster-computing framework. The Advantage of Spark - ü   Ease of Use ü   Open Source ü   Spark is in-memory cluster computing so it Speed is very fast. ü   Combine SQL, streaming, and complex analytics ü   Spark runs everywhere - on Hadoop , Mesos, and standalone and so on. ü   Supports multiple languages The Spark is not a modified version of Hadoop and the Spark uses Hadoop for - ü   Storage ü   Data Processing ü   Spark supports the following languages - ü   Java ü   Python ü   Scala ü   R ü   Clojure Is Apache Spark going to replace Hadoop? My answer Is Yes! What Is your Opinions about the same? Hadoop will be replaced by Spark and both Apache Spark and Hadoop are big-data framewor...

Spark SQL - Features, Advantage and Disadvantage!

What Is Spark SQL ? The Spark SQL is used for real-time, in-memory and parallelized SQL-on-Hadoop engine. The Spark SQL is not a general purpose SQL layer and it’s used to allow us to do several advanced analytics with data. The Spark SQL supports only a subset of SQL functionality and users have to write code in Java , Python and so on to execute a query. Great Features of Spark SQL - ü   Spark SQL provides security through encryption using SSL for HTTP protocols. ü   The Spark SQL supports lots of features to analysis the large scale of data. ü   The Spark SQL supports lots of data types for machine learning . ü   In the Spark SQL, you can easily to write data pipelines. ü   In the Spark SQL, easy to add optimization rules, data types and data source by using the Scala programming language When To Use Spark SQL? Spark SQL is the best SQL-on-Hadoop tool and best used of Spark SQL is fetch data for diverse machine learning...