Powered by GitBook

Limitations of Apache Spark

Some of the limitations of Apache Spark are-

Does not have its file management system, so you need to integrate with Hadoop, or other cloud based data platform.
In-memory capability can become a bottleneck when it comes to cost-efficient processing of Bigdata.
Memory consumption is very high. And the issue for the same does not resolve in user friendly manner.
It requires large data.
MLlib lacking in a number of available algorithms (Tanimoto distance).

results matching ""

No results matching ""