Limitations of Apache Spark
Some of the limitations of Apache Spark are-
Does not have its file management system, so you need to integrate with Hadoop, or other cloud based data platform.
In-memory capability can become a bottleneck when it comes to cost-efficient processing of Bigdata.
Memory consumption is very high. And the issue for the same does not resolve in user friendly manner.
It requires large data.
MLlib lacking in a number of available algorithms (Tanimoto distance).