Limitations of Apache Spark

Some of the limitations of Apache Spark are-

  1. Does not have its file management system, so you need to integrate with Hadoop, or other cloud based data platform.

  2. In-memory capability can become a bottleneck when it comes to cost-efficient processing of Bigdata.

  3. Memory consumption is very high. And the issue for the same does not resolve in user friendly manner.

  4. It requires large data.

  5. MLlib lacking in a number of available algorithms (Tanimoto distance).

results matching ""

    No results matching ""