IBM Cloud Docs
watsonx.data Spark engine

watsonx.data Spark engine

Native Spark engine is a compute engine in IBM® watsonx.data. You can use Native Spark engine to submit applications that involve complex analytical operations.

In watsonx.data, native Spark engine is used to achieve the following use cases:

  • Ingest large volumes of data into watsonx.data tables.
  • Handle complex analytical workload.
  • Table maintenance operation to enhance watsonx.data performance of the table
  • Develop, run, and debug applications written in Python, R, and Scala.

For more information about using Spark engine, see Working with watsonx.data Spark.

Supported Spark version for watsonx.data Spark engine

IBM® watsonx.data supports the following Spark runtime versions to run Spark workloads by using watsonx.data.

Supported Spark versions
Name Status Release date End-of-support date
Apache Spark 3.4.4 Deprecated JUNE 2023 JUNE 2026
Apache Spark 3.5.4 Supported FEB 2025 FEB 2028
Apache Spark 4.0 Supported AUG 2025 AUG 2028