watsonx.data Spark engine

Native Spark engine is a compute engine in IBM® watsonx.data. You can use Native Spark engine to submit applications that involve complex analytical operations.

In watsonx.data, native Spark engine is used to achieve the following use cases:

Ingest large volumes of data into watsonx.data tables.
Handle complex analytical workload.
Table maintenance operation to enhance watsonx.data performance of the table
Develop, run, and debug applications written in Python, R, and Scala.

For more information about using Spark engine, see Working with watsonx.data Spark.

Supported Spark version for watsonx.data Spark engine

IBM® watsonx.data supports the following Spark runtime versions to run Spark workloads by using watsonx.data.

Supported Spark versions
Name	Status	Release date	End-of-support date
Apache Spark 3.4.4	Deprecated	JUNE 2023	JUNE 2026
Apache Spark 3.5.4	Supported	FEB 2025	FEB 2028
Apache Spark 4.0	Supported	AUG 2025	AUG 2028