Provisioning a Spark engine

IBM® watsonx.data allows you to add Spark engines. You can provision a native Spark engine.

Native Spark engine is a compute engine that resides within IBM® watsonx.data.

Support for Spark 3.4 runtime is deprecated and the default version will be changed to Spark 3.5 runtime. To ensure a seamless experience and to leverage the latest features and improvements, switch to Spark 3.5.

To add a Spark engine, complete the following steps.

Log in to watsonx.data console.
From the navigation menu, select Infrastructure manager.
To add a Spark engine, click Add component and click Next.
In the Add component page, from the Engines section, select IBM Spark.

In the Add component - IBM Spark page, choose the engine type from the Type list. You can select Spark or Apache Gluten accelerated Spark. Configure the following details:

a. In the Add component - IBM Spark window, enter the Display name for your Spark engine.

c. Configure the following details:

Provisioning Spark engine
Field	Description
Default Spark version	Select the Spark runtime version that must be considered for processing the applications. For supported Spark versions, see Supported Spark version.
Engine home bucket	Select the registered Cloud Object Storage bucket from the list to store the Spark events and logs that are generated while running spark applications. Note Make sure you do not select the IBM-managed bucket as Spark Engine home. If you select an IBM-managed bucket, you cannot access it to view the logs. For more information, see Before you begin.
Reserve capacity (Conditional)	Specify this field if you are using a version earlier than 2.3 of watsonx.data. Select the Node Type. Enter the number of nodes in the No of nodes field.
Associated catalogs (optional)	Select the catalogs that must be associated with the engine.

Note Provisioning time of the native Spark engine varies depending on the number and type of nodes that you add to the engine.

Click Create. The engine is provisioned and is displayed in the Infrastructure Manager page.

To submit Spark applications using native Spark engine, see Submitting a Spark application.

Related API

For information on related API, see