Quick start watsonx.data console
When you log in to the IBM® watsonx.data web console for the first time, you are presented with the quick start wizard. In this tutorial, you learn how to use the quick start wizard to configure the core components and get started with watsonx.data in a few minutes.
The wizard guides you through the initial configuration process for the infrastructure components of watsonx.data.
Configure a storage
Your watsonx.data needs an object storage bucket to store your raw data files. You can provision a new IBM-managed bucket or register your own bucket. You can add more buckets and register them later. You can also configure the query monitoring details. You can enable or disable the query monitoring to store and manage your diagonostic data.
In the Configure storage section, complete the following steps:
-
Select one of the following options and provide details.
-
Discover COS instance : Selects an existing IBM COS instance and an attached bucket on your IBM Cloud account. If multiple IBM COS instances and buckets are detected, select the IBM COS instance that contains the desired bucket to register with watsonx.data.
-
Register my own : You can use any existing IBM COS bucket from an existing instance or provision a new instance. To provision a new IBM COS instance, provide the following details:
Add bucket Field Description Bucket Type Select from Amazon S3, IBM Storage Ceph, or IBM Cloud Object Storage. Region The region where the data bucket is available. Bucket Name Enter your bucket name. Display name Enter the bucket name to be displayed on-screen. Endpoint Enter the endpoint URL. Access key Enter your access key. Secret key Enter your secret key. Connection status Click the Test connection link to test whether the bucket connection with watsonx.data is successful or not. The system displays the status message.
If you select an existing IBM COS bucket, the default size is 10 GB. It is meant for an exploratory purpose and cannot be used to store production or sensitive data. The watsonx.data instance administrators can disable this bucket for compliance reasons.
When you register your own bucket, ensure to provide the correct details for bucket configuration. The quick start wizard does not validate the bucket configuration details and you cannot modify them later.
-
In the Query monitoring section, complete the following steps:
-
Use the toggle switch to enable (or disable) the query monitoring feature.
The associated catalog that appears with the query monitoring bucket is of type Hive.
-
If you enable the QHMM feature, you must configure the storage details for storing QHMM data. Select one of the following options and provide details.
-
Discover COS instance : Selects an existing IBM COS instance and an attached bucket on your IBM Cloud account. If multiple IBM COS instances and buckets are detected, select the IBM COS instance that contains the desired bucket to register with watsonx.data.
-
Register my own : You can register an existing bucket as a QHMM bucket. Only the following bucket types can be registered as a QHMM bucket: Amazon S3, IBM Storage Ceph, or IBM Cloud Object Storage. To register an existing bucket as QHMM bucket, provide the following details:
Add bucket Field Description Bucket Type Select from Amazon S3, IBM Storage Ceph, or IBM Cloud Object Storage. Region The region where the data bucket is available. Bucket Name Enter your bucket name. Display name Enter the bucket name to be displayed on-screen. Endpoint Enter the endpoint URL. Access key Enter your access key. Secret key Enter your secret key. Connection status Click the Test connection link to test whether the bucket connection with watsonx.data is successful or not. The system displays the status message.
The storage (default or BYOB) can be changed at later point from the watsonx.data console page. See Query monitoring.
-
-
Click Next.
Configure a catalog
Your watsonx.data needs metadata catalogs to manage your table schemas. Creating the support services for the metadata catalog adds 3 RUs/Hr to your instance run rate when you complete the quickstart process.
In the Configure catalog page, complete the following steps:
- Select the table format for managing your data. Apache Hive and Apache Iceberg catalogs are available.
To enable Query monitoring feature, you must select Apache Hive catalog.
- Click Next.
Configure an engine
Your watsonx.data needs a query engine to work with your data.
In the Configure engine page, complete the following steps:
-
Select the engine to run and process the data that you attached.
-
Select the size of the engine based on the requirements of your workload.
Engine size Size Description Starter/Lite (IBM) (2 RUs/hour) Includes 1 coordinator node and 1 worker node. All nodes are Starter. Starter (AWS) (5.6 RUs/hour) Includes 1 coordinator node and 1 worker node. All nodes are cache-optimized. Small (11.2 RUs/hour) Includes 1 coordinator node and 3 worker nodes. All nodes are cache-optimized. Medium (19.6 RUs/hour) Includes 1 coordinator node and 6 worker nodes. All nodes are cache-optimized. Large (36.4 RUs/hour) Includes 1 coordinator node and 12 worker nodes. All nodes are cache-optimized. -
Click Next.
Review the configuration details
In the Summary page, complete the following steps:
-
Review the configurations before you finish setting up your watsonx.data.
-
Click Finish and go.
When the setup is complete, the watsonx.data home page appears. Resource Unit consumption begins soon after creating the support services by using the quick start wizard. You can view the run rate that is submitted for billing from the billing and usage tab. For more information, see Billing and usage.
Next steps
You are all set to use the watsonx.data or you can configure it further.