IBM Cloud Docs
Monitoring for watsonx.data

Monitoring for watsonx.data

Provide insight about your IBM® watsonx.data instance. View metrics on the health of key watsonx.data components such as the metastore and engine. Also, understand the usage of your instances by actively running tasks and queries.

You can use the IBM Cloud Monitoring service to monitor your watsonx.data instance. watsonx.data forwards selected information about your instance to Monitoring so that you can monitor specific metrics such as cpu and memory utilization.

Before you begin

You must have view access to the Sysdig dashboard. For more information about providing the access, see Getting started with IBM Cloud Monitoring.

Set up your watsonx.data service instance

To set up Monitoring for watsonx.data, you must create a service instance and then enable Platform Metrics in the same region as the watsonx.data instance that you want to monitor. If you have deployments in more than one region, you must provision Monitoring and enable platform metrics for each region.

To set up Monitoring,

  1. From the IBM Cloud navigation menu, select Observability.
  2. Select Monitoring. The Monitoring pane opens.
  3. Either use an existing Monitoring service instance or create a new one.
  4. After the instance is ready, enable platform metrics by clicking Configure platform metrics.
  5. Select a region and then a Monitoring instance from that region. If you have deployments in more than one region, you must provision Monitoring and enable platform metrics for each region.

Accessing your IBM Cloud Monitoring metrics

To see your IBM® watsonx.data customer metrics dashboards in Monitoring:

  1. From the IBM Cloud navigation menu, select Observability.

  2. Select Monitoring. Click Open Dashboard.

  3. Click Dashboards in the sidepane, open the IBM® watsonx.data dashboard to view your watsonx.data metrics.

  4. For more information, see Monitoring Getting started tutorial.

Metrics available by Service Plan

HMS health status

Table 2: HMS Health Status metric metadata
Metadata Description
Metric Name ibm_watsonx_data_hms_health
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description Checks the status of Postgres DB connection in HMS pod.

Number of total running queries

Table 6: Number Of Total Running Queries metric metadata
Metadata Description
Metric Name ibm_watsonx_data_queries_running_total
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description Runs Query to get the total queries that are in Running state in Presto Server.

Number of total running tasks

Table 7: Number Of Total Running Tasks metric metadata
Metadata Description
Metric Name ibm_watsonx_data_tasks_running_total
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description Shows the data that is collected by name-value pair collection for tasks in Presto server and populates the tables in the database.

Number of total transactions

Table 8: Number Of Total Transactions metric metadata
Metadata Description
Metric Name ibm_watsonx_data_transactions_total
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description Shows the data that is collected by name-value pair collection for transactions in Presto server and populates the tables in the database.

List of currently available catalogs

Table 3: List Of Currently Available Catalogs metric metadata
Metadata Description
Metric Name ibm_watsonx_data_catalogs
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group

Presto health status

Table 9: Presto Health Status metric metadata
Metadata Description
Metric Name ibm_watsonx_data_presto_health
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description Tracks the heartbeat status by checking if the Presto server is running or not running.

Total number of active nodes

Table 10: Total Number Of Active Nodes metric metadata
Metadata Description
Metric Name ibm_watsonx_data_active_nodes_total
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description The total number of active nodes in the Presto Co-ordinator Pod.

Total number of inactive nodes

Table 11: Total Number Of Inactive Nodes metric metadata
Metadata Description
Metric Name ibm_watsonx_data_inactive_nodes_total
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description The total number of inactive nodes in the Presto Co-ordinator Pod.

Total number of nodes

Table 12: Total Number Of Nodes metric metadata
Metadata Description
Metric Name ibm_watsonx_data_nodes_total
Metric Type gauge
Value Type none
Segment By Service Instance, Resource group
Metric Description The total number of nodes in the Presto Co-ordinator Pod.

Attributes for segmentation

Global attributes

The following attributes are available for segmenting all of the metrics that are listed above.

Table 13: Global attributes
Attribute Attribute Name Attribute Description
Cloud Type ibm_ctype The cloud type is a value of public, dedicated, or local
Location ibm_location The location of the monitored resource - this can be a region, data center or global
Resource ibm_resource The resource being measured by the service - typically an indentifying name or GUID
Resource Type ibm_resource_type The type of the resource being measured by the service
Resource group ibm_resource_group_name The resource group where the service instance was created
Scope ibm_scope The scope is the account, organization, or space GUID associated with this metric
Service name ibm_service_name Name of the service generating this metric

Additional attributes

The following attributes are available for segmenting one or more attributes as described in the reference above. See the individual metrics for segmentation options.

Table 14: Additional attributes
Attribute Attribute Name Attribute Description
Catalog in the Presto Server ibm_watsonx_data_catalog Catalog in the Presto Server
Catalog name ibm_watsonx_data_catalog_name Catalog name
Resource role ibm_watsonx_data_resource_role Role in the resource group
Role associated with Catalog ibm_watsonx_data_catalog_role Role associated with Catalog
Schema in the Presto Server ibm_watsonx_data_schema Schema in the Presto Server
Service instance ibm_service_instance The service instance segment identifies the instance that the metric is associated with.