Monitoring for watsonx.data
Provide insight about your IBM® watsonx.data instance. View metrics on the health of key watsonx.data components such as the metastore and engine. Also, understand the usage of your instances by actively running tasks and queries.
You can use the IBM Cloud Monitoring service to monitor your watsonx.data instance. watsonx.data forwards selected information about your instance to Monitoring so that you can monitor specific metrics such as cpu and memory utilization.
Before you begin
You must have view access to the Sysdig dashboard. For more information about providing the access, see Getting started with IBM Cloud Monitoring.
Set up your watsonx.data service instance
To set up Monitoring for watsonx.data, you must create a service instance and then enable Platform Metrics in the same region as the watsonx.data instance that you want to monitor. If you have deployments in more than one region, you must provision Monitoring and enable platform metrics for each region.
To set up Monitoring,
- From the IBM Cloud navigation menu, select Observability.
- Select Monitoring. The Monitoring pane opens.
- Either use an existing Monitoring service instance or create a new one.
- After the instance is ready, enable platform metrics by clicking Configure platform metrics.
- Select a region and then a Monitoring instance from that region. If you have deployments in more than one region, you must provision Monitoring and enable platform metrics for each region.
Accessing your IBM Cloud Monitoring metrics
To see your IBM® watsonx.data customer metrics dashboards in Monitoring:
-
From the IBM Cloud navigation menu, select Observability.
-
Select Monitoring. Click Open Dashboard.
-
Click Dashboards in the sidepane, open the IBM® watsonx.data dashboard to view your watsonx.data metrics.
-
For more information, see Monitoring Getting started tutorial.
Metrics available by Service Plan
- HMS health status
- Number of total running queries
- Number of total running tasks
- Number of total transactions
- List of currently available catalogs
- Presto health status
- Total number of active nodes
- Total number of inactive nodes
- Total number of nodes
HMS health status
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_hms_health |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
Checks the status of Postgres DB connection in HMS pod. |
Number of total running queries
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_queries_running_total |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
Runs Query to get the total queries that are in Running state in Presto Server. |
Number of total running tasks
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_tasks_running_total |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
Shows the data that is collected by name-value pair collection for tasks in Presto server and populates the tables in the database. |
Number of total transactions
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_transactions_total |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
Shows the data that is collected by name-value pair collection for transactions in Presto server and populates the tables in the database. |
List of currently available catalogs
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_catalogs |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Presto health status
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_presto_health |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
Tracks the heartbeat status by checking if the Presto server is running or not running. |
Total number of active nodes
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_active_nodes_total |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
The total number of active nodes in the Presto Co-ordinator Pod. |
Total number of inactive nodes
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_inactive_nodes_total |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
The total number of inactive nodes in the Presto Co-ordinator Pod. |
Total number of nodes
Metadata | Description |
---|---|
Metric Name |
ibm_watsonx_data_nodes_total |
Metric Type |
gauge |
Value Type |
none |
Segment By |
Service Instance, Resource group |
Metric Description |
The total number of nodes in the Presto Co-ordinator Pod. |
Attributes for segmentation
Global attributes
The following attributes are available for segmenting all of the metrics that are listed above.
Attribute | Attribute Name | Attribute Description |
---|---|---|
Cloud Type |
ibm_ctype |
The cloud type is a value of public, dedicated, or local |
Location |
ibm_location |
The location of the monitored resource - this can be a region, data center or global |
Resource |
ibm_resource |
The resource being measured by the service - typically an indentifying name or GUID |
Resource Type |
ibm_resource_type |
The type of the resource being measured by the service |
Resource group |
ibm_resource_group_name |
The resource group where the service instance was created |
Scope |
ibm_scope |
The scope is the account, organization, or space GUID associated with this metric |
Service name |
ibm_service_name |
Name of the service generating this metric |
Additional attributes
The following attributes are available for segmenting one or more attributes as described in the reference above. See the individual metrics for segmentation options.
Attribute | Attribute Name | Attribute Description |
---|---|---|
Catalog in the Presto Server |
ibm_watsonx_data_catalog |
Catalog in the Presto Server |
Catalog name |
ibm_watsonx_data_catalog_name |
Catalog name |
Resource role |
ibm_watsonx_data_resource_role |
Role in the resource group |
Role associated with Catalog |
ibm_watsonx_data_catalog_role |
Role associated with Catalog |
Schema in the Presto Server |
ibm_watsonx_data_schema |
Schema in the Presto Server |
Service instance |
ibm_service_instance |
The service instance segment identifies the instance that the metric is associated with. |