| Deployment and Updates |
IBM-managed, with automatic updates and scaling. |
Self-managed, manual installation, updates, and scaling. |
| Milvus Support |
Full Milvus service is available (not just lite Milvus). Lite Milvus is also supported for lightweight vector search needs. |
Full Milvus service can be added manually. Lite Milvus is not supported. |
| High Availability and Disaster Recovery (HADR) |
Built-in HA and DR features managed by IBM. |
Requires manual setup for HA and DR; metadata backup supported. |
| Data Access Service (DAS) |
Supported |
Supported (in Tech Preview as of version 2.2.x). |
| OpenTelemetry Support |
Yes |
Yes |
| Governance Integrations (for example, watsonx.governance) |
Seamless native integration with IBM governance services. |
Possible but requires manual configuration and compatibility setup. |
| Security and IAM |
IAM managed through IBM Cloud Identity services with role-based access. |
Depends on local authentication (for example, LDAP, AD); custom IAM configuration required. Full control over data, encryption, firewall, and network; aligns with internal security policies. |
| Resource Scaling |
Elastic, managed automatically by IBM. |
Manual scaling via admin operations and resource management. |
| Service Availability Regions |
Restricted to IBM Cloud-supported regions. |
Available anywhere customer deploys on-premise. |
| Query Engine - Presto |
Available; needs provisioning. IBM manages deployment and scaling. |
Available; needs provisioning. Customer manages deployment and scaling. |
| Spark Support (Ingestion) |
Available out-of-the-box; used for ingestion and table management. Requires provisioning and setup. |
Available out-of-the-box; used for ingestion and table management. Requires provisioning and setup. |
| Storage |
Utilizes IBM Cloud Object Storage (COS) – S3-compatible service automatically provisioned by IBM. You pay for storage with flexible scaling. |
Uses customer-provided persistent storage via Red Hat OpenShift – typically Ceph, IBM Storage Ceph, or Storage Foundation, set up by the customer. |
| Storage format |
Supports both internal COS (automatically provisioned) and a broad range of external object/file stores, including IBM COS, Amazon S3, MinIO, HDFS, Google Cloud Storage, Azure Data Lake, Apache Ozone, NFS, and so forth. |
Same wide support as SaaS—built-in and external: IBM COS, S3, Ceph, MinIO, HDFS, Ozone, ADS, Storage Scale, Portworx, NFS, and so forth. |
| Data sources |
Supports broad set of connectors—including IBM Cloud services (COS, Db2, Cloud Databases), 3rd-party RDBMS (MySQL, PostgreSQL, SQL Server, Oracle), NoSQL (Cassandra, MongoDB), cloud storage (S3, GCS, Azure), files (FTP, HTTP, Box, Dropbox),
Milvus, Elasticsearch, and so forth. |
Virtually identical set supported on OpenShift/Pak—covering those same IBM services, relational and NoSQL databases, cloud stores, file connectors, Milvus, and so forth. |
| Custom JDBC connectors |
No supported |
Supported through Infrastructure Manager |
| Vault and Secret Storage |
Not available |
Supported |
| Kerberos Auth |
Not supported |
Supported for connectors |
| Integration |
|
|
| dbt (Data Build Tool) |
Supported: dbt‑watsonx‑presto for SQL/Presto, dbt for Spark engine (in‑place transforms). |
Supported: dbt‑watsonx‑presto works equally for on‑prem Presto; Spark engine also supports dbt . |
| IBM Knowledge Catalog (IKC) |
Native integration for governance on SQL views/tables across Presto/Spark. |
Same governance integration applies to on‑prem deployments using IKC . |
| Apache Ranger |
Policy support for Presto (C++) and Spark through Ranger plugin. |
Also supported on‑prem when Ranger is available in the environment. |
| Databand |
Supported for Spark monitoring beyond Spark UI. |
Available for on‑prem Spark pipelines too. |
| Birdwatcher |
Debugging tool for Milvus service included. |
Available for On‑Prem. |
| DataStage and Data Virtualization |
Integration with IBM DataStage and Data Virtualization on Cloud Pak for Data (CPD). |
Fully available in on‑prem installation through Cloud Pak integration. |
| BI Tool Integration |
Supports Superset, Tableau, Power BI, Cognos, and so forth through public JDBC/ODBC endpoints. |
Same tools supported; depends on local network setup and firewall rules. |
| Manta Lineage Integration |
Supported; Built-in integration with visualization in Manta UI. |
Supported; Requires manual configuration for Manta integration. |
| Data product hub |
Available as a managed service in the cloud. Users can publish, govern, discover, and consume "data products" created within IBM watsonx.data. Data Product Hub seamlessly integrates with the cloud version for lifecycle management
and cataloging. |
Also fully supported on-premises as software. Data Product Hub can be deployed within an OpenShift/Cloud Pak for Data environment alongside IBM watsonx.data. All features such as, data product lifecycle, metadata, governance, search are
available natively. |
| Extensibility and BYOL |
Limited BYOL (Bring Your Own License) in managed model. |
Full BYOL flexibility – integrate any compatible tools or engines. |
| Air-Gap/Offline Usage |
Not supported; Requires internet access to use. |
Fully supported; Suitable for air-gapped, highly regulated, or disconnected environments. |
| Billing and Licensing |
Subscription-based pricing (per usage or per user) |
Traditional enterprise license + infrastructure cost |