Adding data source

You can register and use data source in IBM® watsonx.data. A catalog defines the schemas and metadata for a data source.

When you add your own object storage bucket or data source, or query the data in these data sources through the query engines of watsonx.data, egress charges for pulling data out of these sources might apply depending on your service provider. If you are using managed services, consult your service provider's documentation or support for details about these charges.

To reduce the latency issues, it is recommended to colocate your additional object storage buckets or data source in the region where watsonx.data instance is provisioned.

To add data source-catalog pair, complete the following steps.

Log in to the watsonx.data instance.
From the navigation menu, select Infrastructure manager.
To add a data source, click Add component.
In the Add component window, select a data source from the Data source section and provide the details to establish the connection.

Two data sources with the same name cannot be added.

watsonx.data supports the following data source options:

Amazon Redshift
Apache Druid
Apache Kafka
Apache Phoenix
Apache Pinot
BigQuery
Cassandra
ClickHouse
Elasticsearch
HANA
IBM Data Virtualization Manager for z/OS
IBM Db2
IBM NPSaaS
Informix
MongoDB
MySQL
Oracle
PostgreSQL
Prometheus
Redis
SingleStore
Snowflake
SQL Server
Teradata
Custom data source
Arrow Flight Service:
Apache Derby
Greenplum
MariaDB
Salesforce

For more information on mixed-case feature flag behavior, supported SQL statements and supported data types matrices, see Support content.

Related API

For information on related API, see