IBM Cloud Docs
Adding data source

Adding data source

You can register and use data source in IBM® watsonx.data. A catalog defines the schemas and metadata for a data source.

When you add your own object storage bucket or data source, or query the data in these data sources through the query engines of watsonx.data, egress charges for pulling data out of these sources might apply depending on your service provider. If you are using managed services, consult your service provider's documentation or support for details about these charges.

To reduce the latency issues, it is recommended to colocate your additional object storage buckets or data source in the region where watsonx.data instance is provisioned.

To add data source-catalog pair, complete the following steps.

  1. Log in to the watsonx.data instance.

  2. From the navigation menu, select Infrastructure manager.

  3. To add a data source, click Add component.

  4. In the Add component, select a data source from the Data source section.

  5. Based on the data source type selected, configure the data source details.

  6. You can associate a catalog to the data source. This catalog can be associated with an engine. A catalog defines the schemas and metadata for a storage or data source. Depending on the storage type, Apache Iceberg, Apache Hive, Apache Hudi, and Delta Lake catalogs are supported.

    Two data sources with the same name cannot be added.

The following data sources are supported:

For more information on mixed-case feature flag behavior, supported SQL statements and supported data types matrices, see Support content.

Related API

For information on related API, see