Choosing a deployment solution
Discovery is available both as a service that is hosted by IBM Cloud and as a service that you install on IBM Cloud Pak for Data. Learn about these deployment solutions and how they differ.
The product user interface and APIs are mostly equivalent regardless of whether you use the managed or installed version of the service. The few differences between the two solutions include:
- How you deploy and set up the service
- The underlying technology that is used to crawl data sources
- Limits for things like the maximum number of documents and enrichments or file sizes
- Who to contact for support and how you share product feedback
Although the documentation that describes how to use the product is the same (what you're reading now), more documentation about how to install and administer the service in IBM Cloud Pak for Data is available from the IBM Cloud Pak for Data product documentation that is hosted in IBM Documentation.
To keep up with product changes, check the following topics periodically:
- IBM Software Hub Release notes for IBM Software Hub service instances
- IBM Cloud Pak for Data Release notes for IBM Cloud Pak for Data service instances
- IBM Cloud Release notes for IBM Cloud service instances
Comparing features
The following table describes the feature support differences between the two deployment types.
The dynamic website web crawl feature, which is controlled by the Execute JavaScript during crawl switcher in Crawl settings, is deprecated and will be removed by September 2025. For more information, see Release notes.
The features that are listed in the IBM Cloud column apply to service instances that are deployed from IBM Cloud Pak for Data as a Service also.
Feature | IBM Cloud | IBM Cloud Pak for Data |
---|---|---|
Crawl the local file system, Window file system, databases, LDAP directories, FileNet P8, and HCL Notes | ||
Schedule crawls with more precision | ||
Apply document-level security to crawled collections | ||
Enable JavaScript execution for web pages that you want to crawl | ||
Crawl IBM Cloud Object Storage | ||
Preview .pdf files that are crawled from external data sources | ||
Build custom crawlers | ||
Use App Connect to crawl other external data sources | ||
Apply answer finding to search queries | ||
Optical Character Recognition v2 | ||
Patterns enrichment | ||
App switcher menu where you can get service instance information and usage statistics | ||
Import and apply UIMA text analysis models created in Watson Explorer Content Analytics Studio | ||
Monitor usage with activity tracker events | ||
Monitor usage of the Analyze API from the product user interface |