IBM Cloud Docs
Release notes for IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data

Release notes for IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data

Learn about features and changes that were included for each release and update of the product software.

IBM Cloud Pak for Data

This information applies only to instances of IBM Watson® Discovery that are installed on IBM Cloud Pak® for Data. For information about releases and updates for managed deployments, see Release notes for Watson Discovery for IBM Cloud.

For the list of Discovery known issues, see Limitations and known issues in Watson Discovery.

Knowledge Studio for IBM Cloud Pak for Data deprecation announcement

After version 4.7, the operator for IBM Knowledge Studio will no longer be supported and will be removed from the IBM Watson Discovery Cartridge for IBM Cloud Pak for Data and from github.com. The service will not be displayed in the Cloud Pak for Data catalog. This change will not impact existing deployments of the operator.

Migrate your solutions to Watson Discovery, which has powerful custom natural language processing capabilities. Any existing Watson Knowledge Studio for Cloud Pak for Data rules-based or machine learning models can be imported to Watson Discovery and applied to your data as custom enrichments. And the recent release of the custom entities extraction feature brings equivalent function to label and train custom entity models into Watson Discovery. For more information about these features, see Choose enrichments.

For more information about migrating your solutions, see Migrating Knowledge Studio solutions.

4.8.0 release, 29 November 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.0 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding

4.7.3 release, 27 September 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.7.3 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding

4.7.1 release, 26 July 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.7.1 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding

4.7.0 release, 28 June 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.7.0 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Optical Character Recognition v2

4.6.6 release, 18 May 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data was not refreshed as part of 4.6.6. You can use Discovery 4.6.5 with IBM Cloud Pak for Data 4.6.6.

4.6.5 release, 2 May 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6.5 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Manage the data in a collection from the new Manage data page

You can now access a Manage data page for a collection. From the new page, you can see a list of the documents in your collection and get a quick view of information about the documents. You can also delete documents from a collection with just a few clicks. For more information, see Excluding content from query results.

You have more control over the data that is crawled by the database connector

When you connect to a database as an external data source, you can now specify the column from which to extract data. If you don't specify the column, a column with text or with a single large object is chosen to be crawled. You can also specify the MIME type of the data in the column that you want to crawl.

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Optical Character Recognition v2

4.6.4 release, 29 March 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data was not refreshed as part of 4.6.4. You can use Discovery 4.6.3 with IBM Cloud Pak for Data 4.6.4 on Red Hat OpenShift Container Platform versions 4.10 or 4.12.

4.6.3 release, 23 february 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6.3 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Manage data page
Important: Back up your data before upgrading to version 4.6.3

Before upgrading to version 4.6.3, you must make a backup of your data. Preserve the backup in a safe location. For more information about backing up your data, see Backing up and restoring data in IBM Cloud Pak for Data. That topic also includes information about restoring your data if that becomes necessary.

4.6.2 release, 30 January 2023

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6.2 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Manage data page

4.6.1 release, December 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data was not refreshed as part of 4.6.1. However, the product documentation was updated with fixes and enhancements.

4.6 release, 30 November 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Manage data page

4.5.3 release, 13 October 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.5.3 is available.

There are no new features in this release. For a list of bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Manage data page
  • Advanced document view for search results
  • The similar parameter of the Query method
  • The smart_document_understanding field in the Get collection method response

15 August 2022

SDKs were updated to reflect the latest API changes.

The following Discovery v2 API changes are now reflected in the SDKs:

  • Use the new document classifier API to get, add, update, or delete a document classifier.

  • A new document status API is available. You can use it to get a list of the documents in a collection and to get details about a single document.

  • You can now get, add, and remove a stop words or expansion list for a collection.

  • The suggested_refinements parameter of the Query method is deprecated.

4.5.1 release, 3 August 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.5.1 is available.

There are no new features in this release. For a list of bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Manage data page
  • Advanced document view for search results
  • The similar parameter of the Query method
  • The smart_document_understanding field in the Get collection method response

4.5 release, 29 June 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.5 is available.

For a list of new features and bug fixes, see What's new and changed in Watson Discovery

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Answer finding
  • Manage data page
  • Advanced document view for search results

4.0.9 release, 25 May 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.9 is available.

API usage information is now available from the user interface

You can now get information about analyze API usage from the Data usage>API usage page in the product user interface. For more information about the analyze API, see Analyze API.

A new document status API is supported in IBM Cloud Pak® for Data instances

Use the new document status API to programmatically get a list of the documents in a collection and to get details about a single document.

  • The API is supported for collections that are created after 23 March 2022.

    If you want to get status information about a collection that was created earlier, trigger a process that runs the conversion step of ingestion on the documents. For example, from the Activity page for the collection, click Recrawl.

  • The API is not supported from the SDKs currently.

For more information about the new API, see the API reference documentation.

Security vulnerabilities were addressed

The following security patches were applied:

4.0.8 release, 27 April 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.8 is available.

The Development deployment type was changed to Starter

When you install Watson Discovery, you can optionally specify the type of deployment by including the deploymentType parameter in your custom resource. The Development option is now called the Starter option.

The Development and Starter options are functionally the same, and both values are accepted by the service.

Security vulnerabilities were addressed

The following security patches were applied:

IBM Watson® Discovery for IBM Cloud Private (ICP) for Data 2.2.x End Of Support

Effective 30 April 2022, IBM will withdraw support for the following programs:

  • IBM Watson Discovery for ICP for Data 2.2.x
  • IBM Watson Discovery for ICP for Data Add-on 2.2.x

For more information, see announcement ENUS921-134.PDF.

4.0.7 release, 30 March 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.7 is available.

IBM Cloud Block Storage is now supported

When you install Discovery, you can specify IBM Cloud Block Storage Gold tier (ibmc-block-gold) as your storage class. For more information about the storage class, see Storing data on classic IBM Cloud Block Storage.

Security vulnerabilities were addressed

The following security patches were applied:

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Home page updates
  • Answer finding
  • Manage data page
  • Advanced document view for search results

30 March 2020

A new document classifier API is available

Use the new document classifier to programmatically get, add, update, or delete a document classifier. The following notes apply to this release:

  • The enrichments property of the Document Classifier object is documented as being optional. However, the property is required currently.
  • The field property in the federated_classification object is documented as a string. However, it is currently an array.

For more information about the new API, see the API reference documentation. For more information about adding a document classifier by using the product user interface, see Using the Content Mining application.

The document classifier endpoints are not supported in the SDKs currently.

4.0.6 release, 1 March 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.6 is available.

Multitenancy is now supported

An administrator can now create up to 10 instances of the Discovery service per deployment, which means that more teams can work on discrete Discovery projects at the same time.

Simpler installation and management of custom connectors

The manage_custom_crawler.sh script was improved to make it easier for you to install and manage your custom connectors in a multitenant environment. For more information, see Installing a custom crawler.

Security vulnerabilities were addressed

The following security patches were applied:

Features that are not available in this release

The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Home page updates
  • Answer finding
  • Access to guided tours from the page header

4.0.5 release, 26 January 2022

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.5 is available.

A security vulnerability was addressed

The following security patch was applied: Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache Log4j

Features that are not available in this release

The following features are generally available from IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Home page updates
  • Answer finding
  • Guided tours

4.0.4 release, 20 December 2021

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.4 is available.

Guided tours are available

Access guided tours from anywhere in the product user interface by clicking the Guided tours button in the page header.

Security vulnerabilities were addressed

The following security patches were applied:

Features that are not available in this release

The following features are generally available from IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Home page updates
  • Answer finding

4.0.3 release, 30 November 2021

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.3 is available.

Another storage option is supported

IBM Spectrum Scale Container Native storage is now supported in addition to Red Hat OpenShift Container Storage and Portworx.

Microsoft SharePoint Online data source improvement

The Sharepoint Online data source now supports crawling your data as a service principal, which means you can access your data without disabling multifactor authentication. For more information, see Microsoft Sharepoint Online.

Microsoft Windows File System improvements

Extra configuration options mean you can specify the following information:

  • The types of files (by file extension) to include or exclude from a crawl of a Windows directory.
  • The character encoding of the data to be crawled. Typically, the encoding is detected automatically. However, you can choose to specify the character encoding as a Java character set yourself.

For more information, see Windows File System.

Field selection is improved

When you apply an enrichment to a field or choose a field to use as the source for a facet, the fields that are displayed for you to choose from now shows only fields that are valid choices.

Search settings change

The spelling correction setting changed from being enabled automatically in new projects to being disabled by default. If you want to alert users when they misspell a term in their query, turn on Spelling suggestions. For more information, see Customizing the search bar.

A Salesforce crawling issue was fixed

Previously, Discovery had an issue where it timed out before it crawled some of the object types in a Salesforce collection. If your collection is configured to crawl the following object types, run a full data source crawl to make sure that your collection contains the most up-to-date data from all of the objects in your Salesforce data source:

  • Attachment
  • ContentVersion
  • Document
Security vulnerabilities were addressed

The following security patches were applied:

Features that are not available in this release

The following features are available from IBM Cloud deployments at the time of this release, but not from installed deployments:

  • Home page updates
  • Guided tours
  • Answer finding

4.0.2 release, 5 October 2021

IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.2 is available.

Support for newer platform software
IBM Cloud Pak® for Data 4.0.2 can be installed on Red Hat® OpenShift® on IBM Cloud® 4.8.
New scoring for NLU enrichments
Relevance and confidence scores are displayed for NLU enrichments that are returned by search. For example, when you open the JSON view of the document preview from a query result, you can see confidence scores for Entities mentions and relevance scores for Keyword mentions.
Improved Web crawl
The Web crawl data source supports more customization options, including the ability to ignore a site's robots.txt file. For more information, see Web crawl.
New upgrade support
The 4.0.2 release supports in-place upgrade from IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.0. For more information, see Upgrading Watson Discovery to a newer 4.0 refresh

IBM Cloud Private End Of Support

Effective 30 September 2021, IBM withdrew support for the following programs:

  • IBM Watson Assistant Discovery Extension for IBM Cloud Private 2.1.0–2.1.4
  • IBM Watson Discovery for ICP for Data 2.1.0–2.1.4
  • IBM Watson Discovery for ICP for Data Add-on 2.1.0–2.1.4

For more information, see announcements ENUS921-005.PDF and ENUSLP21-0099.PDF.

4 release, 13 July 2021

New version now available

Discovery for Cloud Pak for Data 4 is available

This release is supported on IBM Cloud Pak® for Data 4.0.0.

Change to service name

The new name is IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data.

New Smart Document Understanding (SDU) predefined model

When you identify fields, instead of annotating documents with the SDU tool, you can choose to use a pretrained model. The pretrained model applies a non-customizable model that automatically extracts text and identifies tables, lists, and sections.

Improved contract analysis

To enable the Contracts enrichment that recognizes and tags contract-related concepts in your data, you can choose to create a Document Retrieval project type, and then select Apply contracts enrichment. You no longer need to use an installation override YAML file to enable it. This change also means that you can choose which Document Retrieval projects use the Contracts enrichment; it is not applied to all Document Retrieval projects automatically.

New LDAP directory data source

Connect to data that is stored in an external directory that supports the Lightweight Directory Access Protocol (LDAP), such as a corporate email directory. As the directory data is added to your collection, Discovery interprets and stores key attributes of each record, such as department and location information. Later, you can find relevant records by filtering on these attribute categories. For more information, see LDAP directory.

Improved SharePoint OnPrem connection process

The steps you follow to connect to a SharePoint instance that is hosted on-premises were simplified. You no longer need to deploy a web services package on the SharePoint server before you can connect to the SharePoint OnPrem data source. For more information, see SharePoint OnPrem.

New Salesforce proxy support

You can now connect to a Salesforce data source when using a proxy server. For more information, see Salesforce.

Improved custom connector improvements

Support was added for Optical character recognition (OCR)

Support was added for Document-level security

For more information about the custom connector, see Building a Cloud Pak for Data custom connector.

Change to Dynamic Faceted Search

Support for Dynamic Faceted Search and its associated suggested_refinements API query parameter was removed.

2.2.1 release, 26 February 2021

New release now available
IBM Watson™ Discovery for IBM Cloud Pak for Data version 2.2.1 is available.
Support for upgrade
Discovery for Cloud Pak for Data supports an in-place upgrade from version 2.2.0 to 2.2.1 so that you do not need to manually uninstall an earlier version and then install the latest version of the service. For more information, see Upgrading Discovery for Cloud Pak for Data.
New SDK download support
You can now download the custom connector SDK package from your Discovery for Cloud Pak for Data cluster, instead of retrieving the images and the SDK package from the Docker registry. For more information, see Downloading the custom-crawler-docs.zip file in Discovery 2.2.1 and later.
Change to Invoices and Purchase orders
Invoices and Purchase orders models can no longer be enabled in the tooling. If you need these models, please contact IBM Cloud Support to obtain instructions for enabling these models.
Change to Contracts enrichment tables
In a Document Retrieval project that has the Contracts enrichment applied, tables are not included inside the contracts field, as they were previously in projects that had the Contracts enrichment enabled. Tables will continue to be included in a separate tables field when the Table Understanding enrichment is applied.
Change to support for Oracle Database 11g and Postgres 9.5
Support for connecting to Oracle Database 11g was removed because the vendor ended version support on 31 December 2020.
Support for connecting to Postgres 9.5 was removed because the vendor ended version support on 11 February 2021.

2.2.0 release, 8 December 2020

New release now available
IBM Watson™ Discovery for IBM Cloud Pak for Data version 2.2 is available.
Discovery for Cloud Pak for Data now works with IIBM Cloud Pak® for Data 3.5.
New support for Notes attachments
Added support for attachments in the Notes data source. For more information, see Notes
New web crawl scheduling option
You can specify the exact time that you would like your crawls to run for any data source, giving you the flexibility to run them at the times you prefer. For more information, see Configuring Cloud Pak for Data data sources.
New Facet creation in Content Miner
You can now create Facet groups in a Content Miner application.
New custom crawler creation
Added the option to create your own custom crawler plug-in. For more information, see Building a Cloud Pak for Data crawler plug-in Note: Any custom code used with Watson Discovery is the responsibility of the developer and is not covered by IBM support.
Change to Dynamic Facets
Dynamic Facets are no longer enabled by default in Document Retrieval projects.

2.1.4 release, 2 September 2020

New release now available
IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.4 is available.
New Notes connector
Crawl Notes version 9.0.1 systems. For more information, see Notes connector.
New Enable proxy settings in multiple connectors
You can now select the option to enable proxy settings in Box, Microsoft SharePoint Online, and Microsoft SharePoint OnPrem connectors.
New options for Database connector
Added support for multiple tables and the Row filter option to the Database connector.
New authentication types for Web crawler
You can select from three new authentication types in Web crawler: Basic authentication, NTLM authentication, and FORM authentication.
New Analyze API usage monitoring
You can now monitor the usage of the Analyze API using the tooling. For more information, see Monitoring usage.

30 August 2020

Update to API version
The current API version (v2) is now 2020-08-30. The following change was made with this version:
Change to 'options' object
The List enrichments method no longer returns the options object per enrichment. Use the Get enrichment method to return the options object for a single enrichment.

2.1.3 release, 19 June 2020

New release now available
IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.3 is available.
Discovery for Cloud Pak for Data now works with IBM Cloud Pak® for Data 3.0.1.
New Finnish and Hebrew language support
Added basic support for Finnish and Hebrew. For more information, see Language support.
Change to Analyze endpoint
The Analyze endpoint, which supports stateless document ingestion workflows. For details, see the Analyze API. The Analyze API supports JSON documents only. Use of the Analyze API affects license usage.
New options for Content Miner
The content mining application includes two new options: Cyclic time scale on the Time series dashboard, and the Contextual view tab.
New shortcut for Content Mining projects
For Content Mining projects only, the Improve and customize page includes a shortcut: the Launch application button. Previously, you were required to open the Integrate and deploy page, select the Launch application tab, and click the Launch button.
Improved segment limit
The segment limit when splitting documents has been increased to 1,000. For details, see Split documents to make query results more succinct.
Improved Filenet connector
The Filenet connector has document level security.
New beta Curations feature
You can specify up to 1,000 curations. For details about this beta feature, see Curations.
Fixed defects in the 2.1.3 release
In versions 2.1.2, 2.1.1, and 2.1.0, PNG, TIFF, and JPG individual image files are not scanned, and no text is extracted from those files. PNG, TIFF, and JPEG images embedded in PDF, Word, PowerPoint, and Excel files are also not scanned, and no text is extracted from those image files.

2.1.2 release, 31 March 2020

New release now available
IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.2 is available.
New IBM FileNet connector
You can now crawl IBM FileNet systems. For more information, see FileNet connector.
New Swedish, Norwegian, and Danish language support
Added basic support for Swedish, Norwegian (Bokmål and Nynorsk), and Danish. For more information, see Language support.
Change to Advanced rules models enrichment
The Advanced rules models enrichment is now GA.
New document preview for search results
You can now view your search results in a document preview for the following source documents: PDF, Word, PowerPoint, Excel, and all image files. See supported file types for the list of image files. This view makes it easier for you to see search results as highlighted passages within the text of the original document, making the context clearer.
New proxy support for Web Crawl
Support was added to the Web Crawl connector for proxy support.
Change to empty aggregations parameter
Running a query with an empty aggregations parameter returns zero aggregations in the response.
Change to Postgres support
Support for connecting to Postgres 9.4 was removed because the vendor ended version support was ended by the vendor on 13 February 2020.
Fixed the following defects in the 2.1.2 release
When installing Discovery for Cloud Pak for Data on OpenShift, the ranker-rest service might intermittently fail to startup, due to an incompatible jar in the classpath.
When you upload documents to a collection with existing documents, a Documents uploaded! message displays on the Activity page, but no further processing status displays until the number of documents increases.
Running a query with an empty aggregations parameter returns an empty aggregations array.
Deprovisioning a IBM Watson® Discovery for IBM Cloud Pak® for Data Instance will not delete the underlying data. Delete the collections and documents manually.

2.1.1 release, 24 January 2020

New release now available
IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.1 is available.
Fixed the following defects in the 2.1.1 release:
In Document Retrieval project types, when you perform an empty search, and the search results source is set to passages, the query results will display excerpt unavailable in the Project workspace.
When visiting the Storybook links on the Integrate and deploy page, the links do not go to the correct location. Please visit Storybook instead to view documentation.
If you are using Smart Document Understanding, two variables no longer need to be set during installation or reinstallation. For more information, see Environment variable settings for Smart Document Understanding.
Discovery for Content Intelligence and Table Understanding enrichments are configured out of the box to be applied on a field named html. When a user uploads a JSON document without a root-level field named html, these enrichments will not yield results in the index. To run the enrichments on this kind of JSON documents, users must re-configure the enrichments to run on an existing field (or fields) in the JSON document.

2.1.0 release, 27 November 2019

New release now available
IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.0 is available.
Discovery for Cloud Pak for Data now works with IBM Cloud Pak® for Data 2.5.0.0.
New Project-based interface
Test your application like an end-user would with the Document retrieval, Conversational Search, and Content Mining project types. For more information, see Creating projects.
New Content Mining app
Build an end user interface for extracting insights proactively from your entire corpus. For more information, see Analyzing your data with the Content Mining application.
New Content Intelligence add-on
Option to enrich your documents with pre-built domain knowledge for Contracts. For more information, see Document Retrieval for Contracts.
New reusable components
Use reusable components to quickly build your application with Discovery. We ship an autocomplete, rich preview, results and facets component. For more information, see Building and deploying components.
New Czech, Polish, Romanian, Russian, and Slovak language support
Basic support for Czech, Slovak, Russian, Polish and Romanian is added. For more information, see Language support.
New built-in table understanding
Extract tables from your documents without training, and optionally return tables as answers to natural language queries. For more information, see Understanding tables.
New SDK connector
Build custom connectors your Discovery users can use to build their own applications. For more information, see Building and implementing a custom connector.
New pre-built sample project
The sample project is preloaded with data, so you can learn about Discovery. For more information, see Getting started with Watson Discovery.
New passage retrieval
Will return the most relevant passages from your documents, plus you can specify the number of passages returned per document. See Passages.
New project-level querying and relevancy training
Query multiple collections at once including relevance training.
Improved Web crawl connector
Additional options now available for the Web crawl connector - For more information, see Web crawl.
New Local File System connector
Crawl Linux or other file systems. For more information, see Local file system
New dynamic Facets
Automatically generate facets based on the understanding of your data. For more information, see Facets.
New Dictionary suggestions
Dictionary terms are suggested based on your content. For more information, see Dictionary.
New beta Curations
Specify a particular result for a given query. For more information, see the API reference.

2.0.1 release, 30 August 2019

New release now available
IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.0.1 is available.
Discovery for Cloud Pak for Data now works with IBM Cloud Pak® for Data 2.1.0.1.
New Windows File System and Database connectors
Added the Windows File System and Database connectors. For more information, see Database connector and Windows File System connector.
New Chinese language support
Added support for Traditional Chinese. For more information, see Language support.
New FISMA support
Federal Information Security Management Act (FISMA) support is available for IBM Watson® Discovery for IBM Cloud Pak® for Data offerings purchased on or after August 30, 2019. FISMA support is also available to those who purchased the June 28, 2019 version and upgrade to the August 30, 2019 version. IBM Watson® Discovery for IBM Cloud Pak® for Data is FISMA High Ready.
New Classifier enrichment
Released the Classifier enrichment. For more information, see Classifier.
New Red Hat OpenShift support
Added support for installing IBM Cloud Pak® for Data on Red Hat OpenShift.
Fixed the following defects in Discovery for Cloud Pak for Data offerings purchased on or after August 30, 2019
During an active web crawl, if you add an enrichment, then click the Recrawl collection button on the Activity page, the collection will stop processing. If the collection does not return to a Syncing state on its own, clicking the Recrawl collection button an additional time might be required.
While training a collection in the tooling , if you rate the relevancy of a result (for example, asRelevant), then switch to the opposite rating (Not relevant), the page may go blank. To restore the page, refresh the browser. Your updated rating will be retained.
Chinese, Japanese, and Korean language Microsoft Word, Excel, and PowerPoint documents will not display correctly in the index or the Smart Document Understanding editor.
If you upload a zip, gzip, or tar file to your collection, and that file contains multiple files/file types supported by Smart Document Understanding (PDF, Word, Excel, PowerPoint, PNG, TIFF, JPEG), only one of the files in that zip, gzip, or tar file will be available for training in the SDU editor (unless the SDU document limit has already been met). All of the documents will be available in the index. Unzip the file before uploading to avoid this issue.
Query expansion and autocomplete return the wrong error code when the collection_id is invalid. Query expansion will return a 500 error code instead of a 404. Autocomplete will return a 400 when the collection_id is invalid and the prefix parameter isn’t set. It should also return a 404.
When crawling Microsoft SharePoint 2019 collections, only HTML documents will be crawled and indexed. This is a SharePoint issue with how it processes mime-types. See this Microsoft blog post for a workaround.
If you delete an installation of the Discovery for Cloud Pak for Data add-on, the instance will not uninstall completely and your re-installation will fail. See the Discovery for Cloud Pak for Data Readme for post-cleanup steps.
If a JSON document that contains nested JSON objects is ingested, the nested JSON will be indexed as a JSON string.

2.0.0, General Availability (GA) release, 28 June 2019

Discovery for Cloud Pak for Data now available
The IBM Watson® Discovery for IBM Cloud Pak® for Data service brings the cognitive capabilities of IBM Watson® Discovery to the IBM Cloud Pak® for Data platform.