Release notes for IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data
Learn about features and changes that were included for each release and update of the product software.
IBM Cloud Pak for Data
This information applies only to instances of IBM Watson® Discovery that are installed on IBM Cloud Pak® for Data. For information about releases and updates for managed deployments, see Release notes for Watson Discovery for IBM Cloud.
For the list of Discovery known issues, see Limitations and known issues in Watson Discovery.
Knowledge Studio for IBM Cloud Pak for Data deprecation announcement
After version 4.7, the operator for IBM Knowledge Studio will no longer be supported and will be removed from the IBM Watson Discovery Cartridge for IBM Cloud Pak for Data and from github.com. The service will not be displayed in the Cloud Pak for Data catalog. This change will not impact existing deployments of the operator.
Migrate your solutions to Watson Discovery, which has powerful custom natural language processing capabilities. Any existing Watson Knowledge Studio for Cloud Pak for Data rules-based or machine learning models can be imported to Watson Discovery and applied to your data as custom enrichments. And the recent release of the custom entities extraction feature brings equivalent function to label and train custom entity models into Watson Discovery. For more information about these features, see Choose enrichments.
For more information about migrating your solutions, see Migrating Knowledge Studio solutions.
4.8.7 release, 30 October 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.7 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
5.0.3 release, 27 September 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 5.0.3 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
4.8.6 release, 28 August 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.6 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
5.0.1 release, 31 July 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 5.0.1 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
5.0.0 release, 19 June 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 5.0.0 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.8.5 release, 24 April 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.5 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.8.4 release, 28 March 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.4 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.8.3 release, 28 February 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.3 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.8.2 release, 31 January 2024
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.2 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.8.0 release, 29 November 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.8.0 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.7.3 release, 27 September 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.7.3 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following feature is generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.7.1 release, 26 July 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.7.1 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
4.7.0 release, 28 June 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.7.0 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Optical Character Recognition v2
4.6.6 release, 18 May 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data was not refreshed as part of 4.6.6. You can use Discovery 4.6.5 with IBM Cloud Pak for Data 4.6.6.
4.6.5 release, 2 May 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6.5 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Manage the data in a collection from the new Manage data page
-
You can now access a Manage data page for a collection. From the new page, you can see a list of the documents in your collection and get a quick view of information about the documents. You can also delete documents from a collection with just a few clicks. For more information, see Excluding content from query results.
- You have more control over the data that is crawled by the database connector
-
When you connect to a database as an external data source, you can now specify the column from which to extract data. If you don't specify the column, a column with text or with a single large object is chosen to be crawled. You can also specify the MIME type of the data in the column that you want to crawl.
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Optical Character Recognition v2
4.6.4 release, 29 March 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data was not refreshed as part of 4.6.4. You can use Discovery 4.6.3 with IBM Cloud Pak for Data 4.6.4 on Red Hat OpenShift Container Platform versions 4.10 or 4.12.
4.6.3 release, 23 february 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6.3 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Manage data page
- Important: Back up your data before upgrading to version 4.6.3
-
Before upgrading to version 4.6.3, you must make a backup of your data. Preserve the backup in a safe location. For more information about backing up your data, see Backing up and restoring data in IBM Cloud Pak for Data. That topic also includes information about restoring your data if that becomes necessary.
4.6.2 release, 30 January 2023
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6.2 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Manage data page
4.6.1 release, December 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data was not refreshed as part of 4.6.1. However, the product documentation was updated with fixes and enhancements.
4.6 release, 30 November 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.6 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Manage data page
4.5.3 release, 13 October 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.5.3 is available.
There are no new features in this release. For a list of bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Manage data page
- Advanced document view for search results
- The similar parameter of the Query method
- The smart_document_understanding field in the Get collection method response
15 August 2022
- SDKs were updated to reflect the latest API changes.
-
The following Discovery v2 API changes are now reflected in the SDKs:
-
Use the new document classifier API to get, add, update, or delete a document classifier.
-
A new document status API is available. You can use it to get a list of the documents in a collection and to get details about a single document.
-
You can now get, add, and remove a stop words or expansion list for a collection.
-
The
suggested_refinements
parameter of the Query method is deprecated.
-
4.5.1 release, 3 August 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.5.1 is available.
There are no new features in this release. For a list of bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Manage data page
- Advanced document view for search results
- The similar parameter of the Query method
- The smart_document_understanding field in the Get collection method response
4.5 release, 29 June 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.5 is available.
For a list of new features and bug fixes, see What's new and changed in Watson Discovery
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Answer finding
- Manage data page
- Advanced document view for search results
4.0.9 release, 25 May 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.9 is available.
- API usage information is now available from the user interface
-
You can now get information about analyze API usage from the Data usage>API usage page in the product user interface. For more information about the analyze API, see Analyze API.
- A new document status API is supported in IBM Cloud Pak® for Data instances
-
Use the new document status API to programmatically get a list of the documents in a collection and to get details about a single document.
-
The API is supported for collections that are created after 23 March 2022.
If you want to get status information about a collection that was created earlier, trigger a process that runs the conversion step of ingestion on the documents. For example, from the Activity page for the collection, click Recrawl.
-
The API is not supported from the SDKs currently.
For more information about the new API, see the API reference documentation.
-
- Security vulnerabilities were addressed
-
The following security patches were applied:
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Node.js
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache Xerces
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in OpenSSL
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Google Protocol Buffers
4.0.8 release, 27 April 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.8 is available.
- The
Development
deployment type was changed toStarter
-
When you install Watson Discovery, you can optionally specify the type of deployment by including the
deploymentType
parameter in your custom resource. TheDevelopment
option is now called theStarter
option.The
Development
andStarter
options are functionally the same, and both values are accepted by the service. - Security vulnerabilities were addressed
-
The following security patches were applied:
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Google Protocol Buffers
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Node.js
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Java
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in PostgreSQL
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Kotlin
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache POI
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data is affected by a remote code execution in Spring Framework (CVE-2022-22965)
IBM Watson® Discovery for IBM Cloud Private (ICP) for Data 2.2.x End Of Support
Effective 30 April 2022, IBM will withdraw support for the following programs:
- IBM Watson Discovery for ICP for Data 2.2.x
- IBM Watson Discovery for ICP for Data Add-on 2.2.x
For more information, see announcement ENUS921-134.PDF.
4.0.7 release, 30 March 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.7 is available.
- IBM Cloud Block Storage is now supported
-
When you install Discovery, you can specify IBM Cloud Block Storage Gold tier (ibmc-block-gold) as your storage class. For more information about the storage class, see Storing data on classic IBM Cloud Block Storage.
- Security vulnerabilities were addressed
-
The following security patches were applied:
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in NumPy
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Spring
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in FasterXML jackson-databind
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in TensorFlow
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in XStream
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Go
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Home page updates
- Answer finding
- Manage data page
- Advanced document view for search results
30 March 2020
- A new document classifier API is available
-
Use the new document classifier to programmatically get, add, update, or delete a document classifier. The following notes apply to this release:
- The
enrichments
property of the Document Classifier object is documented as being optional. However, the property is required currently. - The
field
property in thefederated_classification
object is documented as a string. However, it is currently an array.
For more information about the new API, see the API reference documentation. For more information about adding a document classifier by using the product user interface, see Using the Content Mining application.
The document classifier endpoints are not supported in the SDKs currently.
- The
4.0.6 release, 1 March 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.6 is available.
- Multitenancy is now supported
-
An administrator can now create up to 10 instances of the Discovery service per deployment, which means that more teams can work on discrete Discovery projects at the same time.
- Simpler installation and management of custom connectors
-
The
manage_custom_crawler.sh
script was improved to make it easier for you to install and manage your custom connectors in a multitenant environment. For more information, see Installing a custom crawler. - Security vulnerabilities were addressed
-
The following security patches were applied:
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Java
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Logback
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache Log4j
- Features that are not available in this release
-
The following features are generally available from managed IBM Cloud deployments at the time of this release, but not from installed deployments:
- Home page updates
- Answer finding
- Access to guided tours from the page header
4.0.5 release, 26 January 2022
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.5 is available.
- A security vulnerability was addressed
-
The following security patch was applied: Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache Log4j
- Features that are not available in this release
-
The following features are generally available from IBM Cloud deployments at the time of this release, but not from installed deployments:
- Home page updates
- Answer finding
- Guided tours
4.0.4 release, 20 December 2021
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.4 is available.
- Guided tours are available
-
Access guided tours from anywhere in the product user interface by clicking the Guided tours button in the page header.
- Security vulnerabilities were addressed
-
The following security patches were applied:
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in LibTIFF
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in TensorFlow
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Node.js
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Netty
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache Log4j
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache Log4j 1.2
- Features that are not available in this release
-
The following features are generally available from IBM Cloud deployments at the time of this release, but not from installed deployments:
- Home page updates
- Answer finding
4.0.3 release, 30 November 2021
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.3 is available.
- Another storage option is supported
-
IBM Spectrum Scale Container Native storage is now supported in addition to Red Hat OpenShift Container Storage and Portworx.
- Microsoft SharePoint Online data source improvement
-
The Sharepoint Online data source now supports crawling your data as a service principal, which means you can access your data without disabling multifactor authentication. For more information, see Microsoft Sharepoint Online.
- Microsoft Windows File System improvements
-
Extra configuration options mean you can specify the following information:
- The types of files (by file extension) to include or exclude from a crawl of a Windows directory.
- The character encoding of the data to be crawled. Typically, the encoding is detected automatically. However, you can choose to specify the character encoding as a Java character set yourself.
For more information, see Windows File System.
- Field selection is improved
-
When you apply an enrichment to a field or choose a field to use as the source for a facet, the fields that are displayed for you to choose from now shows only fields that are valid choices.
- Search settings change
-
The spelling correction setting changed from being enabled automatically in new projects to being disabled by default. If you want to alert users when they misspell a term in their query, turn on Spelling suggestions. For more information, see Customizing the search bar.
- A Salesforce crawling issue was fixed
-
Previously, Discovery had an issue where it timed out before it crawled some of the object types in a Salesforce collection. If your collection is configured to crawl the following object types, run a full data source crawl to make sure that your collection contains the most up-to-date data from all of the objects in your Salesforce data source:
- Attachment
- ContentVersion
- Document
- Security vulnerabilities were addressed
-
The following security patches were applied:
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Node.js
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Axios
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Python Pillow
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Apache Commons Compress
- Security Bulletin: IBM Watson Discovery for IBM Cloud Pak for Data affected by vulnerability in Java
- Features that are not available in this release
-
The following features are available from IBM Cloud deployments at the time of this release, but not from installed deployments:
- Home page updates
- Guided tours
- Answer finding
4.0.2 release, 5 October 2021
IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.2 is available.
- Support for newer platform software
- IBM Cloud Pak® for Data 4.0.2 can be installed on Red Hat® OpenShift® on IBM Cloud® 4.8.
- New scoring for NLU enrichments
- Relevance and confidence scores are displayed for NLU enrichments that are returned by search. For example, when you open the JSON view of the document preview from a query result, you can see confidence scores for Entities mentions and relevance scores for Keyword mentions.
- Improved Web crawl
- The Web crawl data source supports more customization options, including the ability to ignore a site's robots.txt file. For more information, see Web crawl.
- New upgrade support
- The 4.0.2 release supports in-place upgrade from IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data 4.0.0. For more information, see Upgrading Watson Discovery to a newer 4.0 refresh
IBM Cloud Private End Of Support
Effective 30 September 2021, IBM withdrew support for the following programs:
- IBM Watson Assistant Discovery Extension for IBM Cloud Private 2.1.0–2.1.4
- IBM Watson Discovery for ICP for Data 2.1.0–2.1.4
- IBM Watson Discovery for ICP for Data Add-on 2.1.0–2.1.4
For more information, see announcements ENUS921-005.PDF and ENUSLP21-0099.PDF.
4 release, 13 July 2021
- New version now available
-
Discovery for Cloud Pak for Data 4 is available
-
This release is supported on IBM Cloud Pak® for Data 4.0.0.
- Change to service name
-
The new name is IBM Watson® Discovery Cartridge for IBM Cloud Pak® for Data.
- New Smart Document Understanding (SDU) predefined model
-
When you identify fields, instead of annotating documents with the SDU tool, you can choose to use a pretrained model. The pretrained model applies a non-customizable model that automatically extracts text and identifies tables, lists, and sections.
- Improved contract analysis
-
To enable the Contracts enrichment that recognizes and tags contract-related concepts in your data, you can choose to create a Document Retrieval project type, and then select Apply contracts enrichment. You no longer need to use an installation override YAML file to enable it. This change also means that you can choose which Document Retrieval projects use the Contracts enrichment; it is not applied to all Document Retrieval projects automatically.
- New LDAP directory data source
-
Connect to data that is stored in an external directory that supports the Lightweight Directory Access Protocol (LDAP), such as a corporate email directory. As the directory data is added to your collection, Discovery interprets and stores key attributes of each record, such as department and location information. Later, you can find relevant records by filtering on these attribute categories. For more information, see LDAP directory.
- Improved SharePoint OnPrem connection process
-
The steps you follow to connect to a SharePoint instance that is hosted on-premises were simplified. You no longer need to deploy a web services package on the SharePoint server before you can connect to the SharePoint OnPrem data source. For more information, see SharePoint OnPrem.
- New Salesforce proxy support
-
You can now connect to a Salesforce data source when using a proxy server. For more information, see Salesforce.
- Improved custom connector improvements
-
Support was added for Optical character recognition (OCR)
-
Support was added for Document-level security
For more information about the custom connector, see Building a Cloud Pak for Data custom connector.
- Change to Dynamic Faceted Search
-
Support for Dynamic Faceted Search and its associated
suggested_refinements
API query parameter was removed.
2.2.1 release, 26 February 2021
- New release now available
- IBM Watson™ Discovery for IBM Cloud Pak for Data version 2.2.1 is available.
- Support for upgrade
- Discovery for Cloud Pak for Data supports an in-place upgrade from version 2.2.0 to 2.2.1 so that you do not need to manually uninstall an earlier version and then install the latest version of the service. For more information, see Upgrading Discovery for Cloud Pak for Data.
- New SDK download support
- You can now download the custom connector SDK package from your Discovery for Cloud Pak for Data cluster, instead of retrieving the images and the SDK package from the Docker registry. For more information, see Downloading the custom-crawler-docs.zip file in Discovery 2.2.1 and later.
- Change to Invoices and Purchase orders
Invoices
andPurchase orders
models can no longer be enabled in the tooling. If you need these models, please contact IBM Cloud Support to obtain instructions for enabling these models.- Change to Contracts enrichment tables
- In a
Document Retrieval
project that has theContracts
enrichment applied, tables are not included inside thecontracts
field, as they were previously in projects that had theContracts
enrichment enabled. Tables will continue to be included in a separatetables
field when theTable Understanding
enrichment is applied. - Change to support for Oracle Database 11g and Postgres 9.5
- Support for connecting to Oracle Database 11g was removed because the vendor ended version support on 31 December 2020.
- Support for connecting to Postgres 9.5 was removed because the vendor ended version support on 11 February 2021.
2.2.0 release, 8 December 2020
- New release now available
- IBM Watson™ Discovery for IBM Cloud Pak for Data version 2.2 is available.
- Discovery for Cloud Pak for Data now works with IIBM Cloud Pak® for Data 3.5.
- New support for Notes attachments
- Added support for attachments in the Notes data source. For more information, see Notes
- New web crawl scheduling option
- You can specify the exact time that you would like your crawls to run for any data source, giving you the flexibility to run them at the times you prefer. For more information, see Configuring Cloud Pak for Data data sources.
- New Facet creation in Content Miner
- You can now create Facet groups in a Content Miner application.
- New custom crawler creation
- Added the option to create your own custom crawler plug-in. For more information, see Building a Cloud Pak for Data crawler plug-in Note: Any custom code used with Watson Discovery is the responsibility of the developer and is not covered by IBM support.
- Change to Dynamic Facets
- Dynamic Facets are no longer enabled by default in Document Retrieval projects.
2.1.4 release, 2 September 2020
- New release now available
- IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.4 is available.
- New Notes connector
- Crawl Notes version 9.0.1 systems. For more information, see Notes connector.
- New Enable proxy settings in multiple connectors
- You can now select the option to enable proxy settings in Box, Microsoft SharePoint Online, and Microsoft SharePoint OnPrem connectors.
- New options for Database connector
- Added support for multiple tables and the Row filter option to the Database connector.
- New authentication types for Web crawler
- You can select from three new authentication types in Web crawler: Basic authentication, NTLM authentication, and FORM authentication.
- New Analyze API usage monitoring
- You can now monitor the usage of the Analyze API using the tooling. For more information, see Monitoring usage.
30 August 2020
- Update to API version
- The current API version (v2) is now 2020-08-30. The following change was made with this version:
- Change to 'options' object
- The List enrichments method no longer returns the
options
object per enrichment. Use the Get enrichment method to return theoptions
object for a single enrichment.
2.1.3 release, 19 June 2020
- New release now available
- IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.3 is available.
- Discovery for Cloud Pak for Data now works with IBM Cloud Pak® for Data 3.0.1.
- New Finnish and Hebrew language support
- Added basic support for Finnish and Hebrew. For more information, see Language support.
- Change to Analyze endpoint
- The Analyze endpoint, which supports stateless document ingestion workflows. For details, see the Analyze API. The Analyze API supports JSON documents only. Use of the Analyze API affects license usage.
- New options for Content Miner
- The content mining application includes two new options: Cyclic time scale on the Time series dashboard, and the Contextual view tab.
- New shortcut for Content Mining projects
- For Content Mining projects only, the Improve and customize page includes a shortcut: the Launch application button. Previously, you were required to open the Integrate and deploy page, select the Launch application tab, and click the Launch button.
- Improved segment limit
- The segment limit when splitting documents has been increased to 1,000. For details, see Split documents to make query results more succinct.
- Improved Filenet connector
- The Filenet connector has document level security.
- New beta Curations feature
- You can specify up to 1,000 curations. For details about this beta feature, see Curations.
- Fixed defects in the 2.1.3 release
- In versions 2.1.2, 2.1.1, and 2.1.0, PNG, TIFF, and JPG individual image files are not scanned, and no text is extracted from those files. PNG, TIFF, and JPEG images embedded in PDF, Word, PowerPoint, and Excel files are also not scanned, and no text is extracted from those image files.
2.1.2 release, 31 March 2020
- New release now available
- IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.2 is available.
- New IBM FileNet connector
- You can now crawl IBM FileNet systems. For more information, see FileNet connector.
- New Swedish, Norwegian, and Danish language support
- Added basic support for Swedish, Norwegian (Bokmål and Nynorsk), and Danish. For more information, see Language support.
- Change to Advanced rules models enrichment
- The Advanced rules models enrichment is now GA.
- New document preview for search results
- You can now view your search results in a document preview for the following source documents: PDF, Word, PowerPoint, Excel, and all image files. See supported file types for the list of image files. This view makes it easier for you to see search results as highlighted passages within the text of the original document, making the context clearer.
- New proxy support for Web Crawl
- Support was added to the Web Crawl connector for proxy support.
- Change to empty aggregations parameter
- Running a query with an empty
aggregations
parameter returns zero aggregations in the response. - Change to Postgres support
- Support for connecting to Postgres 9.4 was removed because the vendor ended version support was ended by the vendor on 13 February 2020.
- Fixed the following defects in the 2.1.2 release
- When installing Discovery for Cloud Pak for Data on OpenShift, the
ranker-rest
service might intermittently fail to startup, due to an incompatible jar in theclasspath
. - When you upload documents to a collection with existing documents, a
Documents uploaded!
message displays on the Activity page, but no further processing status displays until the number of documents increases. - Running a query with an empty
aggregations
parameter returns an empty aggregations array. - Deprovisioning a IBM Watson® Discovery for IBM Cloud Pak® for Data Instance will not delete the underlying data. Delete the collections and documents manually.
2.1.1 release, 24 January 2020
- New release now available
- IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.1 is available.
- Fixed the following defects in the 2.1.1 release:
- In Document Retrieval project types, when you perform an empty search, and the search results source is set to
passages,
the query results will displayexcerpt unavailable
in the Project workspace. - When visiting the Storybook links on the Integrate and deploy page, the links do not go to the correct location. Please visit Storybook instead to view documentation.
- If you are using Smart Document Understanding, two variables no longer need to be set during installation or reinstallation. For more information, see Environment variable settings for Smart Document Understanding.
- Discovery for Content Intelligence and Table Understanding enrichments are configured out of the box to be applied on a field named
html
. When a user uploads a JSON document without a root-level field namedhtml
, these enrichments will not yield results in the index. To run the enrichments on this kind of JSON documents, users must re-configure the enrichments to run on an existing field (or fields) in the JSON document.
2.1.0 release, 27 November 2019
- New release now available
- IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.1.0 is available.
- Discovery for Cloud Pak for Data now works with IBM Cloud Pak® for Data 2.5.0.0.
- New Project-based interface
- Test your application like an end-user would with the Document retrieval, Conversational Search, and Content Mining project types. For more information, see Creating projects.
- New Content Mining app
- Build an end user interface for extracting insights proactively from your entire corpus. For more information, see Analyzing your data with the Content Mining application.
- New Content Intelligence add-on
- Option to enrich your documents with pre-built domain knowledge for Contracts. For more information, see Document Retrieval for Contracts.
- New reusable components
- Use reusable components to quickly build your application with Discovery. We ship an autocomplete, rich preview, results and facets component. For more information, see Building and deploying components.
- New Czech, Polish, Romanian, Russian, and Slovak language support
- Basic support for Czech, Slovak, Russian, Polish and Romanian is added. For more information, see Language support.
- New built-in table understanding
- Extract tables from your documents without training, and optionally return tables as answers to natural language queries. For more information, see Understanding tables.
- New SDK connector
- Build custom connectors your Discovery users can use to build their own applications. For more information, see Building and implementing a custom connector.
- New pre-built sample project
- The sample project is preloaded with data, so you can learn about Discovery. For more information, see Getting started with Watson Discovery.
- New passage retrieval
- Will return the most relevant passages from your documents, plus you can specify the number of passages returned per document. See Passages.
- New project-level querying and relevancy training
- Query multiple collections at once including relevance training.
- Improved Web crawl connector
- Additional options now available for the Web crawl connector - For more information, see Web crawl.
- New Local File System connector
- Crawl Linux or other file systems. For more information, see Local file system
- New dynamic Facets
- Automatically generate facets based on the understanding of your data. For more information, see Facets.
- New Dictionary suggestions
- Dictionary terms are suggested based on your content. For more information, see Dictionary.
- New beta Curations
- Specify a particular result for a given query. For more information, see the API reference.
2.0.1 release, 30 August 2019
- New release now available
- IBM Watson® Discovery for IBM Cloud Pak® for Data version 2.0.1 is available.
- Discovery for Cloud Pak for Data now works with IBM Cloud Pak® for Data 2.1.0.1.
- New Windows File System and Database connectors
- Added the Windows File System and Database connectors. For more information, see Database connector and Windows File System connector.
- New Chinese language support
- Added support for Traditional Chinese. For more information, see Language support.
- New FISMA support
- Federal Information Security Management Act (FISMA) support is available for IBM Watson® Discovery for IBM Cloud Pak® for Data offerings purchased on or after August 30, 2019. FISMA support is also available to those who purchased the June 28, 2019 version and upgrade to the August 30, 2019 version. IBM Watson® Discovery for IBM Cloud Pak® for Data is FISMA High Ready.
- New Classifier enrichment
- Released the Classifier enrichment. For more information, see Classifier.
- New Red Hat OpenShift support
- Added support for installing IBM Cloud Pak® for Data on Red Hat OpenShift.
- Fixed the following defects in Discovery for Cloud Pak for Data offerings purchased on or after August 30, 2019
- During an active web crawl, if you add an enrichment, then click the Recrawl collection button on the Activity page, the collection will stop processing. If the collection does not return to a Syncing state on its own, clicking the Recrawl collection button an additional time might be required.
- While training a collection in the tooling , if you rate the relevancy of a result (for example, as
Relevant
), then switch to the opposite rating (Not relevant
), the page may go blank. To restore the page, refresh the browser. Your updated rating will be retained. - Chinese, Japanese, and Korean language Microsoft Word, Excel, and PowerPoint documents will not display correctly in the index or the Smart Document Understanding editor.
- If you upload a zip, gzip, or tar file to your collection, and that file contains multiple files/file types supported by Smart Document Understanding (PDF, Word, Excel, PowerPoint, PNG, TIFF, JPEG), only one of the files in that zip, gzip, or tar file will be available for training in the SDU editor (unless the SDU document limit has already been met). All of the documents will be available in the index. Unzip the file before uploading to avoid this issue.
- Query expansion and autocomplete return the wrong error code when the
collection_id
is invalid. Query expansion will return a500
error code instead of a404
. Autocomplete will return a400
when thecollection_id
is invalid and theprefix
parameter isn’t set. It should also return a404
. - When crawling Microsoft SharePoint 2019 collections, only HTML documents will be crawled and indexed. This is a SharePoint issue with how it processes mime-types. See this Microsoft blog post for a workaround.
- If you delete an installation of the Discovery for Cloud Pak for Data add-on, the instance will not uninstall completely and your re-installation will fail. See the Discovery for Cloud Pak for Data Readme for post-cleanup steps.
- If a JSON document that contains nested JSON objects is ingested, the nested JSON will be indexed as a JSON string.
2.0.0, General Availability (GA) release, 28 June 2019
- Discovery for Cloud Pak for Data now available
- The IBM Watson® Discovery for IBM Cloud Pak® for Data service brings the cognitive capabilities of IBM Watson® Discovery to the IBM Cloud Pak® for Data platform.