IBM Cloud Docs
Getting the most from Discovery

Getting the most from Discovery

Discovery was redesigned to introduce new features and a simpler way to build solutions.

The redesigned product is referred to as Discovery v2. When you create an instance on IBM Cloud or install and provision an instance on IBM Cloud Pak for Data, you get the new and improved version of Discovery.

Advantages of using the latest version

Discovery v2 offers the following features and enhancements:

  • A project-based experience that supports many different use cases within a single environment.
  • Built-in customization tools for adding dictionaries, patterns, and classifiers to help business users build projects that understand the language of their domain.
  • Connectors to popular data sources that can quickly access valuable data where it resides.
  • Smart Document Understanding that learns from the structure of human-readable documents, such as PDFs.
  • Natural language query support across all document types, optimized with machine learning to find targeted answers.
  • Advanced search capabilities, such as answer finding, curations, and table retrieval.
  • An out-of-the-box contract understanding function that helps you search and interpret legal contracts.
  • A full-featured Content Mining application that you can use to conduct in-depth analysis of unstructured text.
  • Customizable user interface components that help you to deploy custom applications.

For more information, see Migrating to Discovery v2.

Comparing v1 and v2 features

If you are already familiar with Discovery v1, learn more about how Discovery v2 compares.

Discovery v2 has new features that were previously unavailable. The following table describes feature support in both versions.

Feature support details
This table has row and column headers. The row headers identify features. The column headers identify the different versions of the product. To understand which features are supported by a product version, go to the row that describes the feature, and find the column for the product version that you are interested in.
Feature Product redesign (v2) Earlier version (v1)
Use projects to organize your work checkmark icon
Use the Smart Document Understanding (SDU) to annotate your documents checkmark icon checkmark icon
Leverage intuitive user interface tools to add domain-specific artifacts, such as dictionaries and custom machine learning models checkmark icon
Create a content mining project type and then use the built-in Content Mining application to do in-depth data analysis (IBM Cloud Pak for Data, Enterprise, and Premium plans only) checkmark icon
Perform real-time NLP with the Analyze API (IBM Cloud Pak for Data and Enterprise plans only) checkmark icon
Apply a pretrained Smart Document Understanding model to your collection for similar benefits with less effort checkmark icon
Process text from scanned documents or other images checkmark icon checkmark icon
Extract meaning from tables checkmark icon
Get insights from contracts (IBM Cloud Pak for Data, Enterprise, and Premium plans only) checkmark icon
Apply the Part of Speech enrichment to your data checkmark icon
Use the Entity Extraction, Document and Phrase Sentiment Analysis, and Keyword Extraction enrichments checkmark icon checkmark icon
Use the Category classification, Concept tagging, Relation Extraction, Emotion Analysis, and Semantic Role Extraction, Sentiment of Keywords and Entities enrichments, which are available with the Natural Language Understanding service checkmark icon
Build a custom entity type system checkmark icon
Apply Watson Knowledge Studio NLP models to your data checkmark icon checkmark icon
Support for more connectors from a IBM Cloud Pak for Data deployment, including databases, file systems, FileNet P8, and HCL Notes checkmark icon
Some connectors support document-level security from a IBM Cloud Pak for Data deployment checkmark icon
Programmatically configure external data source crawls checkmark icon
Configure the normalization processes of document segmentation and HTML file inclusion or exclusion rules during ingestion checkmark icon
Configure the JSON normalization process during ingestion and after enrichment checkmark icon checkmark icon
Configure dictionary tokenization checkmark icon
Advanced question-answering capabilities, such as returning the exact answer checkmark icon
Discovery Query Language (DQL) API support checkmark icon checkmark icon
Retrieve passages from documents checkmark icon checkmark icon
Perform relevancy training to improve query results checkmark icon checkmark icon
Configure continuous relevancy training checkmark icon
Retrieve tables checkmark icon
Query result deduplication checkmark icon
Identify document similarity in query results checkmark icon checkmark icon
Indicate a preference (bias) in queries checkmark icon
Review query logging and metrics checkmark icon