IBM Cloud Docs
Creating projects

Creating projects

A project is a convenient way to collect and manage the resources in your IBM Watson® Discovery application. You can assign a Project type and connect your data to the project by creating a collection.

Before you create a project, decide which project type best fits your needs.

Project descriptions

Project type use cases
Need Goal Project type
IBM Cloud I want to extract data to support automation of repetitive document processing tasks. I want to understand quickly what data is extracted from my documents and improve the data by applying enrichments. Intelligent Document Processing
Which document contains the answer to my question? Find meaningful information in sources that contain a mix of structured and unstructured data, and surface it in a stand-alone enterprise search application or in the search field of a business application. Document Retrieval
Where is the part of the contract that I need for my task? Quickly extract critical information from contracts. Document Retrieval for Contracts
I want the chatbot I'm building to use knowledge that I own. Give a virtual assistant quick access to technical information that is stored in various external data sources and document formats to answer customer questions. Conversational Search
I want to uncover insights I didn't know to ask about. Gain insights from pattern analysis or perform root cause analysis. Content Mining

If you created the Discovery service as part of a IBM Cloud Pak for Data as a Service deployment, the Discovery project is separate and distinct from the deployment project that is displayed in IBM Cloud.

To create a project, complete the following steps:

  1. Open the Projects page by selecting My Projects.

  2. Click New project. Name your project, and then choose the project type.

    For more information about each type, see Project types.

  3. If you choose a Document Retrieval project type and your data sources are in English, decide whether to enable the Content Intelligence feature.

    If your data source contains contracts, enable the feature by selecting Apply contracts enrichment. Scroll to see the checkbox, if necessary.

  4. Click Next.

  5. Choose and configure a data source.

    For more information about supported data sources, see Creating collections.

Take advantage of the following resources that are available from the page header:

  • To open the product documentation, click the Help icon Help icon.
  • To see all of your projects, click My projects.

Project types

Choose a project type to get the correct set of enrichments applied to your documents automatically. The improvement tools that are available differ by project type, as do the deployment methods, which are optimized for each use case.

The following project types are available:

Intelligent Document Processing IBM Cloud

Use this project type to understand quickly what data Discovery has extracted from your documents. You can view the extracted data in a rich document preview (default view is PDF). If the extracted data does not meet your requirements, you can apply enrichments to improve the data.

This project type is available from IBM Cloud-managed instances only.

Documents that you add to a project of this type are automatically enriched in the following ways:

  • Entities, such as proper nouns, are identified and tagged.

Document Retrieval

Use this project type to search and find the most relevant answers from your data. Projects of this type are typically deployed as search field components that are added to websites or other applications.

Documents that you add to a project of this type are automatically enriched in the following ways:

  • Entities, such as proper nouns, are identified and tagged.

This tagged information is used later when a natural language phrase is submitted as a search query to return a smarter response.

A sample Document Retrieval project is available for you to explore. For more information, see Getting started with Watson Discovery.

Document Retrieval for Contracts

If you are working with English-language legal contracts, enable the Content Intelligence feature to apply a contracts enrichment that can recognize and tag contract-related concepts in your data. Use this project type to automate complex business processes, such as contract review and negotiation. This project type can help to increase productivity, minimize costs, and reduce your legal exposure.

Only users of installed deployments (IBM Cloud Pak for Data) or Premium or Enterprise plan managed deployments can create this type of project.

In addition to the enrichments that are applied to a typical document retrieval project, the following enrichments are made automatically:

  • Content from tables in the source document is tagged so that it can be found later.
  • Contract details, such as payment terms or parties that are involved in the contract, are identified and tagged.

For any collection that you add to the project, optical character recognition (OCR) is enabled automatically so that text from scanned documents or other images is processed.

When you apply the contracts enrichment, you cannot use Smart Document Understanding to annotate documents. A pretrained SDU model that can recognize contract-related information is applied automatically. The Table understanding enrichment is automatically applied.

For more information, see Understanding contracts.

Conversational Search

The Conversational Search project returns information from a connected data collection as answers to questions that customers ask a chatbot, which is also known as an assistant.

Use IBM® watsonx™ Assistant and Discovery together to give your assistant access to technical content and other knowledge base resources without having to relocate or copy your corporate data. The built-in synchronization capabilities mean that your assistant can share the most up-to-date information available. Use the integrations that are provided with watsonx Assistant to deploy an assistant that connects to this project to various platforms, including your company website, in minutes.

The documents that you add to this type of project are not enriched automatically.

If you need to perform more complex searches from your virtual assistant, you might want to create a Document Retrieval project instead of Conversational Search project. For more information, see Choosing the right project type for a chatbot.

IBM Cloud Another feature to consider enabling is the Emphasize the answer feature. When enabled, the answers that are returned to customers who interact with the assistant show the exact answer highlighted in bold font within the search response. For more information about how the exact answer is determined, see Answer finding.

For more information about building a watsonx Assistant search skill, see the appropriate documentation for your deployment:

Content Mining

Use this project type to discover hidden insights, trends, and relationships in your data.

Only users of installed deployments (IBM Cloud Pak for Data) or Premium or Enterprise plan managed deployments can create this type of project.

This project type is especially useful for analyzing structured data, such as data that you add by uploading a CSV file or by connecting to a database data source. You can add only one collection to a project of this type from the Discovery user interface.

Documents that you add as part of the initial collection are automatically enriched in the following way:

  • Parts of speech are identified and tagged.

After you add a collection and optionally apply more enrichments to the data, a full-featured application is available for you to deploy. You can use the application to research your data in depth. For more information about using the application, see Analyzing your data with the deployed Content Mining application.

From the Content Mining application, you can create the following enrichment types which are not available in other project types:

You can create a collection from the deployed Content Mining application. The collection that you create is not added to your existing Content Mining project. A new Content Mining project is created to store the collection. The collection can contain an uploaded CSV file only. The project that is generated is given the name that you specify for the collection.

Because the data that you add to this type of project is often structured, consider using the API to submit queries in the Discovery Query Language (DQL). With DQL queries, you can get information from specific fields or find specific enrichment type mentions. You cannot apply relevancy training to a Content Mining project.

Custom

Choose this type if you prefer not to use one of the other project types. No enrichments are applied automatically, so you can add only those enrichments that are necessary for your use case.

Basic project defaults

Some enrichments and query result settings are applied to each project type by default.

Basic project defaults
Project type Default enrichments Default query result settings
IBM Cloud Intelligent Document Processing Entities Facets (by Entity), Passages
Document Retrieval Entities Facets (by Entity), Passages
Document Retrieval for Contracts Entities, Table Understanding, and Contracts Facets (by Category, Nature, Contract Term, Contract Payment Term, Contract Type, Contract Currency, Invoice Buyer, Invoice supplier, Invoice Currency, Purchase Order Buyer, Purchase Order Supplier, Purchase Order Payment Term) and Table Retrieval
Conversational Search None Passages
Content Mining Part of Speech None
Custom None Passages

Project limits

The number of projects you can create depends on your Discovery plan type.

Plan details
Plan Projects per service instance
Cloud Pak for Data Unlimited
Premium 100
Enterprise 100
Plus (includes Trial) 20

The Sample project is excluded from the total number of projects.

Renaming a project

You cannot rename the Sample Project.

To rename a project after you create it, complete the following steps:

  1. Go to the My Projects page.
  2. Find the project that you want to rename, click the Project actions icon Overflow menu, and then choose Rename.
  3. Edit the project name, and then click Apply.

Deleting a project

If you want to delete a project, but keep a collection from the project, share the collection with another project before you complete these steps. From another project (a type that allows multiple collections), open the Manage collections tab. Click New collection, and then click Reuse data from an existing collection. Select the collection that you want to keep, and then click Finish.

You cannot delete the Sample Project.

To delete a project, complete the following steps:

  1. Go to the My Projects page.
  2. Find the project that you want to delete, click the Project actions icon Overflow menu, and then choose Delete.
  3. Click Delete.