Release notes for Natural Language Understanding

The following new features and changes to the service are available.

Service API versioning

Current API version: 2022-08-10

API requests require a version parameter that takes the date in the format version=YYYY-MM-DD. Send the version parameter with every API request.

When we change the API in a backwards-incompatible way, we release a new minor version. To take advantage of the changes in a new version, change the value of the version parameter to the new date. If you're not ready to update to that version, don't change your version date.

Active version dates

The following table shows the service behavior changes for each version date. Switching to a later version date will activate all changes introduced in earlier versions.

Version date	Changes summary
`2022-08-10`	Language expansion of entities with a new model for improved accuracy and confidence scores. Version 2 Russian entity type system. Version 2 Swedish entity type system.
`2022-04-07`	Bug fix for Version 2 Categories type system.
`2021-08-15`	Classifications GA and Categories type system Version 2 features.
`2021-03-25`	Custom categories (beta) and custom classifications (beta) features.
`2020-12-09`	Version 2 English entity type system.
`2020-12-02`	Version 2 Korean entity type system. Version 2 Spanish entity type system.
`2020-08-01`	Taxonomy changes aimed towards standardization of the label names in the default taxonomy.
`2019-07-12`	New English entities model with improved accuracy and confidence scores.
`2019-06-04`	Fixed a bug that caused entities requests with custom models to ignore the `limit` option. The default `limit` value for all entities requests is now 50 for all models. The maximum `limit` value of 250 entities has been removed.
`2018-11-16`	Version 2 Italian entity type system.
`2018-09-21`	Version 2 Portuguese entity type system.
`2018-03-16`	Version 2 French entity type system. Version 2 German entity type system.
`2017-02-27`	Base version.

17 October 2024

Retired Summarization (Experimental) Feature: The Summarization (Experimental) feature of Watson Natural Language Understanding has been retired and is no longer available.

Entity types (Version 1) deprecation announcement

Entity types (Version 1) is deprecated. As of 11 July 2023, the v1 Entities type system will no longer be available. Consider switching to the v2 Entities type system. By using the more generic v2 Entities type system together with other features, such as Concepts and Categories, you can achieve similar outcomes with more flexibility. For more information about v2 entity types, see Entity types (Version 2).

9 June 2023

Retired Custom Sentiment Feature: The Custom Sentiment feature is retired. Custom sentiment models can no longer be created nor can they be used with Analyze API calls. To ensure we continue providing our clients with robust and powerful text classification capabilities, IBM recently announced the general availability of a new single-label text classification capability. This new feature includes extended language support and training data customizations suited for building a custom sentiment classifier.

16 February 2023

Updated training process for Custom Classifications: Updates were made to the Custom Classifications training process including improvements in the preprocessing stage, updating underlying libraries, and fixing minor bugs.

2 February 2023

Sentiment support for additional languages: Support for sentiment is now available, for all public service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Hindi, Norwegian Bokmål, Norwegian Nynorsk, Polish, Romanian, Slovak, Swedish, Turkish. For details, see Language support.

11 January 2023

Improved English and Korean Entities Model: Fixed a bug for English and Korean models using Version 2 Entity type system where not all entities are returned properly.

3 November 2022

Improved Error Handling and Validation for Sentiment and Custom Sentiment

If a request contains both an error in the sentiment feature and a valid feature request for another feature, the response returns a 200 with a warning for the sentiment feature.

Requests for Custom Sentiment that include at least one target found in the document will now return a 200 with sentiment analysis for the found targets.

The error log has changed from sentiment request must specify at least one of: document, targets to sentiment request must specify at least one of: document or targets.

1 September 2022

Improved Categories and Custom Categories (beta) Models: The model for Categories and Custom Categories (beta) now supports more contemporary vocabulary. For example, newer words such as "Covid-19" can be interpreted.

10 August 2022

Entities support for additional languages: Support for entities is now available, for all public and premium service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Hindi, Norwegian Bokmål, Norwegian Nynorsk, Polish, Romanian, Slovak, Turkish. For details, see Language support.
Version 2 entity support for Russian and Swedish: Version 2 entity type support is now available, for all public and premium service instances, for the following languages: Russian and Swedish. For details, see Entity type systems.

3 August 2022

Support for Single Label Classifications: Classifications now allows users to pass in a model_type training parameter when creating or updating a model, in order to train a single label classifier. See Classifications training parameters for more details.

11 July 2022

Improved custom classifications model - Japanese: Japanese custom classifications models now train and perform inference faster, with improved model results.
Improved custom categories model: Model contains improved word filtering, resulting in better category determination.

7 April 2022

Categories bug fix: Fixed a bug in the Version 2 Categories type system.

14 February 2022

Emotion support: Support for emotion is now available, for all public service instances, for French. For details, see Language support.
Improved error handling and validation for emotion: If a request contains both an error in the emotion feature and a valid feature request for another feature, the response returns a 200 with a warning for the emotion feature.; The error log has changed from emotion request must specify at least one of: document, targets to emotion request must specify at least one of: document or targets.; Requests that include the emotion feature with any nonstring elements in the targets list returns an error.

1 December 2021

Tone analytics: Tone analytics is now available, for English and French languages only. The tone analytics feature detects excited, frustrated, impolite, polite, sad, satisfied, and sympathetic tones from text.

12 October 2021

Keyword and syntax support for additional languages: Support for keywords and syntax is now available, for all public and premium service instances, for the following languages: Hindi, Romanian, and Turkish. In addition, syntax is available for all supported languages. For details, see Language support.

15 August 2021

Custom classification feature: The custom classifications feature is now generally available (GA).
Categories stock model updated: The categories stock model has been updated to use the IAB Tech Lab 2.0 taxonomy.

5 May 2021

Advanced Rules beta feature deprecated: The advanced rules beta feature is deprecated. As of June 10, 2021, you will not be able to deploy advanced rules models to Natural Language Understanding; existing models will keep running. After June 24, 2021, no advanced rules models will run in Natural Language Understanding.

25 March 2021

Custom classifications beta feature: The custom classifications (Beta) feature allows you to train a multi-label text classifier (English-only) by using your own labeled data.

18 February 2021

Custom categories beta feature: The custom categories (Beta) feature allows you to train custom categories models (English only) with service instances deployed in the Dallas location.

December 2020

9 December 2020

Version 2 entity support for English: Version 2 entity type support is now available, for all public and premium service instances, for English language. For details, see Entity type systems.

2 December 2020

Version 2 entity support for Korean and Spanish: Version 2 entity type support is now available, for all public and premium service instances, for the following languages: Korean and Spanish. For details, see Entity type systems.

November 2020

12 November 2020

Entity support for Arabic, Chinese (Simplified), and Dutch: Support for entities is now available, for all public and premium service instances, for the following languages: Arabic, Chinese (Simplified) and Dutch. For details, see Language support.; Relevance ranking is not supported.

4 November 2020

Keyword support for additional languages: Support for keywords is now available, for all public and premium service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Norwegian, Norwegian-Bokmal, Norwegian-Nynorsk, Polish, and Slovak. For details, see Language support.
Improved keyword support for Russian and Swedish: The code that is used for keyword support for Russian and Swedish has been improved, so different, higher quality results are expected.

September 2020

25 September 2020

version parameter deprecated: The version form parameter for model objects in the API definition has been deprecated; use the model_version parameter instead. The model_version parameter will override version whenever there is a conflict until April 9, 2021.

3 September 2020

Summarization: Summarization (Experimental) returns a summary of input source content. Summarization is currently available for English only.
Categories support for Dutch and Chinese: Support for Chinese and Dutch categories is now available, for all public and premium service instances. For details, see Language support.

August 2020

6 August 2020

Syntax support: Syntax support is no longer "experimental", and is supported in all public and premium environments, for the following languages: English, Arabic, German, Spanish, French, Italian, Japanese, Korean, Dutch, Portuguese, and Chinese (Simplified). For details, see Language support.

1 August 2020

Taxonomy changes: The following changes to the default taxonomy are activated when you use the version date 2020-08-01 or later:

Previous entry	Updated entry
`/food and drink/health and lowfat cooking`	`/food and drink/health and low-fat cooking`
`/society/crime/sexual offence`	`/society/crime/sexual offense`
`/society/crime/sexual offence/paedophilia`	`/society/crime/sexual offense/pedophilia`
`/society/crime/sexual offence/prostitution`	`/society/crime/sexual offense/prostitution`
`/society/crime/sexual offence/rape`	`/society/crime/sexual offense/rape`
`/style and fashion/men 's fashion`	`/style and fashion/men's fashion`
`/technology and computing/consumer electronics/ebook reader`	`/technology and computing/consumer electronics/e-book reader`

8 July 2020

Arabic keyword support: Support for Arabic keywords is now available, for all public and premium service instances. For details, see Language support.

10 April 2020

Custom Sentiment (Beta): Custom Sentiment (Beta) is now available. The Beta supports English and is available only for service instances in the Dallas location. For details, see Creating custom sentiment models.

March 2020

20 March 2020

Error response format change for advanced rules model endpoints (Beta): Changed the error response format for advanced rules model endpoints (Beta) to be consistent with other endpoints.

Renamed status to code.

Renamed message to error.

Previous format:

{
    "status": 404,
    "message": "not found"
}

New format:

{
    "code": 404,
    "error": "not found"
}

11 March 2020

Enhanced sentiment analysis for English: Enhanced English sentiment analysis to better identify and understand idioms in text.

December 2019

12 December 2019

Full support for IBM Cloud IAM: Natural Language Understanding now supports the full implementation of IBM Cloud Identity and Access Management (IAM). API keys for Watson services are no longer limited to a single service instance. You can create access policies and API keys that apply to more than one service, and you can grant access between services.

To support this change, the API service endpoints use a different domain and include the service instance ID.

The pattern is api.{location}.{offering}.watson.cloud.ibm.com/instances/{instance_id}.
Example URL for an instance hosted in the Dallas location: api.us-south.natural-language-understanding.watson.cloud.ibm.com/instances/6bbda3b3-d572-45e1-8c54-22d6ed9e52c2
The previous public endpoint domain was watsonplatform.net.

For more information about the URLs, see the API reference.

These URLs do not introduce a breaking change. The new URLs work both for your existing service instances and for new instances. The original URLs continue to work on your existing service instances for at least one year (until December 2020).

For more information about IAM, see Authenticating to Watson services.

New network and data security features: Support for data encryption with customer-managed keys Users of new premium and dedicated instances can integrate IBM® Key Protect for IBM Cloud® with Natural Language Understanding to encrypt their data and manage encryption keys. For more information, see Protecting sensitive information in your Watson service.; Support for private network endpoints Users of Premium plans can create private network endpoints to connect to Natural Language Understanding over a private network. Connections to private network endpoints do not require public internet access. For more information, see Public and private network endpoints.

11 December 2019

Custom categories (experimental) to be retired: The custom categories experimental feature will be retired on 19 December 2019. On that date, deployed custom categories models will no longer be accessible in Natural Language Understanding. The feature will be removed from Knowledge Studio on an earlier date. Custom categories models will no longer be accessible in Knowledge Studio on 17 December 2019.

November 2019

25 November 2019

Advanced rules (Beta): Analyze text with custom advanced rules models that you create in IBM Watson® Knowledge Studio.

13 November 2019

New South Korea location: You can now create Natural Language Understanding instances in the Seoul location. As with other locations, the IBM Cloud Seoul location uses token-based Identity and Access Management (IAM) authentication.

10 October 2019

Improved Sentiment response times: Improved response times for Japanese and Korean sentiment requests.

September 2019

17 September 2019

Added Keyword and Sentiment support for Dutch and Chinese (Simplified): Added support for Dutch keywords and sentiment. Added support for Chinese (Simplified) keywords and sentiment.

5 September 2019

Improved Sentiment support for Spanish and Portuguese: Improved accuracy for Spanish and Portuguese sentiment. Improved response times for Portuguese sentiment requests.

29 August 2019

Improved Sentiment support for English: Improved accuracy for English sentiment.

12 July 2019

The following changes are activated when you use the version date 2019-07-12 or later.: New English entities model with improved accuracy.; Confidence scores are returned in English entities results.

June 2019

21 June 2019

Confidence score returned for entities: Entity confidence score is now returned for entities requests that use custom machine learning models.

4 June 2019

The following changes are activated when you use the version date 2019-06-04 or later.: Fixed a bug that caused entities requests with custom models to ignore the limit option.; The default limit value for all entities requests is now 50 for all models.; The maximum limit value of 250 entities has been removed.

May 2019

29 May 2019

Sentiment improved for Spanish: Improved Spanish sentiment.
Bug fix for special characters: Fixed a bug that may have affected targeted sentiment results if the target contained special characters.
Service instance changes: The following changes are enabled for service instances in Washington DC, Tokyo, London, and Sydney:

Sentiment option for entities requests that use custom models is enabled for all sentiment-supported languages except Arabic and Russian.

24 May 2019

Keywords improved for German: Improved German keywords.
Sentiment updates for English to locations: English sentiment updates released to service instances in all IBM Cloud locations other than Dallas.

3 May 2019

Sentiment detection for German improved: Improved German document-level sentiment detection.

25 April 2019

Entity mentions for rule-based custom models: Enabled entity mentions for rule-based custom models. To view mentions in your results, set the mentions entities option to true.
Explanations for English categories results: Released explanations for English categories results. To see relevant text from the source that contributed to each result, set the explanation categories option to true.

19 March 2019

Custom categories models created with Knowledge Studio: Introduced experimental support for custom categories models created with Knowledge Studio.
Keywords and concepts improved for Spanish: Improved Spanish keywords and concepts.

10 January 2019

Requests with no version date parameter return error: List models and Delete model requests that don't include a version date parameter now return 400 errors instead of successful responses.
Entites performance improved: Improved entities performance for languages other than English.

December 2018

14 December 2018

Support improvements: You can now create Natural Language Understanding service instances in the IBM Cloud London location.; Added support for Portuguese relations.; Added support for Italian relations.; Added a limit parameter for categories requests that controls the number of categories returned up to a maximum of 10.; Improved accuracy for Japanese and German sentiment.

6 December 2018

Improved accuracy for Italian: Improved accuracy for Italian categories, keywords, entities, and categories.

November 2018

27 November 2018

Improvements and bug fixes: Improved quality of keywords results for English, French, Japanese, and Portuguese input.; Keywords with different capitalizations now appear in results as the same keyword.; The Get models method now returns additional fields that you can use to manage custom models across several deployments.

version: user-provided version string from Knowledge Studio
version_description: user-provided description of this version from Knowledge Studio (for example, what changed since the previous version)
workspace_id: An ID provided by Knowledge Studio that remains constant over repeated deployments from the same Knowledge Studio workspace.
created: A datetime string that indicates when the model was deployed to NLU.

16 November 2018

New algorithm, improvements, and bug fixes: Released new algorithms for English keywords and sentiment to improve accuracy and performance.; Improved keywords accuracy and performance for English, French, Japanese, and Portuguese input.; Improved Italian sentiment accuracy and performance, including better accuracy for large text samples.; Fixed a bug that caused URL-encoded text to appear in Spanish entity disambiguation results.; Released a new Italian entities model with the latest entity type system. You can learn about the latest type system on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to 2018-11-16 to use the new model.

9 November 2018

Categories improvement: Major improvement in accuracy for Categories in all supported languages.

8 November 2018

Service instances can be created in Tokyo location: You can now create Natural Language Understanding service instances in the IBM Cloud Tokyo location.

5 November 2018

Support for Concepts in Italian: Added support for Italian concepts.

30 October 2018

IAM used for service instances: As of 30 October 2018, new service instances created in the Germany and US South regions use Identity and Access Management (IAM) authentication.

21 September 2018

Language support improvements: Added support for French relations and Portuguese concepts.; The targeted sentiment option is now supported for French and Portuguese keywords.; Improved French keywords.; Improved Korean sentiment.; Improved Portuguese keywords and sentiment.; Released a new Portuguese entities model with the latest entity type system. You can learn about the latest type system on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to 2018-09-21 to use the new model.

June 2018

26 June 2018

Added support for Japanese entities and keywords.: For Japanese input, the following entities are not yet supported:

Number
Percent
PhoneNumber
URL

IPv6 addresses in Japanese input are not yet detectable as IPAddress entities.

12 June 2018

IAM used for US East region: As of 12 June 2018, new service instances created in the US East region use Identity and Access Management (IAM) authentication.

6 June 2018

Korean categories improvement: Minor improvements for Korean categories results.

May 2018

29 May 2018

IAM used in Sydney region: As of 29 May 2018, new service instances created in the Sydney region use Identity and Access Management (IAM) authentication.

9 May 2018

German entities improvement: Improved German entities.

4 May 2018

Bug fixes and performance improvements: Added support for Japanese sentiment and semantic roles.; Improved performance for metadata requests.; Fixed a bug that caused NAN relevance scores to appear in some entities results.; Fixed a bug that returned 400 error codes in German and Korean keywords requests when 500 error codes were more appropriate.

April 2018

19 April 2018

Bug fixes and performance improvements: Added support for Japanese relations.; Improved German keywords.; Fixed a bug that caused incorrect entity mention text to be returned.; Fixed a bug that could cause poor results for targeted sentiment.; Fixed a bug that caused the returned analyzed_text to include characters that were not analyzed.

5 April 2018

Webpage improvements: Improved webpage content fetching. If you use the url parameter to analyze webpages, you'll see better results, especially from webpages that use framesets and cookies.
Korean concept improvement: Minor improvements to Korean concepts.

16 March 2018

Language support improvements: Added support for German categories, relations, and semantic roles.; A new relation type system is used for German relations. To view details, see the Relation types (Version 2) page.; Improved German keywords and sentiment.; Added support for Japanese categories and concepts.; Language detection improvements.; Improved webpage cleaning.; Improved French and German entities models. The models use a new entity type system. Check out the new entity types and subtypes on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to 2018-03-16 to use the new model. The following are the differences between the Version 1 type system and the Version 2 type system.

New entity types:
- Date
- Duration
- Measure
- Money
- Number*
- Percent*
- PhoneNumber*
- Ordinal
- Time
- URL*
Removed entity types:
- Anatomy
- Award
- Broadcaster
- Company
- Crime
- Drug
- HealthCondition
- Movie
- MusicGroup
- NaturalEvent
- PrintMedia
- Quantity
- Sport
- SportingEvent
- TelevisionShow
- Vehicle
New entity subtype:
- Quantity

* This entity type is not yet detectable in French text.

January 2018

25 January 2018

Custom model support for Chinese (Simplified): Chinese (Simplified) custom model support is now available for entities and relations.

12 January 2018

Concepts support for German: Added support for German concepts.

15 December 2017

Language support improvements: Dutch custom model support is now available for entities and relations.; French language support now includes concepts.; The French sentiment model has been improved to deliver better results.; The language detection model is faster and detects more languages overall. For the complete list of languages, see Detectable languages.

The following languages are new additions to the list:
- Belarusian (be)
- Bihari (bh)
- Dhivehi (dv)
- Galician (gl)
- Ganda (lg)
- Inuktitut (iu)
- Javanese (jv)
- Kannada (kn)
- Khmer (km)
- Kinyarwanda (rw)
- Laothian (lo)
- Malayalam (ml)
- Marathi (mr)
- Oriya (or)
- Punjabi (pa)
- Scots Gaelic (gd)
- Sinhalese (si)
- Tamil (ta)
- Telugu (te)
- Yiddish (yi)
The following languages are no longer detectable:
- Breton (br)
- Chamorro (ch)
- Esperanto (eo)
- Faroese (fo)
- Hausa (ha)
- Ndebele (nr)
- Ojibwa (oj)

November 2017

28 November 2017

Entity mentions: Get the locations of entity mentions in your entities requests by adding the option "mentions": true.

Example POST /analyze request body:

{
  "text": "Intel planned to announce Monday a laptop-computer chip that combines an Intel processor and an AMD graphics unit.",
  "features": {
    "entities": {
      "mentions": true
    }
  }
}

Example response:

{
  "usage": {
    "text_units": 1,
    "text_characters": 114,
    "features": 1
  },
  "language": "en",
  "entities": [
    {
      "type": "Company",
      "text": "Intel",
      "relevance": 0.33,
      "mentions": [
        {
          "text": "Intel",
          "location": [
            0,
            5
          ]
        },
        {
          "text": "Intel",
          "location": [
            73,
            78
          ]
        }
      ],
      "disambiguation": {
        "subtype": [
          "OperatingSystemDeveloper",
          "ProcessorManufacturer",
          "SoftwareDeveloper"
        ],
        "name": "Intel",
        "dbpedia_resource": "http://dbpedia.org/resource/Intel"
      },
      "count": 2
    },
    {
      "type": "Company",
      "text": "AMD",
      "relevance": 0.33,
      "mentions": [
        {
          "text": "AMD",
          "location": [
            96,
            99
          ]
        }
      ],
      "disambiguation": {
        "subtype": [
          "ProcessorManufacturer"
        ],
        "name": "Advanced Micro Devices",
        "dbpedia_resource": "http://dbpedia.org/resource/Advanced_Micro_Devices"
      },
      "count": 1
    }
  ]
}

17 November 2017

Korean language support improved: Korean language support has expanded to include the following features:

Entities
Keywords
Semantic Roles

30 July 2017

Limit number of characters processed: You can control cost by using the optional limit_text_characters parameter to limit the number of characters that are processed. For example,

{
  "text": "The United States of America, commonly known as the United States (U.S.) or America, is a federal republic composed of 50 states, a federal district, five major self-governing territories, and various possessions. Forty-eight of the fifty states and the federal district are contiguous and located in North America between Canada and Mexico. The state of Alaska is in the northwest corner of North America, bordered by Canada to the east and across the Bering Strait from Russia to the west.",
  "features": {
    "entities": {}
  },
  "limit_text_characters": 50,
  "return_analyzed_text": true
}

Each character counts as one character, regardless of whether the character is a single-byte character or a multibyte character.
For metering, one Natural Language Understanding Item continues to be one feature (also known as an enrichment) per one text unit. One text unit is 10,000 characters or less.
For detailed pricing information, see Natural Language Understanding in the IBM Cloud® Catalog.

In addition to adding the limit_text_characters parameter, the following changes were made to text size limits and truncation:

All text greater than 50,000 characters is truncated before processing. Previously, truncation was based on kilobytes, where one kilobyte equaled 1024 bytes.
An informational message is returned in the response when text beyond 50,000 characters is truncated.
Text limit size for custom models has increased from 10,000 characters to 50,000 characters.
Usage information is added to the response for clarity around number of Natural Language Understanding Items used for each request. For example,

{
  "usage": {
    "text_units": 5,
    "text_characters": 41186,
    "features": 1
  },
  ...
}

8 May 2017

Emotion tone score updates: Updated the emotion tone score model. The training data set was expanded and feature engineering was altered and as a result, the model has higher precision on the IBM benchmark data set.

March 2017

27 March 2017

Model improvements: The entity and sentiment models have been improved, which means that when you call those features, you will get better results.; Relations now support Knowledge Studio custom models in French, German, Italian, and Portuguese.

15 March 2017

Emotion tone score updated: We released updates to the emotion tone score model. The training data set was expanded and as a result, the model has higher precision on the IBM benchmark data set.

27 February 2017

Service GA: The Natural Language Understanding service is now GA.