IBM Cloud Docs
Release notes for Natural Language Understanding

Release notes for Natural Language Understanding

The following new features and changes to the service are available.

Service API versioning

Current API version: 2022-08-10

API requests require a version parameter that takes the date in the format version=YYYY-MM-DD. Send the version parameter with every API request.

When we change the API in a backwards-incompatible way, we release a new minor version. To take advantage of the changes in a new version, change the value of the version parameter to the new date. If you're not ready to update to that version, don't change your version date.

Active version dates

The following table shows the service behavior changes for each version date. Switching to a later version date will activate all changes introduced in earlier versions.

Version date Changes summary
2022-08-10
  • Language expansion of entities with a new model for improved accuracy and confidence scores.
  • Version 2 Russian entity type system.
  • Version 2 Swedish entity type system.
  • 2022-04-07
  • Bug fix for Version 2 Categories type system.
  • 2021-08-15
  • Classifications GA and Categories type system Version 2 features.
  • 2021-03-25
  • Custom categories (beta) and custom classifications (beta) features.
  • 2020-12-09
  • Version 2 English entity type system.
  • 2020-12-02
  • Version 2 Korean entity type system.
  • Version 2 Spanish entity type system.
  • 2020-08-01
  • Taxonomy changes aimed towards standardization of the label names in the default taxonomy.
  • 2019-07-12
  • New English entities model with improved accuracy and confidence scores.
  • 2019-06-04
  • Fixed a bug that caused entities requests with custom models to ignore the limit option.
  • The default limit value for all entities requests is now 50 for all models.
  • The maximum limit value of 250 entities has been removed.
  • 2018-11-16
  • Version 2 Italian entity type system.
  • 2018-09-21
  • Version 2 Portuguese entity type system.
  • 2018-03-16
  • Version 2 French entity type system.
  • Version 2 German entity type system.
  • 2017-02-27 Base version.

    Entity types (Version 1) deprecation announcement

    Entity types (Version 1) is deprecated. As of 11 July 2023, the v1 Entities type system will no longer be available. Consider switching to the v2 Entities type system. By using the more generic v2 Entities type system together with other features, such as Concepts and Categories, you can achieve similar outcomes with more flexibility. For more information about v2 entity types, see Entity types (Version 2).

    9 June 2023

    Retired Custom Sentiment Feature
    The Custom Sentiment feature is retired. Custom sentiment models can no longer be created nor can they be used with Analyze API calls. To ensure we continue providing our clients with robust and powerful text classification capabilities, IBM recently announced the general availability of a new single-label text classification capability. This new feature includes extended language support and training data customizations suited for building a custom sentiment classifier.

    16 February 2023

    Updated training process for Custom Classifications
    Updates were made to the Custom Classifications training process including improvements in the preprocessing stage, updating underlying libraries, and fixing minor bugs.

    2 February 2023

    Sentiment support for additional languages
    Support for sentiment is now available, for all public service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Hindi, Norwegian Bokmål, Norwegian Nynorsk, Polish, Romanian, Slovak, Swedish, Turkish. For details, see Language support.

    11 January 2023

    Improved English and Korean Entities Model
    Fixed a bug for English and Korean models using Version 2 Entity type system where not all entities are returned properly.

    3 November 2022

    Improved Error Handling and Validation for Sentiment and Custom Sentiment

    If a request contains both an error in the sentiment feature and a valid feature request for another feature, the response returns a 200 with a warning for the sentiment feature.

    Requests for Custom Sentiment that include at least one target found in the document will now return a 200 with sentiment analysis for the found targets.

    The error log has changed from sentiment request must specify at least one of: document, targets to sentiment request must specify at least one of: document or targets.

    1 September 2022

    Improved Categories and Custom Categories (beta) Models
    The model for Categories and Custom Categories (beta) now supports more contemporary vocabulary. For example, newer words such as "Covid-19" can be interpreted.

    10 August 2022

    Entities support for additional languages
    Support for entities is now available, for all public and premium service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Hindi, Norwegian Bokmål, Norwegian Nynorsk, Polish, Romanian, Slovak, Turkish. For details, see Language support.
    Version 2 entity support for Russian and Swedish
    Version 2 entity type support is now available, for all public and premium service instances, for the following languages: Russian and Swedish. For details, see Entity type systems.

    3 August 2022

    Support for Single Label Classifications
    Classifications now allows users to pass in a model_type training parameter when creating or updating a model, in order to train a single label classifier. See Classifications training parameters for more details.

    11 July 2022

    Improved custom classifications model - Japanese
    Japanese custom classifications models now train and perform inference faster, with improved model results.
    Improved custom categories model
    Model contains improved word filtering, resulting in better category determination.

    7 April 2022

    Categories bug fix
    Fixed a bug in the Version 2 Categories type system.

    14 February 2022

    Emotion support
    Support for emotion is now available, for all public service instances, for French. For details, see Language support.
    Improved error handling and validation for emotion
    If a request contains both an error in the emotion feature and a valid feature request for another feature, the response returns a 200 with a warning for the emotion feature.
    The error log has changed from emotion request must specify at least one of: document, targets to emotion request must specify at least one of: document or targets.
    Requests that include the emotion feature with any nonstring elements in the targets list returns an error.

    1 December 2021

    Tone analytics
    Tone analytics is now available, for English and French languages only. The tone analytics feature detects excited, frustrated, impolite, polite, sad, satisfied, and sympathetic tones from text.

    12 October 2021

    Keyword and syntax support for additional languages
    Support for keywords and syntax is now available, for all public and premium service instances, for the following languages: Hindi, Romanian, and Turkish. In addition, syntax is available for all supported languages. For details, see Language support.

    15 August 2021

    Custom classification feature
    The custom classifications feature is now generally available (GA).
    Categories stock model updated
    The categories stock model has been updated to use the IAB Tech Lab 2.0 taxonomy.

    5 May 2021

    Advanced Rules beta feature deprecated
    The advanced rules beta feature is deprecated. As of June 10, 2021, you will not be able to deploy advanced rules models to Natural Language Understanding; existing models will keep running. After June 24, 2021, no advanced rules models will run in Natural Language Understanding.

    25 March 2021

    Custom classifications beta feature
    The custom classifications (Beta) feature allows you to train a multi-label text classifier (English-only) by using your own labeled data.

    18 February 2021

    Custom categories beta feature
    The custom categories (Beta) feature allows you to train custom categories models (English only) with service instances deployed in the Dallas location.

    December 2020

    9 December 2020

    Version 2 entity support for English
    Version 2 entity type support is now available, for all public and premium service instances, for English language. For details, see Entity type systems.

    2 December 2020

    Version 2 entity support for Korean and Spanish
    Version 2 entity type support is now available, for all public and premium service instances, for the following languages: Korean and Spanish. For details, see Entity type systems.

    November 2020

    12 November 2020

    Entity support for Arabic, Chinese (Simplified), and Dutch

    Support for entities is now available, for all public and premium service instances, for the following languages: Arabic, Chinese (Simplified) and Dutch. For details, see Language support.

    Relevance ranking is not supported.

    4 November 2020

    Keyword support for additional languages
    Support for keywords is now available, for all public and premium service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Norwegian, Norwegian-Bokmal, Norwegian-Nynorsk, Polish, and Slovak. For details, see Language support.
    Improved keyword support for Russian and Swedish
    The code that is used for keyword support for Russian and Swedish has been improved, so different, higher quality results are expected.

    September 2020

    25 September 2020

    version parameter deprecated
    The version form parameter for model objects in the API definition has been deprecated; use the model_version parameter instead. The model_version parameter will override version whenever there is a conflict until April 9, 2021.

    3 September 2020

    Summarization
    Summarization (Experimental) returns a summary of input source content. Summarization is currently available for English only.
    Categories support for Dutch and Chinese
    Support for Chinese and Dutch categories is now available, for all public and premium service instances. For details, see Language support.

    August 2020

    6 August 2020

    Syntax support
    Syntax support is no longer "experimental", and is supported in all public and premium environments, for the following languages: English, Arabic, German, Spanish, French, Italian, Japanese, Korean, Dutch, Portuguese, and Chinese (Simplified). For details, see Language support.

    1 August 2020

    Taxonomy changes
    The following changes to the default taxonomy are activated when you use the version date 2020-08-01 or later:
    Previous entry Updated entry
    /food and drink/health and lowfat cooking /food and drink/health and low-fat cooking
    /society/crime/sexual offence /society/crime/sexual offense
    /society/crime/sexual offence/paedophilia /society/crime/sexual offense/pedophilia
    /society/crime/sexual offence/prostitution /society/crime/sexual offense/prostitution
    /society/crime/sexual offence/rape /society/crime/sexual offense/rape
    /style and fashion/men 's fashion /style and fashion/men's fashion
    /technology and computing/consumer electronics/ebook reader /technology and computing/consumer electronics/e-book reader

    8 July 2020

    Arabic keyword support
    Support for Arabic keywords is now available, for all public and premium service instances. For details, see Language support.

    10 April 2020

    Custom Sentiment (Beta)
    Custom Sentiment (Beta) is now available. The Beta supports English and is available only for service instances in the Dallas location. For details, see Creating custom sentiment models.

    March 2020

    20 March 2020

    Error response format change for advanced rules model endpoints (Beta)
    Changed the error response format for advanced rules model endpoints (Beta) to be consistent with other endpoints.
    • Renamed status to code.

    • Renamed message to error.

      • Previous format:
      {
          "status": 404,
          "message": "not found"
      }
      
      • New format:
      {
          "code": 404,
          "error": "not found"
      }
      

    11 March 2020

    Enhanced sentiment analysis for English
    Enhanced English sentiment analysis to better identify and understand idioms in text.

    December 2019

    12 December 2019

    Full support for IBM Cloud IAM
    Natural Language Understanding now supports the full implementation of IBM Cloud Identity and Access Management (IAM). API keys for Watson services are no longer limited to a single service instance. You can create access policies and API keys that apply to more than one service, and you can grant access between services.

    To support this change, the API service endpoints use a different domain and include the service instance ID.

    • The pattern is api.{location}.{offering}.watson.cloud.ibm.com/instances/{instance_id}.

    • Example URL for an instance hosted in the Dallas location: api.us-south.natural-language-understanding.watson.cloud.ibm.com/instances/6bbda3b3-d572-45e1-8c54-22d6ed9e52c2

    • The previous public endpoint domain was watsonplatform.net.

    For more information about the URLs, see the API reference.

    These URLs do not introduce a breaking change. The new URLs work both for your existing service instances and for new instances. The original URLs continue to work on your existing service instances for at least one year (until December 2020).

    For more information about IAM, see Authenticating to Watson services.

    New network and data security features

    Support for data encryption with customer-managed keys Users of new premium and dedicated instances can integrate IBM® Key Protect for IBM Cloud® with Natural Language Understanding to encrypt their data and manage encryption keys. For more information, see Protecting sensitive information in your Watson service.

    Support for private network endpoints Users of Premium plans can create private network endpoints to connect to Natural Language Understanding over a private network. Connections to private network endpoints do not require public internet access. For more information, see Public and private network endpoints.

    11 December 2019

    Custom categories (experimental) to be retired
    The custom categories experimental feature will be retired on 19 December 2019. On that date, deployed custom categories models will no longer be accessible in Natural Language Understanding. The feature will be removed from Knowledge Studio on an earlier date. Custom categories models will no longer be accessible in Knowledge Studio on 17 December 2019.

    November 2019

    25 November 2019

    Advanced rules (Beta)
    Analyze text with custom advanced rules models that you create in IBM Watson® Knowledge Studio.

    13 November 2019

    New South Korea location
    You can now create Natural Language Understanding instances in the Seoul location. As with other locations, the IBM Cloud Seoul location uses token-based Identity and Access Management (IAM) authentication.

    10 October 2019

    Improved Sentiment response times
    Improved response times for Japanese and Korean sentiment requests.

    September 2019

    17 September 2019

    Added Keyword and Sentiment support for Dutch and Chinese (Simplified)
    Added support for Dutch keywords and sentiment. Added support for Chinese (Simplified) keywords and sentiment.

    5 September 2019

    Improved Sentiment support for Spanish and Portuguese
    Improved accuracy for Spanish and Portuguese sentiment. Improved response times for Portuguese sentiment requests.

    29 August 2019

    Improved Sentiment support for English
    Improved accuracy for English sentiment.

    12 July 2019

    The following changes are activated when you use the version date 2019-07-12 or later.
    New English entities model with improved accuracy.
    Confidence scores are returned in English entities results.

    June 2019

    21 June 2019

    Confidence score returned for entities
    Entity confidence score is now returned for entities requests that use custom machine learning models.

    4 June 2019

    The following changes are activated when you use the version date 2019-06-04 or later.
    Fixed a bug that caused entities requests with custom models to ignore the limit option.
    The default limit value for all entities requests is now 50 for all models.
    The maximum limit value of 250 entities has been removed.

    May 2019

    29 May 2019

    Sentiment improved for Spanish
    Improved Spanish sentiment.
    Bug fix for special characters
    Fixed a bug that may have affected targeted sentiment results if the target contained special characters.
    Service instance changes
    The following changes are enabled for service instances in Washington DC, Tokyo, London, and Sydney:
    • Sentiment option for entities requests that use custom models is enabled for all sentiment-supported languages except Arabic and Russian.

    24 May 2019

    Keywords improved for German
    Improved German keywords.
    Sentiment updates for English to locations
    English sentiment updates released to service instances in all IBM Cloud locations other than Dallas.

    3 May 2019

    Sentiment detection for German improved
    Improved German document-level sentiment detection.

    25 April 2019

    Entity mentions for rule-based custom models
    Enabled entity mentions for rule-based custom models. To view mentions in your results, set the mentions entities option to true.
    Explanations for English categories results
    Released explanations for English categories results. To see relevant text from the source that contributed to each result, set the explanation categories option to true.

    19 March 2019

    Custom categories models created with Knowledge Studio
    Introduced experimental support for custom categories models created with Knowledge Studio.
    Keywords and concepts improved for Spanish
    Improved Spanish keywords and concepts.

    10 January 2019

    Requests with no version date parameter return error
    List models and Delete model requests that don't include a version date parameter now return 400 errors instead of successful responses.
    Entites performance improved
    Improved entities performance for languages other than English.

    December 2018

    14 December 2018

    Support improvements
    You can now create Natural Language Understanding service instances in the IBM Cloud London location.
    Added support for Portuguese relations.
    Added support for Italian relations.
    Added a limit parameter for categories requests that controls the number of categories returned up to a maximum of 10.
    Improved accuracy for Japanese and German sentiment.

    6 December 2018

    Improved accuracy for Italian
    Improved accuracy for Italian categories, keywords, entities, and categories.

    November 2018

    27 November 2018

    Improvements and bug fixes
    Improved quality of keywords results for English, French, Japanese, and Portuguese input.
    Keywords with different capitalizations now appear in results as the same keyword.
    The Get models method now returns additional fields that you can use to manage custom models across several deployments.
    • version: user-provided version string from Knowledge Studio
    • version_description: user-provided description of this version from Knowledge Studio (for example, what changed since the previous version)
    • workspace_id: An ID provided by Knowledge Studio that remains constant over repeated deployments from the same Knowledge Studio workspace.
    • created: A datetime string that indicates when the model was deployed to NLU.

    16 November 2018

    New algorithm, improvements, and bug fixes
    Released new algorithms for English keywords and sentiment to improve accuracy and performance.
    Improved keywords accuracy and performance for English, French, Japanese, and Portuguese input.
    Improved Italian sentiment accuracy and performance, including better accuracy for large text samples.
    Fixed a bug that caused URL-encoded text to appear in Spanish entity disambiguation results.
    Released a new Italian entities model with the latest entity type system. You can learn about the latest type system on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to 2018-11-16 to use the new model.

    9 November 2018

    Categories improvement
    Major improvement in accuracy for Categories in all supported languages.

    8 November 2018

    Service instances can be created in Tokyo location
    You can now create Natural Language Understanding service instances in the IBM Cloud Tokyo location.

    5 November 2018

    Support for Concepts in Italian
    Added support for Italian concepts.

    30 October 2018

    IAM used for service instances
    As of 30 October 2018, new service instances created in the Germany and US South regions use Identity and Access Management (IAM) authentication.

    21 September 2018

    Language support improvements
    Added support for French relations and Portuguese concepts.
    The targeted sentiment option is now supported for French and Portuguese keywords.
    Improved French keywords.
    Improved Korean sentiment.
    Improved Portuguese keywords and sentiment.
    Released a new Portuguese entities model with the latest entity type system. You can learn about the latest type system on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to 2018-09-21 to use the new model.

    June 2018

    26 June 2018

    Added support for Japanese entities and keywords.
    For Japanese input, the following entities are not yet supported:
    • Number
    • Percent
    • PhoneNumber
    • URL

    IPv6 addresses in Japanese input are not yet detectable as IPAddress entities.

    12 June 2018

    IAM used for US East region
    As of 12 June 2018, new service instances created in the US East region use Identity and Access Management (IAM) authentication.

    6 June 2018

    Korean categories improvement
    Minor improvements for Korean categories results.

    May 2018

    29 May 2018

    IAM used in Sydney region
    As of 29 May 2018, new service instances created in the Sydney region use Identity and Access Management (IAM) authentication.

    9 May 2018

    German entities improvement
    Improved German entities.

    4 May 2018

    Bug fixes and performance improvements
    Added support for Japanese sentiment and semantic roles.
    Improved performance for metadata requests.
    Fixed a bug that caused NAN relevance scores to appear in some entities results.
    Fixed a bug that returned 400 error codes in German and Korean keywords requests when 500 error codes were more appropriate.

    April 2018

    19 April 2018

    Bug fixes and performance improvements
    Added support for Japanese relations.
    Improved German keywords.
    Fixed a bug that caused incorrect entity mention text to be returned.
    Fixed a bug that could cause poor results for targeted sentiment.
    Fixed a bug that caused the returned analyzed_text to include characters that were not analyzed.

    5 April 2018

    Webpage improvements
    Improved webpage content fetching. If you use the url parameter to analyze webpages, you'll see better results, especially from webpages that use framesets and cookies.
    Korean concept improvement
    Minor improvements to Korean concepts.

    16 March 2018

    Language support improvements
    Added support for German categories, relations, and semantic roles.
    A new relation type system is used for German relations. To view details, see the Relation types (Version 2) page.
    Improved German keywords and sentiment.
    Added support for Japanese categories and concepts.
    Language detection improvements.
    Improved webpage cleaning.
    Improved French and German entities models. The models use a new entity type system. Check out the new entity types and subtypes on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to 2018-03-16 to use the new model. The following are the differences between the Version 1 type system and the Version 2 type system.
    • New entity types:
      • Date
      • Duration
      • Measure
      • Money
      • Number*
      • Percent*
      • PhoneNumber*
      • Ordinal
      • Time
      • URL*
    • Removed entity types:
      • Anatomy
      • Award
      • Broadcaster
      • Company
      • Crime
      • Drug
      • HealthCondition
      • Movie
      • MusicGroup
      • NaturalEvent
      • PrintMedia
      • Quantity
      • Sport
      • SportingEvent
      • TelevisionShow
      • Vehicle
    • New entity subtype:
      • Quantity

    * This entity type is not yet detectable in French text.

    January 2018

    25 January 2018

    Custom model support for Chinese (Simplified)
    Chinese (Simplified) custom model support is now available for entities and relations.

    12 January 2018

    Concepts support for German
    Added support for German concepts.

    15 December 2017

    Language support improvements
    Dutch custom model support is now available for entities and relations.
    French language support now includes concepts.
    The French sentiment model has been improved to deliver better results.
    The language detection model is faster and detects more languages overall. For the complete list of languages, see Detectable languages.
    • The following languages are new additions to the list:

      • Belarusian (be)
      • Bihari (bh)
      • Dhivehi (dv)
      • Galician (gl)
      • Ganda (lg)
      • Inuktitut (iu)
      • Javanese (jv)
      • Kannada (kn)
      • Khmer (km)
      • Kinyarwanda (rw)
      • Laothian (lo)
      • Malayalam (ml)
      • Marathi (mr)
      • Oriya (or)
      • Punjabi (pa)
      • Scots Gaelic (gd)
      • Sinhalese (si)
      • Tamil (ta)
      • Telugu (te)
      • Yiddish (yi)
    • The following languages are no longer detectable:

      • Breton (br)
      • Chamorro (ch)
      • Esperanto (eo)
      • Faroese (fo)
      • Hausa (ha)
      • Ndebele (nr)
      • Ojibwa (oj)

    November 2017

    28 November 2017

    Entity mentions
    Get the locations of entity mentions in your entities requests by adding the option "mentions": true.
    • Example POST /analyze request body:

      {
        "text": "Intel planned to announce Monday a laptop-computer chip that combines an Intel processor and an AMD graphics unit.",
        "features": {
          "entities": {
            "mentions": true
          }
        }
      }
      
    • Example response:

      {
        "usage": {
          "text_units": 1,
          "text_characters": 114,
          "features": 1
        },
        "language": "en",
        "entities": [
          {
            "type": "Company",
            "text": "Intel",
            "relevance": 0.33,
            "mentions": [
              {
                "text": "Intel",
                "location": [
                  0,
                  5
                ]
              },
              {
                "text": "Intel",
                "location": [
                  73,
                  78
                ]
              }
            ],
            "disambiguation": {
              "subtype": [
                "OperatingSystemDeveloper",
                "ProcessorManufacturer",
                "SoftwareDeveloper"
              ],
              "name": "Intel",
              "dbpedia_resource": "http://dbpedia.org/resource/Intel"
            },
            "count": 2
          },
          {
            "type": "Company",
            "text": "AMD",
            "relevance": 0.33,
            "mentions": [
              {
                "text": "AMD",
                "location": [
                  96,
                  99
                ]
              }
            ],
            "disambiguation": {
              "subtype": [
                "ProcessorManufacturer"
              ],
              "name": "Advanced Micro Devices",
              "dbpedia_resource": "http://dbpedia.org/resource/Advanced_Micro_Devices"
            },
            "count": 1
          }
        ]
      }
      

    17 November 2017

    Korean language support improved
    Korean language support has expanded to include the following features:
    • Entities
    • Keywords
    • Semantic Roles

    30 July 2017

    Limit number of characters processed
    You can control cost by using the optional limit_text_characters parameter to limit the number of characters that are processed. For example,
    {
      "text": "The United States of America, commonly known as the United States (U.S.) or America, is a federal republic composed of 50 states, a federal district, five major self-governing territories, and various possessions. Forty-eight of the fifty states and the federal district are contiguous and located in North America between Canada and Mexico. The state of Alaska is in the northwest corner of North America, bordered by Canada to the east and across the Bering Strait from Russia to the west.",
      "features": {
        "entities": {}
      },
      "limit_text_characters": 50,
      "return_analyzed_text": true
    }
    
    • Each character counts as one character, regardless of whether the character is a single-byte character or a multibyte character.

    • For metering, one Natural Language Understanding Item continues to be one feature (also known as an enrichment) per one text unit. One text unit is 10,000 characters or less.

    • For detailed pricing information, see Natural Language Understanding in the IBM Cloud® Catalog.

    In addition to adding the limit_text_characters parameter, the following changes were made to text size limits and truncation:

    • All text greater than 50,000 characters is truncated before processing. Previously, truncation was based on kilobytes, where one kilobyte equaled 1024 bytes.

    • An informational message is returned in the response when text beyond 50,000 characters is truncated.

    • Text limit size for custom models has increased from 10,000 characters to 50,000 characters.

    • Usage information is added to the response for clarity around number of Natural Language Understanding Items used for each request. For example,

    {
      "usage": {
        "text_units": 5,
        "text_characters": 41186,
        "features": 1
      },
      ...
    }
    

    8 May 2017

    Emotion tone score updates
    Updated the emotion tone score model. The training data set was expanded and feature engineering was altered and as a result, the model has higher precision on the IBM benchmark data set.

    March 2017

    27 March 2017

    Model improvements
    The entity and sentiment models have been improved, which means that when you call those features, you will get better results.
    Relations now support Knowledge Studio custom models in French, German, Italian, and Portuguese.

    15 March 2017

    Emotion tone score updated
    We released updates to the emotion tone score model. The training data set was expanded and as a result, the model has higher precision on the IBM benchmark data set.

    27 February 2017

    Service GA
    The Natural Language Understanding service is now GA.