Release notes for Natural Language Understanding
The following new features and changes to the service are available.
Service API versioning
Current API version: 2022-08-10
API requests require a version parameter that takes the date in the format version=YYYY-MM-DD
. Send the version parameter with every API request.
When we change the API in a backwards-incompatible way, we release a new minor version. To take advantage of the changes in a new version, change the value of the version parameter to the new date. If you're not ready to update to that version, don't change your version date.
Active version dates
The following table shows the service behavior changes for each version date. Switching to a later version date will activate all changes introduced in earlier versions.
Version date | Changes summary |
---|---|
2022-08-10 |
|
2022-04-07 |
|
2021-08-15 |
|
2021-03-25 |
|
2020-12-09 |
|
2020-12-02 |
|
2020-08-01 |
|
2019-07-12 |
|
2019-06-04 |
limit option.limit value for all entities requests is now 50 for all models.limit value of 250 entities has been removed. |
2018-11-16 |
|
2018-09-21 |
|
2018-03-16 |
|
2017-02-27 |
Base version. |
17 October 2024
- Retired Summarization (Experimental) Feature
- The Summarization (Experimental) feature of Watson Natural Language Understanding has been retired and is no longer available.
Entity types (Version 1) deprecation announcement
Entity types (Version 1) is deprecated. As of 11 July 2023, the v1 Entities type system will no longer be available. Consider switching to the v2 Entities type system. By using the more generic v2 Entities type system together with other features, such as Concepts and Categories, you can achieve similar outcomes with more flexibility. For more information about v2 entity types, see Entity types (Version 2).
9 June 2023
- Retired Custom Sentiment Feature
- The Custom Sentiment feature is retired. Custom sentiment models can no longer be created nor can they be used with Analyze API calls. To ensure we continue providing our clients with robust and powerful text classification capabilities, IBM recently announced the general availability of a new single-label text classification capability. This new feature includes extended language support and training data customizations suited for building a custom sentiment classifier.
16 February 2023
- Updated training process for Custom Classifications
- Updates were made to the Custom Classifications training process including improvements in the preprocessing stage, updating underlying libraries, and fixing minor bugs.
2 February 2023
- Sentiment support for additional languages
- Support for sentiment is now available, for all public service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Hindi, Norwegian Bokmål, Norwegian Nynorsk, Polish, Romanian, Slovak, Swedish, Turkish. For details, see Language support.
11 January 2023
- Improved English and Korean Entities Model
- Fixed a bug for English and Korean models using Version 2 Entity type system where not all entities are returned properly.
3 November 2022
- Improved Error Handling and Validation for Sentiment and Custom Sentiment
-
If a request contains both an error in the sentiment feature and a valid feature request for another feature, the response returns a 200 with a warning for the sentiment feature.
Requests for Custom Sentiment that include at least one target found in the document will now return a 200 with sentiment analysis for the found targets.
The error log has changed from
sentiment request must specify at least one of: document, targets
tosentiment request must specify at least one of: document or targets
.
1 September 2022
- Improved Categories and Custom Categories (beta) Models
- The model for Categories and Custom Categories (beta) now supports more contemporary vocabulary. For example, newer words such as "Covid-19" can be interpreted.
10 August 2022
- Entities support for additional languages
- Support for entities is now available, for all public and premium service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Hindi, Norwegian Bokmål, Norwegian Nynorsk, Polish, Romanian, Slovak, Turkish. For details, see Language support.
- Version 2 entity support for Russian and Swedish
- Version 2 entity type support is now available, for all public and premium service instances, for the following languages: Russian and Swedish. For details, see Entity type systems.
3 August 2022
- Support for Single Label Classifications
- Classifications now allows users to pass in a
model_type
training parameter when creating or updating a model, in order to train a single label classifier. See Classifications training parameters for more details.
11 July 2022
- Improved custom classifications model - Japanese
- Japanese custom classifications models now train and perform inference faster, with improved model results.
- Improved custom categories model
- Model contains improved word filtering, resulting in better category determination.
7 April 2022
- Categories bug fix
- Fixed a bug in the Version 2 Categories type system.
14 February 2022
- Emotion support
- Support for emotion is now available, for all public service instances, for French. For details, see Language support.
- Improved error handling and validation for emotion
- If a request contains both an error in the emotion feature and a valid feature request for another feature, the response returns a 200 with a warning for the emotion feature.
- The error log has changed from
emotion request must specify at least one of: document, targets
toemotion request must specify at least one of: document or targets
. - Requests that include the emotion feature with any nonstring elements in the targets list returns an error.
1 December 2021
- Tone analytics
- Tone analytics is now available, for English and French languages only. The tone analytics feature detects
excited
,frustrated
,impolite
,polite
,sad
,satisfied
, andsympathetic
tones from text.
12 October 2021
- Keyword and syntax support for additional languages
- Support for keywords and syntax is now available, for all public and premium service instances, for the following languages: Hindi, Romanian, and Turkish. In addition, syntax is available for all supported languages. For details, see Language support.
15 August 2021
- Custom classification feature
- The custom classifications feature is now generally available (GA).
- Categories stock model updated
- The categories stock model has been updated to use the IAB Tech Lab 2.0 taxonomy.
5 May 2021
- Advanced Rules beta feature deprecated
- The advanced rules beta feature is deprecated. As of June 10, 2021, you will not be able to deploy advanced rules models to Natural Language Understanding; existing models will keep running. After June 24, 2021, no advanced rules models will run in Natural Language Understanding.
25 March 2021
- Custom classifications beta feature
- The custom classifications (Beta) feature allows you to train a multi-label text classifier (English-only) by using your own labeled data.
18 February 2021
- Custom categories beta feature
- The custom categories (Beta) feature allows you to train custom categories models (English only) with service instances deployed in the Dallas location.
December 2020
9 December 2020
- Version 2 entity support for English
- Version 2 entity type support is now available, for all public and premium service instances, for English language. For details, see Entity type systems.
2 December 2020
- Version 2 entity support for Korean and Spanish
- Version 2 entity type support is now available, for all public and premium service instances, for the following languages: Korean and Spanish. For details, see Entity type systems.
November 2020
12 November 2020
- Entity support for Arabic, Chinese (Simplified), and Dutch
-
Support for entities is now available, for all public and premium service instances, for the following languages: Arabic, Chinese (Simplified) and Dutch. For details, see Language support.
-
Relevance ranking is not supported.
4 November 2020
- Keyword support for additional languages
- Support for keywords is now available, for all public and premium service instances, for the following languages: Czech, Danish, Finnish, Hebrew, Norwegian, Norwegian-Bokmal, Norwegian-Nynorsk, Polish, and Slovak. For details, see Language support.
- Improved keyword support for Russian and Swedish
- The code that is used for keyword support for Russian and Swedish has been improved, so different, higher quality results are expected.
September 2020
25 September 2020
version
parameter deprecated- The
version
form parameter for model objects in the API definition has been deprecated; use themodel_version
parameter instead. Themodel_version
parameter will overrideversion
whenever there is a conflict until April 9, 2021.
3 September 2020
- Summarization
- Summarization (Experimental) returns a summary of input source content. Summarization is currently available for English only.
- Categories support for Dutch and Chinese
- Support for Chinese and Dutch categories is now available, for all public and premium service instances. For details, see Language support.
August 2020
6 August 2020
- Syntax support
- Syntax support is no longer "experimental", and is supported in all public and premium environments, for the following languages: English, Arabic, German, Spanish, French, Italian, Japanese, Korean, Dutch, Portuguese, and Chinese (Simplified). For details, see Language support.
1 August 2020
- Taxonomy changes
- The following changes to the default taxonomy are activated when you use the version date
2020-08-01
or later:
Previous entry | Updated entry |
---|---|
/food and drink/health and lowfat cooking |
/food and drink/health and low-fat cooking |
/society/crime/sexual offence |
/society/crime/sexual offense |
/society/crime/sexual offence/paedophilia |
/society/crime/sexual offense/pedophilia |
/society/crime/sexual offence/prostitution |
/society/crime/sexual offense/prostitution |
/society/crime/sexual offence/rape |
/society/crime/sexual offense/rape |
/style and fashion/men 's fashion |
/style and fashion/men's fashion |
/technology and computing/consumer electronics/ebook reader |
/technology and computing/consumer electronics/e-book reader |
8 July 2020
- Arabic keyword support
- Support for Arabic keywords is now available, for all public and premium service instances. For details, see Language support.
10 April 2020
- Custom Sentiment (Beta)
- Custom Sentiment (Beta) is now available. The Beta supports English and is available only for service instances in the Dallas location. For details, see Creating custom sentiment models.
March 2020
20 March 2020
- Error response format change for advanced rules model endpoints (Beta)
- Changed the error response format for advanced rules model endpoints (Beta) to be consistent with other endpoints.
-
Renamed
status
tocode
. -
Renamed
message
toerror
.- Previous format:
{ "status": 404, "message": "not found" }
- New format:
{ "code": 404, "error": "not found" }
11 March 2020
- Enhanced sentiment analysis for English
- Enhanced English sentiment analysis to better identify and understand idioms in text.
December 2019
12 December 2019
- Full support for IBM Cloud IAM
- Natural Language Understanding now supports the full implementation of IBM Cloud Identity and Access Management (IAM). API keys for Watson services are no longer limited to a single service instance. You can create access policies and API keys that apply to more than one service, and you can grant access between services.
To support this change, the API service endpoints use a different domain and include the service instance ID.
-
The pattern is
api.{location}.{offering}.watson.cloud.ibm.com/instances/{instance_id}
. -
Example URL for an instance hosted in the Dallas location:
api.us-south.natural-language-understanding.watson.cloud.ibm.com/instances/6bbda3b3-d572-45e1-8c54-22d6ed9e52c2
-
The previous public endpoint domain was
watsonplatform.net
.
For more information about the URLs, see the API reference.
These URLs do not introduce a breaking change. The new URLs work both for your existing service instances and for new instances. The original URLs continue to work on your existing service instances for at least one year (until December 2020).
For more information about IAM, see Authenticating to Watson services.
- New network and data security features
-
Support for data encryption with customer-managed keys Users of new premium and dedicated instances can integrate IBM® Key Protect for IBM Cloud® with Natural Language Understanding to encrypt their data and manage encryption keys. For more information, see Protecting sensitive information in your Watson service.
-
Support for private network endpoints Users of Premium plans can create private network endpoints to connect to Natural Language Understanding over a private network. Connections to private network endpoints do not require public internet access. For more information, see Public and private network endpoints.
11 December 2019
- Custom categories (experimental) to be retired
- The custom categories experimental feature will be retired on 19 December 2019. On that date, deployed custom categories models will no longer be accessible in Natural Language Understanding. The feature will be removed from Knowledge Studio on an earlier date. Custom categories models will no longer be accessible in Knowledge Studio on 17 December 2019.
November 2019
25 November 2019
- Advanced rules (Beta)
- Analyze text with custom advanced rules models that you create in IBM Watson® Knowledge Studio.
13 November 2019
- New South Korea location
- You can now create Natural Language Understanding instances in the Seoul location. As with other locations, the IBM Cloud Seoul location uses token-based Identity and Access Management (IAM) authentication.
10 October 2019
- Improved Sentiment response times
- Improved response times for Japanese and Korean sentiment requests.
September 2019
17 September 2019
- Added Keyword and Sentiment support for Dutch and Chinese (Simplified)
- Added support for Dutch keywords and sentiment. Added support for Chinese (Simplified) keywords and sentiment.
5 September 2019
- Improved Sentiment support for Spanish and Portuguese
- Improved accuracy for Spanish and Portuguese sentiment. Improved response times for Portuguese sentiment requests.
29 August 2019
- Improved Sentiment support for English
- Improved accuracy for English sentiment.
12 July 2019
- The following changes are activated when you use the version date
2019-07-12
or later. - New English entities model with improved accuracy.
- Confidence scores are returned in English entities results.
June 2019
21 June 2019
- Confidence score returned for entities
- Entity confidence score is now returned for entities requests that use custom machine learning models.
4 June 2019
- The following changes are activated when you use the version date
2019-06-04
or later. - Fixed a bug that caused entities requests with custom models to ignore the
limit
option. - The default
limit
value for all entities requests is now 50 for all models. - The maximum
limit
value of 250 entities has been removed.
May 2019
29 May 2019
- Sentiment improved for Spanish
- Improved Spanish sentiment.
- Bug fix for special characters
- Fixed a bug that may have affected targeted sentiment results if the target contained special characters.
- Service instance changes
- The following changes are enabled for service instances in Washington DC, Tokyo, London, and Sydney:
- Sentiment option for entities requests that use custom models is enabled for all sentiment-supported languages except Arabic and Russian.
24 May 2019
- Keywords improved for German
- Improved German keywords.
- Sentiment updates for English to locations
- English sentiment updates released to service instances in all IBM Cloud locations other than Dallas.
3 May 2019
- Sentiment detection for German improved
- Improved German document-level sentiment detection.
25 April 2019
- Entity mentions for rule-based custom models
- Enabled entity mentions for rule-based custom models. To view mentions in your results, set the
mentions
entities option totrue
. - Explanations for English categories results
- Released explanations for English categories results. To see relevant text from the source that contributed to each result, set the
explanation
categories option totrue
.
19 March 2019
- Custom categories models created with Knowledge Studio
- Introduced experimental support for custom categories models created with Knowledge Studio.
- Keywords and concepts improved for Spanish
- Improved Spanish keywords and concepts.
10 January 2019
- Requests with no version date parameter return error
List models
andDelete model
requests that don't include a version date parameter now return400
errors instead of successful responses.- Entites performance improved
- Improved entities performance for languages other than English.
December 2018
14 December 2018
- Support improvements
- You can now create Natural Language Understanding service instances in the IBM Cloud London location.
- Added support for Portuguese relations.
- Added support for Italian relations.
- Added a
limit
parameter for categories requests that controls the number of categories returned up to a maximum of 10. - Improved accuracy for Japanese and German sentiment.
6 December 2018
- Improved accuracy for Italian
- Improved accuracy for Italian categories, keywords, entities, and categories.
November 2018
27 November 2018
- Improvements and bug fixes
- Improved quality of keywords results for English, French, Japanese, and Portuguese input.
- Keywords with different capitalizations now appear in results as the same keyword.
- The Get models method now returns additional fields that you can use to manage custom models across several deployments.
version
: user-provided version string from Knowledge Studioversion_description
: user-provided description of this version from Knowledge Studio (for example, what changed since the previous version)workspace_id
: An ID provided by Knowledge Studio that remains constant over repeated deployments from the same Knowledge Studio workspace.created
: A datetime string that indicates when the model was deployed to NLU.
16 November 2018
- New algorithm, improvements, and bug fixes
- Released new algorithms for English keywords and sentiment to improve accuracy and performance.
- Improved keywords accuracy and performance for English, French, Japanese, and Portuguese input.
- Improved Italian sentiment accuracy and performance, including better accuracy for large text samples.
- Fixed a bug that caused URL-encoded text to appear in Spanish entity disambiguation results.
- Released a new Italian entities model with the latest entity type system. You can learn about the latest type system on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to
2018-11-16
to use the new model.
9 November 2018
- Categories improvement
- Major improvement in accuracy for Categories in all supported languages.
8 November 2018
- Service instances can be created in Tokyo location
- You can now create Natural Language Understanding service instances in the IBM Cloud Tokyo location.
5 November 2018
- Support for Concepts in Italian
- Added support for Italian concepts.
30 October 2018
- IAM used for service instances
- As of 30 October 2018, new service instances created in the Germany and US South regions use Identity and Access Management (IAM) authentication.
21 September 2018
- Language support improvements
- Added support for French relations and Portuguese concepts.
- The targeted sentiment option is now supported for French and Portuguese keywords.
- Improved French keywords.
- Improved Korean sentiment.
- Improved Portuguese keywords and sentiment.
- Released a new Portuguese entities model with the latest entity type system. You can learn about the latest type system on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to
2018-09-21
to use the new model.
June 2018
26 June 2018
- Added support for Japanese entities and keywords.
- For Japanese input, the following entities are not yet supported:
- Number
- Percent
- PhoneNumber
- URL
IPv6 addresses in Japanese input are not yet detectable as IPAddress entities.
12 June 2018
- IAM used for US East region
- As of 12 June 2018, new service instances created in the US East region use Identity and Access Management (IAM) authentication.
6 June 2018
- Korean categories improvement
- Minor improvements for Korean categories results.
May 2018
29 May 2018
- IAM used in Sydney region
- As of 29 May 2018, new service instances created in the Sydney region use Identity and Access Management (IAM) authentication.
9 May 2018
- German entities improvement
- Improved German entities.
4 May 2018
- Bug fixes and performance improvements
- Added support for Japanese sentiment and semantic roles.
- Improved performance for metadata requests.
- Fixed a bug that caused
NAN
relevance scores to appear in some entities results. - Fixed a bug that returned
400
error codes in German and Korean keywords requests when500
error codes were more appropriate.
April 2018
19 April 2018
- Bug fixes and performance improvements
- Added support for Japanese relations.
- Improved German keywords.
- Fixed a bug that caused incorrect entity mention text to be returned.
- Fixed a bug that could cause poor results for targeted sentiment.
- Fixed a bug that caused the returned
analyzed_text
to include characters that were not analyzed.
5 April 2018
- Webpage improvements
- Improved webpage content fetching. If you use the
url
parameter to analyze webpages, you'll see better results, especially from webpages that use framesets and cookies. - Korean concept improvement
- Minor improvements to Korean concepts.
16 March 2018
- Language support improvements
- Added support for German categories, relations, and semantic roles.
- A new relation type system is used for German relations. To view details, see the Relation types (Version 2) page.
- Improved German keywords and sentiment.
- Added support for Japanese categories and concepts.
- Language detection improvements.
- Improved webpage cleaning.
- Improved French and German entities models. The models use a new entity type system. Check out the new entity types and subtypes on the Entity types and subtypes (Version 2) page. When your application is compatible with the new type system, change the version date parameter in your requests to
2018-03-16
to use the new model. The following are the differences between the Version 1 type system and the Version 2 type system.
- New entity types:
- Date
- Duration
- Measure
- Money
- Number*
- Percent*
- PhoneNumber*
- Ordinal
- Time
- URL*
- Removed entity types:
- Anatomy
- Award
- Broadcaster
- Company
- Crime
- Drug
- HealthCondition
- Movie
- MusicGroup
- NaturalEvent
- PrintMedia
- Quantity
- Sport
- SportingEvent
- TelevisionShow
- Vehicle
- New entity subtype:
- Quantity
* This entity type is not yet detectable in French text.
January 2018
25 January 2018
- Custom model support for Chinese (Simplified)
- Chinese (Simplified) custom model support is now available for entities and relations.
12 January 2018
- Concepts support for German
- Added support for German concepts.
15 December 2017
- Language support improvements
- Dutch custom model support is now available for entities and relations.
- French language support now includes concepts.
- The French sentiment model has been improved to deliver better results.
- The language detection model is faster and detects more languages overall. For the complete list of languages, see Detectable languages.
-
The following languages are new additions to the list:
- Belarusian (be)
- Bihari (bh)
- Dhivehi (dv)
- Galician (gl)
- Ganda (lg)
- Inuktitut (iu)
- Javanese (jv)
- Kannada (kn)
- Khmer (km)
- Kinyarwanda (rw)
- Laothian (lo)
- Malayalam (ml)
- Marathi (mr)
- Oriya (or)
- Punjabi (pa)
- Scots Gaelic (gd)
- Sinhalese (si)
- Tamil (ta)
- Telugu (te)
- Yiddish (yi)
-
The following languages are no longer detectable:
- Breton (br)
- Chamorro (ch)
- Esperanto (eo)
- Faroese (fo)
- Hausa (ha)
- Ndebele (nr)
- Ojibwa (oj)
November 2017
28 November 2017
- Entity mentions
- Get the locations of entity mentions in your
entities
requests by adding the option"mentions": true
.
-
Example
POST /analyze
request body:{ "text": "Intel planned to announce Monday a laptop-computer chip that combines an Intel processor and an AMD graphics unit.", "features": { "entities": { "mentions": true } } }
-
Example response:
{ "usage": { "text_units": 1, "text_characters": 114, "features": 1 }, "language": "en", "entities": [ { "type": "Company", "text": "Intel", "relevance": 0.33, "mentions": [ { "text": "Intel", "location": [ 0, 5 ] }, { "text": "Intel", "location": [ 73, 78 ] } ], "disambiguation": { "subtype": [ "OperatingSystemDeveloper", "ProcessorManufacturer", "SoftwareDeveloper" ], "name": "Intel", "dbpedia_resource": "http://dbpedia.org/resource/Intel" }, "count": 2 }, { "type": "Company", "text": "AMD", "relevance": 0.33, "mentions": [ { "text": "AMD", "location": [ 96, 99 ] } ], "disambiguation": { "subtype": [ "ProcessorManufacturer" ], "name": "Advanced Micro Devices", "dbpedia_resource": "http://dbpedia.org/resource/Advanced_Micro_Devices" }, "count": 1 } ] }
17 November 2017
- Korean language support improved
- Korean language support has expanded to include the following features:
- Entities
- Keywords
- Semantic Roles
30 July 2017
- Limit number of characters processed
- You can control cost by using the optional
limit_text_characters
parameter to limit the number of characters that are processed. For example,
{
"text": "The United States of America, commonly known as the United States (U.S.) or America, is a federal republic composed of 50 states, a federal district, five major self-governing territories, and various possessions. Forty-eight of the fifty states and the federal district are contiguous and located in North America between Canada and Mexico. The state of Alaska is in the northwest corner of North America, bordered by Canada to the east and across the Bering Strait from Russia to the west.",
"features": {
"entities": {}
},
"limit_text_characters": 50,
"return_analyzed_text": true
}
-
Each character counts as one character, regardless of whether the character is a single-byte character or a multibyte character.
-
For metering, one Natural Language Understanding Item continues to be one feature (also known as an enrichment) per one text unit. One text unit is 10,000 characters or less.
-
For detailed pricing information, see Natural Language Understanding in the IBM Cloud® Catalog.
In addition to adding the limit_text_characters
parameter, the following changes were made to text size limits and truncation:
-
All text greater than 50,000 characters is truncated before processing. Previously, truncation was based on kilobytes, where one kilobyte equaled 1024 bytes.
-
An informational message is returned in the response when text beyond 50,000 characters is truncated.
-
Text limit size for custom models has increased from 10,000 characters to 50,000 characters.
-
Usage information is added to the response for clarity around number of Natural Language Understanding Items used for each request. For example,
{
"usage": {
"text_units": 5,
"text_characters": 41186,
"features": 1
},
...
}
8 May 2017
- Emotion tone score updates
- Updated the emotion tone score model. The training data set was expanded and feature engineering was altered and as a result, the model has higher precision on the IBM benchmark data set.
March 2017
27 March 2017
- Model improvements
- The entity and sentiment models have been improved, which means that when you call those features, you will get better results.
- Relations now support Knowledge Studio custom models in French, German, Italian, and Portuguese.
15 March 2017
- Emotion tone score updated
- We released updates to the emotion tone score model. The training data set was expanded and as a result, the model has higher precision on the IBM benchmark data set.
27 February 2017
- Service GA
- The Natural Language Understanding service is now GA.