Detectable languages
When you analyze text or a web page, Natural Language Understanding detects the source language automatically and returns the corresponding ISO 639-1 code in the response. For automatic language detection to work best, it is recommended that you use text with at least 100 characters.
If automatic language detection isn't working well for your use case, you can manually specify the language of your content in each request.
The following table lists the detectable languages and corresponding ISO 639-1 codes.
Language | ISO 639-1 code |
---|---|
Afrikaans | af |
Albanian | sq |
Amharic | am |
Arabic | ar |
Armenian | hy |
Azerbaijani | az |
Basque | eu |
Belarusian | be |
Bengali | bn |
Bihari | bh |
Bulgarian | bg |
Catalan | ca |
Chinese | zh |
Croatian | hr |
Czech | cs |
Danish | da |
Dhivehi | dv |
Dutch | nl |
English | en |
Estonian | et |
Fijian | fj |
Finnish | fi |
French | fr |
Galician | gl |
Ganda | lg |
Georgian | ka |
German | de |
Greek | el |
Gujarati | gu |
Haitian Creole | ht |
Hebrew | he |
Hindi | hi |
Hungarian | hu |
Icelandic | is |
Indonesian | id |
Inuktitut | iu |
Irish | ga |
Italian | it |
Javanese | jv |
Japanese | ja |
Kannada | kn |
Khmer | km |
Kinyarwanda | rw |
Kirghiz | ky |
Korean | ko |
Latin | la |
Laothian | lo |
Latvian | lv |
Lithuanian | lt |
Macedonian | mk |
Malay | ms |
Malayalam | ml |
Maltese | mt |
Maori | mi |
Marathi | mr |
Nepali | ne |
Norwegian | no |
Oriya | or |
Persian | fa |
Polish | pl |
Portuguese | pt |
Punjabi | pa |
Pashto | ps |
Romanian | ro |
Russian | ru |
Scots Gaelic | gd |
Serbian | sr |
Shona | sn |
Sinhalese | si |
Slovak | sk |
Slovenian | sl |
Spanish | es |
Swahili | sw |
Swedish | sv |
Tagalog | tl |
Tamil | ta |
Telugu | te |
Thai | th |
Turkish | tr |
Ukrainian | uk |
Urdu | ur |
Vietnamese | vi |
Welsh | cy |
Wolof | wo |
Xhosa | xh |
Yiddish | Yi |