IBM Cloud Docs
Detectable languages

Detectable languages

When you analyze text or a web page, Natural Language Understanding detects the source language automatically and returns the corresponding ISO 639-1 code in the response. For automatic language detection to work best, it is recommended that you use text with at least 100 characters.

If automatic language detection isn't working well for your use case, you can manually specify the language of your content in each request.

The following table lists the detectable languages and corresponding ISO 639-1 codes.

Language ISO 639-1 code
Afrikaans af
Albanian sq
Amharic am
Arabic ar
Armenian hy
Azerbaijani az
Basque eu
Belarusian be
Bengali bn
Bihari bh
Bulgarian bg
Catalan ca
Chinese zh
Croatian hr
Czech cs
Danish da
Dhivehi dv
Dutch nl
English en
Estonian et
Fijian fj
Finnish fi
French fr
Galician gl
Ganda lg
Georgian ka
German de
Greek el
Gujarati gu
Haitian Creole ht
Hebrew he
Hindi hi
Hungarian hu
Icelandic is
Indonesian id
Inuktitut iu
Irish ga
Italian it
Javanese jv
Japanese ja
Kannada kn
Khmer km
Kinyarwanda rw
Kirghiz ky
Korean ko
Latin la
Laothian lo
Latvian lv
Lithuanian lt
Macedonian mk
Malay ms
Malayalam ml
Maltese mt
Maori mi
Marathi mr
Nepali ne
Norwegian no
Oriya or
Persian fa
Polish pl
Portuguese pt
Punjabi pa
Pashto ps
Romanian ro
Russian ru
Scots Gaelic gd
Serbian sr
Shona sn
Sinhalese si
Slovak sk
Slovenian sl
Spanish es
Swahili sw
Swedish sv
Tagalog tl
Tamil ta
Telugu te
Thai th
Turkish tr
Ukrainian uk
Urdu ur
Vietnamese vi
Welsh cy
Wolof wo
Xhosa xh
Yiddish Yi