Available languages
We offer flexible, curated datasets for 54 of the world’s major languages in off-the-shelf packages and bespoke bundles tailored to your individual requirements.
Get in touch for more information on our available languages and to discuss how our language datasets can enhance your products.
Languages | Monolingual | Bilingual | Bilingualized | Wordlist | Corpus output |
---|---|---|---|---|---|
Afrikaans | x | x | |||
Amharic | x | ||||
Arabic | x | x | x | x | |
Bengali | x | x | |||
Catalan | x | ||||
Chinese (simplified) | x | x | |||
Chinese (traditional) | x | x | |||
Czech | x | ||||
Danish | x | ||||
Dutch | x | x | |||
English (Aus) | x | ||||
English (CA) | x | ||||
English (NZ) | x | ||||
English (UK) | x | ||||
English (US) | x | ||||
French | x | x | x | x | |
German | x | x | x | ||
Greek | x | ||||
Hebrew | x | x | |||
Hindi | x | x | x | x | |
Hungarian | x | ||||
Igbo | x | x | |||
Indonesian | x | x | x | ||
Italian | x | x | |||
Japanese | x | x | |||
Kannada | x | ||||
Korean | x | x | |||
Latin | x | ||||
Latvian | x | x | |||
Malay | x | x | |||
Malayalam | x | ||||
Marathi | x | x | |||
Northern Sotho | x | ||||
Norwegian | x | x | |||
Persian | x | x | |||
Polish | x | x | |||
Portuguese | x | x | |||
Portuguese (Brazilian) | x | x | x | x | |
Punjabi | x | x | |||
Romanian | x | x | x | ||
Russian | x | x | x | ||
Scottish Gaelic | x | ||||
Serbian | x | ||||
Spanish | x | x | x | ||
Swahili | x | ||||
Swedish | x | x | |||
Tamil | x | x | x | ||
Telugu | x | x | |||
Thai | x | x | x | x | |
Tok Pisin | x | ||||
Turkish | x | x | x | ||
Urdu | x | ||||
Vietnamese | x | x | |||
Xhosa | x | ||||
Zulu | x |
Want to know more about our available languages?