How can our language datasets enhance your products?

Available languages

 

We offer flexible, curated datasets for 54 of the world’s major languages in off-the-shelf packages and bespoke bundles tailored to your individual requirements.

 

Get in touch for more information on our available languages and to discuss how our language datasets can enhance your products.

LanguagesMonolingualBilingualBilingualizedWordlistCorpus output
Afrikaansxx
Amharicx
Arabicxxx
Bengalixxx
Catalanx
Chinese (simplified)xx
Chinese (traditional)xx
Czechx
Danishxxx
Dutchxx
English (Aus)x
English (CA)x
English (NZ)x
English (UK)xx
English (US)x
Finnishx
Frenchxxxx
Germanx
Greekx
Hebrewxxxx
Hindixxxx
Hungarianx
Igbox
Indonesianxx
Italianxx
Japanesex
Kannadax
Koreanxx
Latinx
Latvianxx
Malayx
Malayalamxx
Marathixx
Northern Sothox
Norwegianxx
Persianx
Polishx
Portuguesexxx
Portuguese (Brazilian)x
Punjabixxx
Romanianxx
Russianxxxx
Scottish Gaelicx
Serbianxx
Spanishxx
Swahilix
Swedishxxx
Tamilx
Telugux
Thaixxxxx
Turkishxxx
Urdux
Vietnamesexx
Xhosax
Zuluxxx

Want to know more about our available languages?