Sample our English lexical datasets

At Oxford Languages, we are the leading provider of lexical datasets. Our data can be used for a range of purposes, including:

Training data: Lexical data to help train your AI and NLP processes.

Dictionary display: Find the definition of a word within your product to contain user experience.

Advanced feature support: Such as confirming whether your users have correctly spelled and/or used a word.

We have a range of structured lexical datasets that support a wide variety of use cases.

Monolingual	Ideal for dictionary look up and display.
Bilingual	Ideal for translation.
Bilingualized	Ideal for language learners.
Thesaurus	Ideal for synonyms suggestions and NLP.
Pronunciations	Ideal for demonstrating the pronunciation of a word.
Sentences	Ideal for understanding how a word is used in context.