Pronunciation data for text-to-speech

Pronunciation data for

text-to-speech

 

An assistive technology that reads text aloud, text-to-speech technology can be enhanced with Oxford Languages fully mapped audio and IPA transcriptions. Our data has been carefully compiled by our in-house team of language experts as an output of our language research programme, one of the largest in the world.

 

This dataset is aligned to the words and inflections found in the Oxford Dictionary of English and New Oxford American Dictionary. Its unparalleled coverage and accuracy ensures a higher quality output of speech synthesizers, and with our data features we are confident your text-to-speech application will benefit from increased accuracy in its output.

Data features


 

Our datasets contain features that enable the most accurate and comprehensive text-to-speech applications:

  • Over 500,000 transcriptions, with over 200,000 of both British and American English
  • Syllabified and non-syllabified IPA (International Phonetic Alphabet) transcriptions for each wordform
  • Variant spellings of each word (# is used as a separator)
  • Variety of English, British or American
  • Pronunciation group identifier, a unique identifier for each pronunciation group. Pronunciations which have the same identifier are used interchangeably e.g. engross /ɪnˈɡroʊs/ /ɛnˈɡroʊs/
  • Parts of speech (# is used as a separator), to aid disambiguation where the correct pronunciation is not clear from the spelling
  • Sense level caution which offers a further warning that the pronunciation may vary according to sense (i.e. is not made clear by spelling and given parts of speech)
  • Sound file mappings to appropriate audio file

 

An example of what's in our data, focusing on the word friendliness:

Our British English audio pronunciation

Our American English audio pronunciation

Get in touch to arrange a meeting with one of our experts, and explore how our pronunciation data could enhance your products:


Our Privacy Policy sets out how Oxford University Press handles your personal information, and your rights to object to your personal information being used for marketing to you or being processed as part of our business activities.