What is a dictionary dataset?

What is a dictionary dataset?

 

At Oxford Languages we provide lexical and language datasets for a wide range of technologies and applications. We offer dictionary data in more than 60 languages, and these are made up of a number of different components.

Types of dictionary datasets


 

There are many different types of dictionaries. The three main types are monolingual, bilingual, and semi-bilingual. There are also thesauruses, which are not dictionaries but are closely related.

 

 

Monolingual dictionaries

 

A monolingual dictionary gives definitions of words in a single language. The main categories within monolingual dictionaries are:

  • Current dictionaries, which give the current meanings of a word. These are what most people think of when they hear the word ‘dictionary’.
  • Learner’s dictionaries, which give definitions in simple language, and are most suitable for learners of the language.
  • Historical dictionaries, which list all meanings a word has ever had, from the first to the most recent, e.g. the Oxford English Dictionary

 

 

Bilingual dictionaries

 

A bilingual dictionary gives the translation of the headword in a different language than the headword. The language of the headword is called the source language, the language of the translation the target language. A bilingual dictionary can be monodirectional, where it only translates from one source language into the target language, or bidirectional, which translates from ‘language A’ into ‘language B’ and from ‘language B’ into ‘language A.’

 

 

Semi-bilingual dictionaries

 

A semi-bilingual dictionary is a hybrid dictionary that combines monolingual and bilingual information. In a semi-bilingual dictionary, a headword will be presented with definitions and examples in the language the user is learning to facilitate comprehension, followed by a translation equivalent of the headword and/or the definition to reassure the user that they have understood the meaning of the word. Suitable for language learners with different levels of dictionary usage skills.

 

 

Thesauruses

 

Thesauruses give lists of synonyms and near-synonyms for words. Some also list antonyms.

 

 

Types of data in dictionaries

Data as seen in a monolingual dictionary, from Oxford Dictionaries Premium.
 
See below for more details.

Data as seen in a bilingual dictionary, from Oxford Dictionaries Premium.

Entry:One word in the dictionary with all its information: senses, translations/definitions, examples, etc.
Headword:A headword is a word as you would look it up in a dictionary, e.g. the word cat, but not the word cats. All other features, like examples, phrasal verbs, idioms, etc. are listed under the headword. A headword can be a single word, but also e.g. an abbreviation (ETA), or a compound word (cat flap).
Definition: 
monolingual and semi-bilingual dictionaries only
Explains what the headword means, in the same language as the headword.
Translation: 
bilingual and semi-bilingual dictionaries only
The translation of the headword. This is the equivalent of the headword that would be used in the same position in the sentence with the same function. Sometimes a one-word headword can have a translation that consists of more than one word, e.g. English advisedly has the Spanish translation con conocimiento de causa, or vice versa, e.g. English in any case has the German translation jedenfalls.
Part of speech:Tells you whether a word is a noun, verb, etc.
Pronunciation:Tells you how a word is pronounced, either by IPA, respell, transliteration, or audio.
Sense:Many words have more than one meaning, e.g. table as furniture or table as the periodic table. These different meanings are called senses, and they are separated and each given their own definition or translation.
Indicator:  
bilingual dictionaries only
Gives a short description of each sense. Indicators help the user find the sense of the headword they want to translate and guide them to the right translation.
Label:Gives extra information on word or sense. There are three types of label:

  • Register label: Tells you whether a certain word or sense is formal, vulgar, offensive, etc. These labels are especially important for identifying sensitive content.

  • Region label: Tells you whether a certain word or sense is only found in a particular area, e.g. US or British English.

  • Domain label: Tells you that a certain word or sense is particularly used in a specific field, e.g. Medicine or Law.

Gloss:  
bilingual dictionaries only
If there is no translation for a particular word or sense, a short description can be given in the target language, so speakers of that language still can understand what the word means; e.g. there is no equivalent of the Arabic word musahharati in English, so the Arabic-English dictionary gives person who walks the streets during Ramadan to wake up people for the morning meal, typically using a drum and calling. This is like a definition, but it is in the target language.
Glossarial note:  bilingual dictionaries onlyGives extra information on the translation; if the translation is less specific than the headword; e.g. harvest moon: pleine lune (de l’équinoxe d’automne).
Example:An example shows the headword inside a sentence or phrase. Usually examples in monolingual dictionaries show a typical use of the headword. Examples in bilingual dictionaries may do the same, or show a use that is particularly noteworthy for learners or translators. In bilingual (and some semi-bilingual) dictionaries, examples usually have translations.
Idiom/phrase:An idiom is a phrase whose meaning can’t be derived from its individual words, e.g. over the moon, upset the apple cart, it’s raining cats and dogs.
Phrasal verb:A phrasal verb is a verb used with an adverb or preposition carrying a specific meaning, e.g. break down, see to, look down on.
Etymology:An etymology explains the historical origin of the word. Depending on the style of the dictionary, an etymology can be as simple as stating the language the word came from or may give an extensive narrative description.
Synonyms:Synonyms are different words which mean the same thing as a headword or sense. Some dictionaries give synonyms as well as translations or definitions.
Antonyms:Antonyms are words which mean the opposite of a headword or sense. Some dictionaries give antonyms, and some thesauruses do.