Skip to content

Latest commit

 

History

History
38 lines (25 loc) · 3.41 KB

Glossary.md

File metadata and controls

38 lines (25 loc) · 3.41 KB

Glossary


Backchannel
  Speech is an *interaction*, where parties are in a constant communication. Though speakers generally take turns such that only one speaker 'has the turn' and speaks at a time, the other, listening participants actively participate in the interaction by nodding or shaking their heads in agreement or disagreement, or by corresponding interjection such as '*Uh-oh*', '*Yeah*', and '*Huh?*'. See also [Backchannel (linguistics) / Wikipedia](https://en.wikipedia.org/wiki/Backchannel_(linguistics)).
  
Formants
  The vocal tract has acoustic resonances, which emphasise some frequency ranges while attenuating others. Such high-energy regions of the spectrum are known as formants. They are important in speech since their location in frequency (and amplitude) uniquely identify vowels. By changing the shape of the vocal tract, we can change the location of those resonances and formants, to choose the vowel we want to utter. See also [Acoustic properties of speech signals](content:acoustic-properties) and [Formant / Wikipedia](https://en.wikipedia.org/wiki/Formant).

Fundamental frequency ($F_0$)
  The vocal folds can oscillate when air flows through them and when they are appropriately tensioned. The frequency of such oscillation is known as the fundamental frequency, often abbreviated as $F_0$, and it is perceived as the pitch of a speech sound. See also [Acoustic properties of speech signals](content:acoustic-properties) and [Voice frequency / Wikipedia](https://en.wikipedia.org/wiki/Voice_frequency).

Objective test
   An evaluation methodology based on a computational algorithm. See also [Objective quality evaluation](Evaluation/Objective_quality_evaluation.md)

Perceptual model
  A model which simulates human perception. Typically used as a quality evaluation method, to judge how important different characteristics of a signal are for a human.

Phonation
  The phsyiological process of producing a speech sound is referred to phonation. In some areas, phonation is limited to voiced sounds or just those sounds with some sort of oscillation. See also [Phonation / Wikipedia](https://en.wikipedia.org/wiki/Phonation).

Phone
  Phones are the elementary units of speech, associated with articulatory gestures responsible for producing them and with acoustic cues that make them distinct from other phones. See also [Phones](content:phones) and [Phone (phonetics) / Wikipedia](https://en.wikipedia.org/wiki/Phone_(phonetics)).
  
Phoneme
  Phonemes are defined in terms of their meaning contrasting function: two different phones of a language are also different phonemes, if they can change the meaning of a word. See also [Phonemes](content:phonemes) and [Phoneme / Wikipedia](https://en.wikipedia.org/wiki/Phoneme).

Sampling rate
  The frequency at which the time-domain signal is sampled (measured). See also [Waveform/Sampling rate](content:samplingrate).
  
Subjective test
  An evaluation methodology where a human subject rates a characteristic of a system or signal. See also [Subjective quality evaluation](content:subjectiveevaluation)

Turn-taking
  Humans are generally able to follow only one speech message at a time. In a dialogue, it is therefore important that only one person is speaking at a time. The organization of a dialogue to agree on who 'has the turn' and is currently 'in turn' to speak, is known as turn-taking. See also [Turn taking / Wikipedia](https://en.wikipedia.org/wiki/Turn-taking).