Top Qs
Timeline
Chat
Perspective
Viseme
Any of several speech sounds that look the same, for example when lip reading From Wikipedia, the free encyclopedia
Remove ads
A viseme is any of several speech sounds that look the same, for example when lip reading.[1]
![]() | This article includes a list of general references, but it lacks sufficient corresponding inline citations. (January 2023) |

Visemes and phonemes do not share a one-to-one correspondence. Often several phonemes correspond to a single viseme, as several phonemes look the same on the face when produced, such as /k, ɡ, ŋ/; as well as /t, d, n, l/ and /p, b, m/). Thus words such as pet, bell, and men are difficult for lip-readers to distinguish, as all look like alike. On one account, visemes offer (phonetic) information about place of articulation, while manner of articulation requires auditory input.[2]
However, there may be differences in timing and duration during natural speech in terms of the visual "signature" of a given gesture that cannot be captured by simply concatenating (stilled) images of each of the mouth patterns in sequence.[3] Conversely, some sounds which are hard to distinguish acoustically are clearly distinguished by the face. For example, in spoken English /l/ and /r/ can often sound quite similar (especially in clusters, such as 'grass' vs. 'glass'), yet the visual information can disambiguate. Some linguists have argued that speech is best understood as bimodal (aural and visual), and comprehension can be compromised if one of these two domains is absent.[4]
Visemes can often be humorous, as in the phrase "elephant juice", which when lip-read appears identical to "I love you".
Applications for the study of visemes include speech processing, speech recognition, and computer facial animation.
Remove ads
See also
References
Further reading
Wikiwand - on
Seamless Wikipedia browsing. On steroids.
Remove ads