Visyllable Based Speech Animation

Kshirsagar, Sumedha; Magnenat-Thalmann, Nadia

Visyllable Based Speech Animation

dc.contributor.author	Kshirsagar, Sumedha	en_US
dc.contributor.author	Magnenat-Thalmann, Nadia	en_US
dc.date.accessioned	2015-02-16T08:01:30Z
dc.date.available	2015-02-16T08:01:30Z
dc.date.issued	2003	en_US
dc.description.abstract	Visemes are visual counterpart of phonemes. Traditionally, the speech animation of 3D synthetic faces involvesextraction of visemes from input speech followed by the application of co-articulation rules to generate realisticanimation. In this paper, we take a novel approach for speech animation - using visyllables, the visual counterpartof syllables. The approach results into a concatenative visyllable based speech animation system. The key contributionof this paper lies in two main areas. Firstly, we define a set of visyllable units for spoken English along withthe associated phonological rules for valid syllables. Based on these rules, we have implemented a syllabificationalgorithm that allows segmentation of a given phoneme stream into syllables and subsequently visyllables. Secondly,we have recorded the database of visyllables using a facial motion capture system. The recorded visyllableunits are post-processed semi-automatically to ensure continuity at the vowel boundaries of the visyllables. We defineeach visyllable in terms of the Facial Movement Parameters (FMP). The FMPs are obtained as a result of thestatistical analysis of the facial motion capture data. The FMPs allow a compact representation of the visyllables.Further, the FMPs also facilitate the formulation of rules for boundary matching and smoothing after concatenatingthe visyllables units. Ours is the first visyllable based speech animation system. The proposed technique iseasy to implement, effective for real-time as well as non real-time applications and results into realistic speechanimation.Categories and Subject Descriptors (according to ACM CCS): 1.3.7 [Computer Graphics]: Three-Dimensional Graphics and Realism	en_US
dc.description.number	3	en_US
dc.description.seriesinformation	Computer Graphics Forum	en_US
dc.description.volume	22	en_US
dc.identifier.doi	10.1111/1467-8659.t01-2-00711	en_US
dc.identifier.issn	1467-8659	en_US
dc.identifier.pages	631-639	en_US
dc.identifier.uri	https://doi.org/10.1111/1467-8659.t01-2-00711	en_US
dc.publisher	Blackwell Publishers, Inc and the Eurographics Association	en_US
dc.title	Visyllable Based Speech Animation	en_US

Collections

Issue 3

Visyllable Based Speech Animation

Files

Collections