VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track

Garrido, Pablo; Valgaerts, Levi; Sarmadi, Hamid; Steiner, Ingmar; Varanasi, Kiran; Perez, Patrick; Theobalt, Christian

VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track

dc.contributor.author	Garrido, Pablo	en_US
dc.contributor.author	Valgaerts, Levi	en_US
dc.contributor.author	Sarmadi, Hamid	en_US
dc.contributor.author	Steiner, Ingmar	en_US
dc.contributor.author	Varanasi, Kiran	en_US
dc.contributor.author	Perez, Patrick	en_US
dc.contributor.author	Theobalt, Christian	en_US
dc.contributor.editor	Olga Sorkine-Hornung and Michael Wimmer	en_US
dc.date.accessioned	2015-04-16T07:43:56Z
dc.date.available	2015-04-16T07:43:56Z
dc.date.issued	2015	en_US
dc.description.abstract	In many countries, foreign movies and TV productions are dubbed, i.e., the original voice of an actor is replaced with a translation that is spoken by a dubbing actor in the country's own language. Dubbing is a complex process that requires specific translations and accurately timed recitations such that the new audio at least coarsely adheres to the mouth motion in the video. However, since the sequence of phonemes and visemes in the original and the dubbing language are different, the video-to-audio match is never perfect, which is a major source of visual discomfort. In this paper, we propose a system to alter the mouth motion of an actor in a video, so that it matches the new audio track. Our paper builds on high-quality monocular capture of 3D facial performance, lighting and albedo of the dubbing and target actors, and uses audio analysis in combination with a space-time retrieval method to synthesize a new photo-realistically rendered and highly detailed 3D shape model of the mouth region to replace the target performance. We demonstrate plausible visual quality of our results compared to footage that has been professionally dubbed in the traditional way, both qualitatively and through a user study.	en_US
dc.description.number	2	en_US
dc.description.sectionheaders	All About Faces	en_US
dc.description.seriesinformation	Computer Graphics Forum	en_US
dc.description.volume	34	en_US
dc.identifier.doi	10.1111/cgf.12552	en_US
dc.identifier.pages	193-204	en_US
dc.identifier.uri	https://doi.org/10.1111/cgf.12552	en_US
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.title	VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track	en_US

Collections

34-Issue 2
EG 2015 - Full Papers - CGF 34-Issue 2

VDub: Modifying Face Video of Actors for Plausible Visual Alignment to a Dubbed Audio Track

Files

Collections