Personalized Visual Dubbing through Virtual Dubber and Full Head Reenactment

Jeon, Bobae; Paquette, Eric; Mudur, Sudhir; Popa, Tiberiu

Personalized Visual Dubbing through Virtual Dubber and Full Head Reenactment

Files

egs20251034.pdf (3.29 MB)

short1011_mm.mp4 (183.76 MB)

short1011_supp.pdf (15.25 MB)

Date

2025

Authors

Jeon, Bobae
Paquette, Eric
Mudur, Sudhir
Popa, Tiberiu

Publisher

The Eurographics Association

Abstract

Visual dubbing aims to modify facial expressions to ''lip-sync'' a new audio track. While person-generic talking head generation methods achieve expressive lip synchronization across arbitrary identities, they usually lack person-specific details and fail to generate high-quality results. Conversely, person-specific methods require extensive training. Our method combines the strengths of both methods by incorporating a virtual dubber, a person-generic talking head, as an intermediate representation. We then employ an autoencoder-based person-specific identity swapping network to transfer the actor identity, enabling fullhead reenactment that includes hair, face, ears, and neck. This eliminates artifacts while ensuring temporal consistency. Our quantitative and qualitative evaluation demonstrate that our method achieves a superior balance between lip-sync accuracy and realistic facial reenactment.

CCS Concepts: Computing methodologies → Image manipulation; Animation

        @inproceedings{10.2312:egs.20251034
,
booktitle = {Eurographics 2025 - Short Papers
},
editor = {Ceylan, Duygu and 
Li, Tzu-Mao
},
title = {{Personalized Visual Dubbing through Virtual Dubber and Full Head Reenactment
}},
author = {Jeon, Bobae and 
Paquette, Eric and 
Mudur, Sudhir and 
Popa, Tiberiu
},
year = {2025
},
publisher = {The Eurographics Association
},
ISSN = {1017-4656
},
ISBN = {978-3-03868-268-4
},
DOI = {10.2312/egs.20251034
}
}

URI

https://doi.org/10.2312/egs.20251034
https://diglib.eg.org/handle/10.2312/egs20251034

Collections

EG 2025 - Short Papers

Full item page