@m-toman

Training a talking head

, , and . Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI), page 499-504. Pittsburgh, PA, USA, (October 2002)
DOI: 10.1109/ICMI.2002.1167046

Abstract

A Cyberware laser scan of DWM was made, Baldi's generic morphology was mapped into the form of DWM, this head was trained on real data recorded with Optotrak LED markers, and the quality of its speech was evaluated. Participants were asked to recognize auditory sentences presented alone in noise, aligned with the newly trained synthetic textured mapped target face, or the original natural face. There was a significant advantage when the noisy auditory sentence was paired with either head, with the synthetic textured mapped target face giving as much of an improvement as the original recordings of the natural face.

Links and resources

Tags

community