@m-toman

XIMERA: A New TTS from ATR Based on Corpus-Based Technologies

, , , , and . Proceedings of the 5th ISCA Workshop on Speech Synthesis (SSW), page 179-184. Pittsburgh, PA, USA, (June 2004)

Abstract

This paper describes a new concatenative TTS system under development at ATR. The system, named XIMERA, is based on corpus-based technologies, as was the case for the preceding TTS systems from ATR, namely ν-talk and CHATR. The prominent features of XIMERA are (1) large corpora (a 110-hours corpus of a Japanese male, a 60-hours corpus of a Japanese female, and a 20-hours corpus of a Chinese female), (2) HMM-based generation of prosodic parameters, and (3) a cost function for segment selection optimized based on perceptual experiments. A perception test that evaluated the naturalness of synthetic speech for XIMERA and 10 TTS products, including CHATR, showed that XIMERA outperformed the other ten.

Links and resources

Tags

community

  • @m-toman
  • @dblp
@m-toman's tags highlighted