Reactive Control of Expressive Speech Synthesis Using Kinect Skeleton Tracking
スポンサーリンク
概要
- 論文の詳細を見る
Naturally expressive speech is important for an increasing number of real world speech synthesis appli-cations including augmentative and alternative communication aids and entertainment based applications. One of the important challenges facing speech synthesis development today is how to produce reactive expressive speech, that is speech where various aspects of the way in which speech is said can be controlled in real-time as the speech is produced. This is both a challenge in terms of the adaptability and latency of speech synthesis systems and in terms of how to provide a control mechanism for different situations. To explore these issues and generally raise awareness of these issues we present a reactive speech synthesiser where pitch and duration are controlled by hand movement via the skeleton tracking of a Microsoft Kinect sensor. We see that the manipulation of pitch and duration in realtime is possible (and fun), but it is difficult to produce meaningful expressiveness without an underlying model to allow a high-level representation of expressiveness to be used.
- 一般社団法人電子情報通信学会の論文
- 2012-12-13
著者
-
Yamagishi Junichi
The Centre For Speech Technol. Res. Univ. Of Edinburgh
-
CLARK ROBERT
The Centre for Speech Technology Research University of Edinburgh Informatics Forum 10 Crichton Street EDINBURGH EH89AB United Kingdom
-
KONKIEWICZ MAGDALENA
The Centre for Speech Technology Research University of Edinburgh Informatics Forum 10 Crichton Street EDINBURGH EH89AB United Kingdom
-
ASTRINAK MARIA
Facult Polytechnique de Mons(FPMs)Department of Electrical Engineering TCTS Lab 31 University of Mons Boulevard Dolez B-7000 Mons, Belgium
-
YAMAGISHI JUNICHI
The Centre for Speech Technology Research University of Edinburgh Informatics Forum 10 Crichton Street EDINBURGH EH89AB United Kingdom
関連論文
- Unsupervised speaker adaptation for speech-to-speech translation system (言語理解とコミュニケーション)
- Speech synthesis technologies for individuals with vocal disabilities: Voice banking and reconstruction
- Reactive Control of Expressive Speech Synthesis Using Kinect Skeleton Tracking