Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose topic extraction models based on statistical relevance scores between topic words and words in articles, and report results obtained in topic extraction experiments using continuous speech recognition for Japanese broadcast news utterances. We attempt to represent a topic of news speech using a combination of multiple topic words, which are important words in the news article or words relevant to the news. We assume a topic of news is represented by a combination of words. We statistically model mapping from words in an article to topic words. Using the mapping, the topic extraction model can extract topic words even if they do not appear in the article. We train a topic extraction model capable of computing the degree of relevance between a topic word and a word in an article by using newspaper text covering a five-year period. The degree of relevance between those words is calculated based on measures such as mutual information or the X^2-method. In experiments extracting five topic words using a X^2-based model, we achieve 72% precision and 12% recall for speech recognition results. Speech recognition results generally include a number of recognition errors, which degrades topic extraction performance. To avoid this, we employ N-best candidates and likelihood given by acoustic and language models. In experiments, we find that extracting five topic words using N-best candidate and likelihood values achieves significantly improved precision.
- 社団法人電子情報通信学会の論文
- 2002-07-01
著者
-
FURUI Sadaoki
Department of Computer Science, Graduate School of Information Science and Engineering, Tokyo Instit
-
OHTSUKI Katsutoshi
NTT Cyber Space Laboratories, NTT Corporation
-
Ohtsuki Katsutoshi
Ntt Cyber Space Laboratories Ntt Corporation
-
Furui Sadaoki
Department Of Computer Science Graduate School Of Information Science And Engineering Tokyo Institut
-
MATSUNAGA Shoichi
NTT Cyber Space Laboratories, NTT Corporation
-
MATSUOKA Tatsuo
Broadband Business Department, NTT East Corporation
-
Matsuoka Tatsuo
Broadband Business Department Ntt East Corporation
-
Matsunaga Shoichi
Ntt Cyber Space Laboratories Ntt Corporation
関連論文
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Gait-based Person Identification Robust against Speed Variation using CHLAC features and HMMs
- Tree-Structured Clustering Methods for Piecewise Linear-Transformation-Based Noise Adaptation(Speech and Hearing)
- Incremental Language Modeling for Automatic Transcription of Broadcast News(Speech and Hearing)
- Robust Scene Extraction Using Multi-Stream HMMs for Baseball Broadcast(Image Processing and Video Processing)
- Automatic recognition of Indonesian declarative questions and statements using polynomial coefficients of the pitch contours
- Initial evaluation of the drivers' Japanese speech corpus in a car environment (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Accent analysis for Mandarin large vocabulary continuous speech recognition (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology")
- Evaluation of a Noise-Robust Multi-Stream Speaker Verification Method Using F_0 Information
- Improved Phoneme-History-Dependent Search Method for Large-Vocabulary Continuous-Speech Recognition
- Topic Extraction Based on Continuous Speech Recognition in Broadcast News Speech
- Noise Robust Speech Recognition Using F_0 Contour Information(Speech Dynamics by Ear, Eye, Mouth and Machine)
- Phonology and Morphology Modeling in a Very Large Vocabulary Hungarian Dictation System(Speech and Hearing)
- 連続発話認識のための言語モデル
- Dynamic Bayesian Network-Based Acoustic Models Incorporating Speaking Rate Effects(Speech and Hearing)
- Neural-network-based HMM adaptation for noisy speech recognition
- Speaker Verification Using MMAP Adaptation (言語理解とコミュニケーション)
- Speaker Verification Using MMAP Adaptation (音声)
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Two-pass Approach for Recognizing Code-Switching Speech
- Subject Adaptation and Adaptive Training for Gait-based Person Identification
- Speaker Verification Using MMAP Adaptation
- Two-pass Approach for Recognizing Code-Switching Speech