リアルタイムニュース字幕修正作業のための音声認識誤り自動検出法(<小特集>ヒューマンインフォメーション)

概要

論文の詳細を見る
A speech recognition system with a manual error correction has been developed to produce closed captions in live broadcasting programs such as news programs. Speech recognition errors, however, are not corrected completely because the correctors often miss errors due to lack of care or successive erroneous words. In this paper, we propose a method that detects errors automatically to assist manual correction. Acoustic parameters were extracted from both correct and erroneous results of all morphemes produced by speech recognition systems. Templates which can precisely distinguish between errors and correct results, were then constructed by genetic algorithms (GA) and discriminative training. Consequently, the errors are detected by comparing the acoustic parameters of an unknown recognition result with the templates. Experiments have confirmed that presenting erroneous words by using the proposed method is effective for improving corrector error detection.
社団法人映像情報メディア学会の論文
2003-12-01