A Robust Speech Communication into Smart Info-Media System
スポンサーリンク
概要
- 論文の詳細を見る
This paper introduces our developed noise robust speech communication techniques and describes its implementation to a smart info-media system, i.e., a small robot. Our designed speech communication system consists of automatic speech detection, recognition, and rejection. By using automatic speech detection and recognition, an observed speech waveform can be recognized without a manual trigger. In addition, using speech rejection, this system only accepts registered speech phrases and rejects any other words. In other words, although an arbitrary input speech waveform can be fed into this system and recognized, the system responds only to the registered speech phrases. The developed noise robust speech processing can reduce various noises in many environments. In addition to the design of noise robust speech recognition, the LSI design of this system has been introduced. By using the design of speech recognition application specific IC (ASIC), we can simultaneously realize low power consumption and real-time processing. This paper describes the LSI architecture of this system and its performances in some field experiments. In terms of current speech recognition accuracy, the system can realize 85-99% under 0-20dB SNR and echo environments.
著者
-
Miyanaga Yoshikazu
Graduate School Of Engineering Hokkaido University
-
YOSHIZAWA Shingo
Department of Electrical and Electronic Engineering, Kitami Institute of Technology
-
TAKAHASHI Wataru
Graduate School of Information Science and Technology, Hokkaido University
関連論文
- A-20-12 Data Frame Format for OFDM System with Variable FFT Point of Data
- Robust Speech Spectra Restoration against Unspecific Noise Conditions for Pitch Detection
- A-20-9 A Study of Phase and Distance Histogram Compensation for OFDM Blind Modulation Detection in Adaptive OFDM Communication
- Performance evaluation of quasi-cyclic LDPC codes for IEEE802.11n based MIMO-OFDM systems (スマートインフォメディアシステム)
- Tunable Wordlength Architecture for a Low Power Wireless OFDM Demodulator(VLSI Design Technology and CAD)
- VLSI Implementation of a Complete Pipeline MMSE Detector for a 4 × 4 MIMO-OFDM Receiver
- A Flexible Architecture for Digital Signal Processing(VLSI System)
- Noise-Robust Speech Analysis Using Running Spectrum Filtering(Speech and Hearing)
- Cepstral Amplitude Range Normalization for Noise Robust Speech Recognition(Speech and Hearing)
- Acoustic Analysis of Vocal Tract Using Auto-Mesh Generation of Finite Element Modeling(Digital Signal Processing)
- High-Speed Finite Element Computation in 3-D Acoustical Analysis of Vocal Tract
- VLSI Implementation of a Scalable Pipeline MMSE MIMO Detector for a 4 x 4 MIMO-OFDM Receiver
- W-04 NEXUS-the Next Generation e-Learning System-and FPGA Hardware Design Platform(International Session)
- Performance and Complexity of MIMO Detectors for Advanced Wireless Communications Systems
- Connectivity Modeling Analysis in Flight-Path Based Aviation Ad Hoc Networks
- New Error Resilience Technique Using Adaptive FMO and Intra Refresh for H.264 Video Transmission
- Design of Area- and Power-Efficient Pipeline FFT Processors for 8x8 MIMO-OFDM Systems
- Development and Outdoor Evaluation of an Experimental Platform in an 80-MHz Bandwidth 2×2 MIMO-OFDM System in 5.2-GHz Band
- A Noise-Robust Continuous Speech Recognition System Using Block-Based Dynamic Range Adjustment
- A Dynamically Reconfigurable FPGA-Based Pattern Matching Hardware for Subclasses of Regular Expressions
- A Dynamically Reconfigurable FPGA-Based Pattern Matching Hardware for Subclasses of Regular Expressions
- A Low Power Tone Recognition for Automatic Tonal Speech Recognizer
- Low-Power Dynamic MIMO Detection for a 4×4 MIMO-OFDM Receiver
- A VLSI Design of a Tomlinson-Harashima Precoder for MU-MIMO Systems Using Arrayed Pipelined Processing
- A Robust Speech Communication into Smart Info-Media System
- Efficiency Improvement in Dynamic Time Warping Algorithms for Isolated Word Recognition