A Low Power Tone Recognition for Automatic Tonal Speech Recognizer
スポンサーリンク
概要
- 論文の詳細を見る
This paper proposes a low power tone recognition suitable for automatic tonal speech recognizer (ATSR). The tone recognition estimates fundamental frequency (F0) only from vowels by using a new magnitude difference function (MDF), called vowel-MDF. Accordingly, the number of operations is considerably reduced. In order to apply the tone recognition in portable electronic equipment, the tone recognition is designed using parallel and pipeline architecture. Due to the pipeline and parallel computations, the architecture achieves high throughput and consumes low power. In addition, the architecture is able to reduce the number of input frames depending on vowels, making it more adaptable depending on the maximum number of frames. The proposed architecture is evaluated with words selected from voice activation for GPS systems, phone dialing options, and words having the same phoneme but different tones. In comparison with the autocorrelation method, the experimental results show 35.7% reduction in power consumption and 27.1% improvement of tone recognition accuracy (110 words comprising 187 syllables). In comparison with ATSR without the tone recognition, the speech recognition accuracy indicates 25.0% improvement of ATSR with tone recogntion (2,250 training data and 45 testing words).
著者
-
Chiracharit Werapon
Department Of Electronics And Telecommunication Engineering King Mongkut's University Of Techno
-
Miyanaga Yoshikazu
Graduate School Of Engineering Hokkaido University
-
CHAMNONGTHAI Kosin
Department of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut's University of Technology Thonburi
-
HIGUCHI Kohji
Department of Electronic Engineering, The University of Electro-Communications
-
CHAIWONGSAI Jirabhorn
Department of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut's University of Technology Thonburi
-
CHAIWONGSAI Jirabhorn
Department of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut's University of Technology Thonburi
-
CHIRACHARIT Werapon
Department of Electronic and Telecommunication Engineering, Faculty of Engineering, King Mongkut's University of Technology Thonburi
関連論文
- A-20-12 Data Frame Format for OFDM System with Variable FFT Point of Data
- Robust Speech Spectra Restoration against Unspecific Noise Conditions for Pitch Detection
- A-20-9 A Study of Phase and Distance Histogram Compensation for OFDM Blind Modulation Detection in Adaptive OFDM Communication
- Performance evaluation of quasi-cyclic LDPC codes for IEEE802.11n based MIMO-OFDM systems (スマートインフォメディアシステム)
- Tunable Wordlength Architecture for a Low Power Wireless OFDM Demodulator(VLSI Design Technology and CAD)
- VLSI Implementation of a Complete Pipeline MMSE Detector for a 4 × 4 MIMO-OFDM Receiver
- A Flexible Architecture for Digital Signal Processing(VLSI System)
- Noise-Robust Speech Analysis Using Running Spectrum Filtering(Speech and Hearing)
- Cepstral Amplitude Range Normalization for Noise Robust Speech Recognition(Speech and Hearing)
- Acoustic Analysis of Vocal Tract Using Auto-Mesh Generation of Finite Element Modeling(Digital Signal Processing)
- High-Speed Finite Element Computation in 3-D Acoustical Analysis of Vocal Tract
- VLSI Implementation of a Scalable Pipeline MMSE MIMO Detector for a 4 x 4 MIMO-OFDM Receiver
- Noninvasive Femur Bone Volume Estimation Based on X-Ray Attenuation of a Single Radiographic Image and Medical Knowledge
- W-04 NEXUS-the Next Generation e-Learning System-and FPGA Hardware Design Platform(International Session)
- Normal Mammogram Detection Based on Local Probability Difference Transforms and Support Vector Machines(Advanced Image Technology)
- Performance and Complexity of MIMO Detectors for Advanced Wireless Communications Systems
- Connectivity Modeling Analysis in Flight-Path Based Aviation Ad Hoc Networks
- New Error Resilience Technique Using Adaptive FMO and Intra Refresh for H.264 Video Transmission
- Design of Area- and Power-Efficient Pipeline FFT Processors for 8x8 MIMO-OFDM Systems
- Development and Outdoor Evaluation of an Experimental Platform in an 80-MHz Bandwidth 2×2 MIMO-OFDM System in 5.2-GHz Band
- A Noise-Robust Continuous Speech Recognition System Using Block-Based Dynamic Range Adjustment
- A Dynamically Reconfigurable FPGA-Based Pattern Matching Hardware for Subclasses of Regular Expressions
- A Dynamically Reconfigurable FPGA-Based Pattern Matching Hardware for Subclasses of Regular Expressions
- A Low Power Tone Recognition for Automatic Tonal Speech Recognizer
- Low-Power Dynamic MIMO Detection for a 4×4 MIMO-OFDM Receiver
- A VLSI Design of a Tomlinson-Harashima Precoder for MU-MIMO Systems Using Arrayed Pipelined Processing
- A Robust Speech Communication into Smart Info-Media System
- A Single Tooth Segmentation Using PCA-Stacked Gabor Filter and Active Contour
- Efficiency Improvement in Dynamic Time Warping Algorithms for Isolated Word Recognition
- Fuzzy Logic Based Autonomous Parallel Parking System with Kalman Filtering