CIAIR In-Car Speech Corpus : Influence of Driving Status(<Special Section>Corpus-Based Speech Technologies)
スポンサーリンク
概要
- 論文の詳細を見る
CIAIR, Nagoya University, has been compiling an in-car speech database since 1999. This paper discusses the basic information contained in this database and an analysis on the effects of driving status based on the database. We have developed a system called the Data Collection Vehicle (DCV), which supports synchronous recording of multichannel audio data from 12 microphones which can be placed throughout the vehicle, multi-channel video recording from three cameras, and the collection of vehicle-related data. In the compilation process, each subject had conversations with three types of dialog system: a human, a "Wizard of Oz" system, and a spoken dialog system. Vehicle information such as speed, engine RPM, accelerator/brake-pedal pressure, and steering-wheel motion were also recorded. In this paper, we report on the effect that driving status has on phenomena specific to spoken language
- 社団法人電子情報通信学会の論文
- 2005-03-01
著者
-
TAKEDA Kazuya
Nagoya University
-
Matsubara Shigeki
自治医科大学 産婦人科
-
Takeda Kazuya
Nagoya Univ.
-
Takeda Kazuya
Nagoya Univ. Nagoya‐shi Jpn
-
Matsubara S
Center For Integrated Acoustic Information Research Nagoya University:information Technology Center
-
Matsubara Shigeki
The Graduate School Of Engineering Nagoya University
-
ITAKURA Fumitada
Graduate School of Information Engineering, Meijo University
-
Itakura Fumitada
The Faculty Of Science And Technology Meijo University
-
Matsubara Shigeki
Nagoya Univ. Nagoya‐shi Jpn
-
KAWAGUCHI Nobuo
Nagoya University
-
ITAKURA Fumitada
Meijyo University
-
Kawaguchi N
Center For Integrated Acoustic Information Research Nagoya University:information Technology Center
関連論文
- Acoustic Feature Transformation Combining Average and Maximum Classification Error Minimization Criteria
- Acoustic Feature Transformation Based on Discriminant Analysis Preserving Local Structure for Speech Recognition
- Cytochemical Localization of Oxygen-intermediates Metabolizing Enzymes in the Human Placenta, with special refernece to NADH-oxidase
- Cytochemical Localizations of Ca^-ATPase, NDPase and 5'-nucleotidase in the human term placenta
- ULTRACYTOCHEMICAL LOCALIZATION OF ADP-DEGRADING ENZYME ACTIVITY IN THE HUMAN TERM PLACENTA : DIRECT CYTOCHEMICAL EVIDENCE
- ULTRACYTOCHEMICAL LOCALIZATIONS OF ADENOSINE NUCLEOTIDASE ACTIVITIES IN THE HUMAN TERM PLACENTA, WITH SPECIAL REFERENCE TO 5'-NUCLEOTIDASE ACTIVITY
- Ultrastructural localization of phosphatases in human term placenta
- CENSREC-1-C : An evaluation framework for voice activity detection under noisy environments
- Driver Identification Using Driving Behavior Signals(Human-computer Interaction)
- Leukocyte Migration through the Basement Membrane : Ultrastructural-Cytochemical Observation of the Human Fetal Membrane in Women with Chorioamnionitis-Related Preterm Delivery
- AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments(Speech and Hearing)
- Evaluation of HRTFs estimated using physical features
- Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones(Feature Extraction and Acoustic Medelings, Corpus-Based Speech Technologies)
- Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training
- Acoustic Feature Transformation Based on Discriminant Analysis Preserving Local Structure for Speech Recognition
- Gamma Modeling of Speech Power and Its On-Line Estimation for Statistical Speech Enhancement(Speech Enhancement, Statistical Modeling for Speech Processing)
- Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- SNR and sub-band SNR estimation based on Gaussian mixture modeling in the log power domain with application for speech enhancements (第6回音声言語シンポジウム)
- Acoustic Feature Transformation Combining Average and Maximum Classification Error Minimization Criteria
- Driver's irritation detection using speech recognition results (音声・第10回音声言語シンポジウム)
- Driver's irritation detection using speech recognition results (音声言語情報処理)
- Driver's irritation detection using speech recognition results (言語理解とコミュニケーション・第10回音声言語シンポジウム)
- Predicting the Degradation of Speech Recognition Performance from Sub-band Dynamic Ranges (特集 音声言語情報処理とその応用)
- A model of perceptual distance for group delays based on ellipsoidal mapping
- An Acoustically Oriented Vocal-Tract Model
- Estimation of speaker and listener positions in a car using binaural signals
- Sound localization under conditions of covered ears on the horizontal plane
- Single-Channel Multiple Regression for In-Car Speech Enhancement
- Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition(Speech Enhancement, Multi-channel Acoustic Signal Processing)
- Speech Recognition Using Finger Tapping Timings(Speech and Hearing)
- CIAIR In-Car Speech Corpus : Influence of Driving Status(Corpus-Based Speech Technologies)
- Construction and Evaluation of a Large In-Car Speech Corpus(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- Robust Dependency Parsing of Spontaneous Japanese Spoken Language(Speech Corpora and Related Topics, Corpus-Based Speech Technologies)
- Example-Based Query Generation for Spontaneous Speech
- Incremental Transfer in English-Japanese Machine Translation
- Method for determining sound localization by auditory masking
- Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition
- CENSREC-4: An evaluation framework for distant-talking speech recognition in reverberant environments
- A Graph-Based Spoken Dialog Strategy Utilizing Multiple Understanding Hypotheses
- Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition