Using Semi-supervised Learning for Question Classification
スポンサーリンク
概要
- 論文の詳細を見る
Question classification, an important phase in question answering systems, is the task of identifying the type of a given question among a set of predefined types. This study uses unlabeled questions in combination with labeled questions for semi-supervised learning, to improve the precision of question classification task. For semi-supervised algorithm, we selected Tri-training because it is a simple but efficient co-training style algorithm. However, Tri-training is not well suitable for question data, so we give two proposals to modify Tri-training, to make it more suitable. In order to enable its three classifiers to have different initial hypotheses, Tri-training bootstrap-samples the originally labeled set to get different sets for training the three classifiers. The precisions of three classifiers are decreased because of the bootstrap-sampling. With the purpose to avoid this drawback by allowing each classifier to be initially trained on the originally labeled set while still ensuring the diversity of three classifiers, our first proposal is to use multiple algorithms for classifiers in Tri-training; the second proposal is to use multiple algorithms for classifiers in combination with multiple views, and our experiments show promising results.
- Information and Media Technologies 編集運営会議の論文
著者
-
Shimazu Akira
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Nguyen Tri
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Nguyen Le
School Of Information Science Japan Advanced Institute Of Science And Technology
関連論文
- Word Sense Disambiguation by Combining Classifiers with an Adaptive Selection of Context Representation
- Word Sense Disambiguation by Combining Classifiers with an Adaptive Selection of Context Representation
- Using Semi-supervised Learning for Question Classification
- Automatic Extraction of the Fine Category of Person Named Entities from Text Corpora
- A Semi Supervised Learning Model for Mapping sentences to logical form with ambiguous supervision
- Treatment of Legal Sentences Including Itemization Written in Japanese, English and Vietnamese —Towards Translation into Logical Forms—
- Clause Splitting with Conditional Random Fields
- Learning to Generate a Table-of-Contents with Supportive Knowledge
- Clause Splitting with Conditional Random Fields
- Using shallow semantic parsing and relation extraction for finding contradiction in text
- Treatment of Legal Sentences Including Itemization Written in Japanese, English and Vietnamese —Towards Translation into Logical Forms—
- Using Semi-supervised Learning for Question Classification