Learning to Generate a Table-of-Contents with Supportive Knowledge
スポンサーリンク
概要
- 論文の詳細を見る
In the text summarization field, a table-of-contents is a type of indicative summary that is especially suited for locating information in a long document, or a set of documents. It is also a useful summary for a reader to quickly get an overview of the entire contents. The current models for generating a table-of-contents produced relatively low quality output with many meaningless titles, or titles that have no overlapping meaning with the corresponding contents. This problem may be due to the lack of semantic information and topic information in those models. In this research, we propose to integrate supportive knowledge into the learning models to improve the quality of titles in a generated table-of-contents. The supportive knowledge is derived from a hierarchical clustering of words, which is built from a large collection of raw text, and a topic model, which is directly estimated from the training data. The relatively good results of the experiments showed that the semantic and topic information supplied by supportive knowledge have good effects on title generation, and therefore, they help to improve the quality of the generated table-of-contents.
- (社)電子情報通信学会の論文
- 2011-03-01
著者
-
Shimazu Akira
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Le Nguyen
School Of Information Science Japan Advanced Institute Of Science And Technology
-
NGUYEN Viet
School of Chemical Engineering, Yeungnam University
-
Nguyen Viet
School Of Information Science Japan Advanced Institute Of Science And Technology
-
Nguyen Viet
School Of Chemical Engineering Yeungnam University
関連論文
- Word Sense Disambiguation by Combining Classifiers with an Adaptive Selection of Context Representation
- Preparation methods of alginate micro-hydrogel particles and evaluation of their electrophoresis behavior for possible electronic paper ink application
- Word Sense Disambiguation by Combining Classifiers with an Adaptive Selection of Context Representation
- Using Semi-supervised Learning for Question Classification
- Automatic Extraction of the Fine Category of Person Named Entities from Text Corpora
- A Semi Supervised Learning Model for Mapping sentences to logical form with ambiguous supervision
- Treatment of Legal Sentences Including Itemization Written in Japanese, English and Vietnamese —Towards Translation into Logical Forms—
- Clause Splitting with Conditional Random Fields
- Optimization Based Approach for Industrial PI Controller Design for Optimal Servo Control of Integrating Process with Constraints
- Learning to Generate a Table-of-Contents with Supportive Knowledge
- Optimization Based Approach for Industrial PI Controller Design for Optimal Servo Control of Integrating Process with Constraints
- Clause Splitting with Conditional Random Fields
- Using shallow semantic parsing and relation extraction for finding contradiction in text
- Treatment of Legal Sentences Including Itemization Written in Japanese, English and Vietnamese —Towards Translation into Logical Forms—
- Using Semi-supervised Learning for Question Classification