Text Summarization based on Information Extraction and Categorization Using 5W1H
スポンサーリンク
概要
- 論文の詳細を見る
In an office, it is necessary for understanding the temporal transition and the overall situation on an event from various information to extract and abstract a large number of documents. This paper proposes two robust methods for generating an extract and an abstract from documents: an episodic extraction method which generates an extract on the temporal transition of an event and an overall abstraction method which generates an abstract of overall documents for survey. The episodic extraction method retrieves documents including the 5W1H (who, when, where, what, why, how and predicates) information which specifies an event and generates an extract on the temporal transition of the event. The overall abstraction method abstracts documents by replacing 5W1H elements in each document with their upper categories in a thesaurus. These methods proved to be effective for office work from an application to 10000 news articles and 2500 sales reports.
- 言語処理学会の論文
言語処理学会 | 論文
- 複合語の分野連想語の効率的決定法
- クラス指向事例収集手法による言い換えコーパスの構築
- 動詞項構造辞書への大規模用例付与
- 言い換え技術に関する研究動向
- Morpho-Syntactic Rules for Detecting Japanese Term Variation: Establishment and Evaluation