Text Structuring by Composition and Decomposition of Segments
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we present a structure model for editorial texts and discuss a text analysis method based on the model. A large amount of digitalized documents flow through the media of the INTERNET, CD-ROMs and so on even for personal surroundings. In order to proceed such documents at high speed, the process should be as "superficial" as possible and any specialized knowledge should be required as little as possible. The structuring in our method relies on the analysis of modalities which appear superficially at the tail of Japanese sentences. We define the text structure model of editorials. As a top-down approach for text analysis, we apply a text segmentation method, in which a text is incrementally divided according to the the evaluation function. As a bottom-up approach, based on the rhetorical relation between two neighboring segments, the segments are composed to one according to the strength of the relation. Our approach emploies only the merits of the two, that is, the leaves of a structure tree are analyzed in a bottom-up manner whereas nodes around the root are decomposed in a top-down manner. For the evaluation, we discuss our method from three points of view: (1) objectively agreements checking between formal paragraphs and the upper part around the root of structure trees, (2) agreements checking of the lower part around leaves of trees between human and our method, and (3) human checking of structures generated by our method.
- 言語処理学会の論文
言語処理学会 | 論文
- 複合語の分野連想語の効率的決定法
- クラス指向事例収集手法による言い換えコーパスの構築
- 動詞項構造辞書への大規模用例付与
- 言い換え技術に関する研究動向
- Morpho-Syntactic Rules for Detecting Japanese Term Variation: Establishment and Evaluation