Development and Evaluation of Japanese Clause Boundaries Annotation Program
スポンサーリンク
概要
- 論文の詳細を見る
Sentences generally tend to be long and complicated in monologues, and they cause problems for parsing and translation. It is desirable to define some short unit to process monologues efficiently. We developed "CBAP (Clause Boundaries Annotation Program), " which detects and labels every clause boundary in Japanese text. CBAP accepts a series of morphemes with part-of-speech information and detectsthe final boundary of every clause with more than 97% accuracy. It also inserts 147 kinds of labels which represent the types of the boundaries. Since clauses are syntactically and semantically sufficient constituents, we can use the annotated labels for effective and flexible sentence segmentation. In this paper, we show the method for annotating Japanese clause boundaries, and present the result of experiments to examine the performance of CBAP.
- 言語処理学会の論文
言語処理学会 | 論文
- 複合語の分野連想語の効率的決定法
- クラス指向事例収集手法による言い換えコーパスの構築
- 動詞項構造辞書への大規模用例付与
- 言い換え技術に関する研究動向
- Morpho-Syntactic Rules for Detecting Japanese Term Variation: Establishment and Evaluation