Dependency Parsing with Lattice Structures for Resource-Poor Languages
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we present a new dependency parsing method for languages which have very small annotated corpus and for which methods of segmentation and morphological analysis producing a unique (automatically disambiguated) result are very unreliable. Our method works on a morphosyntactic lattice factorizing all possible segmentation and part-of-speech tagging results. The quality of the input to syntactic analysis is hence much better than that of an unreliable unique sequence of lemmatized and tagged words. We propose an adaptation of Eisners algorithm for finding the k-best dependency trees in a morphosyntactic lattice structure encoding multiple results of morphosyntactic analysis. Moreover, we present how to use Dependency Insertion Grammar in order to adjust the scores and filter out invalid trees, the use of language model to rescore the parse trees and the k-best extension of our parsing model. The highest parsing accuracy reported in this paper is 74.32% which represents a 6.31% improvement compared to the model taking the input from the unreliable morphosyntactic analysis tools.
- (社)電子情報通信学会の論文
- 2009-10-01
著者
-
Kawtrakul Asanee
Naist (special Research Unit On Natural Language Processing And Intelligent Information System Techn
-
SUDPRASERT Sutee
NAiST (special research unit on NAtural language processing and Intelligent information System Techn
-
BOITET Christian
GETALP Group (Study Group on Machine Translation and Automated Processing of Languages and Speech),
-
BERMENT Vincent
GETALP Group (Study Group on Machine Translation and Automated Processing of Languages and Speech),
-
Berment Vincent
Getalp Group (study Group On Machine Translation And Automated Processing Of Languages And Speech) L
-
Boitet Christian
Getalp Group (study Group On Machine Translation And Automated Processing Of Languages And Speech) L
-
Sudprasert Sutee
Naist (special Research Unit On Natural Language Processing And Intelligent Information System Techn