Augmented Mutual Information for Multi-word Extraction
スポンサーリンク
概要
- 論文の詳細を見る
In order to extract multi-words from documents, mutual information (MI), as a statistical method, is the most popular solution under consideration. However, there are two kinds of deficiencies inherent in MI. One is the problem of unilateral cooccurrence, and the other is rare occurrence problem. To attack these two problems, augmented mutual information (AMI) is proposed in this paper to measure word dependency for multi-word extraction. We prove theoretically that AMI has the capacity to approximate MI to capture the independency of individual words, but it will amplify the significance of dependent individual words which may be possible multi-words. And ourexperimental results on Chinese multi-word extraction demonstrate that AMI method hassuperior performance to traditional MI method.
- ICIC Internationalの論文
- 2009-02-00
ICIC International | 論文
- An integrated spatial error concealment technique for H.264/AVC based-on boundary distortion estimation
- A technique of approximate estimation for localization in wireless sensor networks
- Analysis of energy savings using smart metering
- An author identification of in-class source codes by using the forward-backward coding models
- A study on an operation skill inheritance assistance function for virtual surgical simulation system