Automatic extraction of fixed multiword expressions
スポンサーリンク
概要
- 論文の詳細を見る
Fixed multiword expressions are strings of words which together behave like a single word. This research establishes a method for the automatic extraction of such expressions. Our method involves three stages. In the first, statistical measures are used to extract candidate bigrams. In the second, we use this list to select occurrences of candidate expressions in a corpus, together with their surrounding contexts. These examples are used as training data for supervised machine learning, resulting in a classifier which can identify true multiword expressions. The final stage is the estimation of the parts of speech of the extracted expressions. Evaluation demonstrated that collocation measures alone are not effective in identifying target expressions. However, when trained on one million examples, the classifier identified true multiword expressions with precision greater than 90%. Part of speech estimation had precision and recall of over 95% for the part of speech types measured.
- 一般社団法人情報処理学会の論文
- 2005-05-26
著者
-
MATSUMOTO Yuji
Graduate School of Science and Center for Low Temperature Science, Tohoku University
-
HORE Campbell
奈良先端科学技術大学院大学情報科学研究科
-
Hore Campbell
Graduate School Of Infomation Science Nara Institute Science And Technology
-
Matsumoto Yuji
Graduate School Of Agricultural And Life Sciences The University Of Tokyo
-
ASAHARA Masayuki
Graduate School of Infomation Science, Nara Institute Science and Technology
-
Asahara Masayuki
Graduate School Of Infomation Science Nara Institute Science And Technology
-
Matsumoto Yuji
Graduate School Of Infomation Science Nara Institute Science And Technology
関連論文
- Leaf litter decomposition of selected urban tree species during mulching
- Continuous Evolution of Fermi Surface Properties above Metamagnetic Transitions in Ce_xLa_Ru_2Si_2(Condensed matter : electronics structure and electrical, magnetic, and optical properties)
- Isolation of lignin-carbohydrate bonds in wood. Model experiments and preliminary application to pine wood
- Ozonation of a lignin-carbohydrate complex model compound of the benzyl ether type
- カーネル関数を利用した共引用分析の拡張(9月15日)(「アクティブマイニング」及び一般)
- カーネル関数を利用した共引用分析の拡張(「アクティブマイニング」及び一般)
- Structural difference between leaf blade and petiole of original and mulched leaf litter of Ginkgo biloba
- Proof of the presence of racemic forms of arylglycerol-β-aryl ether structure in lignin: studies on the stereo structure of lignin by ozonation
- Constructing a temporal relation identification system of Chinese based on dependency structure analysis (自然言語処理)
- Structural characteristics of lignin in primitive pteridophytes : Selaginella species
- Proof of the presence of guaiacyl-syringyl lignin in Selaginella tamariscina
- Automatic extraction of fixed multiword expressions
- Automatic extraction of fixed multiword expressions
- Formation of methyl iodide from methoxyl-free compounds by hydriodic acid treatment
- Oxidative cleavage of lignin aromatics during chlorine bleaching of kraft pulp
- Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases
- Erythro/threo ratio of β-O-4 structures as an important structural characteristic of lignin. I : Improvement of ozonation method for the quantitative analysis of lignin side-chain structure
- Application of the amount of oxygen consumption to the investigation of the oxidation mechanism of lignin during oxygen-alkali treatment
- Semi-quantitative method to evaluate the α-carbonyl content in lignin
- Enhancement of the reaction between pulp components and hydroxyl radical produced by the decomposition of hydrogen peroxide under alkaline conditions
- Analysis of progress of oxidation reaction during oxygen-alkali treatment of lignin 2: significance of oxidation reaction of lignin during oxygen delignification
- Quantitative study on the possible formation of chloroform during chlorine bleaching of kraft pulp
- Analysis of progress of oxidation reaction during oxygen-alkali treatment of lignin I : method and its application to lignin oxidation
- Evaluation of the extent of the oxidation reaction during chlorine bleaching of pulp
- Reaction selectivity of active oxygen species produced by oxygen-alkali oxidation of a phenolic compound
- Computing Citation Relatedness Using Kernels(preliminary report) (小特集 「アクティブマイニング」および一般)
- Magnetic Phase Diagram and Fermi Surface Properties of CeRu2(Si1-xGex)2
- Anomalous Transport Properties via the Competition between the RKKY Interaction and the Kondo Effect in CexLa1-xRu2Si2
- Delocalization of the f Electron in CexLa1-xRu2Si2
- Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases