A Comparative Study on Effective Context Selection for Distributional Similarity
スポンサーリンク
概要
- 論文の詳細を見る
Distributional similarity is a widely adopted concept to capture the semantic relatedness of words based on their context in various NLP tasks. While accurate similarity calculation requires a huge number of context types and co-occurrences, the contribution to the similarity calcualtion depends on individual context types, and some of them even act as noise. To select well-performing context and alleviate the high computational cost, we propose and investigate the effectiveness of three context selection schemes: category-based, type-based, and co-occurrence based selection. Categorybased selection is a conventional, simplest selection method which limits the context types based on the syntactic category. Finer-grained, type-based selection assigns importance scores to each context type, which we make possible by proposing a novel formalization of distibutional similarity as a classification problem, and applying feature selection techniques. The finest-grained, co-occurrence based selection assigns importance scores to each co-occurrence of words and context types. We evaluate the effectiveness and the trade-off between co-occurrence data size and synonym acquisition performance. Our experiments show that, on the whole, the finest-grained, co-occurrence based selection achieves better performane, although some of the simple category-based selection show comparable performance/cost trade-off.
- Information and Media Technologies 編集運営会議の論文
著者
-
Ogawa Yasuhiro
Graduate School Of Engineering Osaka University
-
Hagiwara Masato
Graduate School Of Information Science Nagoya University
-
Toyama Katsuhiko
Graduate School Of Information Science Nagoya University
-
Hagiwara Masato
Graduate School of Information Science, Nagoya University
-
Ogawa Yasuhiro
Graduate School of Information Science, Nagoya University
関連論文
- Electric Field Tuning of Plasmonic Absorption of Metallic Grating with Twisted Nematic Liquid Crystal
- Fluorescence Enhancement of Conducting Polymer Coated on Biharmonic Metallic Grating
- Finite-Difference Time-Domain Analysis of Polarization-Dependent Transmission in Cholesteric Blue Phase II
- Supervised Synonym Acquisition Using Distributional Features and Syntactic Patterns
- A Comparative Study on Effective Context Selection for Distributional Similarity
- Effective use of indirect dependency for distributional similarity (自然言語処理特集号 言語的オントロジーの構築・連携・利用)
- Effective Use of Indirect Dependency for Distributional Similarity
- Supervised Synonym Acquisition Using Distributional Features and Syntactic Patterns
- A Comparative Study on Effective Context Selection for Distributional Similarity