Measuring the Appropriateness of Automatically Generated Phrasal Paraphrases
スポンサーリンク
概要
- 論文の詳細を見る
The most critical issue in generating and recognizing paraphrases is developing a widecoverage paraphrase knowledge base. To attain the coverage of paraphrases that should not necessarily be represented at surface level, researchers have attempted to represent them with general transformation patterns. However, this approach does not prevent spurious paraphrases because there is no practical method to assess whether or not each instance of those patterns properly represents a pair of paraphrases. This paper argues on the measurement of the appropriateness of such automatically generated paraphrases, particularly targeting at morpho-syntactic paraphrases of predicate phrases. We first specify the criteria that a pair of expressions must satisfy to be regarded as paraphrases. On the basis of the criteria, we then examine several measures for quantifying the appropriateness of a given pair of expressions as paraphrases of each other. In addition to existing measures, a probabilistic model consisting of two distinct components is examined. The first component of the probabilistic model is a structured N-gram language model that quantifies the grammaticality of automatically generated expressions. The second component approximates the semantic equivalence and substitutability of the given pair of expressions on the basis of the distributional hypothesis. Through an empirical experiment, we found (i) the effectiveness of contextual similarity in combination with the constituent similarity of morpho-syntactic paraphrases and (ii) the versatility of the Web for representing the characteristics of predicate phrases.
論文 | ランダム
- 工程・装置 : 水熱拡散を用いた高品位シリカ原料の調整
- 工程・装置 : アコスティックエミッションによる耐火物の焼成時の加熱冷却過程の研究
- 工程・装置 : 高い生地密度を有するセラミックスの射出成形
- 工程・装置 : 射出成形における脱脂時間の短縮
- 工程・装置 : 射出成形混合物のレオロジーに及ぼす粉体特性の影響