Syntax-based XML Subtree Segmentation, Matching and Integration
スポンサーリンク
概要
- 論文の詳細を見る
With the exponential increase in the amount of XML data on the Internet, XML subtree segmentation, matching and integration techniques are valuable and important for many application areas such as change detection, keyword retrieval and knowledge discoveries over XML documents. In this paper, we discuss syntax-based XML subtree matching and integration methods based on the PCDATA values of clustered leaf nodes and their path information. Comparing with the traditional edit distance-based methods, our proposed methods are cost-efficient and high-reliability for applications over large-scale XML documents.
- 社団法人電子情報通信学会の論文
- 2008-09-14
著者
-
Yokota Haruo
Gsic Tokyo Institute Of Technology
-
LIANG Wenxin
CREST, Japan Science and Technology Agency (JST)
-
YOKOTA Haruo
GSIC, Tokyo Institute of Technology
-
Liang Wenxin
Crest Japan Science And Technology Agency (jst)