Extraction of Semantic Text Portion Related to Anchor Link(Language,<Special Section>Human Communication II)
スポンサーリンク
概要
- 論文の詳細を見る
Recently, semantic text portion (STP) is getting popular in the field of Web mining. STP is a text portion in the original page which is semantically related to the anchor pointing to the target page. STPs may include the facts and the people's opinions about the target pages. STPs can be used for various upper-level applications such as automatic summarization and document categorization. In this paper, we concentrate on extracting STPs. We conduct a survey of STP to see the positions of STPs in original pages and find out HTML tags which can divide STPs from the other text portions in original pages. We then develop a method for extracting STPs based on the result of the survey. The experimental results show that our method achieves high performance.
- 社団法人電子情報通信学会の論文
- 2006-06-01
著者
-
Hung Bui
Graduate School Of Engineering Science Osaka University
-
OTSUBO Masanori
Graduate School of Engineering Science, Osaka University
-
HIJIKATA Yoshinori
Graduate School of Engineering Science, Osaka University
-
NISHIDA Shogo
Graduate School of Engineering Science, Osaka University
-
Nishida Shogo
Graduate School Of Engineering Science Osaka University
-
Otsubo Masanori
Graduate School Of Engineering Science Osaka University
-
Hijikata Yoshinori
Graduate School Of Engineering Science Osaka University
関連論文
- Web Page Classification using Anchor-related Text Extracted by a DOM-based Method
- Extraction of Semantic Text Portion Related to Anchor Link(Language,Human Communication II)
- NTM-Agent : Text Mining Agent for Net Auction(Human Communication I)
- Special Section on Human Communication I
- Estimating Reviewer Credibility Using Review Contents and Review Histories