9P-E-10 A method of Web Information Extraction Based on the Length of Nodes in the Html Tree(Room E International session)
スポンサーリンク
概要
- 論文の詳細を見る
The effective extraction of the information from web pages is the prerequisite to the full use of the web resources. We proposed a new method for information extraction from web pages based on the length of the nodes in the DOM tree. We will firstly represent the web page into a DOM tree using the html tags, then the content node of the tree will be identified according to the longest text node, and at last we will distinguish the body of the text block and extract the main content of the web page using the continuity of the structure of the main text content in the DOM tree. The experiment testified the accuracy and efficiency of this method.
- 2010-10-09
論文 | ランダム
- 外傷性尿道断裂に対する尿道再建術 : 開放性尿道再建術について
- 超音波パワードプラ法を用いた持続勃起症の病態評価
- 泌尿器悪性腫瘍における誘導型一酸化窒素合成酵素遺伝子発現の検討
- フッ化物徐放性修復材料のフッ化物溶出量と圧縮強さについて
- 外傷性尿道断裂および骨盤骨折後の勃起不全に対する治療経験