Web Page Classification Method Using Neural Networks
スポンサーリンク
概要
- 論文の詳細を見る
Automatic categorization is the only viable method to deal with the scaling problem of the World Wide Web (WWW). In this paper, we propose a news web page classification method (WPCM). The WPCM uses a neural network with inputs obtained by both the principal components and class profile-based features (CPBF). Each news web page is represented by the term-weighting scheme. As the number of unique words in the collection set is big, the principal component analysis (PCA) has been used to select the most relevant features for the classification. Then the final output of the PCA is combined with the feature vectors from the class-profile which contains the most regular words in each class before feeding them to the neural networks. We have manually selected the most regular words that exist in each class and weighted them using an entropy weighting scheme. The fixed number of regular words from each class will be used as a feature vectors together with the reduced principal components from the PCA. These feature vectors are then used as the input to the neural networks for classification. The experimental evaluation demonstrates that the WPCM method provides acceptable classification accuracy with the sports news datasets.
- 社団法人 電気学会の論文
- 2003-05-01
著者
-
Omatu S
Division Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
-
YOSHIOKA Michifumi
Division of Computer and Systems Sciences, Graduate School of Engineering, Osaka Prefecture Universi
-
FUJINAKA Toru
Division of Computer and Systems Sciences, Graduate School of Engineering, Osaka Prefecture Universi
-
OMATU Sigeru
Division of Computer and Systems Sciences, Graduate School of Engineering, Osaka Prefecture Universi
-
SELAMAT Ali
Division of Computer and Systems Sciences, Graduate School of Engineering, Osaka Prefecture Universi
-
YANAGIMOTO Hidekazu
Division of Computer and Systems Sciences, Graduate School of Engineering, Osaka Prefecture Universi
-
SELAMAT Ali
Osaka Prefecture University
-
YANAGIMOTO Hidekazu
Osaka Prefecture University
-
Selamat Ali
Division Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
-
Yoshioka M
Division Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
-
Yoshioka Michifumi
Division Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
-
Fujinaka Toru
Division Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
-
Yanagimoto Hidekazu
Division Of Computer And Systems Sciences Graduate School Of Engineering Osaka Prefecture University
関連論文
- Pole Placement Using Optimal Regulators (特集:21世紀の扉を開く記念部門誌)
- Bill Money Recognition by a Small Size Neural Network
- Detecting Household Burning Smell Using a Neuro-Electronic Nose System
- Identify Smells Using Time Series Data from Metal Oxide Gas Sensors
- A Reliable Classification Method for Paper Currency Based on the Non-Linear PCA
- Quality Evaluation of Transmission Devices Using the GA
- On Reliability of Paper Currency Classifiers Using Neural Networks and PCA
- Web Page Classification Method Using Neural Networks
- 1-307 Quality Test of Transmission Devices Using Acoustic Data
- Effectiveness of Mobile Agent for Query Retrieval
- An Electronic Nose System Using Artificial Neural Networks with an Effective Initial Training Data Set