Data Analysis by Positive Decision Trees (Special Issue on New Generation Database Technologies)
スポンサーリンク
概要
- 論文の詳細を見る
Decision trees are used as a convenient means to explain given positive examples and negative examples, which is a form of data mining and knowledge discovery. Standard methods such as ID3 may provide non-monotonic decision trees in the sense that data with larger values in all attributes are sometimes classified into a class with a smaller output value. (In the case of binary data, this is equivalent to saying that the discriminant Boolean function that the decision tree represents is not positive.) A motivation of this study comes from an observation that real world data are often positive, and in such cases it is natural to build decision trees which represent positive (i.e., monotone) discriminant functions. For this, we propose how to modify the existing procedures such as ID3, so that the resulting decision tree represents a positive discriminant function. In this procedure, we add some new data to recover the positivity of data, which the original data had but was lost in the process of decomposing data sets by such methods as ID3. To compare the performance of our method with existing methods, we test (1) positive data, which are randomly generated from a hidden positive Boolean function after adding dummy attributes, and (2) breast cancer data as an example of the real-world data. The experimental results on (1) tell that, although the sizes of positive decision trees are relatively larger than those without positivity assumption, positive decision trees exhibit higher accuracy and tend to choose correct attributes, on which the hidden positive Boolean function is defined. For the breast cancer data set, we also observe a similar tendency; i.e., positive decision trees are larger but give higher accuracy.
- 社団法人電子情報通信学会の論文
- 1999-01-25
著者
-
Ono Hirotaka
Graduate School Of Information Science And Electrical Engineering Kyushu University
-
Suda Takashi
Fujitsu Limited
-
Ono Hirotaka
Graduate School Of Engineering Kyoto University
-
MAKINO Kazuhisa
Graduate School of Engineering Science, Osaka University
-
IBARAKI Toshihide
Graduate School of Informatics, Kyoto University
-
Makino Kazuhisa
Graduate School Of Engineering Science Osaka University
-
Ibaraki Toshihide
Graduate School Of Informatics Kyoto University
関連論文
- Optimal Balanced Semi-Matchings for Weighted Bipartite Graphs(Algorithm Theory)
- Optimal Balanced Semi-Matchings for Weighted Bipartite Graphs
- Data Analysis by Positive Decision Trees (Special Issue on New Generation Database Technologies)
- K1 BUILDING GENERAL SOLVERS FOR SCHEDULING PROBLEMS(Keynote speech)
- Optimal Balanced Semi-Matchings for Weighted Bipartite Graphs