The Bump Hunting Method Using the Genetic Algorithm with the Extreme-Value Statistics
スポンサーリンク
概要
- 論文の詳細を見る
In difficult classification problems of the z-dimensional points into two groups giving 0-1 responses due to the messy data structure, we try to find the denser regions for the favorable customers of response 1, instead of finding the boundaries to separate the two groups. Such regions are called the bumps, and finding the boundaries of the bumps is called the bump hunting. The main objective of this paper is to find the largest region of the bumps under a specified ratio of the number of the points of response 1 to the total. Then, we may obtain a trade-off curve between the number of points of response 1 and the specified ratio. The decision tree method with the Gini's index will provide the simple-shaped boundaries for the bumps if the marginal density for response 1 shows a rather simple or monotonic shape. Since the computing time searching for the optimal trees will cost much because of the NP-hardness of the problem, some random search methods, e.g., the genetic algorithm adapted to the tree, are useful. Due to the existence of many local maxima unlike the ordinary genetic algorithm search results, the extreme-value statistics will be useful to estimate the global optimum number of captured points; this also guarantees the accuracy of the semi-optimal solution with the simple descriptive rules. This combined method of genetic algorithm search and extreme-value statistics use is new. We apply this method to some artificial messy data case which mimics the real customer database, showing a successful result. The reliability of the solution is discussed.
- 社団法人電子情報通信学会の論文
- 2006-08-01
著者
-
Hirose Hideo
Department of Systems Design and Informatics Kyushu Institute of Technology
-
Hirose Hideo
Department Of Applied Physics Faculty Of Engineering Nagoya University
-
Hirose Hideo
Department Of Systems Innovation And Informatics Kyushu Institute Of Technology
-
MIYANO Eiji
Department of Systems Design and Informatics, Kyushu Institute of Technology
-
YUKIZANE Takahiro
Department of Systems Innovation and Informatics, Kyushu Institute of Technology
-
OHI Shin-ya
Department of Systems Innovation and Informatics, Kyushu Institute of Technology
-
Miyano Eiji
Kyushu Inst. Technol. Iizuka‐shi Jpn
-
Miyano Eiji
Department Of Systems Design And Informatics Kyushu Institute Of Technology
-
Yukizane Takahiro
Department Of Systems Innovation And Informatics Kyushu Institute Of Technology
-
Ohi Shin-ya
Department Of Systems Innovation And Informatics Kyushu Institute Of Technology
-
MIYANO Eiji
Department of System Design and Informatics, Kyushu Institute of Technology
関連論文
- 2-2 Statistical Analysis for Climatic Data in Bangladesh
- The Consistency of the Pandemic Simulations between the SEIR Model and the MAS Model
- The effect of electric countershock on cardiac troponin T (cTnT) and heart-type fatty acid-binding protein (h-FABP)
- Far-Infrared Laser Action in Optically Pumped CH_3OD
- Far-Infrared Laser Action in Optically Pumped CD_3OD
- Computational Complexities of University Interview Timetabling
- The Bump Hunting Method Using the Genetic Algorithm with the Extreme-Value Statistics
- Forced CO_2 Laser Oscillation by Injection of a Weak External Optical Signal
- Intracavity Pumped Far-Infrared Lasers by TE CO_2 Laser
- More Accurate Breakdown Voltage Estimation for the New Step-up Test Method for Various Probability Distribution Models
- Maximum Likelihood Estimation in a Mixture Regression Model Using the Continuation Method
- Mode Characteristics of TEA CO_2 Laser Using an Unstable Resonator
- Special Section on Discrete Mathematics and Its Applications
- 3-1 Seasonal Prediction of Malaysia Climate Data
- Inapproximability of Maximum r-Regular Induced Connected Subgraph Problems