Modified Fuzzy Gap Statistic for Estimating Preferable Number of Clusters in Fuzzy k-Means Clustering(BIOINFORMATICS)
スポンサーリンク
概要
- 論文の詳細を見る
In clustering methods, the estimation of the optimal number of clusters is significant for subsequent analysis. Without detailed biological information on the genes involved, the evaluation of the number of clusters becomes difficult, and we have to rely on an internal measure that is based on the distribution of the data of the clustering result. The Gap statistic has been proposed as a superior method for estimating the number of clusters in crisp clustering. In this study, we proposed a modified Fuzzy Gap statistic(MFCS) and applied it to fuzzy k-means clustering. For estimating the number of clusters, fuzzy k-means clustering with the MFCS was applied to two artificial data sets with noise and to two experimentally observed gene expression data sets. For the artificial data sets, compared with other internal measures, the MFCS showed a higher performance in terms of robustness against noise for estimating the optimal number of clusters. Moreover, it could be used to estimate the optimal number of clusters in experimental data sets. It was confirmed that the proposed MFCS is a useful method for estimating the number of clusters for microarray data sets.
- 社団法人日本生物工学会の論文
- 2008-03-25
著者
-
Hakamada Kazumi
Dep. Of Bioengineering School Of Engineering The Univ. Of Tokyo 7-3-1 Hongo Bunkyo-ku Tokyo 113-8656
-
Hanai Taizo
Graduate School Of Systems Life Sciences Kyushu University
-
OKAMOTO Masahiro
Graduate School of Comprehensive Human Sciences, Laboratory of Exercise Biochemistry, University of
-
Hakamada Kazumi
Graduate School Of Systems Life Sciences Kyushu University
-
Arima Chinatsu
Graduate School of Systems Life Sciences, Kyushu University
-
Arima Chinatsu
Graduate School Of Systems Life Sciences Kyushu University
関連論文
- 2-O-39 Exercise-enhanced hippocampal neurosteroid metabolism in adult male rat(The Proceedings of the 17th Annual Meetings of Japan Society of Exercise and Sports Physiology Jury 25-26, (Tokyo))
- Modeling of Total Evaluation Process of Ginjo sake Using a Fuzzy Neural Network
- Modified Fuzzy Gap Statistic for Estimating Preferable Number of Clusters in Fuzzy k-Means Clustering(BIOINFORMATICS)