GroupAdaBoost : Accurate Prediction and Selection of Important Genes
スポンサーリンク
概要
- 論文の詳細を見る
In this paper, we propose GroupAdaBoost which is a variant of AdaBoost for statistical pattern recognition. The objective of the proposed algorithm is to solve the "p ≫ n" problem arisen in bioinformatics. In a microarray experiment, gene expressions are observed to extract any specific pattern of gene expressions related to a disease status. Typically, p is the number o」 investigated genes and n is number of individuals. The ordinary method for predicting the genetic causes of diseases is apt to over-learn from any particular training dataset because of the "p ≫ n" problem. We observed that GroupAdaBoost gave a robust performance for cases of the excess number p of genes. In several real datasets which are publicly available from web-pages, we compared the analysis of results among the proposed method and others, and a small scale of simulation study to confirm the validity of the proposed method. Additionally the proposed method effectively worked for the identification of important genes.
- 一般社団法人情報処理学会の論文
- 2007-03-15
著者
-
USHIJIMA Masaru
Genome Center, Japanese Foundation for Cancer Research
-
Eguchi Shinto
Inst. Statistical Mathematics Tokyo Jpn
-
TAKENOUCHI TAKASHI
Graduate School of Information Science, Nara Institute of Science and Technology
-
EGUCHI SHINTO
Institute of Statistical Mathematics, Japan and Department of Statistical Science, Graduate Universi
-
Ushijima Masaru
Genome Center Japanese Foundation For Cancer Research
-
Eguchi Shinto
Institute Of Statistical Mathematics Japan And Department Of Statistical Science Graduate University
-
Takenouchi Takashi
Graduate School Of Information Science Nara Institute Of Science And Technology
関連論文
- Identifying haplotype block structure using an ancestor-derived model
- GroupAdaBoost : Accurate Prediction and Selection of Important Genes
- Genotyping of Single Nucleotide Polymorphisms Based on a Mathematical Model for Two-Dimensional Data
- COLLABORATIVE PREDICTION BY MULTIPLE BAYESIAN NETWORKS AND ITS APPLICATION TO PRINTER USAGE MODELING(Bayesian networks and their application)
- GroupAdaBoost: Accurate Prediction and Selection of Important Genes
- GroupAdaBoost: Accurate Prediction and Selection of Important Genes