A novel bioinformatics tool for phylogenetic classification of genomic sequence fragments derived from mixed genomes of uncultured environmental microbes
スポンサーリンク
概要
- 論文の詳細を見る
A Self-Organizing Map (SOM) is an effective tool for clustering and visualizing high-dimensional complex data on a two-dimensional map. We modified the conventional SOM to genome informatics, making the learning process and resulting map independent of the order of data input, and developed a novel bioinformatics tool for phylogenetic classification of sequence fragments obtained from pooled genome samples of microorganisms in environmental samples allowing visualization of microbial diversity and the relative abundance of microorganisms on a map. First we constructed SOMs of tri- and tetranucleotide frequencies from a total of 3.3-Gb of sequences derived using 113 prokaryotic and 13 eukaryotic genomes, for which complete genome sequences are available. SOMs classified the 330000 10-kb sequences from these genomes mainly according to species without information on the species. Importantly, classification was possible without orthologous sequence sets and thus was useful for studies of novel sequences from poorly characterized species such as those living only under extreme conditions and which have attracted wide scientific and industrial attention. Using the SOM method, sequences that were derived from a single genome but cloned independently in a metagenome library could be reassociated in silico. The usefulness of SOMs in metagenome studies was also discussed.
- 国立極地研究所の論文
著者
-
IKEMURA Toshimichi
Nagahama Institute of Bio-Science and Technology
-
ABE Takashi
Center for Intellectual Property Strategies, Sponsored Laboratory, The Institute of Physical and Che
-
KANAYA SHIGEHIKO
Department of Bioinformatics and Genomics, Graduate School of Information Science, Nara Institute of
-
Kanaya Shigehiko
Department Of Bioinformatics And Genomes Graduate School Of Information Science Nara Institute Of Sc
-
Sugawara Hideaki
Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, and The G
-
Ikemura Toshimichi
The Graduate University for Advanced Studies (Sokendai), Hayama Center for Advanced Research
-
Sugawara Hideaki
Center For Information Biology And Dna Data Bank Of Japan National Institute Of Genetics And The Gra
-
Ikemura Toshimichi
Nagahama Inst. Of Bio-science And Technol.
-
Ikemura Toshimichi
The Graduate University For Advanced Studies (sokendai) Hayama Center For Advanced Research
-
Abe Takashi
Center For Information Biology And Dna Data Bank Of Japan National Institute Of Genetics And The Gra
-
Sugawara Hideaki
Center for Information Biology and DDBJ, National Institute of Genetics
関連論文
- Novel bioinformatics for inter- and intraspecies comparison of genome signatures in plant genomes
- Amino Acid Mixture Identical to Vespa Larval Saliva Increases both Leptin Secretion and Basal Lipolysis in Rat Adipocytes
- Comparison of Protein Complexes Predicted from PPI Networks by DPClus and Newman Clustering Algorithms
- A novel bioinformatics strategy for searching industrially useful genome resources from metagenomic sequence libraries
- A novel bioinformatics tool for phylogenetic classification of genomic sequence fragments derived from mixed genomes of uncultured environmental microbes
- Repression of ∂^-consensus-like Sequence in Transcription Units of Escherichia coli Genome
- Estimation of Partiton Coefficients for Bisubstituted Benzenes between 1-Octanol and Water
- The efficiency of entropy evolution rate for construction of phylogenetic trees
- A novel bioinformatics strategy for searching industrially useful genome resources from metagenomic sequence libraries
- Comparison of Protein Complexes Predicted from PPI Networks by DPClus and Newman Clustering Algorithms
- Comparison of Protein Complexes Predicted from PPI Networks by DPClus and Newman Clustering Algorithms
- Networking of Biological Resource Centers: WDCM experiences
- Systematization of the Protein Sequence Diversity in Enzymes Related to Secondary Metabolic Pathways in Plants, in the Context of Big Data Biology Inspired by the KNApSAcK Motorcycle Database