An outline of an informatical method for identifying the complete set of genes using the DNA sequence of a whole genome
スポンサーリンク
概要
- 論文の詳細を見る
The identification of open reading frames (ORFs) by using the DNA sequence of a whole genome involves a statistical process to separate candidates-i.e. sections that start with formal start colons and end with formal termination colons, into two groups, authentic ORFs and artifacts. A small number of genes known prior to the study can be used for the analysis of general informatical characteristics that are expected to be shared by all the ORFs present in the genome. The results can be summarized into the form of scoring systems that measure the relatedness of each candidate to the model ORE In order to identify the complete set of ORFs the rate of false negative identification needs to be minimized, so that no important ORE is missed. A number of non-ORE sections can be analyzed by the same systems in order to estimate the rate of false positive identification. This rate can be systematically reduced by combining multiple scoring systems that evaluate different ORE-specific characteristics.
- 日本学士院の論文
著者
関連論文
- Characterization and Expression of a New Class of Zinc Finger Protein that Binds to Silencer Region of Ascorbate Oxidase Gene
- Zinc finger proteins of the archaebacterial origin
- Enzymes identified using genomic DNA sequences suggest some atypical characteristics of de novo biosynthesis of purines in archaebacteria
- Archaebacterial ribosome is a chimera of eubacterial-type RNAs and eukaryotic-type proteins.
- Distribution and orientation of genes in the genome of Pyrococcus OT3
- Prophages inserted in archaebacterial genomes
- Comparison of archaebacterial central metabolic pathways using the genomic DNA sequences
- Identification of genes in the complete genomic DNA sequence of a hyper-thermophilic archaeon, Pyrococcus sp. OT3
- Visual presentation of complete genomic DNA sequences, and its application to identification of gene-coding regions
- Comparison of pathways for amino acid biosynthesis in archaebacteria using their genomic DNA sequences
- Determination of the complete genomic DNA sequence of Thermoplasma volcanium GSS1
- A sequence of thirty bases that is highly repetitive in archaebacterial genomes
- Genomic DNA shuffling in archaebacteria : Comparison of the genomes of two Pyrococcus species
- An outline of an informatical method for identifying the complete set of genes using the DNA sequence of a whole genome