Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones
スポンサーリンク
概要
- 論文の詳細を見る
The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, geneprediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level.Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following:integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates(1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for nonprotein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causingphenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.
- Public Library of Scienceの論文
著者
-
Sugano Sumio
放射線医学総合研究所
-
Sugano Sumio
Helix Research Institute
-
Hanada Hironori
Research Institute National Cardiovascular Center
-
Sugano Sumio
Department Of Virology Institute Of Medical Science University Of Tokyo
-
Suzuki Yoichi
Department Of Medical Genetics Tohoku University School Of Medicine
-
NISHIKAWA Ken
National Institute of Genetics
-
Sugano Sumio
University Of Tokyo
-
Miyazaki J
Univ. Tsukuba Ibaraki Jpn
-
Suzuki Yutaka
University of Tokyo
-
Kanai Akinori
University of Tokyo
-
Nakai Kenta
University of Tokyo
-
Yamamoto Takako
Department Of Health Science Shiga University Of Medical Science
-
Yura K
Department Of Biological Science Graduate School Of Science Nagoya University
-
Yura Kei
Quantum Bioinfo. Jaea
-
Ohtsubo Masafumi
Medical Photobiology Department Photon Medical Research Center Hamamatsu University School Of Medici
-
Sakata Yoshiyuki
Department Of Medical Genetics Tohoku University School Of Medicine
-
Osada Naoki
Division Of Genetic Resources National Institute Of Infectious Diseases
-
Suzuki Y
Department Of Medical Genetics Tohoku University School Of Medicine
-
SUZUKI Yasuyo
Department of Clinical Genetics, Medical Institute of Bioregulation, Kyushu University
-
Mashima Jun
Institute Of Biological Sciences University Of Tsukuba:(present Address)school Of Life Science Unive
-
Inoko Hidetoshi
Dep. Of Molecular Life Sci. Tokai Univ. School Of Medicine
-
Go Mitiko
Fac. Bio-sci. Nagahama Inst. Bio-sci. Tech.:ochanomizu Univ.
-
Hanada Kouichi
Departoment Of Neurology And Geriatrics Kagoshima University Graduate School Of Medical And Dental S
-
Suwa Makiko
Computational Biology Research Center (cbrc) National Institute Of Advanced Industrial Science And T
-
Hashimoto Katsuyuki
Division Of Genetic Resources National Institute Of Infectious Diseases
-
Homma Keiichi
National Institute Of Genetics
-
Fukuchi Satoshi
Maebashi Institute of Technology
-
MANO Shuhei
Nagoya City University
-
Shimizu Nobuyoshi
Department of Molecular Biology, Keio University School of Medicine
関連論文
- Fas promoter region gene polymorphism is associated with an increased risk for myocardial infarction
- OE-071 A Genom-Wide Association Study of SNPs in the risk of Atherosclerosis Obliterans(Heart failure, clinical(01)(M),Oral Presentation(English),The 72nd Annual Scientific Meeting of the Japanese Circulation Society)
- A novel KCNQ4 one-base deletion in a large pedigree with hearing loss : implication for the genotype-phenotype correlation
- Functional analysis of PTPN11/SHP-2 mutants identified in Noonan syndrome and childhood leukemia
- Novel IRF6 mutations in Japanese patients with Van der Woude Syndrome : two missense mutations (R45Q and P396S) and a 17-kb deletion
- Lack of evidence for a significant association between nonsyndromic cleft lip with or without cleft palate and the retinoic acid receptor alpha gene in the Japanese population
- Chromosomal localization, structure, single-nucleotide polymorphisms, and expression of the human H-protein gene of the glycine cleavage system (GCSH), a candidate gene for nonketotic hyperglycinemia
- Haplotype analysis suggests that the two predominant mutations in Japanese patients with holocarboxylase synthetase deficiency are founder mutations
- A one-base deletion (183delC) and a missense mutation (D276H) in the T-protein gene from a Japanese family with nonketotic hyperglycinemia
- Mutation and polymorphic marker analyses of 65K- and 67K-glutamate decarboxylase genes in two families with pyridoxine-dependent epilepsy
- DINUCLEOTIDE REPEAT POLYMORPHISM IN 65k-GLUTAMATE DECARBOXYLASE GENE
- Association of Genetic Polymorphisms of Endothelin-Converting Enzyme-1 Gene with Hypertension in a Japanese Population and Rare Missense Mutation in Preproendothelin-1 in Japanese Hypertensives
- Genetic Variations of CYP2C9 in 724 Japanese Individuals and Their Impact on the Antihypertensive Effects of Losartan
- No Association of Polymorphisms in the 5' Region of the CD14 Gene and Food Allergy in a Japanese Population
- CIPRO 2.5 : Ciona intestinalis protein database, a unique integrated repository of large-scale omics data, bioinformatic analyses and curated annotation, with user rating and reviewing functionality
- Airway Responsiveness to Acetylcholine in Congenitally Bronchial-Hypersensitive (BHS) and Bronchial-Hyposensitive (BHR) Guinea Pigs in Vivo and in Vitro
- Histamine-Induced Airway Contraction in Congenitally Bronchial-Hypersensitive (BHS) and Bronchial-Hyposensitive (BHR) Guinea Pigs
- Characterization of Folding Pathways of the Type-1 and Type-2 Periplasmic Binding Proteins MgIB and ArgT
- HLA Class I and II Alleles and Susceptibility to Generalized Pustular Psoriasis : Significant Associations with HLA-Cw1 and HLA-DQB1^*0303
- PE-062 Zyxin Progresses Epithelial-Mesenchymal Transition under the Control of Twist1 and Contributes to Endocardial Morphogenesis(PE011,Congenital Heart Disease (M),Poster Session (English),The 73rd Annual Scientific Meeting of The Japanese Circulation S
- Sequence analysis, gene expression, and chromosomal assignment of mouse Borg4 gene and its human orthologue
- Malaria Genome Project in Japan(Abstracts from the Malaria Workshop)
- Isolation and characterization of a human cDNA encoding a protein homologous to the 7.2-kDa protein (subunit X) of bovine ubiquinol-cytochrome C reductase
- S2d2-3 Comparative analysis of mamalian transcriptomes(S2-d2: "Bioinformatics of Transcriptional Regulation",Symposia,Abstract,Meeting Program of EABS & BSJ 2006)
- Adult onset limb-girdle type mitochondrial myopathy with a mitochondrial DNA np8291 A-to-G substitution
- Phenotypes of X-linked Charcot-Marie-Tooth disease and altered trafficking of mutant Connexin 32 (GJB1)
- Low-Risk Profile for Cardiovascular Disease and Mortality in Japanese
- 日本最高齢馬シンザンの心電図
- Neonatal-Onset Brainstem Reticular Reflex Myoclonus Following a Prenatal Brain Insult : Generalized Myoclonic Jerk and a Brainstem Lesion
- OE-040 FHL-2 Expresses in Endothelial Cell and Suppresses VEGF-induced Angiogenesis Via Sphingosine Kinase-1(Arrhythmia, diagnosis/Pathophysiology/EPS(01)(A),Oral Presentation(English),The 72nd Annual Scientific Meeting of the Japanese Circulation Society
- Genetic Variations of HSD11B2 in Hypertensive Patients and in the General Population, Six Rare Missense/Frameshift Mutations
- Down-regulation in Multiple Human Cancers of a Novel Gene, DMHC, from 17q25.1 That Encodes an Integral Membrane Protein
- Identification of DMC1, a novel gene in the TOC region on 17q25.1 that shows loss of expression in multiple human cancers
- Genetic Association Analysis using Microsatellite Markers in Atopic Dermatitis
- cDNA cloning, expression profile, and genomic structure of human and mouse RNF10/Rnf 10 genes, encoding a novel RING finger protein
- Cloning, expression analysis, and chromosomal localization of a novel butyrophilin-like receptor
- Isolation and chromosomal mapping of a novel human gene showing homology to Na^+/PO4 cotransporter
- A novel human gene whose product shares significant homology with the bovine brain-specific protein p25 on chromosome 5p15.3
- Isolation, tissue expression, and chromosomal assignment of a novel human gene which encodes a protein with RING finger motif
- Cloning, expression analysis, and chromosomal localization of HIP1R, an isolog of huntingtin interacting protein (HIP1)
- Isolation, tissue expression, and chromosomal assignment of human RGS5, a novel G-protein signaling regulator gene
- Hypertension Susceptibility Genes on Chromosome 2p24-p25 in a General Japanese Population(Hypertension, Clinical 6 (H), The 69th Annual Scientific Meeting of the Japanese Circulation Society)
- Combined Cardiovascular Risk Factors and Outcome : NIPPON DATA80, 1980-1994
- Evaluation of PTPN22 polymorphisms and Vogt-Koyanagi-Harada disease in Japanese patients
- Seroepidemiological Studies on Silk Road Ethnic Groups
- Ehlers-Danlos Syndrome and Congenital Heart Anomalies
- Primary Hypoparathyroidism in Turner's Syndrome
- A novel therapeutic trial of homogentisic aciduria in a murine model of alkaptonuria
- FUNCTIONAL DIFFERENCES OF TROPONIN T ISOFORMS(Cell Biology and Morphology)(Proceeding of the Seventy-Third Annual Meeting of the Zoological Society of Japan)
- STUDY ON FUNCTIONAL DIFFERENCES AMONG FAST SKELETAL MUSCLE TROPONIN TISOFORMS(Physiology)(Proceedings of the Sixty-Ninth Annual Meeting of the Zoological Society of Japan)
- Developmental Stage-Dependent Expression of Troponin T Isoforms in Chicken Embryonic Breast Muscles Grafted on Chorio-Allantoic Membrane(Developmental Biology)
- Stability of Chicken Troponin T Expression in Cultured Muscle Cells
- Expression of Chicken Troponin T Isoforms in Cultured Muscle Cells
- Pheophorbide a Content and Chlorophyllase Activity in Green Tea
- Polymorphism of IFN-gamma gene and Vogt-Koyanagi-Harada disease.
- トレ-ニング強度の違いが若馬の呼吸循環機能におよぼす影響
- 騎乗馴致が若馬の呼吸循環機能におよぼす影響
- Tyrosinase gene family and Vogt-Koyanagi-Harada disease in Japanese patients.
- HLA CLASS II GENOTYPING OF SARCOIDOSIS PATIENTS IN HOKKAIDO BY PCR-RFLP
- HLA SEROLOGICAL AND CLASS II GENOTYPING IN SARCOIDOSIS PATIENTS IN JAPAN
- Mutational Analysis of Genes Involved in Pilus Structure, Motility and Transformation Competency in the Unicellular Motile Cyanobacterium Synechocystis sp. PCC 6803
- MUTATIONAL ANALYSIS OF GENES INVOLVED IN MOTILITY AND TRANSFORMATION OF Synechocystis sp. PCC 6803
- II-C-12 The reduced expression of capillary tight junction comprised proteins in a patient with myopathy(THE 45TH ANNUAL MEETING OF THE JAPAN SOCIETY OF HISTOCHEMISTRY AND CYTOCHEMISTRY)
- Conversion of the Coenzyme Specificity of Isocitrate Dehydrogenase by Module Replacement
- Detection of Chlorophyll Breakdown Products in the Senescent Leaves of Higher Plants
- Het-PDB Navi. : A Database for Protein-Small Molecule Interactions
- Association of carotid atherosclerosis with genetic polymorphisms of the klotho gene in patients with hypertension
- RESOPS : A Database for Analyzing the Correspondence of RNA Editing Sites to Protein Three-Dimensional Structures
- 1P479 A possible effect of alternative splicing on multiple regions of a single protein(23. Bioinformatics, genomics and proteomics (I),Poster Session,Abstract,Meeting Program of EABS & BSJ 2006)
- Discrimination of Golgi Type II Membrane Proteins Based on Their Hydropathy Profiles and the Amino Acid Propensities of Their Transmembrane Regions
- MutationView/KMcancerDB : A database for cancer gene mutations
- 1E1348 ヒトプロテオームに存在する新規ホールド数の見積もり(ゲノム生物学、生命情報科学,第49回日本生物物理学会年会)
- Evola : Ortholog database of all human genes in H-InvDB with manual curation of phylogenetic trees
- Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones
- Improved genome assembly and evidence-based global gene model set for the chordate Ciona intestinalis : new insight into intron and operon populations
- The H-Invitational Database (H-InvDB), a comprehensive annotation resource for human genes and transcripts
- Profile of Participants and Genotype Distributions of 108 Polymorphisms in a Cross-Sectional Study of Associations of Genotypes With Lifestyle and Clinical Factors : A Project in the Japan Multi-Institutional Collaborative Cohort (J-MICC) Study
- A genome-wide association study identifies RNF213 as the first Moyamoya disease gene
- Association of the MMP9 gene with childhood cedar pollen sensitization and pollinosis
- Impact of insulin resistance, insulin and adiponectin on kidney stones in the Japanese population
- Complex divergence at a microsatellite marker C1_2_5 in the lineage of HLA-Cw/-B haplotype
- Identification of novel candidate loci for anorexia nervosa at 1q41 and 11q22 in Japanese by a genome-wide association analysis with microsatellite markers
- Replication of a microsatellite genome-wide association study of Behçet's disease in a Korean population
- Hunting for genes for hypertension : the Millennium Genome Project for Hypertension
- 2PT127 選択的スプライシングによる新規タンパク質アイソフォームの機能・構造予測情報を得るためのツールの開発と公開(日本生物物理学会第50回年会(2012年度))
- Characterization of Folding Pathways of the Type-1 and Type-2 Periplasmic Binding Proteins MglB and ArgT