The Human Gene Mutation Database (HGMD) and Its Exploitation in the Fields of Personalized Genomics and Molecular Evolution
互联网
- Abstract
- Table of Contents
- Figures
- Literature Cited
Abstract
The Human Gene Mutation Database (HGMD) constitutes a comprehensive core collection of data on germ?line mutations in nuclear genes underlying or associated with human inherited disease (http://www.hgmd.org). Data cataloged include single?base?pair substitutions in coding, regulatory, and splicing?relevant regions, micro?deletions and micro?insertions, indels, and triplet repeat expansions, as well as gross gene deletions, insertions, duplications, and complex rearrangements. Each mutation is entered into HGMD only once, in order to avoid confusion between recurrent and identical?by?descent lesions. By March 2012, the database contained in excess of 123,600 different lesions (HGMD Professional release 2012.1) detected in 4,514 different nuclear genes, with new entries currently accumulating at a rate in excess of 10,000 per annum. ?6,000 of these entries constitute disease?associated and functional polymorphisms. HGMD also includes cDNA reference sequences for more than 98% of the listed genes. Curr. Protoc. Bioinform. 39:1.13.1?1.13.20. © 2012 by John Wiley & Sons, Inc.
Keywords: HGMD; mutation; database; inherited disease; gene
Table of Contents
- Commentary
- Literature Cited
- Figures
- Tables
Materials
Figures
-
Figure 1.13.1 The HGMD home page. View Image -
Figure 1.13.2 Example of a gene page for the factor VIII ( F8 ) gene. All mutation data, plus external links, can be accessed through this page. View Image -
Figure 1.13.3 Example of missense/nonsense mutation data for the F8 gene. HGMD accession number, codon number, nucleotide change, amino acid change, phenotype, links to references and curatorial comments are provided. View Image -
Figure 1.13.4 Example of splice mutation data for the F8 gene. HGMD accession number, IVS number, location (expressed relative to the donor or acceptor splice site), nucleotide substitution, phenotype, links to references, and curatorial comments are provided. View Image -
Figure 1.13.5 Example of regulatory mutation data for the F8 gene. The nucleotide change is shown with 30 bp of flanking sequence either side. Phenotype, HGMD accession number, relative location, reference data, and curatorial comments are also provided. View Image -
Figure 1.13.6 Example of micro‐deletion mutation data for the F8 gene. The deletion is shown with 10 bp of flanking sequence either side. The caret (⁁) marks the preceding complete codon (number provided). HGMD accession number, phenotype, links to references, and curatorial comments are also provided. View Image -
Figure 1.13.7 Example of micro‐insertion mutation data for the F8 gene. HGMD accession number, location (nucleotide/codon), inserted base(s), phenotype, links to references, and curatorial comments are provided. View Image -
Figure 1.13.8 Example of small indel mutation data for the F8 gene. The indel is shown with 10 bp of flanking sequence either side of the deleted bases. Inserted bases are in lowercase. The caret (⁁) marks the preceding complete codon (number provided). HGMD accession number, phenotype, links to references, and curatorial comments are also provided. View Image -
Figure 1.13.9 Example of gross deletion mutation data for the F8 gene. A narrative‐style description, phenotype, links to references, and curatorial comments are provided. View Image -
Figure 1.13.10 Example of gross insertion mutation data for the F8 gene. A narrative‐style description, phenotype, links to references, and curatorial comments are provided. View Image -
Figure 1.13.11 Example of complex mutation data for the F8 gene. A narrative‐style description, phenotype, links to references, and curatorial comments are provided. View Image
Videos
Literature Cited
Literature Cited | |
1000 Genomes Project Consortium. 2010. A map of human genome variation from population‐scale sequencing. Nature 467:1061‐1073. | |
Amberger, J., Bocchini, C., and Hamosh, A. 2011. A new face and new challenges for Online Mendelian Inheritance in Man (OMIM). Hum. Mutat. 32:564‐567. | |
Ball, E.V., Stenson, P.D., Krawczak, M., Cooper, D.N., and Chuzhanova, N.A. 2005. Microdeletions and microinsertions causing human genetic disease: Common mechanisms of mutagenesis and the role of local DNA sequence complexity. Hum. Mutat. 26:205‐213. | |
Becker, K.G., Barnes, K.C., Bright, T.J., and Wang, S.A. 2004. The genetic association database. Nat. Genet. 36:431‐432. | |
Cooper, D.N., Nussbaum, R.L., and Krawczak, M. 2002. Proposed guidelines for papers describing DNA polymorphism‐disease associations. Hum. Genet. 110:207‐208. | |
Cooper, D.N., Bacolla, A., Férec, C., Vasquez, K.M., Kehrer‐Sawatzki, H., and Chen, J.M. 2011. On the sequence‐directed nature of human gene mutation: The role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease. Hum. Mutat. 32:1075‐1099. | |
den Dunnen, J.T. and Antonarakis, S.E. 2001. Nomenclature recommendations: Nomenclature for the description of human sequence variations. Hum. Genet. 109:121‐124. | |
Forbes, S.A., Tang, G., Bindal, N., Bamford, S., Dawson, E., Cole, C., Kok, C.Y., Jia, M., Ewing, R., Menzies, A., Teague, J.W., Stratton, M.R., and Futreal, P.A. 2010. COSMIC (the Catalogue of Somatic Mutations in Cancer): A resource to investigate acquired mutations in human cancer. Nucleic Acids Res. 38:D652‐D657. | |
Frézal, J. 1998. GenAtlas database, genes and development defects. C. R. Acad. Sci. III 321:805‐817. | |
Gibbs, R.A., Weinstock, G.M., Metzker, M.L., Muzny, D.M., Sodergren, E.J., Scherer, S., Scott, G., Steffen, D., Worley, K.C., Burch, P.E., Okwuonu, G., et al.; Rat Genome Sequencing Project Consortium. 2004. Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature 428:493‐521. | |
Hirakawa, M., Tanaka, T., Hashimoto, Y., Kuroda, M., Takagi, T., and Nakamura, Y. 2002. JSNP: A database of common gene variations in the Japanese population. Nucleic Acids Res. 30:158‐162. | |
Kerlavage, A., Bonazzi, V., di Tommaso, M., Lawrence, C., Li, P., Mayberry, F., Mural, R., Nodell, M., Yandell, M., Zhang, J., Thomas, P. 2002. The Celera Discovery System. Nucleic Acids Res. 30:129‐136. | |
Krawczak, M., Ball, E.V., and Cooper, D.N. 1998. Neighboring nucleotide effects on the rates of meiotic single base‐pair substitution in human genes. Am. J. Hum. Genet. 63:474‐488. | |
Krawczak, M., Chuzhanova, N.A., Stenson, P.D., Johansen, B.N., Ball, E.V., and Cooper, D.N. 2000. Changes in primary DNA sequence complexity influence the phenotypic consequences of mutations in human gene regulatory regions. Hum. Genet. 107:362‐365. | |
Levy, S., Sutton, G., Ng, P.C., Feuk, L., Halpern, A.L., Walenz, B.P., Axelrod, N., Huang, J., Kirkness, E.F., Denisov, G., Lin, Y., MacDonald, J.R., Pang, A.W., Shago, M., Stockwell, T.B., Tsiamouri, A., Bafna, V., Bansal, V., Kravitz, S.A., Busam, D.A., Beeson, K.Y., McIntosh, T.C., Remington, K.A., Abril, J.F., Gill, J., Borman, J., Rogers, Y.H., Frazier, M.E., Scherer, S.W., Strausberg, R.L., and Venter, J.C. 2007. The diploid genome sequence of an individual human. PLoS Biol. 5:e254. | |
MacArthur, D.G., Balasubramanian, S., Frankish, A., Huang, N., Morris, J., Walter, K., Jostins, L., Habegger, L., Pickrell, J.K., Montgomery, S.B., Albers, C.A., Zhang, Z.D., Conrad, D.F., Lunter, G., Zheng, H., Ayub, Q., DePristo, M.A., Banks, E., Hu, M., Handsaker, R.E., Rosenfeld, J.A., Fromer, M., Jin, M., Mu, X.J., Khurana, E., Ye, K., Kay, M., Saunders, G.I., Suner, M.M., Hunt, T., Barnes, I.H., Amid, C., Carvalho‐Silva, D.R., Bignell, A.H., Snow, C., Yngvadottir, B., Bumpstead, S., Cooper, D.N., Xue, Y., Romero, I.G.; 1000 Genomes Project Consortium, Wang, J., Li, Y., Gibbs, R.A., McCarroll, S.A., Dermitzakis, E.T., Pritchard, J.K., Barrett, J.C, Harrow, J., Hurles, M.E., Gerstein, M.B., and Tyler‐Smith, C. 2012. A systematic survey of loss‐of‐function variants in human protein‐coding genes. Science 335:823‐828. | |
Maglott, D., Ostell, J., Pruitt, K.D., and Tatusova, T. 2011. Entrez Gene: Gene‐centered information at NCBI. Nucleic Acids Res. 39:D52‐D57. | |
Marth, G.T., Yu, F., Indap, A.R., Garimella, K., Gravel, S., Leong, W.F., Tyler‐Smith, C., Bainbridge, M., Blackwell, T., Zheng‐Bradley, X., Chen, Y., Challis, D., Clarke, L., Ball, E.V., Cibulskis, K., Cooper, D.N., Fulton, B., Hartl, C., Koboldt, D., Muzny, D., Smith, R., Sougnez, C., Stewart, C., Ward, A., Yu, J., Xue, Y., Altshuler, D., Bustamante, C.D., Clark, A.G., Daly, M., Depristo, M., Flicek, P., Gabriel, S., Mardis, E., Palotie, A., Gibbs, R.; the 1000 Genomes Project. 2011. The functional spectrum of low‐frequency coding variation. Genome Biol. 12:R84. | |
Pagon, R.A. 2006. GeneTests: An online genetic information resource for health care providers. J. Med. Libr. Assoc. 94:343‐348. | |
Rhesus Macaque Genome Sequencing and Analysis Consortium, Gibbs, R.A., Rogers, J., Katze, M.G., Bumgarner, R., Weinstock, G.M., Mardis, E.R., Remington, K.A., Strausberg, R.L., Venter, J.C., Wilson, R.K., Batzer, M.A., Bustamante, C.D., Eichler, E.E., Hahn, M.W., Hardison, R.C., Makova, K.D., Miller, W., Milosavljevic, A., Palermo, R.E., Siepel, A., Sikela, J.M., Attaway, T., Bell, S., Bernard, K.E., Buhay, C.J., Chandrabose, M.N., Dao, M., Davis, C., Delehaunty, K.D., Ding, Y., Dinh, H.H., Dugan‐Rocha, S., Fulton, L.A., Gabisi, R.A., Garner, T.T., Godfrey, J., Hawes, A.C., Hernandez, J., Hines, S., Holder, M., Hume, J., Jhangiani, S.N., Joshi, V., Khan, Z.M., Kirkness, E.F., Cree, A., Fowler, R.G., Lee, S., Lewis, L.R., Li, Z., Liu, Y.S., Moore, S.M., Muzny, D., Nazareth, L.V., Ngo, D.N., Okwuonu, G.O., Pai, G., Parker, D., Paul, H.A., Pfannkoch, C., Pohl, C.S., Rogers, Y.H., Ruiz, S.J., Sabo, A., Santibanez, J., Schneider, B.W., Smith, S.M., Sodergren, E., Svatek, A.F., Utterback, T.R., Vattathil, S., Warren, W., White, C.S., Chinwalla, A.T., Feng, Y., Halpern, A.L., Hillier, L.W., Huang, X., Minx, P., Nelson, J.O., Pepin, K.H., Qin, X., Sutton, G.G., Venter, E., Walenz, B.P., Wallis, J.W., Worley, K.C., Yang, S.P., Jones, S.M., Marra, M.A., Rocchi, M., Schein, J.E., Baertsch, R., Clarke, L., Csürös, M., Glasscock, J., Harris, R.A., Havlak, P., Jackson, A.R., Jiang, H., Liu, Y., Messina, D.N., Shen, Y., Song, H.X., Wylie, T., Zhang, L., Birney, E., Han, K., Konkel, M.K., Lee, J., Smit, A.F., Ullmer, B., Wang, H., Xing, J., Burhans, R., Cheng, Z., Karro, J.E., Ma, J., Raney, B., She, X., Cox, M.J., Demuth, J.P., Dumas, L.J., Han, S.G., Hopkins, J., Karimpour‐Fard, A., Kim, Y.H., Pollack, J.R., Vinar, T., Addo‐Quaye, C., Degenhardt, J., Denby, A., Hubisz, M.J., Indap, A., Kosiol, C., Lahn, B.T., Lawson, H.A., Marklein, A., Nielsen, R., Vallender, E.J., Clark, A.G., Ferguson, B., Hernandez, R.D., Hirani, K., Kehrer‐Sawatzki, H., Kolb, J., Patil, S., Pu, L.L., Ren, Y., Smith, D.G., Wheeler, D.A., Schenck, I., Ball, E.V., Chen, R., Cooper, D.N., Giardine, B., Hsu, F., Kent, W.J., Lesk, A., Nelson, D.L., O'brien, W.E., Prüfer, K., Stenson, P.D., Wallace, J.C., Ke, H., Liu, X.M., Wang, P., Xiang, A.P., Yang, F., Barber, G.P., Haussler, D., Karolchik, D., Kern, A.D., Kuhn, R.M., Smith, K.E., and Zwieg, A.S. 2007. Evolutionary and biomedical insights from the rhesus macaque genome. Science 316:222‐234. | |
Ruiz‐Pesini, E., Lott, M.T., Procaccio, V., Poole, J.C., Brandon, M.C., Mishmar, D., Yi, C., Kreuziger, J., Baldi, P., and Wallace, D.C. 2007. An enhanced MITOMAP with a global mtDNA mutational phylogeny. Nucleic Acids Res. 35:D823‐D828. | |
Safran, M., Dalah, I., Alexander, J., Rosen, N., Iny Stein, T., Shmoish, M., Nativ, N., Bahir, I., Doniger, T., Krug, H., Sirota‐Madi, A., Olender, T., Golan, Y., Stelzer, G., Harel, A., and Lancet, D. 2010. GeneCards Version 3: The human gene integrator. Database (Oxford) 2010:baq020. | |
Sanford, J.R., Wang, X., Mort, M., Vanduyn, N., Cooper, D.N., Mooney, S.D., Edenberg, H.J., and Liu, Y. 2009. Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts. Genome Res. 19:381‐394. | |
Scally, A., Dutheil, J.Y., Hillier, L.W., Jordan, G.E., Goodhead, I., Herrero, J., Hobolth, A., Lappalainen, T., Mailund, T., Marques‐Bonet, T., McCarthy, S., Montgomery, S.H., Schwalie, P.C., Tang, Y.A., Ward, M.C., Xue, Y., Yngvadottir, B., Alkan, C., Andersen, L.N., Ayub, Q., Ball, E.V., Beal, K., Bradley, B.J., Chen, Y., Clee, C.M., Fitzgerald, S., Graves, T.A., Gu, Y., Heath, P., Heger, A., Karakoc, E., Kolb‐Kokocinski, A., Laird, G.K., Lunter, G., Meader, S., Mort, M., Mullikin, J.C., Munch, K., O'Connor, T.D., Phillips, A.D., Prado‐Martinez, J., Rogers, A.S., Sajjadian, S., Schmidt, D., Shaw, K., Simpson, J.T., Stenson, P.D., Turner, D.J., Vigilant, L., Vilella, A.J., Whitener, W., Zhu, B., Cooper, D.N., de Jong, P., Dermitzakis, E.T., Eichler, E.E., Flicek, P., Goldman, N., Mundy, N.I., Ning, Z., Odom, D.T., Ponting, C.P., Quail, M.A., Ryder, O.A., Searle, S.M., Warren, W.C., Wilson, R.K., Schierup, M.H., Rogers, J., Tyler‐Smith, C., and Durbin, R. 2012. Insights into hominid evolution from the gorilla genome sequence. Nature 483:169‐175. | |
Seal, R.L., Gordon, S.M., Lush, M.J., Wright, M.W., and Bruford, E.A. 2011. Genenames.org: The HGNC resources in 2011. Nucleic Acids Res. 39:D514‐519. | |
Stenson, P.D., Mort, M., Ball, E.V., Howells, K., Phillips, A.D., Thomas, N.S., and Cooper, D.N. 2009. The Human Gene Mutation Database: 2008 update. Genome Med. 1:13. | |
Sterne‐Weiler, T., Howard, J., Mort, M., Cooper, D.N., and Sanford, J.R. 2011. Loss of exon identity is a common mechanism of human inherited disease. Genome Res. 21:1563‐1571. | |
UniProt Consortium. 2012. Reorganizing the protein space at the Universal Protein Resource (UniProt). Nucleic Acids Res. 40:D71‐D75. | |
Vogt, G., Chapgier, A., Yang, K., Chuzhanova, N., Feinberg, J., Fieschi, C., Boisson‐Dupuis, S., Alcais, A., Filipe‐Santos, O., Bustamante, J., de Beaucoudrey, L., Al‐Mohsen, I., Al‐Hajjar, S., Al‐Ghonaium, A., Adimi, P., Mirsaeidi, M., Khalilzadeh, S., Rosenzweig, S., de la Calle Martin, O., Bauer, T.R., Puck, J.M., Ochs, H.D., Furthner, D., Engelhorn, C., Belohradsky, B., Mansouri, D., Holland, S.M., Schreiber, R.D., Abel, L., Cooper, D.N., Soudais, C., and Casanova, J‐L. 2005. Gains‐of‐glycosylation comprise an unexpectedly large group of pathogenic mutations. Nat. Genet. 37:692‐700. | |
Wheeler, D.A., Srinivasan, M., Egholm, M., Shen, Y., Chen, L., McGuire, A., He, W., Chen, Y.J., Makhijani, V., Roth, G.T., Gomes, X., Tartaro, K., Niazi, F., Turcotte, C.L., Irzyk, G.P., Lupski, J.R., Chinault, C., Song, X.Z., Liu, Y., Yuan, Y., Nazareth, L., Qin, X., Muzny, D.M., Margulies, M., Weinstock, G.M., Gibbs, R.A., and Rothberg, J.M. 2008. The complete genome of an individual by massively parallel DNA sequencing. Nature 452:872‐876. | |
Yan, G., Zhang, G., Fang, X., Zhang, Y., Li, C., Ling, F., Cooper, D.N., Li, Q., Li, Y., van Gool, A.J., Du, H., Chen, J., Chen, R., Zhang, P., Huang, Z., Thompson, J.R., Meng, Y., Bai, Y., Wang, J., Zhuo, M., Wang, T., Huang, Y., Wei, L., Li, J., Wang, Z., Hu, H., Yang, P., Le, L., Stenson, P.D., Li, B., Liu, X., Ball, E.V., An, N., Huang, Q., Zhang, Y., Fan, W., Zhang, X., Li, Y., Wang, W., Katze, M.G., Su, B., Nielsen, R., Yang, H., Wang, J., Wang, X., and Wang, J. 2011. Genome sequencing and comparison of two nonhuman primate animal models, the cynomolgus and Chinese rhesus macaques. Nat. Biotechnol. 29:1019‐1023. |