丁香实验_LOGO
登录
提问
我要登录
|免费注册
点赞
收藏
wx-share
分享

Using Model Organism Databases (MODs)

互联网

669
  • Abstract
  • Table of Contents
  • Figures
  • Literature Cited

Abstract

 

Model Organism Databases (MODs) represent the union of database technology and biology, and are essential to modern biological and medical research. Research communities are producing floods of new data, of increasingly different types and complexity. MODs assimilate this information from a wide variety of sources, organize it in a comprehensible manner, and make it freely available to the public via the Internet. MODs permit researchers to sort through massive amounts of data, providing access to key information that they might otherwise have overlooked. The protocols in this unit offer a general introduction to different types of data available in the growing number of MODs, and approaches for accessing, browsing, and querying these data. Curr. Protoc. Essential Lab. Tech. 1:11.4.1?11.4.17. © 2009 by John Wiley & Sons, Inc.

Keywords: Genome project; genetics; DNA sequence; gene model; protein function

     
 
GO TO THE FULL PROTOCOL:
PDF or HTML at Wiley Online Library

Table of Contents

  • Overview and Principles
  • Basic Protocol 1: General Guidelines for Using a Model Organism Database Using the Saccharomyces Genome Database as an Example
  • Basic Protocol 2: Obtaining a Sequence from Gbrowse
  • Basic Protocol 3: Using Textpresso to Search Full Text Papers
  • Commentary
  • Literature Cited
  • Figures
     
 
GO TO THE FULL PROTOCOL:
PDF or HTML at Wiley Online Library

Materials

 
GO TO THE FULL PROTOCOL:
PDF or HTML at Wiley Online Library

Figures

  •   Figure 11.4.1 The SGD home page (http://www.yeastgenome.org), like most MOD home pages, is the main point of entry to the Web site. The home page lists news items and announcements, and provides links to different areas and tools provided by SGD. Ovals indicate the database Search box, a link to the Advanced Search tool, and the Google‐based html Web site search.
    View Image
  •   Figure 11.4.2 A locus summary page in SGD showing the different types of information included on a typical gene page. A portion of the page at the top that contains the SGD toolbar and Search box is not shown. A portion of the page at the bottom that contains a gene summary paragraph and references is not shown.
    View Image
  •   Figure 11.4.3 Search Results for gene/protein information and functional annotations in SGD from a simple search using the phrase “histone deacetylase.”
    View Image
  •   Figure 11.4.4 Search Results for Gene Ontology molecular function terms that contain the words “histone deacetylase.”
    View Image
  •   Figure 11.4.5 The top portion of the Gene Ontology detail page in SGD for the molecular function term “histone deacetylase activity.”
    View Image
  •   Figure 11.4.6 A view of a portion of S. cerevisiae chromosome II in SGD's version of the GBrowse genome browser. GBrowse allows queries using the Landmark or Region search box, download of DNA sequences using the Reports & Analysis pull‐down, adjustment of the viewing window using the Scroll/Zoom menu, and customization of the tracks shown in the Details field.
    View Image
  •   Figure 11.4.7 An example of a text search (“histone deacetylase”) in SGD's version of GBrowse that generates multiple results.
    View Image
  •   Figure 11.4.8 Entry page in SGD for the Textpresso text‐mining system.
    View Image
  •   Figure 11.4.9 The top portion of a results page in SGD's version of Textpresso for the query “DNA helicase cancer.”
    View Image

Videos

Literature Cited

   Arnaud, M.B., Costanzo, M.C., Skrzypek, M.S., Binkley, G., Lane, C., Miyasato, S.R., and Sherlock, G. 2005. The Candida Genome Database (CGD), a community resource for Candida albicans gene and protein information. Nucleic Acids Res. 33:D358‐D363.
   Arnaud, M.B., Costanzo, M.C., Skrzypek, M.S., Shah, P., Binkley, G., Miyasato, S.R., and Sherlock, G. 2009. Aspergillus Genome Database http://www.aspergillusgenome.org/ (April 8, 2009).
   Blake, J.A., and Harris, M.A. 2008. The Gene Ontology (GO) Project: Structured vocabularies for molecular biology and their application to genome and expression analysis. Curr. Protoc. Bioinform. 23:7.2.1‐7.2.9.
   Blake, J.A., Bult, C.J., Eppig, J.T., Kadin, J.A., Richardson, J.E., and the Mouse Genome Database Group. 2009. The Mouse Genome Database genotypes::phenotypes. Nucleic Acids Res. 37:D712‐D719.
   Cherry, J.M., Adler, C., Ball, C., Chervitz, S.A., Dwight, S.S., Hester, E.T., Jia, Y., Juvik, G., Roe, T., Schroeder, M., Weng, S., and Botstein, D. 1998. SGD: Saccharomyces Genome Database. Nucleic Acids Res. 26:73‐79.
   Donlin, M.J. 2007. Using the Generic Genome Browser (GBrowse). Curr. Protoc. Bioinform. 17:9.9.1‐9.9.24.
   Dwight, S.S., Balakrishnan, R., Christie, K.R., Costanzo, M.C., Dolinski, K., Engel, S.R., Feierbach, B., Fisk, D.G., Hirschman, J., Hong, E.L., Issel‐Tarver, L., Nash, R.S., Sethuraman, A., Starr, B., Theesfeld, C.L., Andrada, R., Binkley, G., Dong, Q., Lane, C., Schroeder, M., Weng, S., Botstein, D., and Cherry, J.M. 2004. Saccharomyces genome database: Underlying principles and organisation. Brief. Bioinform. 5:9‐22.
   Gelbart, W.M., Crosby, M., Matthews, B., Rindone, W.P., Chillemi, J., Russo Twombly, S., Emmert, D., Ashburner, M., Drysdale, R.A., Whitfield, E., Millburn, G.H., de Grey, A., Kaufman, T., Matthews, K., Gilbert, D., Strelets, V., and Tolstoshev, C. 1997. FlyBase: A Drosophila Database. Nucleic Acids Res. 25:63‐66.
   Goffeau, A., Barrell, B.G., Bussey, H., Davis, R.W., Dujon, B., Feldmann, H., Galibert, F., Hoheisel, J.D., Jacq, C., Johnston, M., Louis, E.J., Mewes, H.W., Murakami, Y., Philippsen, P., Tettelin, H., and Oliver, S.G. 1996. Life with 6000 genes. Science 274:546‐567.
   Harris, M., and the Gene Ontology Consortium. 2008. The Gene Ontology project in 2008. Nucleic Acids Res. 36:440‐444.
   Karolchik, D., Hinrichs, A.S., and Kent, W.J. 2007. The UCSC Genome Browser. Curr. Protoc. Bioinform. 17:1.4.1‐1.4.24.
   Kreppel, L., Fey, P., Gaudet, P., Just, E., Kibbe, W.A., Chisholm, R.L., and Kimmel, A.R. 2004. dictyBase: A new Dictyostelium discoideum genome database. Nucleic Acids Res. 2004. 32:D332‐D333.
   Müller, H.M., Kenny, E.E., and Sternberg, P.W. 2004. Textpresso: An ontology‐based information retrieval and extraction system for biological literature. PLoS Biol. 2:e309.
   Ono, B.I., Hazu, T., Yoshida, S., Kawato, T., Shinoda, S., Brzvwczy, J., and Paszewski, A. 1999. Cysteine biosynthesis in Saccharomyces cerevisiae: A new outlook on pathway and regulation. Yeast 15:1365‐1375.
   Reiser, L. and Rhee, S.Y. 2005. Using the Arabidopsis Information Resource (TAIR) to find information about Arabidopsis genes. Curr. Protoc. Bioinform. 9:1.11.1‐1.11.45.
   Rhee, S.Y. 2000. Bioinformatic resources, challenges, and opportunities using Arabidopsis thaliana as a model organism in post‐genomic era. Plant Physiol. 124:1460‐1464.
   Schwarz, E.M. and Sternberg, P.W. 2006. Searching WormBase for information about Caenorhabditis elegans. Curr. Protoc. Bioinform. 14:1.8.1‐1.8.43.
   Shaw, D. 2009. Searching the Mouse Genome Informatics (MGI) resources for information on mouse biology from genotype to phenotype. Curr. Protoc. Bioinform. 25:1.7.1‐1.7.14.
   Stein, L.D., Sternberg, P., Durbin, R., Thierry‐Mieg, J., and Spieth, J. 2001. WormBase: Network access to the genome and biology of Caenorhabditis elegans. Nucleic Acids Res. 29:82‐86.
   Stein, L.D., Mungall, C., Shu, S., Caudy, M., Mangone, M., Day, A., Nickerson, E., Stajich, J.E., Harris, T.W., Arva, A., and Lewis, S. 2002. The generic genome browser: A building block for a model organism system database. Genome Res. 12:1599‐1610.
   Twigger, S.N., Smith, J.S., Zuniga‐Meyer, A., and Bromberg, S.K. 2006. Exploring phenotypic data at the Rat Genome Database. Curr. Protoc. Bioinform. 14:1.14.1‐1.14.27.
   Twigger, S.N., Shimoyama, M., Bromberg, S., Kwitek, A.E., Jacob, H.J., and the RGD Team. 2007. The Rat Genome Database, update 2007 – Easing the path from disease to data and back again. Nucleic Acids Res. 35:D658‐D662.
Internet Resources
   http://www.agbase.msstate.edu
   Resource for functional analysis of agricultural plant and animal gene products.
   http://www.arabidopsis.org
   The Arabidopsis Information Resource (TAIR): Database of genetic and molecular biology data for the plant Arabidopsis thaliana.
   http://crfb.univ‐mrs.fr/aniseed
   Ascidian Network for InSitu Expression and Embryological Data (ANISEED): Database for Ciona intestinalis, C. savignyi, Halocynthia roretzi, and Phallusia mammillata.
   http://agd.vital‐it.ch
   Ashbya Genome Database (AGD): Database of gene annotation and microarray data for Ashbya gossypii and Saccharomyces cerevisiae.
   http://www.aspergillusgenome.org
   Aspergillus Genome Database (AspGD): Resource for genomic sequence data and gene and protein information for Aspergilli.
   http://bovinegenome.org
   Database that integrates bovine genomics data with structural and functional annotations of genes and the genome.
   http://www.candidagenome.org
   Database that serves as a resource for genomic sequence data and gene and protein information for Candida albicans.
   http://dictybase.org
   Resource for the biology and genomics of the social amoeba Dictyostelium discoideum.
   http://ecolihub.org
   Centralized resource linking various E. coli online information services, databases, and Web sites.
   http://flybase.org
   Database of Drosophila genes and genomes.
   http://gmod.org
   Generic Model Organism Database (GMOD) project: Collection of open source software tools for creating genome‐scale biological databases.
   http://www.gramene.org
   Data resource for comparative genome analysis in the grasses.
   http://www.beebase.org
   Hymenoptera Genome Database (BeeBase): Database of genes and genomes of Apis mellifera and Nasonia vitripennis.
   http://www.informatics.jax.org
   Mouse Genome Informatics: Resource for the laboratory mouse, providing genetic, genomic, and biological data for the study of human health and disease.
   http://paramecium.cgm.cnrs‐gif.fr
   Database of genomic sequence and genetic data for Paramecium tetraurelia.
   http://rgd.mcw.edu
   Rat Genome Database (RGD): Database of laboratory rat genetic and genomic data, including information for quantitative trait loci, mutations, and phenotypes.
   http://www.yeastgenome.org
   Saccharomyces Genome Database (SGD): Scientific database of the molecular biology and genetics of the yeast Saccharomyces cerevisiae.
   http://www.genedb.org/genedb/pombe
   Schizosaccharomyces pombe GeneDB: Database of genetic features, functional annotations, and other information for fission yeast.
   http://smedgd.neuro.utah.edu
   Schmidtea mediterranea Genome Database (SmedGD): Database for information associated with the planarian genome.
   http://www.textpresso.org
   Text‐mining system for scientific literature.
   http://wfleabase.org
   Web service that provides gene and genomic information for species of the genus Daphnia, commonly known as the water flea.
   http://www.wormbase.org
   Biology and genomic information for Caenorhabditis species.
   http://zfin.org
   Zebrafish Information Network: Database for the molecular biology and genetics of zebrafish.
GO TO THE FULL PROTOCOL:
PDF or HTML at Wiley Online Library
 
ad image
提问
扫一扫
丁香实验小程序二维码
实验小助手
丁香实验公众号二维码
扫码领资料
反馈
TOP
打开小程序