Generic Model Organism Database

The Generic Model Organism Database (GMOD) project provides biological research communities with a toolkit of open-source software components for visualizing, annotating, managing, and storing biological data. The GMOD project is funded by the United States National Institutes of Health, National Science Foundation and the USDA Agricultural Research Service.

Generic Model Organism Database (GMOD)
Operating systemWindows, Mac OS X
TypeBioinformatics
LicenseGPL v3
Websitegmod.org/wiki/Main_Page Edit this on Wikidata

History

edit

The GMOD project was started in the early 2000s as a collaboration between several model organism databases (MODs) who shared a need to create similar software tools for processing data from sequencing projects. MODs, or organism-specific databases, describe genome and other information about important experimental organisms in the life sciences and capture the large volumes of data and information being generated by modern biology. Rather than each group designing their own software, four major MODs--FlyBase, Saccharomyces Genome Database, Mouse Genome Database, and at or run off a Chado schema database.

Chado database schema

edit

The Chado[1] schema aims to cover many of the classes of data frequently used by modern biologists, from genetic data to phylogenetic trees to publications to organisms to microarray data to IDs to RNA/protein expression. Chado makes extensive use of controlled vocabularies to type all entities in the database; for example: genes, transcripts, exons, transposable elements, etc., are stored in a feature table, with the type provided by Sequence Ontology. When a new type is added to the Sequence Ontology, the feature table requires no modification, only an update of the data in the database. The same is largely true of analysis data that can be stored in Chado as well.

The existing core modules of Chado are:

  • sequence - for sequences/features
  • cv - for controlled-vocabs/ontologies
  • general - currently just dbxrefs
  • organism - taxonomic data
  • pub - publication and references
  • companalysis - augments sequence module with computational analysis data
  • map - non-sequence maps
  • genetic - genetic and phenotypic data
  • expression - gene expression
  • natural diversity - population data

Software

edit

The full list of GMOD software components is found on the GMOD Components page.[2] These components include:

  • GMOD Core (Chado database and tools)
    • Chado: the Chado schema and tools to install it.[3]
    • XORT: a tool for loading and dumping chado-xml[4]
    • GMODTools: extracts data from a Chado database into common genome bulk formats (GFF, Fasta, etc.)[5]
  • MOD website
  • Genome Editing and Visualization
    • Apollo: a Java application for viewing and editing genome annotations[7][8]
    • GBrowse: a CGI application for displaying genome annotations[9][10]
    • JBrowse: a JavaScript application for displaying genome annotations[11]
    • Pathway Tools: a genome browser with a comparative mode
  • Comparative Genomics
    • GBrowse_syn: a GBrowse based synteny viewer[12]
    • CMap: a CGI application for displaying comparative maps[13]
  • Literature curation
    • Textpresso: a text mining system for scientific literature [14]
  • Database querying tools
    • BioMart: a query-oriented data management system
    • InterMine: open source data warehouse system
  • Biological Pathways
    • Pathway Tools: tools for metabolic pathway information, and analysis of high-throughput functional genomics data
  • Regulatory Networks
    • Pathway Tools: supports definition of regulatory interactions and browsing of regulatory networks
  • Analysis

Participating databases

edit

The following organism databases are contributing to and/or adopting GMOD components for model organism databases.

NISEED[17]AntonosporaDB[citation needed]Arabidopsis[18]
BeeBaseBeetleBase[19][20]Bovine genome database (BGD)
BioHealthBase[21]Bovine QTL ViewerCattle EST Gene Family Database
CGDCGLChromDB
Chromosome 7 Annotation ProjectCSHLmpdDatabase of Genomic Variants
DictyBase[22]DroSpeGeEcoCyc
FlyBaseFungal Comparative GenomicsFungal Telomere Browser
Gallus Genome BrowserGeneDBGrainGenes
GrameneHapMapHuman 2q33
Human Genome Segmental Duplication DatabaseIVDBMAGI
Marine Biological Lab Organism DatabasesMouse Genome InformaticsNon-Human Segmental Duplication Database
OMAPOryGenesDBOryza Chromosome 8
Pathway ToolsParameciumDB[23]PeanutMap
PlantsDBPlasmoDBPomBase
PseudoCAPPossumBasePUMAdb
Rat Genome DatabaseSaccharomyces Genome DatabaseSGD Lite
SmedDBSol Genomics NetworkSoybase
Soybean Gbrowse DatabaseT1DBaseThe Arabidopsis Information Resource
TGDThe Genome InstituteThe Institute for Genomic Research
TIGR Rice Genome BrowserToxoDBTriAnnot BAC Viewer
VectorBasewFleaBase[24]WormBase
XanthusBaseXenbase
edit

See also

edit

References

edit
edit