Overview
The mouse is a key model organism for the understanding of mammalian biology because it has been well studied and is genetically and physiologically similar to humans. To utilize mouse data to its fullest, we have developed an integrated database of mouse genetic, genomic and biological data. The Mouse Genome Informatics Database (MGI) is used by the international scientific community as its primary resource for mouse information and as a tool for new biological discovery. The database contains a wide variety of data pertaining to genes, their DNA and protein sequences, and the phenotypes that result from mutations in different genes. The three central components of MGI are the Mouse Genome Database (MGD), an internationally recognized database for the laboratory mouse, the Mouse Tumor Biology (MTB) database, which facilitates the selection of experimental models for cancer research, and the International Mouse Strain Resource (IMSR), a searchable online database cataloging mouse stocks available worldwide. The database continues to expand to keep abreast of new technologies and to grow with our expanding knowledge of how the genetic blueprint of DNA manifests in traits of a living individual.
Research details
Bioinformatics Resources for the Mouse: From Sequence to Models of Human Disease
The mouse is an exceptional mammalian model system for connecting knowledge from sequence to human disease. The mouse has the unique combination of a well-developed genetic map, a sequenced genome, a large collection of inbred and genetically engineered strains, and extensive technologies to manipulate its genome. The Mouse Genome Informatics Database (MGI) integrates genetic, genomic, biological, and phenotypic data about the laboratory mouse to support the understanding of normal development and the mechanisms underlying heritable diseases.
Many diverse types of data, varying in granularity, representation, and organization come together in MGI. These data are integrated through resolution of object identities and application of structured vocabularies (ontologies) for anatomy, gene function, phenotypes, tumor and disease terms; and by adherence to international nomenclature standards. MGI data integration is critical for posing complex questions and enabling retrieval of complete results. Integration encourages data exploration, provides connections between biological data concepts, and can lead to new insights and knowledge discovery. Three components of MGI are outlined below.
Mouse Genome Database (MGD)
MGD (www.informatics.jax.org) is the internationally recognized database for the laboratory mouse. Among the many types of data represented are genes and genomic features; nucleotide and protein sequences; molecular reagents; genomic, genetic, and cytogenetic maps; gene variants, particularly SNPs; orthology relationships between mouse genes and genes in other mammals; gene functions (GO; Gene Ontology); phenotypic descriptions of mouse mutants and inbred strains; and mouse genotypes that model human hereditary diseases. Data in MGD are updated continuously through semi-automated loads, curation of the primary scientific literature, and data submissions from the scientific community. Data exchange and co-curation through collaborations with major resources such as NCBI and UniProt ensure that all of these resources contain the highest quality data. MGD users benefit from links to key information resources such as ENSEMBL, UCSC, InterPro, MGC, OMIM, and others, enabled through shared data element identities. Our web interface allows simple and complex querying and flexible navigation among data pages and graphical displays. The back-end database and supporting software and front-end presentations continue to be improved and expanded to reflect community needs, new research technologies, and the growth and change in the character and types of data being generated.
Mouse Tumor Biology Database (MTB)
MTB (tumor.informatics.jax.org) facilitates the selection of experimental models for cancer research, allows cancer models to be evaluated as they apply to humans, presents mutation patterns for specific mouse cancers, and allows comparison across cancer models. MTB includes data on the organ and tissue of tumor origin, tumor classification, induction treatment, metastases, mouse strains, and specific genes or mutations in the host strain or somatic changes in the tumor.
MTB incorporates data from many laboratories that collect, image, and diagnose mouse tumors. The Mouse Pathology Submission interface allows contributors to create records; generate tumor diagnoses including pathology and treatment descriptions; attach images to diagnoses; and edit and review their submitted data.
The Mouse Tumor Frequency Grid is an interactive visualization tool providing an overview of tumor frequencies and tumor types in different mouse strains (http://tumor.informatics.jax.org/mtbwi/tumorFrequencyGrid.do). Strains are displayed on the vertical axis and organs on the horizontal axis. Color-coded cells in the Grid represent tumor frequencies. Each colored cell is an active link, generating a database query for the underlying data. Users can expand the Grid axes for individual strain families and organ systems, thus "zooming-in" on specific sub-strains and tissues of interest.
International Mouse Strain Resource (IMSR)
IMSR (www.imsr.org) is a searchable online database cataloging mouse stocks available worldwide, including inbred, mutant, and genetically engineered mice and mutant ES cell lines. The goal of the IMSR is to assist the international scientific community in locating and obtaining mouse resources for research. For each strain listed in the IMSR, users can access information about where the strain is available from (repository holder), in what state the strain is available (live, frozen embryo, frozen germplasm, ES cell line), link to descriptive information about the strain, link to data in MGD describing phenotypes of any mutations carried by the strain, and link to a form for contacting the strain holder to order the strain or ask for additional information. The reciprocal links between MGD phenotype records and IMSR strain holder information allow users to find mice based on gene or strain name or phenotype. IMSR currently includes data from 15 repository consortia representing 26 repository sites worldwide.
Accessing the Mouse Genome Informatics (MGI) Resource:
www.informatics.jax.org provides access to all MGI resources.
Web users can use MGI's browsing and searching tools and visual displays to navigate the site. Each web page includes a quick Search Box where a word or partial word can be entered and the type of data to be searched can be specified. For example, a user could enter "pax" into the Search Box and indicate that "Gene symbols/names" are to be searched. Various query forms allow users to ask more complex questions that involve multiple parameters and multiple types of data. For example, using the Phenotypes/Allele query form, one could ask "What mutant alleles located on Chromosome 11 between 40 and 100 Mb were created by gene targeting and are models for cardiomyopathy?"
Computational users can access MGI data in nightly-generated tab-delimited data files containing key data elements downloadable for analyses. On request, we also provide SQL (structured query language) accounts for directly querying the database.
MGI-LIST, an electronic bulletin board for the mouse genomics community has over 2,000 subscribers. To subscribe, visit the website. User Support is available by phone (207-288-6445), fax (207-288-6132), or email to assist with using MGI data resources.
Read about other MGI collaborating projects in the reports by Drs. Blake, Bult, and Ringwald.
Lab staff
| Principal Investigator: Janan T. Eppig, Ph.D. Co-Principal Investigators/Project Leaders: Judith A. Blake, Ph.D. Carol J. Bult, Ph.D. James A. Kadin, Ph.D. Joel E. Richardson, Ph.D. John P. Sundberg, D.V.M, Ph.D., Dipl. A.C.V.P. Research Associate: Igor Mikaelian, D.V.M. Senior Scientific Curators: Anna Anagnostopoulos, Ph.D. Dirck W. Bradt, M.S. Debra M. Krupke, M.S. Lois J. Maltais, B.S. Cynthia L. Smith, Ph.D. Yunxia Zhu, Ph.D. Scientific Curators: Theresa Allio, Ph.D. Susan M. Bello, Ph.D. Howard Dene, Ph.D. Alexander Diehl, Ph.D. Kenneth S. Frazer, M.S. Carroll W. Goldsmith, Ph.D. Ira Lu, M.S. Li Ni, Ph.D. Hiraoki Onda, Ph.D. Tatiparthi B. Reddy, Ph.D. Beverly Richards-Smith, Ph.D. Robert Sinclair, Ph.D. Benjamin A. Taylor, Ph.D. Monika Tomczuk, Ph.D. Linda L. Washburn, M.S. Scientific Curator Assistants: Donna L. Burkart, B.A. Megan A. Cassell, B.S. Deborah J. Reed Monica McAndrews-Hill, B.S. Information Specialists: Nancy E. Butler Deborah J. Reed |
Software Project Manager: Scientific Software Engineers: |
Publication listings
Mouse Phenome Database Integration Consortium. 2007. Integration of mouse phenome data resources. Mamm Genome 18:157-163.
Eppig JT, Blake JA, Bult CJ, Richardson JE, Kadin JA, Ringwald M and the MGI Staff. 2007. Mouse Genome Informatics (MGI) resources for pathology and toxicology. Toxicol Pathol 35:456-7.
Barker JE, Deveau SA, Compton ST, Fancher K, Eppig JT. 2005. High incidence, early onset of histiocytic sarcomas in mice with Hertwig's anemia. Exp Hematol. 33:1118-1129.
Eppig JT, Bult CJ, Kadin JA, Richardson JE, Blake JA and the Mouse Genome Database Group. 2005. The Mouse Genome Database (MGD): From Genes to Mice, A community Resource for Mouse Biology. Nucleic Acids Research 33:D471-5.
Krupke DM, Naf D, Vincent MJ, Allio T, Mikaelian I, Sundberg JP, Bult CJ, Eppig JT. 2005. The mouse tumor biology database: integrated access to mouse cancer biology data. Exp Lung Res 31:259-270.
Smith CL, Goldsmith CA, Eppig JT. 2005. The Mammalian Phenotype Ontology as a tool for annotation, analyzing, and comparing phenotypic information. Genome Biology 6(1):R7.
Strivens M, Eppig JT. 2004. Visualizing the laboratory mouse: capturing phenotypic information. Genetica 122:89-97.
Richardson JE, Kadin JA, Blake JA, Bult CJ, Eppig JT, Ringwald M and the Mouse Genome Informatics Group. 2004. From sipping on a straw to drinking from a fire hose; data integration in a public genome database. Proceedings of the 20th IEEE International Conference on Data Engineering (March) 795-798.
Hill DP, Begley DA, Finger JH, Hayamizu TF, McCrigh IJ, Simth CM, Beal JS, Corbani LE, Blake JA, Eppig JT, Kadin JA, Richardson JE, Ringwald M. 2004. The Mouse Gene Expression Database (GXD): updates and enhancements. Nucl Acids Res 32:D568-D571.
Mikaelian I, Nanney LB, Parman KS, Kusewitt DF, Wart JM, Naf D, Krupke DM, Eppig JT, Bult CJ, Seymour R, Ichiki T and Sundberg JP. 2004. Antibodies that Label Paraffin-Embedded Mouse Tissues: A Collaboratove Endeavor. Toxicol Pathol 32:181-191.
Gene Ontology Consortium. 2004. The Gene Ontology (GO) database and informatics resource. Nucl Acids Res 32:D258-D261.
Bult CJ, Blake JA, Richardson JE, Kadin JA, Eppig JT, and the members of the Mouse Genome Database Group. 2004. The Mouse Genome Database (MGD) Integrating biology with the genome. Nucl Acids Res 32:D476-D481.
Balderelli RM, Hill DP, Blake JA, Adachi J, Furuno M, Bradt D, Corbani LE, Cousins S, Frazer KS, Qi D, Yang L, Ramachandran S, Reed D, Zhu Y, Kasukawa T, Ringwald M, King BL, Maltais LJ, McKenzie LM, Schriml LM, Maglott D, Church DM, Pruitt K, Eppig JT, Richardson JE, Kadin JA, Bult CJ. 2003. Connecting sequence and biology in the laboratory mouse. Genome Res 13:1505-19.
Rowe LB, Barter ME, Kelmenson JA, Eppig JT. 2003. The compre henisve mouse radiationhybrid map densely cross-reference to the rcombination map: A tool to support the sequence assemblies. Genome Res 13:122-133.
Blake JA, Richardson JE, Bult CJ, Kadin JA, Eppig JT. 2003. MGD: The Mouse Genome Database. Jucleic Acids Res 31:193-195
Blake JA, Richardson JE, Bult CJ, Kadin JA, Eppig JT, and the Mouse Genome Database
Group. 2002. The Mouse Genome Database (MGD): The model organism database for the laboratory mouse. Nucleic Acids Res 30:113-115.
Eppig JT, Blake JA, Smith C, Burkart DL, Goldsmith CW, Lutz CM, Smith CL. 2002. Corralling Conditional Mutations: A unified resource for mouse phenotypes. Genesis 32:63-65.
Maltais LJ, Blake JA, Chu T, Lutz CM, Eppig JT, Jackson I. 2002. Rules and guidelines for mouse gene, allele, and mutation nomenclature: A condensed version. Genomics 79: 471-474.
Naf D, Krupke DM, Sundberg JP, Eppig JT, Bult CJ. 2002. The Mouse Tumor Biology
database (MTB): A public resource for cancer genetics and pathology of the mouse. Cancer Res 62:1235-1240.
Twigger S, Lu J, Shimoyama M, Chen D, Pasko D, Long H, Ginster J, Chen C-F, Nigam R, Kwitek-Black A, Eppig J, Maltais L, Maglott D, Schuler G, Jacob H, Tonellato P. 2002. Rat Genome Database (RGD) – mapping disease onto the genome. Nucleic Acids Res 30:125-128.
Books, Book Chapters, and Reviews:
Blake JA, Eppig JT, Bult CJ. 2003. Mouse and Rat Genome Informatics. In: Bioinformatics For Geneticists, Wiley & Sons, Inc., pp. 119-142