A-Z Index | Directory | Careers

DOE JGI Releases a New Version of its Metagenome Data Management & Analysis System

February 7, 2008

For more information, contact
David Gilbert
DOE JGI Public Affairs Manager
(925) 296-5643
degilbert@lbl.gov

WALNUT CREEK, CA--Targeting its ever-expanding user community, the U.S. Department of Energy Joint Genome Institute (DOE JGI) has released an upgraded version of the IMG/M metagenome data management and analysis system, accessible to the public at http://img.jgi.doe.gov/m.

IMG/M provides tools for analyzing the functional capability of microbial communities based on their metagenome DNA sequence in the context of reference isolate genomes. The new version of IMG/M includes five additional metagenome datasets generated from microbial community samples that were the subject of recently published studies. These include the metagenomic and functional analysis of termite hindgut microbiota (Nature 450, 560-565, 22 November 2007) and the single cell genetic analysis of TM7, a rare and uncultivated microbe from the human mouth (PNAS, July 17, 2007, vol. 104, no. 29, 11889-11894).

"IMG/M is a fantastic tool that is incredibly helpful in understanding our data," said Stephen Quake, Co-Chair, Department of Bioengineering at Stanford University, Investigator, Howard Hughes Medical Institute, and senior author on the PNAS study. "We used IMG/M in numerous ways, both to analyze our data and to understand general properties of other relevant bacterial genomes. I look forward to analyzing our new datasets with IMG/M."

IMG/M will be demonstrated at a workshop on March 26 as part of the DOE JGI Third Annual User Meeting. IMG/M contains all isolate genomes in version 2.4 of DOE JGI’s Integrated Microbial Genomes (IMG) system, which represents an increase of 1,339 reference genomes from the previous version of IMG/M. Now, IMG/M contains 2,953 isolate genomes consisting of 819 bacterial, 50 archaeal, 40 eukaryotic genomes, and 2,044 viruses.

IMG/M provides new tools for analyzing metagenome datasets in the context of reference isolate genomes, such as the Reference Genome Context Viewer and Protein Recruitment Plot that allow the examination of metagenomes in the context of individual reference isolate genomes. New Abundance Comparison and Functional Category Comparison tools enable pairwise function analysis (COG, Pfam, Enzyme, TIGRfam) and functional category (e.g., COG category) abundance comparisons, respectively, between a metagenome dataset and one or several reference metagenomes or genomes, and test whether the differences in abundance are statistically significant.

IMG/M has been developed jointly by the DOE JGI’s Genome Biology Program (GBP) and Lawrence Berkeley National Laboratory (LBNL) Biological Data Management and Technology Center (BDMTC). The large-scale pairwise gene similarity computations for all the genomes included in IMG/M have been carried out using ScalaBLAST by the Computational Biology and Bioinformatics Group of the Computational Sciences and Mathematics Division at Pacific Northwest National Laboratory, using the William R. Wiley Environmental Molecular Sciences Laboratory (EMSL) Molecular Sciences Computing Facility supercomputer.

The U.S. Department of Energy Joint Genome Institute, supported by the DOE Office of Science, unites the expertise of five national laboratories -- Lawrence Berkeley, Lawrence Livermore, Los Alamos, Oak Ridge, and Pacific Northwest -- along with the Stanford Human Genome Center to advance genomics in support of the DOE missions related to clean energy generation and environmental characterization and cleanup. DOE JGI’s Walnut Creek, CA, Production Genomics Facility provides integrated high-throughput sequencing and computational analysis that enable systems-based scientific approaches to these challenges.


About Computing Sciences at Berkeley Lab

High performance computing plays a critical role in scientific discovery. Researchers increasingly rely on advances in computer science, mathematics, computational science, data science, and large-scale computing and networking to increase our understanding of ourselves, our planet, and our universe. Berkeley Lab’s Computing Sciences Area researches, develops, and deploys new foundations, tools, and technologies to meet these needs and to advance research across a broad range of scientific disciplines.