A-Z Index | Directory | Careers

New Microbial Metagenome Data Analysis System in Production

May 1, 2006

Image of microbial communities found in termite guts

IMG/M is being used to study microbial communiites, such as these protists found in termite guts.

An experimental metagenomics data management and analysis system co-developed by the Biological Data Management and Technology Center (BDMTC) at Berkeley Lab with the Genome Biology Program and Microbial Ecology Program at DOE’s Joint Genome Institute and released earlier this year is also already helping produce scientific discoveries.

Called IMG/M, the system extends the Integrated Microbial Genomes (IMG) system with the ability to integrate and analyze metagenome data, and has provided immediate support for metagenomics studies at JGI. “IMG/M is the first publicly available metagenome data management and analysis system—the first of its kind,” said Victor Markowitz, head of BDMTC.

“Most research in microbial genome analysis focuses on individual organisms,” said Nikos Kyrpides, head of JGI’s Genome Biology Program. “The application of high throughput sequencing to environmental samples has revealed a new universe of microbial community genomes including mostly microorganisms unknown to science. Microbial community genome analysis, also known as metagenomics, focuses on how the entire community functions together.”

Thus far, IMG/M has been used by Phil Hugenholtz, head of the Microbial Ecology Program, and his colleagues at JGI for completing the analysis of enhanced biological phosphorus removing sludge communities and for studying the metagenomes of several key microbial communities recently sequenced by DOE JGI, including the ligno- cellulose-hydrolyzing communities in termite hindguts.

“IMG/M has proven to be an extremely useful resource and tool for analyzing our metagenomic data,” said Jared R. Leadbetter, associate professor of environmental microbiology at the California Institute of Technology and collaborator on the termite hindgut microbial community for bioenergy project. “Such datasets are large, complex, and potentially unwieldy. Importantly, IMG/M is more than just an excellent tool to analyze data. The manner in which the results of that analysis are organized and made accessible through a user-friendly interface allows the researcher to rapidly move in a number of different intellectual directions. As a result, the user becomes better educated with and gets a real feel for the data in a manner that would not otherwise be possible on such short time scales.”

IMG/M was presented by Markowitz at the recent Keystone Symposium on Microbial Community Genomics in Animals and the Environment, organized by DOE JGI Director Edward M. Rubin and Edward F. DeLong, professor in the Division of Biological Engineering and Department of Civil and Environmental Engineering at the Massachusetts Institute of Technology.

“IMG/M provides an intuitive interface, and nice complement to IMG for comparing gene content and phylogenetic profiles of microbial genomes, and relating them to the large microbial community datasets now accumulating,” DeLong said. “These are great and sorely needed data- exploration tools.”

IMG/M was demonstrated at a workshop on April 1, as part of the DOE JGI First Annual User Meeting. Additionally, BDMTC staff members will be presenting the project this summer at workshops and symposia in England, Brazil and Austria.

“We have an experimental system which has proved to be immediately useful in helping scientists to conduct their studies effectively,” Markowitz said.

IMG/M is accessible to the public.


About Computing Sciences at Berkeley Lab

High performance computing plays a critical role in scientific discovery. Researchers increasingly rely on advances in computer science, mathematics, computational science, data science, and large-scale computing and networking to increase our understanding of ourselves, our planet, and our universe. Berkeley Lab’s Computing Sciences Area researches, develops, and deploys new foundations, tools, and technologies to meet these needs and to advance research across a broad range of scientific disciplines.