Berkeley Lab Researchers Prepare U.S. Climate Community for 100-Gigabit Data Transfers
October 30, 2009
Contacts: Linda Vu, firstname.lastname@example.org, 510-495-2402
As researchers around the world tackle the issue of global climate change, they are both generating and sharing increasingly large amounts of data. This increased collaboration helps climate scientists better understand what is happening and evaluate the effectiveness of possible mitigations.
But sharing these increasingly large datasets requires reliable high-bandwidth networks. To help ensure that the climate research community has the resources necessary to access, transfer and analyze the data, the U.S. Department of Energy has funded several related projects at Lawrence Berkeley National Laboratory.
The newest project, called Climate 100, will help the research community effectively use the planned 100-gigabit-per-second networks. Climate 100, funded with $201,000 under the American Recovery and Reinvestment Act, will bring together middleware and network researchers to develop the needed tools and techniques for moving unprecedented amounts of data.
"Climate 100 is a system that will integrate massive climate datasets, emerging 100 Gbps networks, and state-of-the-art data transport and management technologies to enable realistic at-scale experimentation with climate data management transport and analysis in a 100 Gbps, 100 petabyte world," says Alex Sim, a co-principal investigator of the Climate 100 project. Sim is a member of the Scientific Data Management Group at Berkeley Lab.
As Climate 100 tools are being developed, DOE's Energy Sciences Network (ESnet) will be building a prototype 100 Gbps network, the fastest network to date linking DOE supercomputing centers in California, Illinois and Tennessee. Called the Advanced Networking Initiative, the project will advance the development of 100 Gbps equipment and services to handle the ever increasing flow of critical data between research institutions. The initiative, managed by ESnet at Berkeley Lab, received $62 million in Recovery Act funding.
The Climate 100 project will build on the efforts of DOE's Earth System Grid (ESG), the leading infrastructure for accessing and distributing the impending influx of climate model data.
The ESG was initially developed with funding from DOE's Next Generation Internet Program to address the management and use of extremely large and diverse datasets. It has since become a Center for Enabling Technologies (CET) that provides access to climate data from large scale simulations, metadata information about the models and available datasets, as well as analysis and information tools. The ESG-CET is supported by the DOE's Scientific Discovery through Advanced Computing (SciDAC) program and led by Dean Williams of the Lawrence Livermore National Laboratory. Williams also serves as co-principal investigator on the Climate 100 project.
As an example of the scope of the datasets and user demand, the Program for Climate Model Diagnosis and Intercomparison at Lawrence Livermore National Laboratory has archived climate modeling data from around the world. Known as the World Climate Research Program Coupled Model Intercomparison Project (CMIP3), this open archive now contains more than 35 terabytes of data and is accessed by more than 1,200 users. However, the next-generation archive at Livermore is expected to contain at least 650 terabytes, and the larger distributed worldwide archive will be between 6 to 10 petabytes.
About Computing Sciences at Berkeley Lab
The Lawrence Berkeley National Laboratory (Berkeley Lab) Computing Sciences organization provides the computing and networking resources and expertise critical to advancing the Department of Energy's research missions: developing new energy sources, improving energy efficiency, developing new materials and increasing our understanding of ourselves, our world and our universe.
ESnet, the Energy Sciences Network, provides the high-bandwidth, reliable connections that link scientists at 40 DOE research sites to each other and to experimental facilities and supercomputing centers around the country. The National Energy Research Scientific Computing Center (NERSC) powers the discoveries of 6,000 scientists at national laboratories and universities, including those at Berkeley Lab's Computational Research Division (CRD). CRD conducts research and development in mathematical modeling and simulation, algorithm design, data storage, management and analysis, computer system architecture and high-performance software implementation. NERSC and ESnet are DOE Office of Science User Facilities.
Lawrence Berkeley National Laboratory addresses the world's most urgent scientific challenges by advancing sustainable energy, protecting human health, creating new materials, and revealing the origin and fate of the universe. Founded in 1931, Berkeley Lab's scientific expertise has been recognized with 13 Nobel prizes. The University of California manages Berkeley Lab for the DOE’s Office of Science.
DOE’s Office of Science is the single largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.