Globus Online and ESnet Partner for Scientific Data Management
November 15, 2012
SALT LAKE CITY, UT – Globus Online and the Energy Sciences Network (ESnet) today announced a new collaboration to help scientists better manage the growing amounts of data they need to move, share, and analyze worldwide.
As partners, the organizations plan to develop joint outreach and educational programs designed for small and medium science collaborations that have not traditionally used data transfer tools and advanced networking as part of their workflow, but who will soon require these capabilities as their data volumes increase. ESnet and Globus Online will also pursue coordinated research efforts aimed at better integrating their services to make it easier for scientists to set up efficient end-to-end data transfers.
The collaboration brings together two leading organizations that have successfully tackled the challenge of data transfer and sharing from two different perspectives. ESnet is the Department of Energy’s high performance science network that interconnects the nation’s research laboratories, supercomputing centers and experimental facilities. Built on GridFTP, Globus Online provides an easy-to-use, highly reliable online file transfer and sharing service that researchers can use from their desktop or integrate into their scientific application. Globus Online has become a primary on-ramp for researchers to access high performance networks like ESnet for rapidly sharing data with collaborators or to use remote computing and scientific instrument facilities.
“In this new era of data intensive science impacting nearly all research disciplines, data transfer and sharing tools, and high performance networks are becoming indispensible enablers for scientific discovery,” said Greg Bell, Director of ESnet. “Rather than forcing scientists—especially those without IT expertise—to piece together the myriad tools they need to move their data effectively, we are partnering with Globus Online to provide education on a holistic set of capabilities that can be easily adopted and used right away.”
One ESnet-developed strategy that has been very helpful to scientists coping with rapidly increasing data volumes is the Science DMZ. The Science DMZ calls for a high performance Data Transfer Node, or DTN, which is a dedicated system explicitly built for high performance data transfers. Scientists have achieved incredible data transfer performance when coupling the Globus Online file transfer service with the use of a DTN.
In August 2012, Gary Bates at the National Oceanic and Atmospheric Administration (NOAA) Earth System Research Lab in Boulder, Colo., saw the benefits of a Science DMZ first-hand. Until the Boulder Network Operations Center (BNOC) built a Science DMZ, data trickled through the firewall at 1-2 megabytes per second. Using Globus Online and a newly configured data transfer node, Bates transferred 273 files with a total size of 239.5 gigabytes in just over 10 minutes. “Whoa! Transfer from NERSC to the BNOC data transfer node using Globus is screaming!” Bates wrote. He calculated his transfer rate to be about 395 megabytes per second, or over 3 gigabits per second. “I've never gotten anything close to that before.”
“This example demonstrates the natural synergy between Globus Online and ESnet, highlighting how a high performance, modern network architecture combined with a sophisticated file transfer and sharing service can work together to enhance scientific productivity and make new modes of discovery possible,” said Ian Foster, co-lead of the Globus Online service.
ESnet provides the high-bandwidth, reliable connections that link scientists at national laboratories, universities and other research institutions, enabling them to collaborate on some of the world's most important scientific challenges including energy, climate science, and the origins of the universe. Funded by the U.S. Department of Energy's (DOE) Office of Science and located within the Scientific Networking Division at Lawrence Berkeley National Laboratory, ESnet provides scientists with access to unique DOE research facilities and computing resources.
About Globus Online
Globus Online is software-as-a-service for research data management. Globus Online provides a reliable, high-performance file transfer service for easily and securely moving big data between resources, such as supercomputing facilities, cloud storage systems, campus clusters, lab servers, or personal computers. The service also enables data on existing storage systems to be shared with users at multiple organizations, without specialized software or expensive cloud storage. Designed specifically for researchers, Globus Online is recommended by dozens of institutions and high-performance computing facilities worldwide. Globus Online is an initiative by the Computation Institute at the University of Chicago and Argonne National Laboratory, and is supported in part by funding from the Department of Energy, the National Science Foundation, and the National Institutes of Health. For more information, visitwww.globusonline.org.
About Computing Sciences at Berkeley Lab
The Computing Sciences Area at Lawrence Berkeley National Laboratory(Berkeley Lab) provides the computing and networking resources and expertise critical to advancing Department of Energy Office of Science (DOE-SC) research missions: developing new energy sources, improving energy efficiency, developing new materials, and increasing our understanding of ourselves, our world, and our universe. ESnet, the Energy Sciences Network, provides the high-bandwidth, reliable connections that link scientists at 40 DOE research sites to each other and to experimental facilities and supercomputing centers around the country. The National Energy Research Scientific Computing Center (NERSC) powers the discoveries of 7,000-plus scientists at national laboratories and universities. NERSC and ESnet are both Department of Energy Office of Science National User Facilities. The Computational Research Division (CRD) conducts research and development in mathematical modeling and simulation, algorithm design, data storage, management and analysis, computer system architecture and high-performance software implementation.
Berkeley Lab addresses the world's most urgent scientific challenges by advancing sustainable energy, protecting human health, creating new materials, and revealing the origin and fate of the universe. Founded in 1931, Berkeley Lab's scientific expertise has been recognized with 13 Nobel prizes. The University of California manages Berkeley Lab for the DOE’s Office of Science. The DOE Office of Science is the United States' single largest supporter of basic research in the physical sciences and is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.