Lawrence Berkeley, Oak Ridge National Labs Share 2018 ACM Gordon Bell Prize
Computing Science Teams Honored for Advances in Understanding Climate Change and Combating Opioid Addiction
November 20, 2018
A team of computational scientists and engineers from Lawrence Berkeley National Laboratory (Berkeley Lab), Oak Ridge National Laboratory (ORNL) and NVIDIA has been awarded the ACM Gordon Bell Prize for applying an exascale-class deep learning application to extreme climate data and breaking the exaop (1 billion billion calculations) computing barrier for the first time with a deep learning application.
Climate scientists rely on some of the fastest computers in the world to run high-fidelity simulations, but those simulations produce massive amounts of data that create their own computational challenge. The team used the fastest computer in the world, ORNL’s Summit system, to apply deep learning methods to extract detailed information from climate data produced at NERSC. Rather than computing simple quantities such as average global temperature, these methods discover and locate features like hurricanes and atmospheric rivers, which are important in characterizing extreme weather patterns and their impact.
“We are thrilled to be honored with the SC18 Gordon Bell Prize for our efforts in scalability and time to solution,” said Prabhat, who leads the Data & Analytics Services team at NERSC and was a co-author on the Gordon Bell submission. “The team from NERSC, NVIDIA and OLCF put in a tremendous amount of effort over eight months to develop this application from scratch. Achieving an exaop-level performance required significant innovation across the entire stack, and this result is a testament to the quality of the team and the capabilities of Summit.”
Summit is an IBM system powered by more than 9,000 IBM POWER9 CPUs and 27,000 NVIDIA® Tesla® V100 Tensor Core GPUs. By tapping into the specialized NVIDIA Tensor Cores built into the GPUs at scale, the researchers achieved a peak performance of 1.13 exaops and a sustained performance of 0.999 – the fastest deep learning algorithm reported to date.
“We’ve entered a new era in high performance computing, one in which exaflop performance is a reality,” said Michael Houston, a senior distinguished engineer of deep learning at NVIDIA who led NVIDIA’s research team on the climate analytics project with Berkeley Lab. “Partnering with the best minds in high performance computing at Berkeley Lab and ORNL, we demonstrated how tens of thousands of NVIDIA V100 Tensor Core GPUs on Summit can achieve breakthrough, exascale levels of performance to tackle the world’s most pressing problems.”
While deep learning is a well-established tool for problems like image analysis, its use as a tool in scientific discovery, and especially complex data sets like climate data, is relatively new.
“This project was particularly impressive from a software perspective, in that we utilized a high-productivity framework like TensorFlow, to scale deep learning code on up to 4,560 Summit nodes,” said Thorsten Kurth, application performance specialist at NERSC, and lead author on the Gordon Bell paper. “Our performance enhancements are not hardwired to the climate application, but can be generally applied to other scientific deep learning applications that require extreme scaling. Our project has hopefully provided a blueprint for AI+HPC projects for the future."
In addition to Prabhat, Houston and Kurth, the research team included Jack Deslippe, Mayur Mudigonda and Ankur Mahesh of Berkeley Lab; Michael Matheson of ORNL; and Michael Houston, Sean Treichler, Joshua Romero, Nathan Luehr, Everett Phillips and Massimiliano Fatica of NVIDIA.
They share this year’s award with another team led by ORNL that was recognized for its work in leveraging population-scale genomic datasets and algorithmic advances to uncover hidden networks of genes at incredible speeds. This project included contributions from Berkeley Lab’s Kjiersten Fagnan, chief informatics officer for the Joint Genome Institute. This group also did its computing on ORNL’s Summit system at the Oak Ridge Leadership Computing Facility (OLCF).
The award was presented November 15 during the SC18 conference in Dallas, Texas. Established more than three decades ago by the Association for Computing Machinery, the ACM Gordon Bell Prize tracks the progress of parallel computing and rewards innovation in applying high performance computing to challenges in science, engineering, and large- scale data analytics
Both NERSC and OLCF are DOE Office of Science User Facilities.
Read the DOE release about these achievements here.
About Computing Sciences at Berkeley Lab
The Lawrence Berkeley National Laboratory (Berkeley Lab) Computing Sciences organization provides the computing and networking resources and expertise critical to advancing the Department of Energy's research missions: developing new energy sources, improving energy efficiency, developing new materials and increasing our understanding of ourselves, our world and our universe.
ESnet, the Energy Sciences Network, provides the high-bandwidth, reliable connections that link scientists at 40 DOE research sites to each other and to experimental facilities and supercomputing centers around the country. The National Energy Research Scientific Computing Center (NERSC) powers the discoveries of 7,000-plus scientists at national laboratories and universities, including those at Berkeley Lab's Computational Research Division (CRD). CRD conducts research and development in mathematical modeling and simulation, algorithm design, data storage, management and analysis, computer system architecture and high-performance software implementation. NERSC and ESnet are Department of Energy Office of Science User Facilities.
Lawrence Berkeley National Laboratory addresses the world's most urgent scientific challenges by advancing sustainable energy, protecting human health, creating new materials, and revealing the origin and fate of the universe. Founded in 1931, Berkeley Lab's scientific expertise has been recognized with 13 Nobel prizes. The University of California manages Berkeley Lab for the DOE’s Office of Science.
DOE’s Office of Science is the single largest supporter of basic research in the physical sciences in the United States, and is working to address some of the most pressing challenges of our time. For more information, please visit science.energy.gov.