ALBUQUERQUE, NM — To help moderate the energy needs of increasingly power-hungry supercomputers, researchers at Sandia National Laboratories have released an application programming interface (API) with the goal of standardizing measurement and control of power- and energy-relevant features for high performance computing (HPC) systems.
The High Performance Computing — Power API specification, still open to collaborators for future development, is vendor-neutral in that it works on HPC products from any manufacturer, and has been vetted through reviews by other laboratories, universities and commercial collaborators.
“While the bulk of improvements in energy efficiency will no doubt come from hardware advances, software will play a critical role in maximizing the benefits of new hardware capabilities,” said Sandia researcher Jim Laros, who leads the specification development effort.
The specification developed by Laros’ team standardizes relevant measurement and control interfaces for a comprehensive range of HPC roles. These range from the hardware level to application interfaces. For example, standardizing how a work-load manager interfaces with the HPC system will enable such features as energy-aware scheduling. This can minimize power usage during the hours in which utility companies often charge higher rates.
“Because future architectures might not be able to operate all components at full capability for a variety of reasons, including temperature considerations or power delivery limitations,” said Laros, “our specification defines standard interfaces to facilitate appropriate choices in allocating the available power budget among many, sometimes conflicting and often site-specific, considerations.
“The specification we have developed provides multiple levels of abstractions to satisfy the requirements of multiple types of users or roles.”
The Power API specification was developed after study by Sandia and the National Renewable Energy Laboratory (NREL). The work was based on almost a decade of research at Sandia, aided by numerous industry and community relationships that developed over the same period.
A face-to-face review in July included representatives from Cray, Adaptive Computing, Penguin Computing, AMD, IBM, Intel and NREL, Laros said. Other university and laboratory collaborators, including the University of New Mexico, Oak Ridge and Los Alamos national laboratories and the Energy Efficient High Performance Computing working group, provided written feedback.
After the open release of version 1.0, an international audience conducted a review in Denver this fall.
“Feedback from both reviews as well as our continued collaborations with the HPC community will strengthen future releases of the specification,” said Laros.
The first production implementation of the Power API specification will be on the Trinity supercomputer, the first of the new advanced technology platforms funded by the National Nuclear Security Administration’s Advanced Simulation and Computing program.
Laros and his team are available for questions regarding Sandia’s power API at firstname.lastname@example.org. “We welcome feedback and collaboration with anyone who is interested,” he said.
Sandia National Laboratories is a multi-program laboratory operated by Sandia Corporation, a wholly owned subsidiary of Lockheed Martin, for the U.S. Department of Energy’s National Nuclear Security Administration. With main facilities in Albuquerque, NM, and Livermore, CA, Sandia has major R&D responsibilities in national security, energy and environmental technologies and economic competitiveness.