Document Type
Dissertation
Degree
Doctor of Philosophy (PhD)
Major/Program
Computer Science
First Advisor's Name
Jason Liu
First Advisor's Committee Title
Committee Chair
Second Advisor's Name
S. S. Iyengar
Second Advisor's Committee Title
Committee Member
Third Advisor's Name
Deng Pan
Third Advisor's Committee Title
Committee Member
Fourth Advisor's Name
Leonardo Bobadilla
Fourth Advisor's Committee Title
Committee Member
Fifth Advisor's Name
Wujie Wen
Fifth Advisor's Committee Title
Committee Member
Keywords
High-Performance Computing, Interconnection Network, Performance Prediction, Demand Response, Resource Management, Energy Efficiency
Date of Defense
3-22-2018
Abstract
The growing computational demand of scientific applications has greatly motivated the development of large-scale high-performance computing (HPC) systems in the past decade. To accommodate the increasing demand of applications, HPC systems have been going through dramatic architectural changes (e.g., introduction of many-core and multi-core systems, rapid growth of complex interconnection network for efficient communication between thousands of nodes), as well as significant increase in size (e.g., modern supercomputers consist of hundreds of thousands of nodes). With such changes in architecture and size, the energy consumption by these systems has increased significantly. With the advent of exascale supercomputers in the next few years, power consumption of the HPC systems will surely increase; some systems may even consume hundreds of megawatts of electricity. Demand response programs are designed to help the energy service providers to stabilize the power system by reducing the energy consumption of participating systems during the time periods of high demand power usage or temporary shortage in power supply.
This dissertation focuses on developing energy-efficient demand-response models and algorithms to enable HPC system's demand response participation. In the first part, we present interconnection network models for performance prediction of large-scale HPC applications. They are based on interconnected topologies widely used in HPC systems: dragonfly, torus, and fat-tree. Our interconnect models are fully integrated with an implementation of message-passing interface (MPI) that can mimic most of its functions with packet-level accuracy. Extensive experiments show that our integrated models provide good accuracy for predicting the network behavior, while at the same time allowing for good parallel scaling performance. In the second part, we present an energy-efficient demand-response model to reduce HPC systems' energy consumption during demand response periods. We propose HPC job scheduling and resource provisioning schemes to enable HPC system's emergency demand response participation. In the final part, we propose an economic demand-response model to allow both HPC operator and HPC users to jointly reduce HPC system's energy cost. Our proposed model allows the participation of HPC systems in economic demand-response programs through a contract-based rewarding scheme that can incentivize HPC users to participate in demand response.
Identifier
FIDC006527
Recommended Citation
Ahmed, Kishwar, "Energy Demand Response for High-Performance Computing Systems" (2018). FIU Electronic Theses and Dissertations. 3569.
https://digitalcommons.fiu.edu/etd/3569
Included in
Contracts Commons, Digital Communications and Networking Commons, Power and Energy Commons
Rights Statement
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/
This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).