WEKO3
アイテム
Job Classification Through Long-Term Log Analysis Towards Power-Aware HPC System Operation
https://nied-repo.bosai.go.jp/records/6389
https://nied-repo.bosai.go.jp/records/63891ae8621b-1f6b-4c5f-815e-3e28e678ebf4
| Item type | researchmap(1) | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 公開日 | 2023-09-20 | |||||||||||||||
| タイトル | ||||||||||||||||
| 言語 | en | |||||||||||||||
| タイトル | Job Classification Through Long-Term Log Analysis Towards Power-Aware HPC System Operation | |||||||||||||||
| 言語 | ||||||||||||||||
| 言語 | eng | |||||||||||||||
| 著者 |
Yuichi Tsujita
× Yuichi Tsujita
× Atsuya Uno
× Ryuichi Sekizawa
× Keiji Yamamoto
× Fumichika Sueyasu
|
|||||||||||||||
| 抄録 | ||||||||||||||||
| 内容記述タイプ | Other | |||||||||||||||
| 内容記述 | High utilization of HPC system resources under constraints in electric power consumption or I/O workload is one of the primary goals to deal with high demand from application users. Utilization of CPU and memory, which is tightly related to electric power consumption, is counterpart metric of I/O activities in most HPC jobs. Towards higher utilization of HPC systems under restriction in management for electric power consumption and I/O activities, we need to care not to have hot-spots in power consumption or I/O operations because such situation leads to unstable system operation by exceeding capability of electric power supply or the I/O subsystem in such hot-spots. Analysis of a huge scale of log data collected from the K computer has revealed high correlation between I/O activities and CPU and memory utilization in some specific compute node layouts, showing unique characteristics of HPC jobs such as computation intensive or I/O-intensive. It has turned out that classifying jobs in terms of required electric power can divide into two groups, jobs consuming high electric power and I/O-intensive jobs. We have succeeded in job classification by achieving high correctness using machine learning approach, and we have confirmed effectiveness of the classification towards power-aware system operation in our next HPC system, the supercomputer Fugaku. | |||||||||||||||
| 言語 | en | |||||||||||||||
| 書誌情報 |
en : 2021 29TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING (PDP 2021) p. 26-34, 発行日 2021 |
|||||||||||||||
| 出版者 | ||||||||||||||||
| 言語 | en | |||||||||||||||
| 出版者 | IEEE COMPUTER SOC | |||||||||||||||
| ISSN | ||||||||||||||||
| 収録物識別子タイプ | ISSN | |||||||||||||||
| 収録物識別子 | 1066-6192 | |||||||||||||||
| DOI | ||||||||||||||||
| 関連識別子 | 10.1109/PDP52278.2021.00014 | |||||||||||||||