I. Introduction
HPC productivity could be greatly enhanced by more accurate I/O performance prediction. At supercomputing facilities, scientists are allocated limited compute cycles and charged based on their application execution times. They are therefore motivated to limit or reduce the time consumption on I/O. For the facilities, more predictable I/O performance will enable more precise compute-time allocations and further enable additional jobs on the same platform, increasing the system utilization. For example, for many scientists the ideal time spent on writing state snapshots is ~10% of the total application execution times. In practice, however, the cost may span a wide range (e.g., 7%-20% for XGC code [39]) because of lack of guidance on I/O configuration. With accurate write performance prediction, scientists can manage the cost by configuring the write frequencies appropriately (§II-A).