Study of Machine Learning Models for Accelerating Scalability Analysis in the PaScal Suite
Scalability analysis, High-performance computing, Machine learning
Performance and scalability analysis of parallel applications is a fundamental step in high-performance computing software development; however, it requires multiple executions, resulting in high computational, energy, and time costs. In this context, this work proposes two machine learning–based approaches to support scalability analysis by integrating predictive models into the PaScal Analyzer. The proposed approach relies on regression models trained adaptively from real measurements collected during the execution of parallel applications. The workflow is based on cross-validation, enabling the assessment of measurement quality and the automatic selection of model hyperparameters before final training. As a result, execution times for unmeasured configurations can be estimated, reducing the waiting time required to analyze application scalability.
Initial experiments were conducted in a high-performance computing environment, validating the integration of the experimental infrastructure. The results obtained at this stage indicate the feasibility of the proposed approach and provide a foundation for more in-depth comparative analyses between the adopted models, as well as for the expansion of experimental scenarios.