Systems and Means of Informatics
2024, Volume 34, Issue 3, pp 109-122
MODELING OF THE INPUT FLOW OF LANL MUSTANG COMPUTING CLUSTER WORKLOADS
Abstract
Statistical analysis is an indispensable element in the construction of a mathematical model of the object under study. Queuing systems as an object of research have specific features that make it necessary to go beyond the general theory of stochastic processes. The article discusses the construction of the models of the input flow of multiprocessor systems based on the trace of the real workload of the Mustang cluster obtained as a part of the Atlas project (www.project-atlas.org). Mustang data features include a long observation period, an impressive amount of data collected, a wide field of research due to the simplified nature of previous studies and fuzzy conclusions already made, the combination of fragments with different flow intensities, the presence of stationary and nonstationary areas, and the inapplicability of the simple Poisson flow model. As a solution to the problems that arise for stationary data fragments, it is proposed to use the branching Poisson process model. The well-known methods of estimating the model parameters are supplemented by the procedure for refining estimates and formalized methods of confirmatory analysis. Given the large amounts of data being processed, it is important to build effective algorithms for calculating the characteristics of streams and smoothing out sample indicators.
[+] References (6)
- Bhat, U.N., and S. S. Rao. 1987. Statistical analysis of queueing systems. Queueing Syst. 1(3):217{247. doi: 10.1007/BF01149536.
- Talby, D., D. Feitelson, and A. Raveh. 2007. A co-plot analysis of logs and models of parallel workloads. ACM T. Model. Comput. S. 17(3): 12. 27 p. doi: 10.1145/ 1243991.124399.
- Feitelson, D. G., D. Tsafrir, and D. Krakov. 2014. Experience with using the parallel workloads archive. J. Parallel Distr. Com. 74(10):2967{2982. doi: 10.1016/j.jpdc. 2014.06.013.
- Amvrosiadis, G., V. Kuchnik, J.W. Park, C. Cranor, G. R. Ganger, E. Moore, and N. DeBardeleben. 2018. The Atlas cluster trace repository. USENIX Winter 43(4):29{35.
- Amvrosiadis, G., J.W. Park, G. R. Ganger, G. A. Gibson, E. Baseman, and N. De- Bardeleben. 2018. On the diversity of cluster workloads and its impact on research results. USENIX Annual Technical Conference Proceedings. Boston, MA: USENIX Association. 533{546.
- Cox, D. R., and P. A. W. Lewis. 1966. The statistical analysis of series of events. New York, NY: John Wiley. 285 p.
[+] About this article
Title
MODELING OF THE INPUT FLOW OF LANL MUSTANG COMPUTING CLUSTER WORKLOADS
Journal
Systems and Means of Informatics
Volume 34, Issue 3, pp 109-122
Cover Date
2024-10-30
DOI
10.14357/08696527240308
Print ISSN
0869-6527
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
trace repository; LANL Mustang cluster; branching Poisson process (BPP); statistical analysis of series of events
Authors
M. P. Krivenko
Author Affiliations
Federal Research Center "Computer Science and Control", Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
|