Predicting Software System Quality Indicators Using Modifications of the SHAP Method

Authors

  • A. S. Shantyr State University of Information and Communication Technologies, Kyiv

DOI:

https://doi.org/10.31649/1997-9266-2024-177-6-94-102

Keywords:

optimization, deep learning model, Bayesian updating, adaptability

Abstract

This paper proposes five suggestions for improving the practical application of the SHAP method in the context of software system (SS) quality assessment. The study explores the potential for enhancing the SHAP method (Shapley Additive Explanations) in forecasting the quality indicators of software systems. The aim of the research is to increase the accuracy and adaptability of the SHAP method through modifications that account for various SS quality parameters, including performance, reliability, scalability, and usability. The objectives of the study are the following: to conduct a detailed review of the issues associated with the application of the SHAP method in SS quality assessment; to mathematically describe five methods for modifying the SHAP method to improve its accuracy, adaptability, and speed in evaluating SS quality indicators; and to experimentally verify the proposed modifications to assess their effectiveness compared to the original method. The study examines five methods for enhancing the SHAP method: LSTM (Long Short-Term Memory); CNN (Convolutional Neural Networks); adaptive SHAP; MLP (Multilayer Perceptron); and RNN (Recurrent Neural Network, including model ensembles and Bayesian updating). A practical comparison of the results demonstrated that the proposed variations of SHAP could significantly improve accuracy and prediction speed, especially in dynamic conditions and large data volumes. The development and justification of the five optimization modifications of the SHAP method achieved improvements in accuracy and forecasting efficiency for SS quality indicators. The research found that methods using deep neural networks (LSTM, CNN) show higher accuracy and adaptability compared to the original SHAP; however, this comes at the cost of implementation complexity and longer execution times. Adaptive SHAP and Ensemble are optimal in terms of balancing accuracy, adaptability, and interpretability, but require some optimization to improve execution time. The original SHAP showed satisfactory accuracy (MAE 0.84) and interpretability (7/10), but lagged behind modern approaches in adaptability (6/10) and execution time (0.4 hours). SHAP needs further optimization, especially in dynamic environments where rapid adaptation to new data is crucial. Time series (LSTM) provided the highest accuracy among the methods (MAE 0.91) and good adaptability (8/10).

Author Biography

A. S. Shantyr, State University of Information and Communication Technologies, Kyiv

Cand. Sc. (Eng.), Associate Professor of the Chair of Artificial Intelligence

References

A. Altmann, L. Toloşi, O. Sander, and T. Lengauer, “Permutation importance: A corrected feature importance measure,” Bioinformatics, no. 26 (10), pp.1340-1347, 2010. https://doi.org/10.1093/bioinformatics/btq134 .

T. Amoriello, R. Ciccoritti, and P. Ferrante, “Prediction of strawberries’ quality parameters using artificial neural networks,” Agronomy, no. 12 (4), pр. 963, 2022. https://doi.org/10.3390/agronomy12040963 .

L. Antwarg, R. M. Miller, B. Shapira, and L. Rokach, “Explaining anomalies detected by autoencoders using Shapley Additive Explanations,” Expert Systems with Applications, no. 186, pр.115736, 2021. https://doi.org/10.1016/j.eswa.2021.115736 .

K. N. Dheeraj, et al., “Crop quality prediction using ml and neural networks,” International Journal on Cybernetics & Informatics, no. 10 (2), рр. 07-11, 2021. https://doi.org/10.5121/ijci.2021.100202 .

R. EjrnÆs, E. Aude, B. Nygaard, and B. Münier, “Prediction of habitat quality using ordination and neural networks,” Ecological Applications, no. 12 (4), рр.1180-1187, 2002. https://doi.org/10.1890/1051-0761(2002)012[1180:pohquo]2.0.co;2 .

F. Fumagalli, M. Muschalik, E. Hüllermeier, and B. Hammer, “Incremental permutation feature importance (iPFI): Towards online explanations on data streams,” Machine Learning, 2023. https://doi.org/10.1007/s10994-023-06385-y .

H. Ghanmi, A. Ghith, and T. Benameur, “Ring spun yarn quality prediction using hybrid neural networks,” The Journal of the Textile Institute, рр.1-9, 2021. https://doi.org/10.1080/00405000.2021.2022826 .

K. N. Jha, and C. T. Chockalingam, “Prediction of quality performance using artificial neural networks,” Journal of Advances in Management Research, no. 6 (1), pр.70-86, 2009. https://doi.org/10.1108/09727980910972172 .

H. Kaneko, “Cross‐validated permutation feature importance considering correlation between features,” Analytical Science Advances, 2022. https://doi.org/10.1002/ansa.202200018 .

A. Michalski, K. Duraj, and B. Kupcewicz, “Leukocyte deep learning classification assessment using Shapley additive explanations algorithm,” International Journal of Laboratory Hematology, 2023. https://doi.org/10.1111/ijlh.14031 .

R. Miković, B. Arsić, and Đ. Gligorijević, “Importance of social capital for knowledge acquisition – DeepLIFT learning from international development projects,” Information Processing & Management, no. 61(4), pр.103694, 2024. Accessed: 8 Nov. 2024. [Online]. https://doi.org/10.1016/j.ipm.2024.103694 .

A. Movsessian, D. G. Cava, and D. Tcherniak, “Interpretable machine learning in damage detection using shapley additive explanations,” ASCE-ASME J Risk and Uncert in Engrg Sys Part B Mech Engrg, no. 8 (2), 2022. https://doi.org/10.1115/1.4053304 .

H. Ni, and H. Yin, “Exchange rate prediction using hybrid neural networks and trading indicators,” Neurocomputing, no. 72 (13-15), рр. 2815-2823, 2009. https://doi.org/10.1016/j.neucom.2008.09.023 .

S. Nirmalraj, A. S. M. Antony, at al. “Permutation feature importance-based fusion techniques for diabetes prediction,” Soft Computing, 2023. https://doi.org/10.1007/s00500-023-08041-y .

I. Poligné, B. Broyart, G. Trystram, and A. Collignan, “Prediction of mass-transfer kinetics and product quality changes during a dehydration-impregnation-soaking process using artificial neural networks. Application to pork curing,” LWT — Food Science and Technology, no. 35 (8), рр.748-756, 2002. https://doi.org/10.1006/fstl.2002.0939 .

N. H. A. Rahman, M. H. Lee, Suhartono, and M. T. Latif, “Artificial neural networks and fuzzy time series forecasting: An application to air quality,” Quality and Quantity, no. 49 (6), рр. 2633-2647, 2014. https://doi.org/10.1007/s11135-014-0132-6 .

A. M. Schweidtmann, et al., “Graph neural networks for prediction of fuel ignition quality,” Energy & Fuels, no. 34(9), рр.11395-11407. https://doi.org/10.1021/acs.energyfuels.0c01533 .

C. H. Tan, K. M. Yusof, and S. R. W. Alwi, “Quality prediction for polypropylene extrusion based on neural networks,” IOP Conference Series: Materials Science and Engineering, no. 1257 (1), pp. 012034, 2022. https://doi.org/10.1088/1757-899x/1257/1/012034 .

J. S. Tan, “Ablation study on feature group importance for automated essay scoring,” Asia-Pacific Journal of Information Technology and Multimedia, no. 11 (01), рр. 90-101, 2022. https://doi.org/10.17576/apjitm-2022-1101-08 .

M. M. T. Thwin, and T.-S. Quah, “Application of neural networks for software quality prediction using object-oriented metrics,” Journal of Systems and Software, no. 76 (2), рр. 147-156, 2005. https://doi.org/10.1016/j.jss.2004.05.001 .

S. Wang, and Y. Zhang, “Grad-CAM: Understanding AI models,” Computers, Materials & Continua, no. 76 (2), рр.1321-1324, 2023. https://doi.org/10.32604/cmc.2023.041419 .

Downloads

Abstract views: 20

Published

2024-12-27

How to Cite

[1]
A. S. Shantyr, “Predicting Software System Quality Indicators Using Modifications of the SHAP Method”, Вісник ВПІ, no. 6, pp. 94–102, Dec. 2024.

Issue

Section

Information technologies and computer sciences

Metrics

Downloads

Download data is not yet available.