SevPredict: Exploring the Potential of Large Language Models in Software Maintenance

Muhammad Ali Arshad, Adnan Riaz*, Rubia Fatima, Affan Yasin*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The prioritization of bug reports based on severity is a crucial aspect of bug triaging, enabling a focus on more critical issues. Traditional methods for assessing bug severity range from manual inspection to the application of machine and deep learning techniques. However, manual evaluation tends to be resource-intensive and inefficient, while conventional learning models often lack contextual understanding. This study explores the effectiveness of large language models (LLMs) in predicting bug report severity. We propose a novel approach called SevPredict using GPT-2, an advanced LLM, and compare it against state-of-the-art models. The comparative analysis between the proposed approach and state-of-the-art approaches suggests that the proposed approach outperforms the state-of-the-art approaches in terms of performance evaluation metrics. SevPredict shows improvements over the best-performing state-of-the-art approach (BERT-SBR) with 1.72% higher accuracy, 2.18% higher precision, and 4.94% higher MCC. The improvements are even more substantial when compared to the approach by Ramay et al., with SevPredict demonstrating 10.66% higher accuracy, 10.39% higher precision, 3.29% higher recall, 7.19% higher F1-score, and a remarkable 41.27% higher MCC. These findings not only demonstrate the superiority of our GPT-2-based approach in predicting the severity of bug reports but also highlight its potential to significantly advance automated bug triaging and software maintenance. This research introduces a severity prediction tool named SevPredict.

Original languageEnglish
Pages (from-to)2739-2760
Number of pages22
JournalAI (Switzerland)
Volume5
Issue number4
DOIs
Publication statusPublished - Dec 2024

Keywords

  • large language models
  • mining software repository
  • severity prediction

Fingerprint

Dive into the research topics of 'SevPredict: Exploring the Potential of Large Language Models in Software Maintenance'. Together they form a unique fingerprint.

Cite this