Exploring and mining rationale information for low-rating software applications

Tahir Ullah; Javed Ali Khan; Nek Dil Khan; Affan Yasin; Hasna Arshad

doi:10.1007/s00500-023-09054-3

Exploring and mining rationale information for low-rating software applications

Tahir Ullah, Javed Ali Khan^*, Nek Dil Khan, Affan Yasin, Hasna Arshad

^*Corresponding author for this work

Research output: Contribution to journal › Article › peer-review

8 Citations (Scopus)

Abstract

Rationale refers to making human judgments, sets of reasons, or intentions to explain a particular decision. Nowadays, crowd-users argue and justify their decisions on social media platforms about market-driven software applications, thus generating a software rationale. Such rationale information can be of pivotal importance for the software and requirements engineers to enhance the performance of existing software applications by revealing end-users tactic knowledge to improve software designing and development decision-making. For this purpose, we proposed an automated approach to capture and analyze end-user reviews containing rationale information, focusing on low-rating applications in the amazon store using Natural Language Processing (NLP) and supervised machine learning (ML) classification methods. In the literature, high-rating applications have been emphasized while ignoring low-rating software application that causes potential biasness. Therefore, we examined 59 comparatively low-ranked market-based software applications from the Amazon app store covering various software categories to capture and identify crowd-users justifications. Next, using a developed grounded theory and content analysis approach, we studied and recorded how crowd-users analyze and explain their rationale based on issues encountered, attacking or supporting arguments registered, and updating or uninstalling software applications. Also, to achieve the best results, an experimental study is conducted by comparing various ML and deep learning (DL) algorithms, i.e., MNB, LR, RF, MLP, KNN, AdaBoost, Voting, LSTM, and BILSTM classifiers on the end-users rationale data set by preprocessing the input data, applying feature engineering, balancing the data set, and then training and testing the ML algorithms with a standard cross-validation approach. We obtained satisfactory results with MLP, voting, LSTM, and RF Classifiers, having 93%, 93%, 91%, and 90% average accuracy, respectively. Also, we plot the ROC curves for the high-performing ML and DL classifiers to identify and capture classifiers yielding the best performance with an under-sampling or oversampling balancing approach. The proposed research approach outerperforms the existing rationale approaches with better Precision, Recall, and F-measure values. Additionally, the paper discusses various aspects of the proposed approach by extending it in multiple directions to be utilized by software developers and vendors to help improve the performance of existing software applications.

Original language	English
Journal	Soft Computing
DOIs	https://doi.org/10.1007/s00500-023-09054-3
Publication status	Accepted/In press - 2023
Externally published	Yes

Keywords

Amazon store
Deep learning
Grounded theory
Machine learning
NLP
Software rationale
User feedback

Access to Document

10.1007/s00500-023-09054-3

Cite this

@article{f7e353fb85824c6b8c5c1fd6cfe1d0e9,

title = "Exploring and mining rationale information for low-rating software applications",

abstract = "Rationale refers to making human judgments, sets of reasons, or intentions to explain a particular decision. Nowadays, crowd-users argue and justify their decisions on social media platforms about market-driven software applications, thus generating a software rationale. Such rationale information can be of pivotal importance for the software and requirements engineers to enhance the performance of existing software applications by revealing end-users tactic knowledge to improve software designing and development decision-making. For this purpose, we proposed an automated approach to capture and analyze end-user reviews containing rationale information, focusing on low-rating applications in the amazon store using Natural Language Processing (NLP) and supervised machine learning (ML) classification methods. In the literature, high-rating applications have been emphasized while ignoring low-rating software application that causes potential biasness. Therefore, we examined 59 comparatively low-ranked market-based software applications from the Amazon app store covering various software categories to capture and identify crowd-users justifications. Next, using a developed grounded theory and content analysis approach, we studied and recorded how crowd-users analyze and explain their rationale based on issues encountered, attacking or supporting arguments registered, and updating or uninstalling software applications. Also, to achieve the best results, an experimental study is conducted by comparing various ML and deep learning (DL) algorithms, i.e., MNB, LR, RF, MLP, KNN, AdaBoost, Voting, LSTM, and BILSTM classifiers on the end-users rationale data set by preprocessing the input data, applying feature engineering, balancing the data set, and then training and testing the ML algorithms with a standard cross-validation approach. We obtained satisfactory results with MLP, voting, LSTM, and RF Classifiers, having 93%, 93%, 91%, and 90% average accuracy, respectively. Also, we plot the ROC curves for the high-performing ML and DL classifiers to identify and capture classifiers yielding the best performance with an under-sampling or oversampling balancing approach. The proposed research approach outerperforms the existing rationale approaches with better Precision, Recall, and F-measure values. Additionally, the paper discusses various aspects of the proposed approach by extending it in multiple directions to be utilized by software developers and vendors to help improve the performance of existing software applications.",

keywords = "Amazon store, Deep learning, Grounded theory, Machine learning, NLP, Software rationale, User feedback",

author = "Tahir Ullah and Khan, {Javed Ali} and Khan, {Nek Dil} and Affan Yasin and Hasna Arshad",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.",

year = "2023",

doi = "10.1007/s00500-023-09054-3",

language = "English",

journal = "Soft Computing",

issn = "1432-7643",

}

TY - JOUR

T1 - Exploring and mining rationale information for low-rating software applications

AU - Ullah, Tahir

AU - Khan, Javed Ali

AU - Khan, Nek Dil

AU - Yasin, Affan

AU - Arshad, Hasna

PY - 2023

Y1 - 2023

N2 - Rationale refers to making human judgments, sets of reasons, or intentions to explain a particular decision. Nowadays, crowd-users argue and justify their decisions on social media platforms about market-driven software applications, thus generating a software rationale. Such rationale information can be of pivotal importance for the software and requirements engineers to enhance the performance of existing software applications by revealing end-users tactic knowledge to improve software designing and development decision-making. For this purpose, we proposed an automated approach to capture and analyze end-user reviews containing rationale information, focusing on low-rating applications in the amazon store using Natural Language Processing (NLP) and supervised machine learning (ML) classification methods. In the literature, high-rating applications have been emphasized while ignoring low-rating software application that causes potential biasness. Therefore, we examined 59 comparatively low-ranked market-based software applications from the Amazon app store covering various software categories to capture and identify crowd-users justifications. Next, using a developed grounded theory and content analysis approach, we studied and recorded how crowd-users analyze and explain their rationale based on issues encountered, attacking or supporting arguments registered, and updating or uninstalling software applications. Also, to achieve the best results, an experimental study is conducted by comparing various ML and deep learning (DL) algorithms, i.e., MNB, LR, RF, MLP, KNN, AdaBoost, Voting, LSTM, and BILSTM classifiers on the end-users rationale data set by preprocessing the input data, applying feature engineering, balancing the data set, and then training and testing the ML algorithms with a standard cross-validation approach. We obtained satisfactory results with MLP, voting, LSTM, and RF Classifiers, having 93%, 93%, 91%, and 90% average accuracy, respectively. Also, we plot the ROC curves for the high-performing ML and DL classifiers to identify and capture classifiers yielding the best performance with an under-sampling or oversampling balancing approach. The proposed research approach outerperforms the existing rationale approaches with better Precision, Recall, and F-measure values. Additionally, the paper discusses various aspects of the proposed approach by extending it in multiple directions to be utilized by software developers and vendors to help improve the performance of existing software applications.

AB - Rationale refers to making human judgments, sets of reasons, or intentions to explain a particular decision. Nowadays, crowd-users argue and justify their decisions on social media platforms about market-driven software applications, thus generating a software rationale. Such rationale information can be of pivotal importance for the software and requirements engineers to enhance the performance of existing software applications by revealing end-users tactic knowledge to improve software designing and development decision-making. For this purpose, we proposed an automated approach to capture and analyze end-user reviews containing rationale information, focusing on low-rating applications in the amazon store using Natural Language Processing (NLP) and supervised machine learning (ML) classification methods. In the literature, high-rating applications have been emphasized while ignoring low-rating software application that causes potential biasness. Therefore, we examined 59 comparatively low-ranked market-based software applications from the Amazon app store covering various software categories to capture and identify crowd-users justifications. Next, using a developed grounded theory and content analysis approach, we studied and recorded how crowd-users analyze and explain their rationale based on issues encountered, attacking or supporting arguments registered, and updating or uninstalling software applications. Also, to achieve the best results, an experimental study is conducted by comparing various ML and deep learning (DL) algorithms, i.e., MNB, LR, RF, MLP, KNN, AdaBoost, Voting, LSTM, and BILSTM classifiers on the end-users rationale data set by preprocessing the input data, applying feature engineering, balancing the data set, and then training and testing the ML algorithms with a standard cross-validation approach. We obtained satisfactory results with MLP, voting, LSTM, and RF Classifiers, having 93%, 93%, 91%, and 90% average accuracy, respectively. Also, we plot the ROC curves for the high-performing ML and DL classifiers to identify and capture classifiers yielding the best performance with an under-sampling or oversampling balancing approach. The proposed research approach outerperforms the existing rationale approaches with better Precision, Recall, and F-measure values. Additionally, the paper discusses various aspects of the proposed approach by extending it in multiple directions to be utilized by software developers and vendors to help improve the performance of existing software applications.

KW - Amazon store

KW - Deep learning

KW - Grounded theory

KW - Machine learning

KW - NLP

KW - Software rationale

KW - User feedback

UR - http://www.scopus.com/inward/record.url?scp=85166902679&partnerID=8YFLogxK

U2 - 10.1007/s00500-023-09054-3

DO - 10.1007/s00500-023-09054-3

M3 - Article

AN - SCOPUS:85166902679

SN - 1432-7643

JO - Soft Computing

JF - Soft Computing

ER -

Exploring and mining rationale information for low-rating software applications

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this