TY - JOUR
T1 - Exploring reddit forum for software evolution as an alternative requirements source
T2 - An end-user discussion dataset on Google maps
AU - Khan, Javed Ali
AU - Khan, Nek Dil
AU - Yaqoob, Muhammad
AU - Yasin, Affan
AU - Alwadain, Ayed
N1 - Publisher Copyright:
© 2024
PY - 2024/12
Y1 - 2024/12
N2 - For software development and evolution, end-user feedback from app stores and the Twitter (X) platform has been intensively used recently. However, Reddit forums that provide an argumentative platform to argue and reason about various software features and issues have been less likely to be explored for software evolution and improvement in the literature. Therefore, this study explores Reddit forums as an alternative source for software evolution compared to App Stores, Twitter (X), and Amazon reviews. For this purpose, a Python script is developed to extract end-user discussions related to the Google Maps (GM) app from Reddit forums using Python Praw API, keep the original argumentative structure in user discussions. In total, 3119 end-user discussions from seven related topics about the GMM app are extracted for software evolution. This dataset includes detailed end-user feedback and associated metadata, including Comments ID, Parent ID, author names, timestamps, and upvotes. This dataset is a crucial and valuable resource for software vendors, developers, researchers, and educationists to improve their understanding of identifying new features to include in upcoming app versions. Also, it is of pivotal importance in better understanding recently occurring issues, unlike app stores where user debate on it and provide their justifications. Moreover, the replication package and process of the dataset can enable software researchers, vendors, and developers to extract data from the Reddit forum and use it for the software evolution and improvement process.
AB - For software development and evolution, end-user feedback from app stores and the Twitter (X) platform has been intensively used recently. However, Reddit forums that provide an argumentative platform to argue and reason about various software features and issues have been less likely to be explored for software evolution and improvement in the literature. Therefore, this study explores Reddit forums as an alternative source for software evolution compared to App Stores, Twitter (X), and Amazon reviews. For this purpose, a Python script is developed to extract end-user discussions related to the Google Maps (GM) app from Reddit forums using Python Praw API, keep the original argumentative structure in user discussions. In total, 3119 end-user discussions from seven related topics about the GMM app are extracted for software evolution. This dataset includes detailed end-user feedback and associated metadata, including Comments ID, Parent ID, author names, timestamps, and upvotes. This dataset is a crucial and valuable resource for software vendors, developers, researchers, and educationists to improve their understanding of identifying new features to include in upcoming app versions. Also, it is of pivotal importance in better understanding recently occurring issues, unlike app stores where user debate on it and provide their justifications. Moreover, the replication package and process of the dataset can enable software researchers, vendors, and developers to extract data from the Reddit forum and use it for the software evolution and improvement process.
KW - Classification
KW - CrowdRE
KW - Google maps feedback
KW - Natural Language Processing (NLP)
KW - Reddit discussions
KW - Requirements dataset
KW - Requirements engineering
KW - Sentiment analysis
UR - http://www.scopus.com/inward/record.url?scp=85206088785&partnerID=8YFLogxK
U2 - 10.1016/j.dib.2024.110993
DO - 10.1016/j.dib.2024.110993
M3 - Article
AN - SCOPUS:85206088785
SN - 2352-3409
VL - 57
JO - Data in Brief
JF - Data in Brief
M1 - 110993
ER -