Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services

Chaomin Wu; Di Wu; Yushan Pan; Hao Wang

doi:10.1007/978-981-97-9431-7_5

Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services

Chaomin Wu, Di Wu, Yushan Pan, Hao Wang^*

^*Corresponding author for this work

Xi'an Jiaotong-Liverpool University

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

Abstract

This paper identifies the limitations of current AI datasets within the public service sector, specifically concerning the human-robot interaction (HRI) context. Existing datasets often lack the necessary interactive features for effective and efficient interactions, hindering the development of customized and emotionally responsive systems. As public service demands become more diverse and complex in HRI, traditional datasets fail to support high-quality interactions, necessitating significant improvements. To address this issue, we introduce a QA dialogue dataset specifically tailored for public service applications, comprising 1208 pairs generated by large language model. This dataset integrates textual and emotional data, providing detailed annotations for interaction quality and emotional accuracy. Our method includes four stages: data generation, annotation, emotion analysis, and performance evaluation. During the data generation stage, GPT-4 is employed to create a diverse set of dialogues. In the annotation stage, these dialogues are meticulously labeled for quality and emotional content. The emotion analysis stage utilizes various recognition algorithms to process the data. Finally, the performance evaluation stage involves experiments to validate the dataset’s effectiveness. Comparative experiments demonstrate the dataset’s efficacy in enhancing the adaptability and performance of public service robots, underscoring its potential for training AI models to effectively handle real-world dialogues.

Original language	English
Title of host publication	Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings
Editors	Derek F. Wong, Zhongyu Wei, Muyun Yang
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	56-68
Number of pages	13
ISBN (Print)	9789819794300
DOIs	https://doi.org/10.1007/978-981-97-9431-7_5
Publication status	Published - 2025
Event	13th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2024 - Hangzhou, China Duration: 1 Nov 2024 → 3 Nov 2024

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	15359 LNAI
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	13th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2024
Country/Territory	China
City	Hangzhou
Period	1/11/24 → 3/11/24

Keywords

Emotion Analysis
Human-Robot Interaction
Public Service
QA Dialogue Datasets

Access to Document

10.1007/978-981-97-9431-7_5

Cite this

Wu, C., Wu, D., Pan, Y., & Wang, H. (2025). Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services. In D. F. Wong, Z. Wei, & M. Yang (Eds.), Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings (pp. 56-68). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15359 LNAI). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-97-9431-7_5

Wu, Chaomin ; Wu, Di ; Pan, Yushan et al. / Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services. Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings. editor / Derek F. Wong ; Zhongyu Wei ; Muyun Yang. Springer Science and Business Media Deutschland GmbH, 2025. pp. 56-68 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{8a75a093383243958e6f1814fed40eea,

title = "Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services",

abstract = "This paper identifies the limitations of current AI datasets within the public service sector, specifically concerning the human-robot interaction (HRI) context. Existing datasets often lack the necessary interactive features for effective and efficient interactions, hindering the development of customized and emotionally responsive systems. As public service demands become more diverse and complex in HRI, traditional datasets fail to support high-quality interactions, necessitating significant improvements. To address this issue, we introduce a QA dialogue dataset specifically tailored for public service applications, comprising 1208 pairs generated by large language model. This dataset integrates textual and emotional data, providing detailed annotations for interaction quality and emotional accuracy. Our method includes four stages: data generation, annotation, emotion analysis, and performance evaluation. During the data generation stage, GPT-4 is employed to create a diverse set of dialogues. In the annotation stage, these dialogues are meticulously labeled for quality and emotional content. The emotion analysis stage utilizes various recognition algorithms to process the data. Finally, the performance evaluation stage involves experiments to validate the dataset{\textquoteright}s effectiveness. Comparative experiments demonstrate the dataset{\textquoteright}s efficacy in enhancing the adaptability and performance of public service robots, underscoring its potential for training AI models to effectively handle real-world dialogues.",

keywords = "Emotion Analysis, Human-Robot Interaction, Public Service, QA Dialogue Datasets",

author = "Chaomin Wu and Di Wu and Yushan Pan and Hao Wang",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.; 13th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2024 ; Conference date: 01-11-2024 Through 03-11-2024",

year = "2025",

doi = "10.1007/978-981-97-9431-7_5",

language = "English",

isbn = "9789819794300",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "56--68",

editor = "Wong, {Derek F.} and Zhongyu Wei and Muyun Yang",

booktitle = "Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings",

}

Wu, C, Wu, D, Pan, Y & Wang, H 2025, Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services. in DF Wong, Z Wei & M Yang (eds), Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 15359 LNAI, Springer Science and Business Media Deutschland GmbH, pp. 56-68, 13th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2024, Hangzhou, China, 1/11/24. https://doi.org/10.1007/978-981-97-9431-7_5

Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services. / Wu, Chaomin; Wu, Di; Pan, Yushan et al.
Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings. ed. / Derek F. Wong; Zhongyu Wei; Muyun Yang. Springer Science and Business Media Deutschland GmbH, 2025. p. 56-68 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 15359 LNAI).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services

AU - Wu, Chaomin

AU - Wu, Di

AU - Pan, Yushan

AU - Wang, Hao

N1 - Publisher Copyright: © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

PY - 2025

Y1 - 2025

N2 - This paper identifies the limitations of current AI datasets within the public service sector, specifically concerning the human-robot interaction (HRI) context. Existing datasets often lack the necessary interactive features for effective and efficient interactions, hindering the development of customized and emotionally responsive systems. As public service demands become more diverse and complex in HRI, traditional datasets fail to support high-quality interactions, necessitating significant improvements. To address this issue, we introduce a QA dialogue dataset specifically tailored for public service applications, comprising 1208 pairs generated by large language model. This dataset integrates textual and emotional data, providing detailed annotations for interaction quality and emotional accuracy. Our method includes four stages: data generation, annotation, emotion analysis, and performance evaluation. During the data generation stage, GPT-4 is employed to create a diverse set of dialogues. In the annotation stage, these dialogues are meticulously labeled for quality and emotional content. The emotion analysis stage utilizes various recognition algorithms to process the data. Finally, the performance evaluation stage involves experiments to validate the dataset’s effectiveness. Comparative experiments demonstrate the dataset’s efficacy in enhancing the adaptability and performance of public service robots, underscoring its potential for training AI models to effectively handle real-world dialogues.

AB - This paper identifies the limitations of current AI datasets within the public service sector, specifically concerning the human-robot interaction (HRI) context. Existing datasets often lack the necessary interactive features for effective and efficient interactions, hindering the development of customized and emotionally responsive systems. As public service demands become more diverse and complex in HRI, traditional datasets fail to support high-quality interactions, necessitating significant improvements. To address this issue, we introduce a QA dialogue dataset specifically tailored for public service applications, comprising 1208 pairs generated by large language model. This dataset integrates textual and emotional data, providing detailed annotations for interaction quality and emotional accuracy. Our method includes four stages: data generation, annotation, emotion analysis, and performance evaluation. During the data generation stage, GPT-4 is employed to create a diverse set of dialogues. In the annotation stage, these dialogues are meticulously labeled for quality and emotional content. The emotion analysis stage utilizes various recognition algorithms to process the data. Finally, the performance evaluation stage involves experiments to validate the dataset’s effectiveness. Comparative experiments demonstrate the dataset’s efficacy in enhancing the adaptability and performance of public service robots, underscoring its potential for training AI models to effectively handle real-world dialogues.

KW - Emotion Analysis

KW - Human-Robot Interaction

KW - Public Service

KW - QA Dialogue Datasets

UR - http://www.scopus.com/inward/record.url?scp=85209773197&partnerID=8YFLogxK

U2 - 10.1007/978-981-97-9431-7_5

DO - 10.1007/978-981-97-9431-7_5

M3 - Conference Proceeding

AN - SCOPUS:85209773197

SN - 9789819794300

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 56

EP - 68

BT - Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings

A2 - Wong, Derek F.

A2 - Wei, Zhongyu

A2 - Yang, Muyun

PB - Springer Science and Business Media Deutschland GmbH

T2 - 13th CCF International Conference on Natural Language Processing and Chinese Computing, NLPCC 2024

Y2 - 1 November 2024 through 3 November 2024

ER -

Wu C, Wu D, Pan Y, Wang H. Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services. In Wong DF, Wei Z, Yang M, editors, Natural Language Processing and Chinese Computing - 13th National CCF Conference, NLPCC 2024, Proceedings. Springer Science and Business Media Deutschland GmbH. 2025. p. 56-68. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-981-97-9431-7_5

Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

环境赋使场景下的多模态机械臂响应技术研究

Cite this

Leveraging Large Language Models for QA Dialogue Dataset Construction and Analysis in Public Services

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Projects

环境赋使场景下的多模态机械臂响应技术研究

Cite this