Improving Biomedical Claim Detection using Prompt Learning Approaches

Tong Chen; Angelos Stefanidis; Zhengyong Jiang; Jionglong Su

doi:10.1109/PRML59573.2023.10348317

Improving Biomedical Claim Detection using Prompt Learning Approaches

Tong Chen, Angelos Stefanidis, Zhengyong Jiang^*, Jionglong Su^*

^*Corresponding author for this work

School of AI and Advanced Computing

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

1 Citation (Scopus)

Abstract

Biomedical claim detection is an effective method to uncover negative effects arising from the treatment of disease and detect misinformation about medications from online platforms. Due to the power of pre-trained language models (PLMs), such as BERT, RoBERTa and T5, fine-tuned PLMs perform exceptionally well in biomedical claim detection. However, a gap exists in the text classification task between objective forms used in pre-training and fine-tuning for PLMs methods, preventing these models from taking full advantage of the information for biomedical claim detection. Motivated by the prompt learning approach, we propose a method, in which the classification task is transformed into a masked language modeling task that fully utilizes the mask learning capability of PLMs for better prediction of biomedical claim detection. In our method, a template with a mask representing the label is first constructed, and the mask is then filled and mapped to the corresponding label. We use three PLMs as backbone models, i.e., BERT, RoBERTa, and T5, with both hard and mixed templates which are fully and partially predefined templates. Experimental results using the BioClaim dataset demonstrate the superiority of the prompt learning methods over the BERT and RoBERTa classification baselines. Furthermore, the T5 model with mixed template consistently outperforms the rest of experimented models and achieves state-of-the-art performance with an increase of 5.3% on F1-score compared to previous research on this dataset.

Original language	English
Title of host publication	2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	369-376
Number of pages	8
ISBN (Electronic)	9798350324303
DOIs	https://doi.org/10.1109/PRML59573.2023.10348317
Publication status	Published - 2023
Event	4th IEEE International Conference on Pattern Recognition and Machine Learning, PRML 2023 - Urumqi, China Duration: 4 Aug 2023 → 6 Aug 2023

Publication series

Name	2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023

Conference

Conference	4th IEEE International Conference on Pattern Recognition and Machine Learning, PRML 2023
Country/Territory	China
City	Urumqi
Period	4/08/23 → 6/08/23

Keywords

Claim detection
Natural language processing
Pre-trained language models
Prompt learning

Access to Document

10.1109/PRML59573.2023.10348317

Cite this

Chen, T., Stefanidis, A., Jiang, Z., & Su, J. (2023). Improving Biomedical Claim Detection using Prompt Learning Approaches. In 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023 (pp. 369-376). (2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/PRML59573.2023.10348317

Chen, Tong ; Stefanidis, Angelos ; Jiang, Zhengyong et al. / Improving Biomedical Claim Detection using Prompt Learning Approaches. 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 369-376 (2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023).

@inproceedings{6f4a053d3b97447a9710ca1498e80fba,

title = "Improving Biomedical Claim Detection using Prompt Learning Approaches",

abstract = "Biomedical claim detection is an effective method to uncover negative effects arising from the treatment of disease and detect misinformation about medications from online platforms. Due to the power of pre-trained language models (PLMs), such as BERT, RoBERTa and T5, fine-tuned PLMs perform exceptionally well in biomedical claim detection. However, a gap exists in the text classification task between objective forms used in pre-training and fine-tuning for PLMs methods, preventing these models from taking full advantage of the information for biomedical claim detection. Motivated by the prompt learning approach, we propose a method, in which the classification task is transformed into a masked language modeling task that fully utilizes the mask learning capability of PLMs for better prediction of biomedical claim detection. In our method, a template with a mask representing the label is first constructed, and the mask is then filled and mapped to the corresponding label. We use three PLMs as backbone models, i.e., BERT, RoBERTa, and T5, with both hard and mixed templates which are fully and partially predefined templates. Experimental results using the BioClaim dataset demonstrate the superiority of the prompt learning methods over the BERT and RoBERTa classification baselines. Furthermore, the T5 model with mixed template consistently outperforms the rest of experimented models and achieves state-of-the-art performance with an increase of 5.3% on F1-score compared to previous research on this dataset.",

keywords = "Claim detection, Natural language processing, Pre-trained language models, Prompt learning",

author = "Tong Chen and Angelos Stefanidis and Zhengyong Jiang and Jionglong Su",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 4th IEEE International Conference on Pattern Recognition and Machine Learning, PRML 2023 ; Conference date: 04-08-2023 Through 06-08-2023",

year = "2023",

doi = "10.1109/PRML59573.2023.10348317",

language = "English",

series = "2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "369--376",

booktitle = "2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023",

}

Chen, T , Stefanidis, A , Jiang, Z & Su, J 2023, Improving Biomedical Claim Detection using Prompt Learning Approaches. in 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023. 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023, Institute of Electrical and Electronics Engineers Inc., pp. 369-376, 4th IEEE International Conference on Pattern Recognition and Machine Learning, PRML 2023, Urumqi, China, 4/08/23. https://doi.org/10.1109/PRML59573.2023.10348317

Improving Biomedical Claim Detection using Prompt Learning Approaches. / Chen, Tong ; Stefanidis, Angelos ; Jiang, Zhengyong et al.
2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 369-376 (2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023).

Research output: Chapter in Book or Report/Conference proceeding › Conference Proceeding › peer-review

TY - GEN

T1 - Improving Biomedical Claim Detection using Prompt Learning Approaches

AU - Chen, Tong

AU - Stefanidis, Angelos

AU - Jiang, Zhengyong

AU - Su, Jionglong

PY - 2023

Y1 - 2023

N2 - Biomedical claim detection is an effective method to uncover negative effects arising from the treatment of disease and detect misinformation about medications from online platforms. Due to the power of pre-trained language models (PLMs), such as BERT, RoBERTa and T5, fine-tuned PLMs perform exceptionally well in biomedical claim detection. However, a gap exists in the text classification task between objective forms used in pre-training and fine-tuning for PLMs methods, preventing these models from taking full advantage of the information for biomedical claim detection. Motivated by the prompt learning approach, we propose a method, in which the classification task is transformed into a masked language modeling task that fully utilizes the mask learning capability of PLMs for better prediction of biomedical claim detection. In our method, a template with a mask representing the label is first constructed, and the mask is then filled and mapped to the corresponding label. We use three PLMs as backbone models, i.e., BERT, RoBERTa, and T5, with both hard and mixed templates which are fully and partially predefined templates. Experimental results using the BioClaim dataset demonstrate the superiority of the prompt learning methods over the BERT and RoBERTa classification baselines. Furthermore, the T5 model with mixed template consistently outperforms the rest of experimented models and achieves state-of-the-art performance with an increase of 5.3% on F1-score compared to previous research on this dataset.

AB - Biomedical claim detection is an effective method to uncover negative effects arising from the treatment of disease and detect misinformation about medications from online platforms. Due to the power of pre-trained language models (PLMs), such as BERT, RoBERTa and T5, fine-tuned PLMs perform exceptionally well in biomedical claim detection. However, a gap exists in the text classification task between objective forms used in pre-training and fine-tuning for PLMs methods, preventing these models from taking full advantage of the information for biomedical claim detection. Motivated by the prompt learning approach, we propose a method, in which the classification task is transformed into a masked language modeling task that fully utilizes the mask learning capability of PLMs for better prediction of biomedical claim detection. In our method, a template with a mask representing the label is first constructed, and the mask is then filled and mapped to the corresponding label. We use three PLMs as backbone models, i.e., BERT, RoBERTa, and T5, with both hard and mixed templates which are fully and partially predefined templates. Experimental results using the BioClaim dataset demonstrate the superiority of the prompt learning methods over the BERT and RoBERTa classification baselines. Furthermore, the T5 model with mixed template consistently outperforms the rest of experimented models and achieves state-of-the-art performance with an increase of 5.3% on F1-score compared to previous research on this dataset.

KW - Claim detection

KW - Natural language processing

KW - Pre-trained language models

KW - Prompt learning

UR - http://www.scopus.com/inward/record.url?scp=85182017974&partnerID=8YFLogxK

U2 - 10.1109/PRML59573.2023.10348317

DO - 10.1109/PRML59573.2023.10348317

M3 - Conference Proceeding

AN - SCOPUS:85182017974

T3 - 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023

SP - 369

EP - 376

BT - 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 4th IEEE International Conference on Pattern Recognition and Machine Learning, PRML 2023

Y2 - 4 August 2023 through 6 August 2023

ER -

Chen T , Stefanidis A , Jiang Z , Su J. Improving Biomedical Claim Detection using Prompt Learning Approaches. In 2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 369-376. (2023 IEEE 4th International Conference on Pattern Recognition and Machine Learning, PRML 2023). doi: 10.1109/PRML59573.2023.10348317

Improving Biomedical Claim Detection using Prompt Learning Approaches

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this