QARR-FSQA: Question-Answer Replacement and Removal Pretraining Framework for Few-Shot Question Answering

Siao Wah Tan; Chin Poo Lee; Kian Ming Lim; Connie Tee; Ali Alqahtani

doi:10.1109/ACCESS.2024.3487581

QARR-FSQA: Question-Answer Replacement and Removal Pretraining Framework for Few-Shot Question Answering

Siao Wah Tan, Chin Poo Lee^*, Kian Ming Lim^*, Connie Tee, Ali Alqahtani

^*Corresponding author for this work

School of AI and Advanced Computing

Research output: Contribution to journal › Article › peer-review

Abstract

In Natural Language Processing, creating training data for question answering (QA) systems typically requires significant effort and expertise. This challenge is amplified in few-shot scenarios where only a limited number of training samples are available. This paper proposes a novel pretraining framework to enhance few-shot question answering (FSQA) capabilities. It begins with the selection of the Discrete Reasoning Over the Content of Paragraphs (DROP) dataset, designed for English reading comprehension tasks involving various reasoning types. Data preprocessing converts question-answer pairs into a predefined template, consisting of a concatenated sequence of the question, a mask token with a prefix, and the context, forming the input sequence, while the target sequence includes the question and answer. The Question-Answer Replacement and Removal (QARR) technique augments the dataset by integrating the answer into the question and selectively removing words. Various templates for question-answer pairs are introduced. Models like BART, T5, and LED are then used to evaluate the framework's performance, undergoing further pretraining on the augmented dataset with their respective architectures and optimization objectives. The study also investigates the impact of different templates on model performance in few-shot QA tasks. Evaluated on three datasets in few-shot scenarios, the QARR-T5 method outperforms state-of-the-art FSQA techniques, achieving the highest F1 scores of 81.7% in 16-shot and 32-shot, 82.7% in 64-shot, and 84.5% in 128-shot on the SQuAD dataset. This demonstrates the framework's effectiveness in improving models' generalization and performance on new datasets with limited samples, advancing few-shot QA.

Original language	English
Pages (from-to)	159280-159295
Number of pages	16
Journal	IEEE Access
Volume	12
DOIs	https://doi.org/10.1109/ACCESS.2024.3487581
Publication status	Published - 2024

Keywords

Natural language processing
few-shot question answering
generative question answering models
pretraining framework

Access to Document

10.1109/ACCESS.2024.3487581

Cite this

@article{f6d85eaaeaf4489693146db1183c832a,

title = "QARR-FSQA: Question-Answer Replacement and Removal Pretraining Framework for Few-Shot Question Answering",

abstract = "In Natural Language Processing, creating training data for question answering (QA) systems typically requires significant effort and expertise. This challenge is amplified in few-shot scenarios where only a limited number of training samples are available. This paper proposes a novel pretraining framework to enhance few-shot question answering (FSQA) capabilities. It begins with the selection of the Discrete Reasoning Over the Content of Paragraphs (DROP) dataset, designed for English reading comprehension tasks involving various reasoning types. Data preprocessing converts question-answer pairs into a predefined template, consisting of a concatenated sequence of the question, a mask token with a prefix, and the context, forming the input sequence, while the target sequence includes the question and answer. The Question-Answer Replacement and Removal (QARR) technique augments the dataset by integrating the answer into the question and selectively removing words. Various templates for question-answer pairs are introduced. Models like BART, T5, and LED are then used to evaluate the framework's performance, undergoing further pretraining on the augmented dataset with their respective architectures and optimization objectives. The study also investigates the impact of different templates on model performance in few-shot QA tasks. Evaluated on three datasets in few-shot scenarios, the QARR-T5 method outperforms state-of-the-art FSQA techniques, achieving the highest F1 scores of 81.7% in 16-shot and 32-shot, 82.7% in 64-shot, and 84.5% in 128-shot on the SQuAD dataset. This demonstrates the framework's effectiveness in improving models' generalization and performance on new datasets with limited samples, advancing few-shot QA.",

keywords = "Natural language processing, few-shot question answering, generative question answering models, pretraining framework",

author = "{Wah Tan}, Siao and {Poo Lee}, Chin and {Ming Lim}, Kian and Connie Tee and Ali Alqahtani",

note = "Publisher Copyright: {\textcopyright} 2013 IEEE.",

year = "2024",

doi = "10.1109/ACCESS.2024.3487581",

language = "English",

volume = "12",

pages = "159280--159295",

journal = "IEEE Access",

issn = "2169-3536",

}

TY - JOUR

T1 - QARR-FSQA

T2 - Question-Answer Replacement and Removal Pretraining Framework for Few-Shot Question Answering

AU - Wah Tan, Siao

AU - Poo Lee, Chin

AU - Ming Lim, Kian

AU - Tee, Connie

AU - Alqahtani, Ali

PY - 2024

Y1 - 2024

N2 - In Natural Language Processing, creating training data for question answering (QA) systems typically requires significant effort and expertise. This challenge is amplified in few-shot scenarios where only a limited number of training samples are available. This paper proposes a novel pretraining framework to enhance few-shot question answering (FSQA) capabilities. It begins with the selection of the Discrete Reasoning Over the Content of Paragraphs (DROP) dataset, designed for English reading comprehension tasks involving various reasoning types. Data preprocessing converts question-answer pairs into a predefined template, consisting of a concatenated sequence of the question, a mask token with a prefix, and the context, forming the input sequence, while the target sequence includes the question and answer. The Question-Answer Replacement and Removal (QARR) technique augments the dataset by integrating the answer into the question and selectively removing words. Various templates for question-answer pairs are introduced. Models like BART, T5, and LED are then used to evaluate the framework's performance, undergoing further pretraining on the augmented dataset with their respective architectures and optimization objectives. The study also investigates the impact of different templates on model performance in few-shot QA tasks. Evaluated on three datasets in few-shot scenarios, the QARR-T5 method outperforms state-of-the-art FSQA techniques, achieving the highest F1 scores of 81.7% in 16-shot and 32-shot, 82.7% in 64-shot, and 84.5% in 128-shot on the SQuAD dataset. This demonstrates the framework's effectiveness in improving models' generalization and performance on new datasets with limited samples, advancing few-shot QA.

AB - In Natural Language Processing, creating training data for question answering (QA) systems typically requires significant effort and expertise. This challenge is amplified in few-shot scenarios where only a limited number of training samples are available. This paper proposes a novel pretraining framework to enhance few-shot question answering (FSQA) capabilities. It begins with the selection of the Discrete Reasoning Over the Content of Paragraphs (DROP) dataset, designed for English reading comprehension tasks involving various reasoning types. Data preprocessing converts question-answer pairs into a predefined template, consisting of a concatenated sequence of the question, a mask token with a prefix, and the context, forming the input sequence, while the target sequence includes the question and answer. The Question-Answer Replacement and Removal (QARR) technique augments the dataset by integrating the answer into the question and selectively removing words. Various templates for question-answer pairs are introduced. Models like BART, T5, and LED are then used to evaluate the framework's performance, undergoing further pretraining on the augmented dataset with their respective architectures and optimization objectives. The study also investigates the impact of different templates on model performance in few-shot QA tasks. Evaluated on three datasets in few-shot scenarios, the QARR-T5 method outperforms state-of-the-art FSQA techniques, achieving the highest F1 scores of 81.7% in 16-shot and 32-shot, 82.7% in 64-shot, and 84.5% in 128-shot on the SQuAD dataset. This demonstrates the framework's effectiveness in improving models' generalization and performance on new datasets with limited samples, advancing few-shot QA.

KW - Natural language processing

KW - few-shot question answering

KW - generative question answering models

KW - pretraining framework

UR - http://www.scopus.com/inward/record.url?scp=85208216890&partnerID=8YFLogxK

U2 - 10.1109/ACCESS.2024.3487581

DO - 10.1109/ACCESS.2024.3487581

M3 - Article

AN - SCOPUS:85208216890

SN - 2169-3536

VL - 12

SP - 159280

EP - 159295

JO - IEEE Access

JF - IEEE Access

ER -

QARR-FSQA: Question-Answer Replacement and Removal Pretraining Framework for Few-Shot Question Answering

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this