TY - JOUR
T1 - Building a corpus of student academic writing in EMI contexts
T2 - Challenges in corpus design and data collection across international higher education settings
AU - Gablasova, Dana
AU - Harding, Luke
AU - Bottini, Raffaella
AU - Brezina, Vaclav
AU - Ren, Haoshan (Sally)
AU - Iamartino, Giovanni
AU - Li, Yingyu
AU - Liu, Tanjun
AU - Poggesi, Laura
AU - Savski, Kristof
AU - Toomaneejinda, Anuchit
AU - Zottola, Angela
N1 - Publisher Copyright:
© 2024
PY - 2024/12
Y1 - 2024/12
N2 - The article discusses methodological procedures and challenges in a project requiring multi-site, transnational data collection for the construction of a corpus of academic writing in EMI higher education contexts. Drawing on our decision-making experiences as a research team, together with empirical data generated through data collection logs recorded by a network of researchers involved in the project, we reflect on key issues in conducting the project and the solutions we found to address specific challenges. After describing the background to the project and the current status of the corpus, we focus on four broad challenges: (1) selecting partners and managing a multi-site project; (2) defining a working construct of academic writing; (3) categorising data according to disciplinary areas; and (4) managing data collection “on the ground”. Throughout, we provide descriptions of our solutions to the challenges identified, and we conclude with a call for further publication of corpus construction records to provide greater transparency and detail around decisions and judgements made at all stages of a corpus construction project.
AB - The article discusses methodological procedures and challenges in a project requiring multi-site, transnational data collection for the construction of a corpus of academic writing in EMI higher education contexts. Drawing on our decision-making experiences as a research team, together with empirical data generated through data collection logs recorded by a network of researchers involved in the project, we reflect on key issues in conducting the project and the solutions we found to address specific challenges. After describing the background to the project and the current status of the corpus, we focus on four broad challenges: (1) selecting partners and managing a multi-site project; (2) defining a working construct of academic writing; (3) categorising data according to disciplinary areas; and (4) managing data collection “on the ground”. Throughout, we provide descriptions of our solutions to the challenges identified, and we conclude with a call for further publication of corpus construction records to provide greater transparency and detail around decisions and judgements made at all stages of a corpus construction project.
KW - Corpus construction
KW - Corpus data collection
KW - Corpus design
KW - EMI
KW - English as a Medium of Instruction
KW - Written academic English
UR - http://www.scopus.com/inward/record.url?scp=85200857180&partnerID=8YFLogxK
U2 - 10.1016/j.rmal.2024.100140
DO - 10.1016/j.rmal.2024.100140
M3 - Article
AN - SCOPUS:85200857180
SN - 2772-7661
VL - 3
JO - Research Methods in Applied Linguistics
JF - Research Methods in Applied Linguistics
IS - 3
M1 - 100140
ER -