Using C-LARA to evaluate GPT-4’s multilingual processing

ChatGPT C-LARA-Instance; Belinda Chiera; Cathy Chua; Chadi Raheb; Manny Rayner; Annika Simonsen; Zhengkang Xiang; Rina Zviel-Girshin

Using C-LARA to evaluate GPT-4’s multilingual processing

ChatGPT C-LARA-Instance, Belinda Chiera, Cathy Chua, Chadi Raheb, Manny Rayner, Annika Simonsen, Zhengkang Xiang, Rina Zviel-Girshin

Abstract

We present a cross-linguistic study in which the open source C-LARA platform was used to evaluate GPT-4’s ability to perform several key tasks relevant to Computer Assisted Language Learning. For each of the languages English, Farsi, Faroese, Mandarin and Russian, we instructed GPT-4, through C-LARA, to write six different texts, using prompts chosen to obtain texts of widely differing character. We then further instructed GPT-4 to annotate each text with segmentation markup, glosses and lemma/part-of-speech information; native speakers hand-corrected the texts and annotations to obtain error rates on the different component tasks. The C-LARA platform makes it easy to combine the results into a single multimodal document, further facilitating checking of their correctness. GPT-4’s performance varied widely across languages and processing tasks, but performance on different text genres was roughly comparable. In some cases, most notably glossing of English text, we found that GPT-4 was consistently able to revise its annotations to improve them.

Anthology ID:: 2023.alta-1.3
Volume:: Proceedings of the 21st Annual Workshop of the Australasian Language Technology Association
Month:: November
Year:: 2023
Address:: Melbourne, Australia
Editors:: Smaranda Muresan, Vivian Chen, Kennington Casey, Vandyke David, Dethlefs Nina, Inoue Koji, Ekstedt Erik, Ultes Stefan
Venue:: ALTA
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 20–29
Language:
URL:: https://aclanthology.org/2023.alta-1.3
DOI:
Bibkey:
Cite (ACL):: ChatGPT C-LARA-Instance, Belinda Chiera, Cathy Chua, Chadi Raheb, Manny Rayner, Annika Simonsen, Zhengkang Xiang, and Rina Zviel-Girshin. 2023. Using C-LARA to evaluate GPT-4’s multilingual processing. In Proceedings of the 21st Annual Workshop of the Australasian Language Technology Association, pages 20–29, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):: Using C-LARA to evaluate GPT-4’s multilingual processing (C-LARA-Instance et al., ALTA 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.alta-1.3.pdf

PDF Cite Search