Enhancing SLM via ChatGPT and Dataset Augmentation

Pieper, Tom; Ballout, Mohamad; Krumnack, Ulf; Heidemann, Gunther; Kühnberger, Kai-Uwe

Abstract:This paper explores the enhancement of small language models through strategic dataset augmentation via ChatGPT-3.5-Turbo, in the domain of Natural Language Inference (NLI). By employing knowledge distillation-based techniques and synthetic dataset augmentation, we aim to bridge the performance gap between large language models (LLMs) and small language models (SLMs) without the immense cost of human annotation. Our methods involve two forms of rationale generation--information extraction and informed reasoning--to enrich the ANLI dataset. We then fine-tune T5-Small on these augmented datasets, evaluating its performance against an established benchmark. Our findings reveal that the incorporation of synthetic rationales significantly improves the model's ability to comprehend natural language, leading to 1.3\% and 2.3\% higher classification accuracy, respectively, on the ANLI dataset, demonstrating the potential of leveraging LLMs for dataset augmentation. This approach not only enhances the performance of smaller models on complex tasks but also introduces a cost-effective method for fine-tuning smaller language models. By advancing our understanding of knowledge distillation and fine-tuning strategies, this work contributes to the ongoing effort to create more capable and efficient NLP systems.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2409.12599 [cs.CL]
	(or arXiv:2409.12599v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2409.12599

Computer Science > Computation and Language

Title:Enhancing SLM via ChatGPT and Dataset Augmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators