SSRN 4871732
SSRN 4871732
SSRN 4871732
Note: This preprint is not edited enough and requires further refinement.
Development and Evaluation of Myanmar GPT: A Language
Model for Myanmar Natural Language Processing
Abstract
This paper presents the development and evaluation of Myanmar GPT, the first large language
model (LLM) tailored for the Myanmar language. Myanmar GPT was created by Min Si Thu.
Utilizing a diverse corpus and advanced training techniques, Myanmar GPT aims to bridge the
gap in natural language processing for underrepresented languages. The model's performance is
evaluated using standard metrics, demonstrating significant improvements over existing
solutions. This research contributes a valuable tool for the Myanmar NLP community and sets
the stage for future advancements in the field.
4 Soky, K., Mimura, M., Kawahara, T., Li, 9 Htet, A. K. (2024). Building a Dataset and
S., Ding, C., Chu, C., & Sam, S. (2021, Exploring Low-Resource Approaches to
November). Khmer speech translation Natural Language Inference with Myanmar
corpus of the extraordinary chambers in the (Doctoral dissertation, Macquarie
courts of cambodia (eccc). In 2021 24th University).
Conference of the Oriental COCOSDA
International Committee for the 10 Htet, A. K. (2024). Building a Dataset
Co-ordination and Standardisation of and Exploring Low-Resource Approaches to
Speech Databases and Assessment Natural Language Inference with Myanmar
Techniques (O-COCOSDA) (pp. 122-127). (Doctoral dissertation, Macquarie
IEEE. University).