[CITATION][C] Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv

J Devlin, MW Chang, K Lee… - arXiv preprint …, 2019 - Retrieved 2023-01-17, from http …