Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

"Dolma: an Open Corpus of Three Trillion Tokens for Language Model ..."

Luca Soldaini et al. (2024)

Details and statistics

DOI: 10.18653/V1/2024.ACL-LONG.840

access: open

type: Conference or Workshop Paper

metadata version: 2024-12-03