Overview
- Reviews the history of LI research, including the challenges that have renewed interest in researching the topic
- Compares and contrasts the features and methods commonly used for LI, as well as LI performance evaluation methods
- Highlights the applications of language identification and identifies areas for future research in LI
Part of the book series: Synthesis Lectures on Human Language Technologies (SLHLT)
Access this book
Tax calculation will be finalised at checkout
Other ways to access
About this book
Similar content being viewed by others
Keywords
Table of contents (7 chapters)
Authors and Affiliations
About the authors
Marcos Zampieri, Ph.D., is an Assistant Professor at George Mason University. He received his PhD from Saarland University with a thesis on computational modelling of language variation. He has published over 100 peer-reviewed papers on various topics in computational linguistics and NLP such as language and dialect identification, native language identification, machine translation, lexical complexity prediction, and social media mining.
Timothy Baldwin, Ph.D., is the Acting Provost and Chair of the Department of Natural Language Processing at Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) in addition to being a Melbourne Laureate Professor in the School of Computing and Information Systems at The University of Melbourne. Prior to joining The University of Melbourne, he was a Senior Research Engineer at the Center for the Study of Language and Information at Stanford University. He is the author of over 450 peer-reviewed publications across diverse topics in natural language processing and AI, in addition to being an ARC Future Fellow, and the recipient of a number of prestigious awards at top conferences.
Krister Lindén, Ph.D., is the Research Director of Language Technology at the University of Helsinki in addition to the National Coordinator of FIN-CLARIN, the Finnish Node of CLARIN ERIC, which is a European research infrastructure for Social Sciences and the Humanities. Heis the Chair of the CLARIN National Coordinators Forum and a member of CLIC (Committee for Legal and Ethical Issues in CLARIN). He holds a doctoral degree in Language Technology from the University of Helsinki. He is the co-author of more than 160 publications related to language technology and its utilization in digital humanities and language resource processing. He is currently also a deputy team leader in the Centre of Excellence of Ancient Near Eastern Empires.
Bibliographic Information
Book Title: Automatic Language Identification in Texts
Authors: Tommi Jauhiainen, Marcos Zampieri, Timothy Baldwin, Krister Lindén
Series Title: Synthesis Lectures on Human Language Technologies
DOI: https://doi.org/10.1007/978-3-031-45822-4
Publisher: Springer Cham
eBook Packages: Synthesis Collection of Technology (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2024
Hardcover ISBN: 978-3-031-45821-7Published: 03 January 2024
Softcover ISBN: 978-3-031-45824-8Due: 16 January 2025
eBook ISBN: 978-3-031-45822-4Published: 01 January 2024
Series ISSN: 1947-4040
Series E-ISSN: 1947-4059
Edition Number: 1
Number of Pages: XIV, 148
Number of Illustrations: 2 b/w illustrations, 8 illustrations in colour
Topics: Natural Language Processing (NLP), Computational Linguistics, Computer Applications, Statistics, general, Artificial Intelligence, Computer Science, general