research-article

A hybrid system for code switch point detection in informal Arabic text

Authors:

Heba Elfardy,

Mohamed Al-Badrashiny,

Mona DiabAuthors Info & Claims

XRDS: Crossroads, The ACM Magazine for Students, Volume 21, Issue 1

Pages 52 - 57

https://doi.org/10.1145/2659893

Published: 14 October 2014 Publication History

Get Access

Abstract

How to detect the switch between a standard and a dialectal form of a language in written text and why this is important for natural language processing tasks.

References

[1]

Ferguson, C. A. Diglossia. Word 15 (1959), 325--340.

Crossref

Google Scholar

[2]

Solorio, T. and Liu, Y. Part-of-speech Tagging for English-Spanish Code Switched Text. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP). 2008.

Digital Library

Google Scholar

[3]

Manandise, E. and Gdaniec, C. Morphology to the Rescue Redux: Resolving borrowings and code-mixing in machine translation. In SFCM'11. 2011.

Crossref

Google Scholar

[4]

Biadsy, F., Hirschberg, J., and Habash, N. Spoken Arabic Dialect Identification Using Phonotactic Modeling. In Proceedings of the Workshop on Computational Approaches to Semitic Languages at the meeting of the European Association for Computational Linguistics (EACL). (Athens, Greece). 2009.

Digital Library

Google Scholar

[5]

Zaidan, O. and Callison-Burrch, C. The Arabic Online Commentary Dataset: An annotated dataset of informal Arabic with high dialectal content. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL). 2011.

Digital Library

Google Scholar

[6]

Cotterell, R. and Callison-Burch, C. A Multi-Dialect, Multi-Genre Corpus of Informal Written Arabic. In Proceedings of the Language Resources and Evaluation Conference (LREC). Reykjavik, Iceland. 2014.

Google Scholar

[7]

Salloum, W., Elfardy, H., Alamir-Salloum, L., Habash, N., and Diab, M. Sentence Level Dialect Identification for Machine Translation System Selection. In Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). 2014.

Crossref

Google Scholar

[8]

Habash, N., Diab, M., and Rabmow, O. Conventional Orthography for Dialectal Arabic. In Proceedings of the Language Resources and Evaluation Conference (LREC). Istanbul, Turkey. 2012.

Google Scholar

[9]

Eskander, R., Habash, N., Rambow, O., and Tomeh, N. Processing Spontaneous Orthography. In Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Atlanta, GA. 2013.

Google Scholar

[10]

Stolcke, ASRILM an Extensible Language Modeling Toolkit. In Proceedings of the International Conference on Spoken Language Processing. 2002

Google Scholar

[11]

Elfardy, H., Al-Badrashiny, M., and Diab, M. Code Switch Point Detection in Arabic. In Proceedings of the 18th International Conference on Application of Natural Language to Information Systems (NLDB2013). MediaCity, UK. 2013.

Crossref

Google Scholar

[12]

Elfardy, H. and Diab, M. T. Sentence Level Dialect Identification in Arabic. In Proceedings of the annual meeting of the Association for Computational Linguistics (ACL). 2013.

Google Scholar

[13]

Habash, N., Eskander, R., and Hawwari, A. A Morphological Analyzer for Egyptian Arabic. NAACL-HLT 2012 Workshop on Computational Morphology and Phonology (SIGMORPHON2012). 2012.

Digital Library

Google Scholar

[14]

Hall, M., Frank, E., Holmes, G., Reutemann, B. P., and Witten, I. H. The WEKA Data Mining Software: an update. ACM SIGKDD Explorations Newsletter 11, 1 (2009), 10--18.

Digital Library

Google Scholar

Cited By

View all

Tarmom TTeahan WAtwell EAlsalka M(2020)Compression versus traditional machine learning classifiers to detect code-switching in varieties and dialects: Arabic as a case studyNatural Language Engineering10.1017/S135132492000011X(1-14)Online publication date: 5-May-2020
https://doi.org/10.1017/S135132492000011X

Index Terms

A hybrid system for code switch point detection in informal Arabic text

Recommendations

A segmentation-free approach to text recognition with application to Arabic text
Offline arabic handwritten text recognition: A Survey

Research in offline Arabic handwriting recognition has increased considerably in the past few years. This is evident from the numerous research results published recently in major journals and conferences in the area of handwriting recognition. Features ...
Arabic handwritten text recognition using structural and syntactic pattern attributes

Comments

Information & Contributors

Information

Published In

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 October 2014

Published in XRDS Volume 21, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article
Popular
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
222
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)1

Reflects downloads up to 22 Sep 2024

Other Metrics

View Author Metrics

Citations

Cited By

View all

Tarmom TTeahan WAtwell EAlsalka M(2020)Compression versus traditional machine learning classifiers to detect code-switching in varieties and dialects: Arabic as a case studyNatural Language Engineering10.1017/S135132492000011X(1-14)Online publication date: 5-May-2020
https://doi.org/10.1017/S135132492000011X

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Digital Edition

View this article in digital edition.

Digital Edition

Magazine Site

View this article on the magazine site (external)

Magazine Site

Abstract

References

Cited By

Index Terms

Recommendations

A segmentation-free approach to text recognition with application to Arabic text

Offline arabic handwritten text recognition: A Survey

Arabic handwritten text recognition using structural and syntactic pattern attributes

Comments

Information

Published In

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Get Access

Login options

Full Access

View options

PDF

eReader

Digital Edition

Magazine Site

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations