Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3503162.3505241acmotherconferencesArticle/Chapter ViewAbstractPublication PagesfireConference Proceedingsconference-collections
abstract

UrduThreat@ FIRE2021: Shared Track on Abusive Threat Identification in Urdu

Published: 26 January 2022 Publication History

Abstract

With the growth of spread and importance of social media platforms, the effect of their misuse became more and more impactful. This shared task address the task of abusive and threatening language detection in Urdu language that has more than 230 million speakers worldwide. We presented two datasets: (i) Abusive and Non-Abusive language, (ii) Threatening and Non-Threatening language. The abusive dataset contains 1,187 tweets categorized as Abusive and 1,213 as Non-Abusive and the threatening dataset contains 4,929 tweets categorized as Non-Threatening and 1,071 as Threatening. In this shared task, 21 teams registered for participation from six countries (India, Pakistan, China, Malaysia, United Arab Emirates, Taiwan), 10 teams submitted their runs for Subtask A — Abusive Language Detection, 9 teams submitted their runs for Subtask B — Threatening Language detection, and seven teams submitted their technical reports. We provided one baseline system for Subtask A and three baseline systems for Subtask B. The best performing system achieved an F-score value of 0.88 for Subtask A and 0.545 for Subtask B. For both subtasks, m-Bert based transformer models showed the best performance.

References

[1]
Maaz Amjad, Noman Ashraf, Alisa Zhila, Grigori Sidorov, Arkaitz Zubiaga, and Alexander Gelbukh. 2021. Automatic Abusive Language Detection in Urdu Tweets. Acta Polytechnica Hungarica(2021), 1785–8860.
[2]
Maaz Amjad, Noman Ashraf, Alisa Zhila, Grigori Sidorov, Arkaitz Zubiaga, and Alexander Gelbukh. 2021. Threatening Language Detection and Target Identification in Urdu Tweets. IEEE Access 9(2021), 128302–128313.
[3]
Maaz Amjad, Grigori Sidorov, Alisa Zhila, Helena Gómez-Adorno, Ilia Voronkov, and Alexander Gelbukh. 2020. Bend the Truth: A benchmark dataset for fake news detection in Urdu and its evaluation. Journal of Intelligent & Fuzzy Systems 39, 2 (2020), 2457–2469. https://doi.org/10.3233/JIFS-179905
[4]
Maaz Amjad, Alisa Zhila, Grigori Sidorov, Andrey Labunets, Sabur Butt, Hamza Imam Amjad, Oxana Vitman, and Alexander Gelbukh. 2021. Overview of Abusive and Threatening Language Detection in Urdu at FIRE 2021.” In CEUR Workshop Proceedings. (2021). CEUR Workshop Proceedings.

Cited By

View all
  • (2025)UrduHope: Analysis of hope and hopelessness in Urdu textsKnowledge-Based Systems10.1016/j.knosys.2024.112746308(112746)Online publication date: Jan-2025
  • (2024)Multilingual Detection of Cyberbullying in Mixed Urdu, Roman Urdu, and English Social Media ConversationsIEEE Access10.1109/ACCESS.2024.343290812(105201-105210)Online publication date: 2024
  • (2024)Reading Between the Lines: Machine Learning Ensemble and Deep Learning for Implied Threat Detection in Textual DataInternational Journal of Computational Intelligence Systems10.1007/s44196-024-00580-y17:1Online publication date: 15-Jul-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
FIRE '21: Proceedings of the 13th Annual Meeting of the Forum for Information Retrieval Evaluation
December 2021
113 pages
ISBN:9781450395960
DOI:10.1145/3503162
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 26 January 2022

Check for updates

Author Tags

  1. Abusive languages detection
  2. Urdu language
  3. low resource languages
  4. threatening languages detection

Qualifiers

  • Abstract
  • Research
  • Refereed limited

Conference

FIRE 2021
FIRE 2021: Forum for Information Retrieval Evaluation
December 13 - 17, 2021
Virtual Event, India

Acceptance Rates

Overall Acceptance Rate 19 of 64 submissions, 30%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)UrduHope: Analysis of hope and hopelessness in Urdu textsKnowledge-Based Systems10.1016/j.knosys.2024.112746308(112746)Online publication date: Jan-2025
  • (2024)Multilingual Detection of Cyberbullying in Mixed Urdu, Roman Urdu, and English Social Media ConversationsIEEE Access10.1109/ACCESS.2024.343290812(105201-105210)Online publication date: 2024
  • (2024)Reading Between the Lines: Machine Learning Ensemble and Deep Learning for Implied Threat Detection in Textual DataInternational Journal of Computational Intelligence Systems10.1007/s44196-024-00580-y17:1Online publication date: 15-Jul-2024
  • (2023)Detection of offensive terms in resource-poor language using machine learning algorithmsPeerJ Computer Science10.7717/peerj-cs.15249(e1524)Online publication date: 29-Aug-2023
  • (2023)Hate Speech Detection in Indian Languages: A Brief Survey2023 IEEE 2nd International Conference on Data, Decision and Systems (ICDDS)10.1109/ICDDS59137.2023.10434756(1-5)Online publication date: 1-Dec-2023
  • (2023)Analysis of Fake News Detection MethodsRecent Developments and the New Directions of Research, Foundations, and Applications10.1007/978-3-031-23476-7_13(131-144)Online publication date: 27-Jun-2023
  • (2022)Evaluating the Impact of OCR Quality on Short Texts Classification TaskAdvances in Computational Intelligence10.1007/978-3-031-19496-2_13(163-177)Online publication date: 24-Oct-2022

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media