Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 64 results for author: Ahmad, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10152  [pdf, other

    cs.CL

    Mitigating Translationese in Low-resource Languages: The Storyboard Approach

    Authors: Garry Kuwanto, Eno-Abasi E. Urua, Priscilla Amondi Amuok, Shamsuddeen Hassan Muhammad, Anuoluwapo Aremu, Verrah Otiende, Loice Emma Nanyanga, Teresiah W. Nyoike, Aniefon D. Akpan, Nsima Ab Udouboh, Idongesit Udeme Archibong, Idara Effiong Moses, Ifeoluwatayo A. Ige, Benjamin Ajibade, Olumide Benjamin Awokoya, Idris Abdulmumin, Saminu Mohammad Aliyu, Ruqayya Nasir Iro, Ibrahim Said Ahmad, Deontae Smith, Praise-EL Michaels, David Ifeoluwa Adelani, Derry Tanti Wijaya, Anietie Andy

    Abstract: Low-resource languages often face challenges in acquiring high-quality language data due to the reliance on translation-based methods, which can introduce the translationese effect. This phenomenon results in translated sentences that lack fluency and naturalness in the target language. In this paper, we propose a novel approach for data collection by leveraging storyboards to elicit more fluent a… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: published at LREC-COLING 2024

    ACM Class: I.2.7

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) 11349-11360

  2. arXiv:2407.02631  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Nollywood: Let's Go to the Movies!

    Authors: John E. Ortega, Ibrahim Said Ahmad, William Chen

    Abstract: Nollywood, based on the idea of Bollywood from India, is a series of outstanding movies that originate from Nigeria. Unfortunately, while the movies are in English, they are hard to understand for many native speakers due to the dialect of English that is spoken. In this article, we accomplish two goals: (1) create a phonetic sub-title model that is able to translate Nigerian English speech to Ame… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 8 pages, 4 figures, 2 tables

  3. arXiv:2406.19504  [pdf, other

    cs.CL

    Are Generative Language Models Multicultural? A Study on Hausa Culture and Emotions using ChatGPT

    Authors: Ibrahim Said Ahmad, Shiran Dudy, Resmi Ramachandranpillai, Kenneth Church

    Abstract: Large Language Models (LLMs), such as ChatGPT, are widely used to generate content for various purposes and audiences. However, these models may not reflect the cultural and emotional diversity of their users, especially for low-resource languages. In this paper, we investigate how ChatGPT represents Hausa's culture and emotions. We compare responses generated by ChatGPT with those provided by nat… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2406.03786  [pdf, other

    cs.CR

    Adaptive Lightweight Security for Performance Efficiency in Critical Healthcare Monitoring

    Authors: Ijaz Ahmad, Faheem Shahid, Ijaz Ahmad, Johirul Islam, Kazi Nymul Haque, Erkki Harjula

    Abstract: The healthcare infrastructure requires robust security procedures, technologies, and policies due to its critical nature. Since the Internet of Things (IoT) with its diverse technologies has become an integral component of future healthcare systems, its security requires a thorough analysis due to its inherent security limitations that arise from resource constraints. Existing communication techno… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 6 pages, 7 figures, 3 tables

  5. arXiv:2404.19025  [pdf

    cs.SE cs.CR

    Unsupervised Binary Code Translation with Application to Code Similarity Detection and Vulnerability Discovery

    Authors: Iftakhar Ahmad, Lannan Luo

    Abstract: Binary code analysis has immense importance in the research domain of software security. Today, software is very often compiled for various Instruction Set Architectures (ISAs). As a result, cross-architecture binary code analysis has become an emerging problem. Recently, deep learning-based binary analysis has shown promising success. It is widely known that training a deep learning model require… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: conference

    Journal ref: The 2023 Conference on Empirical Methods in Natural Language Processing. 2023

  6. arXiv:2403.18933  [pdf, other

    cs.CL

    SemEval-2024 Task 1: Semantic Textual Relatedness for African and Asian Languages

    Authors: Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Meriem Beloucif, Christine De Kock, Oumaima Hourrane, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Krishnapriya Vishnubhotla, Seid Muhie Yimam, Saif M. Mohammad

    Abstract: We present the first shared task on Semantic Textual Relatedness (STR). While earlier shared tasks primarily focused on semantic similarity, we instead investigate the broader phenomenon of semantic relatedness across 14 languages: Afrikaans, Algerian Arabic, Amharic, English, Hausa, Hindi, Indonesian, Kinyarwanda, Marathi, Moroccan Arabic, Modern Standard Arabic, Punjabi, Spanish, and Telugu. The… ▽ More

    Submitted 17 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: SemEval 2024 Task Description Paper. arXiv admin note: text overlap with arXiv:2402.08638

  7. arXiv:2403.12980  [pdf, other

    cs.DC

    Containerization in Multi-Cloud Environment: Roles, Strategies, Challenges, and Solutions for Effective Implementation

    Authors: Muhammad Waseem, Aakash Ahmad, Peng Liang, Muhammad Azeem Akbar, Arif Ali Khan, Iftikhar Ahmad, Manu Setälä, Tommi Mikkonen

    Abstract: Containerization in a multi-cloud environment facilitates workload portability and optimized resource utilization. Containerization in multi-cloud environments has received significant attention in recent years both from academic research and industrial development perspectives. However, there exists no effort to systematically investigate the state of research on this topic. The aim of this resea… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 59 pages, 4 images, 16 tables, Manuscript submitted to a Journal (2024)

  8. arXiv:2403.01100  [pdf, other

    cs.CR

    Adaptive Security in 6G for Sustainable Healthcare

    Authors: Ijaz Ahmad, Ijaz Ahmad, Erkki Harjula

    Abstract: 6G will fulfill the requirements of future digital healthcare systems through emerging decentralized computing and secure communications technologies. Digital healthcare solutions employ numerous low-power and resource-constrained connected things, such as the Internet of Medical Things (IoMT). However, the current digital healthcare solutions will face two major challenges. First, the proposed so… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 9 pages,2 figures, accepted by NCDHWS, to be published by Springer

  9. arXiv:2402.08638  [pdf, other

    cs.CL

    SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages

    Authors: Nedjma Ousidhoum, Shamsuddeen Hassan Muhammad, Mohamed Abdalla, Idris Abdulmumin, Ibrahim Said Ahmad, Sanchit Ahuja, Alham Fikri Aji, Vladimir Araujo, Abinew Ali Ayele, Pavan Baswani, Meriem Beloucif, Chris Biemann, Sofia Bourhim, Christine De Kock, Genet Shanko Dekebo, Oumaima Hourrane, Gopichand Kanumolu, Lokesh Madasu, Samuel Rutunda, Manish Shrivastava, Thamar Solorio, Nirmal Surange, Hailegnaw Getaneh Tilaye, Krishnapriya Vishnubhotla, Genta Winata , et al. (2 additional authors not shown)

    Abstract: Exploring and quantifying semantic relatedness is central to representing language and holds significant implications across various NLP tasks. While earlier NLP research primarily focused on semantic similarity, often within the English language context, we instead investigate the broader phenomenon of semantic relatedness. In this paper, we present \textit{SemRel}, a new semantic relatedness dat… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted to the Findings of ACL 2024

  10. arXiv:2402.04141  [pdf, other

    cs.SE cs.AI

    Multi-line AI-assisted Code Authoring

    Authors: Omer Dunay, Daniel Cheng, Adam Tait, Parth Thakkar, Peter C Rigby, Andy Chiu, Imad Ahmad, Arun Ganesan, Chandra Maddila, Vijayaraghavan Murali, Ali Tayyebi, Nachiappan Nagappan

    Abstract: CodeCompose is an AI-assisted code authoring tool powered by large language models (LLMs) that provides inline suggestions to 10's of thousands of developers at Meta. In this paper, we present how we scaled the product from displaying single-line suggestions to multi-line suggestions. This evolution required us to overcome several unique challenges in improving the usability of these suggestions f… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  11. arXiv:2401.13133  [pdf, other

    cs.CL cs.SI

    Analyzing COVID-19 Vaccination Sentiments in Nigerian Cyberspace: Insights from a Manually Annotated Twitter Dataset

    Authors: Ibrahim Said Ahmad, Lukman Jibril Aliyu, Abubakar Auwal Khalid, Saminu Muhammad Aliyu, Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Bala Mairiga Abduljalil, Bello Shehu Bello, Amina Imam Abubakar

    Abstract: Numerous successes have been achieved in combating the COVID-19 pandemic, initially using various precautionary measures like lockdowns, social distancing, and the use of face masks. More recently, various vaccinations have been developed to aid in the prevention or reduction of the severity of the COVID-19 infection. Despite the effectiveness of the precautionary measures and the vaccines, there… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  12. arXiv:2312.16104  [pdf, other

    cs.CL cs.AI

    Dotless Representation of Arabic Text: Analysis and Modeling

    Authors: Maged S. Al-Shaibani, Irfan Ahmad

    Abstract: This paper presents a novel dotless representation of Arabic text as an alternative to the standard Arabic text representation. We delve into its implications through comprehensive analysis across five diverse corpora and four different tokenization techniques. We explore the impact of dotless representation on the relationships between tokenization granularity and vocabulary size and compare them… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

  13. arXiv:2312.10321  [pdf, other

    cs.DB cs.CL

    LLM-SQL-Solver: Can LLMs Determine SQL Equivalence?

    Authors: Fuheng Zhao, Lawrence Lim, Ishtiyaque Ahmad, Divyakant Agrawal, Amr El Abbadi

    Abstract: Judging the equivalence between two SQL queries is a fundamental problem with many practical applications in data management and SQL generation (i.e., evaluating the quality of generated SQL queries in text-to-SQL task). While the research community has reasoned about SQL equivalence for decades, it poses considerable difficulties and no complete solutions exist. Recently, Large Language Models (L… ▽ More

    Submitted 19 June, 2024; v1 submitted 16 December, 2023; originally announced December 2023.

  14. arXiv:2311.12179  [pdf, other

    cs.CL

    Leveraging Closed-Access Multilingual Embedding for Automatic Sentence Alignment in Low Resource Languages

    Authors: Idris Abdulmumin, Auwal Abubakar Khalid, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Lukman Jibril Aliyu, Babangida Sani, Bala Mairiga Abduljalil, Sani Ahmad Hassan

    Abstract: The importance of qualitative parallel data in machine translation has long been determined but it has always been very difficult to obtain such in sufficient quantity for the majority of world languages, mainly because of the associated cost and also the lack of accessibility to these languages. Despite the potential for obtaining parallel datasets from online articles using automatic approaches,… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: To appear in the proceedings of ICCAIT 2023. 6 pages, 2 figures

  15. arXiv:2310.09016  [pdf

    cs.CV cs.AI

    A Spatial-Temporal Dual-Mode Mixed Flow Network for Panoramic Video Salient Object Detection

    Authors: Xiaolei Chen, Pengcheng Zhang, Zelong Du, Ishfaq Ahmad

    Abstract: Salient object detection (SOD) in panoramic video is still in the initial exploration stage. The indirect application of 2D video SOD method to the detection of salient objects in panoramic video has many unmet challenges, such as low detection accuracy, high model complexity, and poor generalization performance. To overcome these hurdles, we design an Inter-Layer Attention (ILA) module, an Inter-… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  16. A three in one bottom-up framework for simultaneous semantic segmentation, instance segmentation and classification of multi-organ nuclei in digital cancer histology

    Authors: Ibtihaj Ahmad, Syed Muhammad Israr, Zain Ul Islam

    Abstract: Simultaneous segmentation and classification of nuclei in digital histology play an essential role in computer-assisted cancer diagnosis; however, it remains challenging. The highest achieved binary and multi-class Panoptic Quality (PQ) remains as low as 0.68 bPQ and 0.49 mPQ, respectively. It is due to the higher staining variability, variability across the tissue, rough clinical conditions, over… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  17. arXiv:2306.07888  [pdf, other

    cs.PF cs.SE eess.SY

    CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems

    Authors: Md Shahriar Iqbal, Ziyuan Zhong, Iftakhar Ahmad, Baishakhi Ray, Pooyan Jamshidi

    Abstract: Modern computer systems are highly configurable, with hundreds of configuration options that interact, resulting in an enormous configuration space. As a result, optimizing performance goals (e.g., latency) in such systems is challenging due to frequent uncertainties in their environments (e.g., workload fluctuations). Recently, transfer learning has been applied to address this problem by reusing… ▽ More

    Submitted 3 October, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  18. arXiv:2305.17690  [pdf, other

    cs.CL

    HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language

    Authors: Shantipriya Parida, Idris Abdulmumin, Shamsuddeen Hassan Muhammad, Aneesh Bose, Guneet Singh Kohli, Ibrahim Said Ahmad, Ketan Kotwal, Sayan Deb Sarkar, Ondřej Bojar, Habeebah Adamu Kakudi

    Abstract: This paper presents HaVQA, the first multimodal dataset for visual question-answering (VQA) tasks in the Hausa language. The dataset was created by manually translating 6,022 English question-answer pairs, which are associated with 1,555 unique images from the Visual Genome dataset. As a result, the dataset provides 12,044 gold standard English-Hausa parallel sentences that were translated in a fa… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 as a long paper (Findings)

  19. arXiv:2305.12050  [pdf, other

    cs.SE cs.AI

    AI-assisted Code Authoring at Scale: Fine-tuning, deploying, and mixed methods evaluation

    Authors: Vijayaraghavan Murali, Chandra Maddila, Imad Ahmad, Michael Bolin, Daniel Cheng, Negar Ghorbani, Renuka Fernandez, Nachiappan Nagappan, Peter C. Rigby

    Abstract: Generative LLMs have been shown to effectively power AI-based code authoring tools that can suggest entire statements or blocks of code during code authoring. In this paper we present CodeCompose, an AI-assisted code authoring tool developed and deployed at Meta internally. CodeCompose is based on the InCoder LLM that merges generative capabilities with bi-directionality. We have scaled up CodeCom… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  20. arXiv:2305.06897  [pdf, other

    cs.CL cs.AI cs.IR

    AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages

    Authors: Odunayo Ogundepo, Tajuddeen R. Gwadabe, Clara E. Rivera, Jonathan H. Clark, Sebastian Ruder, David Ifeoluwa Adelani, Bonaventure F. P. Dossou, Abdou Aziz DIOP, Claytone Sikasote, Gilles Hacheme, Happy Buzaaba, Ignatius Ezeani, Rooweither Mabuya, Salomey Osei, Chris Emezue, Albert Njoroge Kahira, Shamsuddeen H. Muhammad, Akintunde Oladipo, Abraham Toluwase Owodunni, Atnafu Lambebo Tonja, Iyanuoluwa Shode, Akari Asai, Tunde Oluwaseyi Ajayi, Clemencia Siro, Steven Arthur , et al. (27 additional authors not shown)

    Abstract: African languages have far less in-language content available digitally, making it challenging for question answering systems to satisfy the information needs of users. Cross-lingual open-retrieval question answering (XOR QA) systems -- those that retrieve answer content from other languages while serving people in their native language -- offer a means of filling this gap. To this end, we create… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  21. arXiv:2305.00076  [pdf, other

    cs.CL

    HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-Information for Multi-Level Sexism Classification

    Authors: Saminu Mohammad Aliyu, Idris Abdulmumin, Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Saheed Abdullahi Salahudeen, Aliyu Yusuf, Falalu Ibrahim Lawan

    Abstract: We present the findings of our participation in the SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS) task, a shared task on offensive language (sexism) detection on English Gab and Reddit dataset. We investigated the effects of transferring two language models: XLM-T (sentiment classification) and HateBERT (same domain -- Reddit) for multi-level classification into Sexist or not… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

    Comments: 5 pages, 3 figures

  22. arXiv:2304.06845  [pdf, other

    cs.CL

    SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)

    Authors: Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Seid Muhie Yimam, David Ifeoluwa Adelani, Ibrahim Sa'id Ahmad, Nedjma Ousidhoum, Abinew Ayele, Saif M. Mohammad, Meriem Beloucif, Sebastian Ruder

    Abstract: We present the first Africentric SemEval Shared task, Sentiment Analysis for African Languages (AfriSenti-SemEval) - The dataset is available at https://github.com/afrisenti-semeval/afrisent-semeval-2023. AfriSenti-SemEval is a sentiment classification challenge in 14 African languages: Amharic, Algerian Arabic, Hausa, Igbo, Kinyarwanda, Moroccan Arabic, Mozambican Portuguese, Nigerian Pidgin, Oro… ▽ More

    Submitted 1 May, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: 19 pages, 5 figures, 6 tables

  23. arXiv:2303.14720  [pdf, other

    eess.SP cs.LG eess.SY

    Driver Profiling and Bayesian Workload Estimation Using Naturalistic Peripheral Detection Study Data

    Authors: Nermin Caber, Bashar I. Ahmad, Jiaming Liang, Simon Godsill, Alexandra Bremers, Philip Thomas, David Oxtoby, Lee Skrypchuk

    Abstract: Monitoring drivers' mental workload facilitates initiating and maintaining safe interactions with in-vehicle information systems, and thus delivers adaptive human machine interaction with reduced impact on the primary task of driving. In this paper, we tackle the problem of workload estimation from driving performance data. First, we present a novel on-road study for collecting subjective workload… ▽ More

    Submitted 8 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

    Comments: Accepted for IEEE Transactions on Intelligent Vehicles

  24. arXiv:2303.05465  [pdf, other

    cs.NI eess.SY

    3D UAV Trajectory Design for Fair and Energy-Efficient Communication: A Deep Reinforcement Learning Technique

    Authors: Shahid Rasool, Irfan Ullah, Abid Ali, Ishtiaq Ahmad

    Abstract: In different situations, like disaster communication and network connectivity for rural locations, unmanned aerial vehicles (UAVs) could indeed be utilized as airborne base stations to improve both the functionality and coverage of communication networks. Ground users can employ mobile UAVs to establish communication channels and deliver packages. UAVs, on the other hand, have restricted transmiss… ▽ More

    Submitted 27 January, 2023; originally announced March 2023.

  25. arXiv:2302.08956  [pdf, other

    cs.CL

    AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

    Authors: Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, Nedjma Ousidhoum, David Ifeoluwa Adelani, Seid Muhie Yimam, Ibrahim Sa'id Ahmad, Meriem Beloucif, Saif M. Mohammad, Sebastian Ruder, Oumaima Hourrane, Pavel Brazdil, Felermino Dário Mário António Ali, Davis David, Salomey Osei, Bello Shehu Bello, Falalu Ibrahim, Tajuddeen Gwadabe, Samuel Rutunda, Tadesse Belay, Wendimu Baye Messelle, Hailu Beshada Balcha, Sisay Adugna Chala, Hagos Tesfahun Gebremichael, Bernard Opoku , et al. (1 additional authors not shown)

    Abstract: Africa is home to over 2,000 languages from more than six language families and has the highest linguistic diversity among all continents. These include 75 languages with at least one million speakers each. Yet, there is little NLP research conducted on African languages. Crucial to enabling such research is the availability of high-quality annotated datasets. In this paper, we introduce AfriSenti… ▽ More

    Submitted 4 November, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 14 pages, 3 Figures, 10 Tables

  26. arXiv:2301.10177  [pdf, other

    eess.SP cs.IT eess.SY

    Co-channel Interference Management for the Next-Generation Heterogeneous Networks using Deep Leaning

    Authors: Ishtiaq Ahmad, Aftab Hussain

    Abstract: The connectivity of public-safety mobile users (MU) in the co-existence of a public-safety network (PSN), unmanned aerial vehicles (UAVs), and LTE-based railway networks (LRN) needs a thorough investigation. UAVs are deployed as mobile base stations (BSs) for cell-edge coverage enhancement for MU. The co-existence of heterogeneous networks gives rise to the issue of co-channel interference due to… ▽ More

    Submitted 6 January, 2023; originally announced January 2023.

  27. arXiv:2211.15262  [pdf, other

    cs.CL

    HERDPhobia: A Dataset for Hate Speech against Fulani in Nigeria

    Authors: Saminu Mohammad Aliyu, Gregory Maksha Wajiga, Muhammad Murtala, Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Ibrahim Said Ahmad

    Abstract: Social media platforms allow users to freely share their opinions about issues or anything they feel like. However, they also make it easier to spread hate and abusive content. The Fulani ethnic group has been the victim of this unfortunate phenomenon. This paper introduces the HERDPhobia - the first annotated hate speech dataset on Fulani herders in Nigeria - in three languages: English, Nigerian… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

    Comments: To appear in the Proceedings of the Sixth Workshop on Widening Natural Language Processing at EMNLP2022

  28. arXiv:2210.10903  [pdf, other

    cs.AI

    Machine and Deep Learning Methods with Manual and Automatic Labelling for News Classification in Bangla Language

    Authors: Istiak Ahmad, Fahad AlQurashi, Rashid Mehmood

    Abstract: Research in Natural Language Processing (NLP) has increasingly become important due to applications such as text classification, text mining, sentiment analysis, POS tagging, named entity recognition, textual entailment, and many others. This paper introduces several machine and deep learning methods with manual and automatic labelling for news classification in the Bangla language. We implemented… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 29 pages, 30 figures

  29. arXiv:2210.09389  [pdf, other

    cs.CL cs.AI

    Potrika: Raw and Balanced Newspaper Datasets in the Bangla Language with Eight Topics and Five Attributes

    Authors: Istiak Ahmad, Fahad AlQurashi, Rashid Mehmood

    Abstract: Knowledge is central to human and scientific developments. Natural Language Processing (NLP) allows automated analysis and creation of knowledge. Data is a crucial NLP and machine learning ingredient. The scarcity of open datasets is a well-known problem in machine and deep learning research. This is very much the case for textual NLP datasets in English and other major world languages. For the Ba… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: 10 pages, 5 figures

  30. arXiv:2209.14292   

    cs.AI cs.HC

    Proceedings of the AI-HRI Symposium at AAAI-FSS 2022

    Authors: Zhao Han, Emmanuel Senft, Muneeb I. Ahmad, Shelly Bagchi, Amir Yazdani, Jason R. Wilson, Boyoung Kim, Ruchen Wen, Justin W. Hart, Daniel Hernández García, Matteo Leonetti, Ross Mead, Reuth Mirsky, Ahalya Prabhakar, Megan L. Zimmerman

    Abstract: The Artificial Intelligence (AI) for Human-Robot Interaction (HRI) Symposium has been a successful venue of discussion and collaboration on AI theory and methods aimed at HRI since 2014. This year, after a review of the achievements of the AI-HRI community over the last decade in 2021, we are focusing on a visionary theme: exploring the future of AI-HRI. Accordingly, we added a Blue Sky Ideas trac… ▽ More

    Submitted 28 November, 2022; v1 submitted 28 September, 2022; originally announced September 2022.

  31. arXiv:2208.04351  [pdf, other

    cs.SE cs.LG

    Learning to Learn to Predict Performance Regressions in Production at Meta

    Authors: Moritz Beller, Hongyu Li, Vivek Nair, Vijayaraghavan Murali, Imad Ahmad, Jürgen Cito, Drew Carlson, Ari Aye, Wes Dyer

    Abstract: Catching and attributing code change-induced performance regressions in production is hard; predicting them beforehand, even harder. A primer on automatically learning to predict performance regressions in software, this article gives an account of the experiences we gained when researching and deploying an ML-based regression prediction pipeline at Meta. In this paper, we report on a comparative… ▽ More

    Submitted 22 May, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

  32. arXiv:2205.01133  [pdf, other

    cs.CL cs.CV cs.LG

    Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation

    Authors: Idris Abdulmumin, Satya Ranjan Dash, Musa Abdullahi Dawud, Shantipriya Parida, Shamsuddeen Hassan Muhammad, Ibrahim Sa'id Ahmad, Subhadarshi Panda, Ondřej Bojar, Bashir Shehu Galadanci, Bello Shehu Bello

    Abstract: Multi-modal Machine Translation (MMT) enables the use of visual information to enhance the quality of translations. The visual information can serve as a valuable piece of context information to decrease the ambiguity of input sentences. Despite the increasing popularity of such a technique, good and sizeable datasets are scarce, limiting the full extent of their potential. Hausa, a Chadic languag… ▽ More

    Submitted 6 May, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: Accepted at Language Resources and Evaluation Conference 2022 (LREC2022)

  33. arXiv:2204.03944  [pdf

    cs.LG cs.NI

    Channel model for end-to-end learning of communications systems: A survey

    Authors: Ijaz Ahmad, Seokjoo Shin

    Abstract: The traditional communication model based on chain of multiple independent processing blocks is constraint to efficiency and introduces artificial barriers. Thus, each individually optimized block does not guarantee end-to-end performance of the system. Recently, end-to-end learning of communications systems through machine learning (ML) have been proposed to optimize the system metrics jointly ov… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

    Comments: in the proceedings of Korean Institute of Smart Media (KISM) Spring conference 2020

  34. arXiv:2204.03155  [pdf

    cs.CV

    Just-Noticeable-Difference Based Edge Map Quality Measure

    Authors: Ijaz Ahmad, Seokjoo Shin

    Abstract: The performance of an edge detector can be improved when assisted with an effective edge map quality measure. Several evaluation methods have been proposed resulting in different performance score for the same candidate edge map. However, an effective measure is the one that can be automated and which correlates with human judgement perceived quality of the edge map. Distance-based edge map measur… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: In proceedings of the 4th International Conference on Next Generation Computing (ICNGC) 2018

  35. arXiv:2203.16780  [pdf

    cs.CR cs.AI cs.MM

    A Pixel-based Encryption Method for Privacy-Preserving Deep Learning Models

    Authors: Ijaz Ahmad, Seokjoo Shin

    Abstract: In the recent years, pixel-based perceptual algorithms have been successfully applied for privacy-preserving deep learning (DL) based applications. However, their security has been broken in subsequent works by demonstrating a chosen-plaintext attack. In this paper, we propose an efficient pixel-based perceptual encryption method. The method provides a necessary level of security while preserving… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: in the proceedings of the Korean Institute of Communications and Information Sciences (KICS) Winter Conference, Pyeongchang, Korea, Feb 2022

  36. arXiv:2203.12091  [pdf, other

    cs.NE cs.AI cs.AR cs.CV

    FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation

    Authors: Ahmad Shawahna, Sadiq M. Sait, Aiman El-Maleh, Irfan Ahmad

    Abstract: Deep neural networks (DNNs) have demonstrated their effectiveness in a wide range of computer vision tasks, with the state-of-the-art results obtained through complex and deep structures that require intensive computation and memory. Now-a-days, efficient model inference is crucial for consumer applications on resource-constrained platforms. As a result, there is much interest in the research and… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 30 pages, 12 figures, 5 tables. in IEEE Access, 2022

    Report number: Electronic ISSN: 2169-3536 MSC Class: 68T01 ACM Class: I.2.0; I.4.0; I.5.0; C.1.0

  37. arXiv:2202.09918  [pdf, other

    cs.CV cs.LG

    SRL-SOA: Self-Representation Learning with Sparse 1D-Operational Autoencoder for Hyperspectral Image Band Selection

    Authors: Mete Ahishali, Serkan Kiranyaz, Iftikhar Ahmad, Moncef Gabbouj

    Abstract: The band selection in the hyperspectral image (HSI) data processing is an important task considering its effect on the computational complexity and accuracy. In this work, we propose a novel framework for the band selection problem: Self-Representation Learning (SRL) with Sparse 1D-Operational Autoencoder (SOA). The proposed SLR-SOA approach introduces a novel autoencoder model, SOA, that is desig… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

  38. arXiv:2201.08277  [pdf, other

    cs.CL cs.AI

    NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis

    Authors: Shamsuddeen Hassan Muhammad, David Ifeoluwa Adelani, Sebastian Ruder, Ibrahim Said Ahmad, Idris Abdulmumin, Bello Shehu Bello, Monojit Choudhury, Chris Chinenye Emezue, Saheed Salahudeen Abdullahi, Anuoluwapo Aremu, Alipio Jeorge, Pavel Brazdil

    Abstract: Sentiment analysis is one of the most widely studied applications in NLP, but most work focuses on languages with large amounts of data. We introduce the first large-scale human-annotated Twitter sentiment dataset for the four most widely spoken languages in Nigeria (Hausa, Igbo, Nigerian-Pidgin, and Yorùbá ) consisting of around 30,000 annotated tweets per language (and 14,000 for Nigerian-Pidgin… ▽ More

    Submitted 18 June, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: Submitted to LREC 2022, 13 pages, 2 figures

  39. arXiv:2106.07540  [pdf, other

    cs.CL cs.LG

    Evaluating Various Tokenizers for Arabic Text Classification

    Authors: Zaid Alyafeai, Maged S. Al-shaibani, Mustafa Ghaleb, Irfan Ahmad

    Abstract: The first step in any NLP pipeline is to split the text into individual tokens. The most obvious and straightforward approach is to use words as tokens. However, given a large text corpus, representing all the words is not efficient in terms of vocabulary size. In the literature, many tokenization algorithms have emerged to tackle this problem by creating subwords which in turn limits the vocabula… ▽ More

    Submitted 28 September, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

  40. arXiv:2010.13830   

    cs.RO cs.AI cs.HC

    Proceedings of the AI-HRI Symposium at AAAI-FSS 2020

    Authors: Shelly Bagchi, Jason R. Wilson, Muneeb I. Ahmad, Christian Dondrup, Zhao Han, Justin W. Hart, Matteo Leonetti, Katrin Lohan, Ross Mead, Emmanuel Senft, Jivko Sinapov, Megan L. Zimmerman

    Abstract: The Artificial Intelligence (AI) for Human-Robot Interaction (HRI) Symposium has been a successful venue of discussion and collaboration since 2014. In that time, the related topic of trust in robotics has been rapidly growing, with major research efforts at universities and laboratories across the world. Indeed, many of the past participants in AI-HRI have been or are now involved with research i… ▽ More

    Submitted 14 December, 2020; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: Symposium proceedings

  41. Knowledge Distillation in Deep Learning and its Applications

    Authors: Abdolmaged Alkhulaifi, Fahad Alsahli, Irfan Ahmad

    Abstract: Deep learning based models are relatively large, and it is hard to deploy such models on resource-limited devices such as mobile phones and embedded devices. One possible solution is knowledge distillation whereby a smaller model (student model) is trained by utilizing the information from a larger model (teacher model). In this paper, we present a survey of knowledge distillation techniques appli… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 9 pages

    Journal ref: PeerJ Comput.Sci. 7:e474 (2021)

  42. arXiv:2007.08871  [pdf, other

    cs.NI cs.CR

    Overview of Security of Virtual Mobile Networks

    Authors: Ijaz Ahmad, Ilkka Harjula, Jarno Pinola

    Abstract: 5G is enabling different services over the same physical infrastructure through the concepts and technologies of virtualization, softwarization, slicing and cloud computing. Virtual Mobile Networks (VMNs), using these concepts, provide an opportunity to share the same physical infrastructure among multiple operators. Each VMN Operator (VMNO) can have own distinct operating and support systems. How… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 6 pages, 1 figure, 3 tables

  43. arXiv:2007.05296  [pdf, other

    cs.NI cs.CR

    Improving Software Defined Cognitive and Secure Networking

    Authors: Ijaz Ahmad

    Abstract: Traditional communication networks consist of large sets of vendor-specific manually configurable devices which are hardwired with specific control logic or algorithms. The resulting networks comprise distributed control plane architectures that are complex in nature, difficult to integrate and operate, and are least efficient in terms of resource usage. However, the rapid increase in data traffic… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: 85 pages, 12 figures, PhD thesis

  44. arXiv:2007.04705  [pdf, other

    cs.NI cs.AI cs.LG

    Challenges of AI in Wireless Networks for IoT

    Authors: Ijaz Ahmad, Shahriar Shahabuddin, Tanesh Kumar, Erkki Harjula, Marcus Meisel, Markku Juntti, Thilo Sauter, Mika Ylianttila

    Abstract: The Internet of Things (IoT), hailed as the enabler of the next industrial revolution, will require ubiquitous connectivity, context-aware and dynamic service mobility, and extreme security through the wireless network infrastructure. Artificial Intelligence (AI), thus, will play a major role in the underlying network infrastructure. However, a number of challenges will surface while using the con… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

  45. arXiv:2007.04239  [pdf, other

    cs.CL cs.LG stat.ML

    A Survey on Transfer Learning in Natural Language Processing

    Authors: Zaid Alyafeai, Maged Saeed AlShaibani, Irfan Ahmad

    Abstract: Deep learning models usually require a huge amount of data. However, these large datasets are not always attainable. This is common in many challenging NLP tasks. Consider Neural Machine Translation, for instance, where curating such large datasets may not be possible specially for low resource languages. Another limitation of deep learning models is the demand for huge computing resources. These… ▽ More

    Submitted 31 May, 2020; originally announced July 2020.

  46. arXiv:2004.13875  [pdf, other

    cs.IT eess.SP

    6G White Paper on Machine Learning in Wireless Communication Networks

    Authors: Samad Ali, Walid Saad, Nandana Rajatheva, Kapseok Chang, Daniel Steinbach, Benjamin Sliwa, Christian Wietfeld, Kai Mei, Hamid Shiri, Hans-Jürgen Zepernick, Thi My Chinh Chu, Ijaz Ahmad, Jyrki Huusko, Jaakko Suutala, Shubhangi Bhadauria, Vimal Bhatia, Rangeet Mitra, Saidhiraj Amuru, Robert Abbas, Baohua Shao, Michele Capobianco, Guanghui Yu, Maelick Claes, Teemu Karvonen, Mingzhe Chen , et al. (2 additional authors not shown)

    Abstract: The focus of this white paper is on machine learning (ML) in wireless communications. 6G wireless communication networks will be the backbone of the digital transformation of societies by providing ubiquitous, reliable, and near-instant wireless connectivity for humans and machines. Recent advances in ML research has led enable a wide range of novel technologies such as self-driving vehicles and v… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

  47. arXiv:2004.00689  [pdf, other

    cs.CY cs.AI cs.HC cs.RO

    Robots in the Danger Zone: Exploring Public Perception through Engagement

    Authors: David A. Robb, Muneeb I. Ahmad, Carlo Tiseo, Simona Aracri, Alistair C. McConnell, Vincent Page, Christian Dondrup, Francisco J. Chiyah Garcia, Hai-Nguyen Nguyen, Èric Pairet, Paola Ardón Ramírez, Tushar Semwal, Hazel M. Taylor, Lindsay J. Wilson, David Lane, Helen Hastie, Katrin Lohan

    Abstract: Public perceptions of Robotics and Artificial Intelligence (RAI) are important in the acceptance, uptake, government regulation and research funding of this technology. Recent research has shown that the public's understanding of RAI can be negative or inaccurate. We believe effective public engagement can help ensure that public opinion is better informed. In this paper, we describe our first ite… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: Accepted in HRI 2020, Keywords: Human robot interaction, robotics, artificial intelligence, public engagement, public perceptions of robots, robotics and society

    ACM Class: K.4.m; I.2.9

    Journal ref: In Human-Robot Interaction HRI 2020, ACM, NY, USA, 10 pages

  48. arXiv:1909.05160  [pdf, other

    cs.HC cs.AI cs.RO

    Trust and Cognitive Load During Human-Robot Interaction

    Authors: Muneeb Imtiaz Ahmad, Jasmin Bernotat, Katrin Lohan, Friederike Eyssel

    Abstract: This paper presents an exploratory study to understand the relationship between a humans' cognitive load, trust, and anthropomorphism during human-robot interaction. To understand the relationship, we created a \say{Matching the Pair} game that participants could play collaboratively with one of two robot types, Husky or Pepper. The goal was to understand if humans would trust the robot as a teamm… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

    Comments: 10 Pages, 5 figures, AAAI Symposium on Artificial Intelligence for Human-Robot Interaction, 7th-9th November, 2019

    Report number: AI-HRI/2019/06

  49. arXiv:1906.10557  [pdf, other

    cs.HC

    Multi-Modal Measurements of Mental Load

    Authors: Ingo Keller, Muneeb Imtiaz Ahmad, Katrin Lohan

    Abstract: This position paper describes an experiment conducted to understand the relationships between different physiological measures including pupil Diameter, Blinking Rate, Heart Rate, and Heart Rate Variability in order to develop an estimation of users' mental load in real-time (see Sidebar 1). Our experiment involved performing a task to spot a correct or an incorrect word or sentence with different… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: CHI Conference-W12, April 2019, Glasgow, United Kingdom

  50. Orchestrating Service Migration for Low Power MEC-Enabled IoT Devices

    Authors: Jude Okwuibe, Juuso Haavisto, Erkki Harjula, Ijaz Ahmad, Mika Ylianttila

    Abstract: Multi-Access Edge Computing (MEC) is a key enabling technology for Fifth Generation (5G) mobile networks. MEC facilitates distributed cloud computing capabilities and information technology service environment for applications and services at the edges of mobile networks. This architectural modification serves to reduce congestion, latency, and improve the performance of such edge colocated applic… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.