Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–29 of 29 results for author: Faisal, F

.
  1. arXiv:2404.08092  [pdf, ps, other

    cs.CL cs.AI

    Data-Augmentation-Based Dialectal Adaptation for LLMs

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: This report presents GMUNLP's participation to the Dialect-Copa shared task at VarDial 2024, which focuses on evaluating the commonsense reasoning capabilities of large language models (LLMs) on South Slavic micro-dialects. The task aims to assess how well LLMs can handle non-standard dialectal varieties, as their performance on standard languages is already well-established. We propose an approac… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  2. arXiv:2403.20088  [pdf, other

    cs.CL

    An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: The capacity and effectiveness of pre-trained multilingual models (MLMs) for zero-shot cross-lingual transfer is well established. However, phenomena of positive or negative transfer, and the effect of language choice still need to be fully understood, especially in the complex setting of massively multilingual LMs. We propose an \textit{efficient} method to study transfer language influence in ze… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

  3. arXiv:2403.11009  [pdf, other

    cs.CL cs.AI

    DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

    Authors: Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

    Abstract: Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever l… ▽ More

    Submitted 7 July, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Fahim Faisal, Orevaoghene Ahia

  4. arXiv:2310.08078  [pdf, other

    cs.CL cs.LG

    To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer

    Authors: Md Mushfiqur Rahman, Fardin Ahsan Sakib, Fahim Faisal, Antonios Anastasopoulos

    Abstract: Choosing an appropriate tokenization scheme is often a bottleneck in low-resource cross-lingual transfer. To understand the downstream implications of text representation choices, we perform a comparative analysis on language models having diverse text representation modalities including 2 segmentation-based models (\texttt{BERT}, \texttt{mBERT}), 1 image-based model (\texttt{PIXEL}), and 1 charac… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at 3RD MULTILINGUAL REPRESENTATION LEARNING (MRL) WORKSHOP, 2023

  5. arXiv:2309.00949  [pdf, ps, other

    cs.CL

    Multilingual Text Representation

    Authors: Fahim Faisal

    Abstract: Modern NLP breakthrough includes large multilingual models capable of performing tasks across more than 100 languages. State-of-the-art language models came a long way, starting from the simple one-hot representation of words capable of performing tasks like natural language understanding, common-sense reasoning, or question-answering, thus capturing both the syntax and semantics of texts. At the… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: PhD Comprehensive exam report

  6. arXiv:2308.01932  [pdf, other

    cond-mat.supr-con cs.LG

    Investigation on Machine Learning Based Approaches for Estimating the Critical Temperature of Superconductors

    Authors: Fatin Abrar Shams, Rashed Hasan Ratul, Ahnaf Islam Naf, Syed Shaek Hossain Samir, Mirza Muntasir Nishat, Fahim Faisal, Md. Ashraful Hoque

    Abstract: Superconductors have been among the most fascinating substances, as the fundamental concept of superconductivity as well as the correlation of critical temperature and superconductive materials have been the focus of extensive investigation since their discovery. However, superconductors at normal temperatures have yet to be identified. Additionally, there are still many unknown factors and gaps o… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: Accepted for publication on IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, ROBOTICS,SIGNAL, AND IMAGE PROCESSING (AIRoSIP 2023)

  7. arXiv:2305.14716  [pdf, other

    cs.CL

    GlobalBench: A Benchmark for Global Progress in Natural Language Processing

    Authors: Yueqi Song, Catherine Cui, Simran Khanuja, Pengfei Liu, Fahim Faisal, Alissa Ostapenko, Genta Indra Winata, Alham Fikri Aji, Samuel Cahyawijaya, Yulia Tsvetkov, Antonios Anastasopoulos, Graham Neubig

    Abstract: Despite the major advances in NLP, significant disparities in NLP system performance across languages still exist. Arguably, these are due to uneven resource allocation and sub-optimal incentives to work on less resourced languages. To track and further incentivize the global development of equitable language technology, we introduce GlobalBench. Prior multilingual benchmarks are static and have f… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Preprint, 9 pages

  8. arXiv:2304.12979  [pdf, other

    cs.CL cs.LG

    GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters

    Authors: Md Mahfuz Ibn Alam, Ruoyu Xie, Fahim Faisal, Antonios Anastasopoulos

    Abstract: This report describes GMU's sentiment analysis system for the SemEval-2023 shared task AfriSenti-SemEval. We participated in all three sub-tasks: Monolingual, Multilingual, and Zero-Shot. Our approach uses models initialized with AfroXLMR-large, a pre-trained multilingual language model trained on African languages and fine-tuned correspondingly. We also introduce augmented training data along wit… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted at SemEval Workshop at ACL 2023

  9. arXiv:2212.10408  [pdf, other

    cs.CL

    Geographic and Geopolitical Biases of Language Models

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: Pretrained language models (PLMs) often fail to fairly represent target users from certain world regions because of the under-representation of those regions in training datasets. With recent PLMs trained on enormous data sources, quantifying their potential biases is difficult, due to their black-box nature and the sheer scale of the data sources. In this work, we devise an approach to study the… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  10. arXiv:2205.09634  [pdf, other

    cs.CL

    Phylogeny-Inspired Adaptation of Multilingual Models to New Languages

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: Large pretrained multilingual models, trained on dozens of languages, have delivered promising results due to cross-lingual learning capabilities on variety of language tasks. Further adapting these models to specific languages, especially ones unseen during pre-training, is an important goal towards expanding the coverage of language technologies. In this study, we show how we can use language ph… ▽ More

    Submitted 22 November, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: accepted in AACL 2022 Main Conference

  11. Survival Prediction of Children Undergoing Hematopoietic Stem Cell Transplantation Using Different Machine Learning Classifiers by Performing Chi-squared Test and Hyper-parameter Optimization: A Retrospective Analysis

    Authors: Ishrak Jahan Ratul, Ummay Habiba Wani, Mirza Muntasir Nishat, Abdullah Al-Monsur, Abrar Mohammad Ar-Rafi, Fahim Faisal, Mohammad Ridwan Kabir

    Abstract: Bone Marrow Transplant, a gradational rescue for a wide range of disorders emanating from the bone marrow, is an efficacious surgical treatment. Several risk factors, such as post-transplant illnesses, new malignancies, and even organ damage, can impair long-term survival. Therefore, technologies like Machine Learning are deployed for investigating the survival prediction of BMT receivers along wi… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: 25 pages, 14 figures, 38 tables

    Report number: 9391136 ACM Class: J.3

  12. arXiv:2112.03497  [pdf, other

    cs.CL

    Dataset Geography: Mapping Language Data to Language Users

    Authors: Fahim Faisal, Yinkai Wang, Antonios Anastasopoulos

    Abstract: As language technologies become more ubiquitous, there are increasing efforts towards expanding the language diversity and coverage of natural language processing (NLP) systems. Arguably, the most important factor influencing the quality of modern NLP systems is data availability. In this work, we study the geographical representativeness of NLP datasets, aiming to quantify if and by how much do N… ▽ More

    Submitted 23 March, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: ACL 2022

  13. arXiv:2110.08480  [pdf, other

    cs.AI

    Learning Cooperation and Online Planning Through Simulation and Graph Convolutional Network

    Authors: Rafid Ameer Mahmud, Fahim Faisal, Saaduddin Mahmud, Md. Mosaddek Khan

    Abstract: Multi-agent Markov Decision Process (MMDP) has been an effective way of modelling sequential decision making algorithms for multi-agent cooperative environments. A number of algorithms based on centralized and decentralized planning have been developed in this domain. However, dynamically changing environment, coupled with exponential size of the state and joint action space, make it difficult for… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  14. arXiv:2109.12072  [pdf

    cs.CL

    SD-QA: Spoken Dialectal Question Answering for the Real World

    Authors: Fahim Faisal, Sharlina Keshava, Md Mahfuz ibn Alam, Antonios Anastasopoulos

    Abstract: Question answering (QA) systems are now available through numerous commercial applications for a wide variety of domains, serving millions of users that interact with them via speech interfaces. However, current benchmarks in QA research do not account for the errors that speech recognition models might introduce, nor do they consider the language variations (dialects) of the users. To address thi… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 Findings

  15. arXiv:2109.12028  [pdf

    cs.CL

    Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering

    Authors: Fahim Faisal, Antonios Anastasopoulos

    Abstract: Human knowledge is collectively encoded in the roughly 6500 languages spoken around the world, but it is not distributed equally across languages. Hence, for information-seeking question answering (QA) systems to adequately serve speakers of all languages, they need to operate cross-lingually. In this work we investigate the capabilities of multilingually pre-trained language models on cross-lingu… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Comments: Accepted at MRQA Workshop 2021

  16. arXiv:2109.09297  [pdf

    eess.SY

    Design, Simulation and Feasibility Analysis of Bifacial Solar PV System in Marine Drive Road, Cox's Bazar

    Authors: Abdullah Al Mehadi, Mirza Muntasir Nishat, Fahim Faisal, Ahmed Raza Hasan Bhuiyan, Mohyeu Hussain, Md Ashraful Hoque

    Abstract: This paper proposes a design and simulation based investigative analysis of a vertically mounted bifacial solar photovoltaic model in Marine Drive Road, Cox's Bazar. Cox's bazar is a famous tourist destination which seems to be a flexible site for implementing such energy harvesting system without affecting the nearby eco-system and solves the existing land shortage problem. Moreover, the infrastr… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: Accepted and presented in 2021 International Conference on Science & Contemporary Technologies (ICSCT). Will be published in IEEE Xplore

  17. arXiv:2106.08415  [pdf, other

    cs.SE cs.AI cs.CL cs.LG

    Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors

    Authors: Junayed Mahmud, Fahim Faisal, Raihan Islam Arnob, Antonios Anastasopoulos, Kevin Moran

    Abstract: Automated source code summarization is a popular software engineering research topic wherein machine translation models are employed to "translate" code snippets into relevant natural language descriptions. Most evaluations of such models are conducted using automatic reference-based metrics. However, given the relatively large semantic gap between programming languages and natural language, we ar… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted to the 2021 NLP4Prog Workshop co-located with The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)

  18. arXiv:1907.09395  [pdf, other

    cs.SI cs.LG

    Mining Temporal Evolution of Knowledge Graph and Genealogical Features for Literature-based Discovery Prediction

    Authors: Nazim Choudhury, Fahim Faisal, Matloob Khushi

    Abstract: Literature-based knowledge discovery process identifies the important but implicit relations among information embedded in published literature. Existing techniques from Information Retrieval and Natural Language Processing attempt to identify the hidden or unpublished connections between information concepts within published literature, however, these techniques undermine the concept of predictin… ▽ More

    Submitted 10 November, 2019; v1 submitted 22 July, 2019; originally announced July 2019.

  19. arXiv:1905.01987   

    cs.IR cs.CL

    Disease Identification From Unstructured User Input

    Authors: Fahim Faisal, Shafkat Ahmed Bhuiyan, Dr. Abu Raihan Mostofa Kamal

    Abstract: A method to identify probable diseases from the unstructured textual input (eg, health forum posts) by incorporating a lexicographic and semantic feature based two-phase text classification module and a symptom-disease correlation-based similarity measurement module. One notable aspect of my approach was to develop a competent algorithm to extract all inherent features from the data source to make… ▽ More

    Submitted 10 May, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: This was an undergraduate research. The hypotheses it proposes is based on a small number of samples and thus, can not be declared significant. To declare it significant, a large number of sample testing is needed. After that, it can be put through

  20. arXiv:1704.01221  [pdf, other

    q-bio.MN

    From homogeneous to heterogeneous network alignment via colored graphlets

    Authors: Shawn Gu, John Johnson, Fazle E. Faisal, Tijana Milenkovic

    Abstract: Network alignment (NA) compares networks with the goal of finding a node mapping that uncovers highly similar (conserved) network regions. Existing NA methods are homogeneous, i.e., they can deal only with networks containing nodes and edges of one type. Due to increasing amounts of heterogeneous network data with nodes or edges of different types, we extend three recent state-of-the-art homogeneo… ▽ More

    Submitted 28 March, 2018; v1 submitted 4 April, 2017; originally announced April 2017.

  21. Strong-Field S-Matrix Theory With Coulomb-Volkov Final State in All Orders

    Authors: F. H. M. Faisal

    Abstract: Despite its long standing usefulness for the analysis of various processes in intense laser fields, it is well-known that the so-called strong-field KFR or SFA ansatz does not account for the final-state Coulomb interaction. Due to its importance for the ubiquitous ionisation process, numerous heuristic attempts have been made during the last several decades to account for the final state Coulomb… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

    Journal ref: Phys. Rev. A 94, 031401 (2016)

  22. arXiv:1605.07247  [pdf, ps, other

    q-bio.MN

    Network approach integrates 3D structural and sequence data to improve protein structural comparison

    Authors: Fazle E. Faisal, Julie L. Chaney, Khalique Newaz, Jun Li, Scott J. Emrich, Patricia L. Clark, Tijana Milenkovic

    Abstract: Initial protein structural comparisons were sequence-based. Since amino acids that are distant in the sequence can be close in the 3-dimensional (3D) structure, 3D contact approaches can complement sequence approaches. Traditional 3D contact approaches study 3D structures directly. Instead, 3D structures can be modeled as protein structure networks (PSNs). Then, network approaches can compare prot… ▽ More

    Submitted 27 February, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

  23. arXiv:1307.3388  [pdf, ps, other

    cs.CE q-bio.MN

    Dynamic networks reveal key players in aging

    Authors: Fazle Elahi Faisal, Tijana Milenkovic

    Abstract: Motivation: Since susceptibility to diseases increases with age, studying aging gains importance. Analyses of gene expression or sequence data, which have been indispensable for investigating aging, have been limited to studying genes and their protein products in isolation, ignoring their connectivities. However, proteins function by interacting with other proteins, and this is exactly what biolo… ▽ More

    Submitted 12 July, 2013; originally announced July 2013.

  24. arXiv:1102.1677  [pdf, other

    cond-mat.mes-hall cond-mat.other

    Adiabatic solution of Dirac equation of "graphinos" in an intense electromagnetic field and emission of high order harmonics near the Dirac points

    Authors: F. H. M. Faisal

    Abstract: We obtain a class of adiabatic solutions of Dirac equation for the charged massless relativistic quasi-particles that arise from the low-energy excitations \cite{foot-1} in a 2D graphene sheet, interacting with an electromagnetic field. The analytic solutions obtained are useful for {\it non-perturbative} investigation of processes in intense laser fields. As a first example we employ them to pred… ▽ More

    Submitted 1 March, 2011; v1 submitted 8 February, 2011; originally announced February 2011.

    Comments: 6 pages, 3 figures, additional Fig. 3 and preliminary comparison with graphene, additional citations

  25. A 4-Component Dirac Theory of Ionization of Hydrogen Molecular Ion in a Super-Intense Laser Field

    Authors: F. H. M. Faisal

    Abstract: In this paper a 4-component Dirac theory of ionization of hydrogen molecular ion in a super-intense laser field is developed. Simple analytic expressions for the spin specific as well as the total ionization currents emitted from the ground state of the ion are derived. The results are given for all polarization and finite propagation vectors of the field. They apply for inner-shell ionization o… ▽ More

    Submitted 11 July, 2009; v1 submitted 10 July, 2009; originally announced July 2009.

    Comments: 7 pages, no figures

  26. arXiv:0810.0788  [pdf, ps, other

    physics.atom-ph physics.atm-clus

    Equivalence of the velocity and length gauge perturbation series

    Authors: F. H. M. Faisal

    Abstract: We derive a "master" perturbation expansion for the quantum transition amplitude in a light field between the field-free initial and final atomic states in the minimal-coupling (MC) "velocity" gauge. The result is used to prove that the traditional "velocity" and "length" gauge perturbation series are equivalent infinite series representations or branches of the same amplitude function, that are… ▽ More

    Submitted 4 October, 2008; originally announced October 2008.

    Comments: 4 pages, 37 equations, 8 references (with 2 foot notes)

  27. A theory of intense-field dynamic alignment and high harmonic generation from coherently rotating molecules and interpretation of intense-field ultrafast pump-probe experiments

    Authors: A. Abdurrouf, F. H. M. Faisal

    Abstract: A theory of ultra-fast pump-probe experiments proposed by us earlier [F.H.M. Faisal et al., Phys. Rev. Lett. 98, 143001 (2007) and F.H.M. Faisal and A. Abdurrouf, Phys. Rev. Lett. 100, 123005 (2008)] is developed here fully and applied to investigate the phenomena of dynamic alignment and high harmonic generation (HHG) from coherently rotating linear molecules. The theory provides essentially an… ▽ More

    Submitted 8 October, 2008; v1 submitted 26 September, 2008; originally announced September 2008.

    Comments: 31 pages, 22 figures, and 140 equations

  28. Interplay of polarization geometry and rotational dynamics in high harmonic generation from coherently rotating linear molecule

    Authors: F. H. M. Faisal, A. Abdurrouf

    Abstract: Recent reports on intense-field pump-probe experiments for high harmonic generation from coherently rotating linear molecules, have revealed remarkable characteristic effects of the simultaneous variation of the polarization geometry and the time delay on the high harmonic signals. We analyze the effects and give a unified theoretical account of the experimental observations

    Submitted 15 July, 2007; originally announced July 2007.

    Comments: 4 pages, 5 figures

  29. Microwave-induced control of Free Electron Laser radiation

    Authors: A. J. Blasco, L. Plaja, L. Roso, F. H. M. Faisal

    Abstract: The dynamical response of a relativistic bunch of electrons injected in a planar magnetic undulator and interacting with a counterpropagating electromagnetic wave is studied. We demonstrate a resonance condition for which the free electron laser (FEL) dynamics is strongly influenced by the presence of the external field. It opens up the possibility of control of short wavelength FEL emission cha… ▽ More

    Submitted 28 May, 2001; originally announced May 2001.

    Comments: 14 pages, 5 figures, accepted for publication in Phys. Rev. E