Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 105 results for author: Diab, M

.
  1. SpannerLib: Embedding Declarative Information Extraction in an Imperative Workflow

    Authors: Dean Light, Ahmad Aiashy, Mahmoud Diab, Daniel Nachmias, Stijn Vansummeren, Benny Kimelfeld

    Abstract: Document spanners have been proposed as a formal framework for declarative Information Extraction (IE) from text, following IE products from the industry and academia. Over the past decade, the framework has been studied thoroughly in terms of expressive power, complexity, and the ability to naturally combine text analysis with relational querying. This demonstration presents SpannerLib a library… ▽ More

    Submitted 3 September, 2024; originally announced September 2024.

    Comments: 4 pages

    MSC Class: H.4

  2. arXiv:2408.12493  [pdf, other

    astro-ph.IM physics.optics

    Performance estimation of photonic integrated wavefront corrector for single-mode fiber coupling

    Authors: Dhwanil Patel, Momen Diab, Ross Cheriton, Jacob Taylor, Libertad Rojas, Suresh Sivanandam

    Abstract: Many modern astronomical instruments rely on the optimal coupling of starlight into single-mode fibers (SMFs). For ground-based telescopes, this coupling is limited by atmospheric turbulence. We propose an integrated wavefront corrector based on silicon-on-insulator (SOI) photonics, which samples the aberrated wavefront via a microlens array (MLA). The MLA focuses the sampled wavefront onto an arr… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 8 pages, 6 figures, submitted to SPIE Adaptive Optics Systems IX

  3. arXiv:2408.07735  [pdf, other

    astro-ph.IM physics.optics

    Experimental demonstration of photonic phase correctors based on grating coupler arrays and thermo-optic shifters

    Authors: Momen Diab, Ross Cheriton, Jacob Taylor, Dhwanil Patel, Libertad Rojas, Mark Barnet, Polina Zavyalova, Dan-Xia Xu, Pavel Cheben, Siegfried Janz, Jens H. Schmid, Suresh Sivanandam

    Abstract: In ground-based astronomy, the ability to couple light into single-mode fibers (SMFs) is limited by atmospheric turbulence, which prohibits the use of many astrophotonic instruments. We propose a silicon-on-insulator photonic chip capable of coherently coupling the out-of-phase beamlets from the subapertures of a telescope pupil into an SMF. The photonic integrated circuit (PIC) consists of an arr… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 13 pages, 11 figures, 2 tables, submitted to SPIE Adaptive Optics Systems IX

  4. arXiv:2407.18147  [pdf, other

    cs.CL

    The FIGNEWS Shared Task on News Media Narratives

    Authors: Wajdi Zaghouani, Mustafa Jarrar, Nizar Habash, Houda Bouamor, Imed Zitouni, Mona Diab, Samhaa R. El-Beltagy, Muhammed AbuOdeh

    Abstract: We present an overview of the FIGNEWS shared task, organized as part of the ArabicNLP 2024 conference co-located with ACL 2024. The shared task addresses bias and propaganda annotation in multilingual news posts. We focus on the early days of the Israel War on Gaza as a case study. The task aims to foster collaboration in developing annotation guidelines for subjective tasks by creating frameworks… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 18 pages, 10 tables, 1 figure, accepted to ArabicNLP 2024 co-located with ACL 2024

  5. arXiv:2407.11171  [pdf, other

    physics.optics astro-ph.IM

    End-to-end simulations of photonic phase correctors for adaptive optics systems

    Authors: Dhwanil Patel, Momen Diab, Ross Cheriton, Jacob Taylor, Libertad Rojas, Martin Vachon, Dan-Xia Xu, Jens H. Schmid, Pavel Cheben, Siegfried Janz, Suresh Sivanandam

    Abstract: Optical beams and starlight distorted by atmospheric turbulence can be corrected with adaptive optics systems to enable efficient coupling into single-mode fibers. Deformable mirrors, used to flatten the wavefront in astronomical telescopes, are costly, sensitive, and complex mechanical components that require careful calibration to enable high-quality imaging in astronomy, microscopy, and vision… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 14 pages, 8 figures, 1 table, to be published in Optics Express. Corresponding author: Momen Diab (momen.diab@utoronto.ca)

  6. arXiv:2406.17660  [pdf, other

    cs.LG

    Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients

    Authors: Aashiq Muhamed, Oscar Li, David Woodruff, Mona Diab, Virginia Smith

    Abstract: Large language model (LLM) training and finetuning are often bottlenecked by limited GPU memory. While existing projection-based optimization methods address this by projecting gradients into a lower-dimensional subspace to reduce optimizer state memory, they typically rely on dense projection matrices, which can introduce computational and memory overheads. In this work, we propose Grass (GRAdien… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  7. arXiv:2405.20253  [pdf, other

    cs.CL

    Evaluating Large Language Model Biases in Persona-Steered Generation

    Authors: Andy Liu, Mona Diab, Daniel Fried

    Abstract: The task of persona-steered text generation requires large language models (LLMs) to generate text that reflects the distribution of views that an individual fitting a persona could have. People have multifaceted personas, but prior work on bias in LLM-generated opinions has only explored multiple-choice settings or one-dimensional personas. We define an incongruous persona as a persona with multi… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted to Findings of ACL 2024. Code and data available at https://github.com/andyjliu/persona-steered-generation-bias

  8. arXiv:2405.06258  [pdf, other

    cs.CL

    Automatic Generation of Model and Data Cards: A Step Towards Responsible AI

    Authors: Jiarui Liu, Wenkai Li, Zhijing Jin, Mona Diab

    Abstract: In an era of model and data proliferation in machine learning/AI especially marked by the rapid advancement of open-sourced technologies, there arises a critical need for standardized consistent documentation. Our work addresses the information incompleteness in current human-generated model and data cards. We propose an automated generation approach using Large Language Models (LLMs). Our key con… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

    Comments: NAACL 2024 (Oral)

  9. arXiv:2405.01502  [pdf, other

    cs.CL cs.AI cs.LG

    Analyzing the Role of Semantic Representations in the Era of Large Language Models

    Authors: Zhijing Jin, Yuen Chen, Fernando Gonzalez, Jiarui Liu, Jiayi Zhang, Julian Michael, Bernhard Schölkopf, Mona Diab

    Abstract: Traditionally, natural language processing (NLP) models often use a rich set of features created by linguistic expertise, such as semantic representations. However, in the era of large language models (LLMs), more and more tasks are turned into generic, end-to-end sequence generation problems. In this paper, we investigate the question: what is the role of semantic representations in the era of LL… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: NAACL 2024

  10. arXiv:2404.13531  [pdf, other

    math.NA

    Splitting Techniques for DAEs with port-Hamiltonian Applications

    Authors: Andreas Bartel, Malak Diab, Andreas Frommer, Michael Günther, Nicole Marheineke

    Abstract: In the simulation of differential-algebraic equations (DAEs), it is essential to employ numerical schemes that take into account the inherent structure and maintain explicit or hidden algebraic constraints without altering them. This paper focuses on operator-splitting techniques for coupled systems and aims at preserving the structure in the port-Hamiltonian framework. The study explores two deco… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 31 pages, 22 figures

    MSC Class: 65L05; 65L20; 65L80; 97N40

  11. Non-extensive Effects on the QCD Equation of State and Fluctuations of Conserved Charges within Polyakov Quark Meson Model

    Authors: Abdel Magied Diab

    Abstract: The influence of non-extensive Tsallis statistics on the hadron phase structure has been investigated using the Polyakov-quark-meson (PQM) model. The analysis examines the non-extensive effects on the temperature dependence of PQM order parameters, thermodynamic quantities related to the QCD equation of state, and fluctuations of conserved charges at varying chemical potentials. The results show t… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 19 pages, 10 figures with 118 references. Accepted paper to publish in Journal of Physics G: Nuclear and Particle Physics

  12. arXiv:2404.00756  [pdf, other

    cs.AI cs.LG cs.LO cs.RO

    Recover: A Neuro-Symbolic Framework for Failure Detection and Recovery

    Authors: Cristina Cornelio, Mohammed Diab

    Abstract: Recognizing failures during task execution and implementing recovery procedures is challenging in robotics. Traditional approaches rely on the availability of extensive data or a tight set of constraints, while more recent approaches leverage large language models (LLMs) to verify task steps and replan accordingly. However, these methods often operate offline, necessitating scene resets and incurr… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  13. arXiv:2402.18424  [pdf, other

    cs.CL cs.AI cs.LG

    Emotion Classification in Low and Moderate Resource Languages

    Authors: Shabnam Tafreshi, Shubham Vatsal, Mona Diab

    Abstract: It is important to be able to analyze the emotional state of people around the globe. There are 7100+ active languages spoken around the world and building emotion classification for each language is labor intensive. Particularly for low-resource and endangered languages, building emotion classification can be quite challenging. We present a cross-lingual emotion classifier, where we train an emot… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  14. arXiv:2402.13231  [pdf, other

    cs.CL cs.CY

    Investigating Cultural Alignment of Large Language Models

    Authors: Badr AlKhamissi, Muhammad ElNokrashy, Mai AlKhamissi, Mona Diab

    Abstract: The intricate relationship between language and culture has long been a subject of exploration within the realm of linguistic anthropology. Large Language Models (LLMs), promoted as repositories of collective human knowledge, raise a pivotal question: do these models genuinely encapsulate the diverse knowledge adopted by different cultures? Our study reveals that these models demonstrate greater c… ▽ More

    Submitted 6 July, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: ACL 2024 (Main)

  15. arXiv:2402.11710  [pdf, other

    cs.CL

    A Note on Bias to Complete

    Authors: Jia Xu, Mona Diab

    Abstract: Minimizing social bias strengthens societal bonds, promoting shared understanding and better decision-making. We revisit the definition of bias by discovering new bias types (e.g., societal status) in dynamic environments and describe them relative to context, such as culture, region, time, and personal background. Our framework includes eight hypotheses about bias and a minimizing bias strategy f… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  16. arXiv:2311.00615  [pdf

    astro-ph.IM physics.ins-det

    2023 Astrophotonics Roadmap: pathways to realizing multi-functional integrated astrophotonic instruments

    Authors: Nemanja Jovanovic, Pradip Gatkine, Narsireddy Anugu, Rodrigo Amezcua-Correa, Ritoban Basu Thakur, Charles Beichman, Chad Bender, Jean-Philippe Berger, Azzurra Bigioli, Joss Bland-Hawthorn, Guillaume Bourdarot, Charles M. Bradford, Ronald Broeke, Julia Bryant, Kevin Bundy, Ross Cheriton, Nick Cvetojevic, Momen Diab, Scott A. Diddams, Aline N. Dinkelaker, Jeroen Duis, Stephen Eikenberry, Simon Ellis, Akira Endo, Donald F. Figer , et al. (55 additional authors not shown)

    Abstract: Photonics offer numerous functionalities that can be used to realize astrophotonic instruments. The most spectacular example to date is the ESO Gravity instrument at the Very Large Telescope in Chile. Integrated astrophotonic devices stand to offer critical advantages for instrument development, including extreme miniaturization, as well as integration, superior thermal and mechanical stabilizatio… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 191 pages, 47 figures. This is the version of the article before peer review or editing, as submitted by an author to J. Phys. Photonics. IOP Publishing Ltd is not responsible for any errors or omissions in this version of the manuscript or any version derived from it. The Version of Record is available online at https://iopscience.iop.org/article/10.1088/2515-7647/ace869/meta

    Journal ref: J. Phys. Photonics 5 042501 (2023)

  17. arXiv:2308.16736  [pdf, other

    math.DS

    Operator splitting for semi-explicit differential-algebraic equations and port-Hamiltonian DAEs

    Authors: Andreas Bartel, Malak Diab, Andreas Frommer, Michael Günther

    Abstract: Operator splitting methods allow to split the operator describing a complex dynamical system into a sequence of simpler subsystems and treat each part independently. In the modeling of dynamical problems, systems of (possibly coupled) differential-algebraic equations (DAEs) arise. This motivates the application of operator splittings which are aware of the various structural forms of DAEs. Here, w… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  18. arXiv:2306.05836  [pdf, other

    cs.CL cs.AI cs.LG

    Can Large Language Models Infer Causation from Correlation?

    Authors: Zhijing Jin, Jiarui Liu, Zhiheng Lyu, Spencer Poff, Mrinmaya Sachan, Rada Mihalcea, Mona Diab, Bernhard Schölkopf

    Abstract: Causal inference is one of the hallmarks of human intelligence. While the field of CausalNLP has attracted much interest in the recent years, existing causal inference datasets in NLP primarily rely on discovering causality from empirical knowledge (e.g., commonsense knowledge). In this work, we propose the first benchmark dataset to test the pure causal inference skills of large language models (… ▽ More

    Submitted 17 April, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: ICLR 2024

  19. OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models

    Authors: Badr AlKhamissi, Siddharth Verma, Ping Yu, Zhijing Jin, Asli Celikyilmaz, Mona Diab

    Abstract: In this paper, we conduct a thorough investigation into the reasoning capabilities of Large Language Models (LLMs), focusing specifically on the Open Pretrained Transformers (OPT) models as a representative of such models. Our study entails finetuning three different sizes of OPT on a carefully curated reasoning corpus, resulting in two sets of finetuned models: OPT-R, finetuned without explanatio… ▽ More

    Submitted 24 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Proceedings of the 1st Workshop on Natural Language Reasoning and Structured Explanations (NLRSE) at ACL 2023

  20. arXiv:2212.14208  [pdf, other

    math.NA

    A flexible short recurrence Krylov subspace method for matrices arising in the time integration of port Hamiltonian systems and ODEs/DAEs with a dissipative Hamiltonian

    Authors: Malak Diab, Andreas Frommer, Karsten Kahl

    Abstract: For several classes of mathematical models that yield linear systems, the splitting of the matrix into its Hermitian and skew Hermitian parts is naturally related to properties of the underlying model. This is particularly so for discretizations of dissipative Hamiltonian ODEs, DAEs and port Hamiltonian systems where, in addition, the Hermitian part is positive definite or semi-definite. It is the… ▽ More

    Submitted 29 December, 2022; originally announced December 2022.

    MSC Class: 65F08

  21. arXiv:2212.08286  [pdf, other

    cs.CL

    ALERT: Adapting Language Models to Reasoning Tasks

    Authors: Ping Yu, Tianlu Wang, Olga Golovneva, Badr AlKhamissi, Siddharth Verma, Zhijing Jin, Gargi Ghosh, Mona Diab, Asli Celikyilmaz

    Abstract: Current large language models can perform reasonably well on complex tasks that require step-by-step reasoning with few-shot learning. Are these models applying reasoning skills they have learnt during pre-training and reason outside of their training context, or are they simply memorizing their training corpus at finer granularity and have learnt to better understand their context? To tease apart… ▽ More

    Submitted 7 July, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  22. Artificial Potential Field-Based Path Planning for Cluttered Environments

    Authors: Mosab Diab, Mostafa Mohammadkarimi, Raj Thilak Rajan

    Abstract: In this paper, we study path planning algorithms of resource constrained mobile agents in unknown cluttered environments, which include but are not limited to various terrestrial missions e.g., search and rescue missions by drones in jungles, and space missions e.g., navigation of rovers on the Moon. In particular, we focus our attention on artificial potential field (APF) based methods, in which… ▽ More

    Submitted 29 August, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 8 pages, 8 figures, 2023 IEEE Aerospace Conference Proceedings

    Journal ref: 2023 IEEE Aerospace Conference

  23. arXiv:2210.07652  [pdf, other

    cs.CL cs.AI

    Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values

    Authors: Yejin Bang, Tiezheng Yu, Andrea Madotto, Zhaojiang Lin, Mona Diab, Pascale Fung

    Abstract: Many NLP classification tasks, such as sexism/racism detection or toxicity detection, are based on human values. Yet, human values can vary under diverse cultural conditions. Therefore, we introduce a framework for value-aligned classification that performs prediction based on explicitly written human values in the command. Along with the task, we propose a practical approach that distills value-a… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  24. arXiv:2210.01734  [pdf, other

    cs.CL cs.LG

    Text Characterization Toolkit

    Authors: Daniel Simig, Tianlu Wang, Verna Dankers, Peter Henderson, Khuyagbaatar Batsuren, Dieuwke Hupkes, Mona Diab

    Abstract: In NLP, models are usually evaluated by reporting single-number performance scores on a number of readily available benchmarks, without much deeper analysis. Here, we argue that - especially given the well-known fact that benchmarks often contain biases, artefacts, and spurious correlations - deeper results analysis should become the de-facto standard when presenting new models or benchmarks. We p… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  25. arXiv:2209.15168  [pdf, other

    cs.CL cs.LG

    Depth-Wise Attention (DWAtt): A Layer Fusion Method for Data-Efficient Classification

    Authors: Muhammad ElNokrashy, Badr AlKhamissi, Mona Diab

    Abstract: Language Models pretrained on large textual data have been shown to encode different types of knowledge simultaneously. Traditionally, only the features from the last layer are used when adapting to new tasks or data. We put forward that, when using or finetuning deep pretrained models, intermediate layer features that may be relevant to the downstream task are buried too deep to be used efficient… ▽ More

    Submitted 7 May, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted Oral Presentation at LREC-COLING 2024; 10 pages, 9 figures

  26. arXiv:2205.12495  [pdf, other

    cs.CL

    ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

    Authors: Badr AlKhamissi, Faisal Ladhak, Srini Iyer, Ves Stoyanov, Zornitsa Kozareva, Xian Li, Pascale Fung, Lambert Mathias, Asli Celikyilmaz, Mona Diab

    Abstract: Hate speech detection is complex; it relies on commonsense reasoning, knowledge of stereotypes, and an understanding of social nuance that differs from one culture to the next. It is also difficult to collect a large-scale hate speech annotated dataset. In this work, we frame this problem as a few-shot learning task, and show significant gains with decomposing the task into its "constituent" parts… ▽ More

    Submitted 20 May, 2023; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted at EMNLP 2022

    Journal ref: In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 2109-2120, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics

  27. arXiv:2205.12484  [pdf, other

    cs.CL cs.AI

    GisPy: A Tool for Measuring Gist Inference Score in Text

    Authors: Pedram Hosseini, Christopher R. Wolfe, Mona Diab, David A. Broniatowski

    Abstract: Decision making theories such as Fuzzy-Trace Theory (FTT) suggest that individuals tend to rely on gist, or bottom-line meaning, in the text when making decisions. In this work, we delineate the process of developing GisPy, an open-source tool in Python for measuring the Gist Inference Score (GIS) in text. Evaluation of GisPy on documents in three benchmarks from the news and scientific text domai… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: Accepted to the 4th Workshop on Narrative Understanding @ NAACL 2022

  28. arXiv:2205.08533  [pdf, ps, other

    cs.CL

    Consistent Human Evaluation of Machine Translation across Language Pairs

    Authors: Daniel Licht, Cynthia Gao, Janice Lam, Francisco Guzman, Mona Diab, Philipp Koehn

    Abstract: Obtaining meaningful quality scores for machine translation systems through human evaluation remains a challenge given the high variability between human evaluators, partly due to subjective expectations for translation quality for different language pairs. We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more cons… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 10 pages

  29. arXiv:2205.07960  [pdf, other

    cs.CL

    Meta AI at Arabic Hate Speech 2022: MultiTask Learning with Self-Correction for Hate Speech Classification

    Authors: Badr AlKhamissi, Mona Diab

    Abstract: In this paper, we tackle the Arabic Fine-Grained Hate Speech Detection shared task and demonstrate significant improvements over reported baselines for its three subtasks. The tasks are to predict if a tweet contains (1) Offensive language; and whether it is considered (2) Hate Speech or not and if so, then predict the (3) Fine-Grained Hate Speech label from one of six categories. Our final soluti… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: Accepted at the 5th Workshop on Open-Source Arabic Corpora and Processing Tools (OSACT5/LREC 2022)

  30. arXiv:2205.01068  [pdf, other

    cs.CL cs.LG

    OPT: Open Pre-trained Transformer Language Models

    Authors: Susan Zhang, Stephen Roller, Naman Goyal, Mikel Artetxe, Moya Chen, Shuohui Chen, Christopher Dewan, Mona Diab, Xian Li, Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, Luke Zettlemoyer

    Abstract: Large language models, which are often trained for hundreds of thousands of compute days, have shown remarkable capabilities for zero- and few-shot learning. Given their computational cost, these models are difficult to replicate without significant capital. For the few that are available through APIs, no access is granted to the full model weights, making them difficult to study. We present Open… ▽ More

    Submitted 21 June, 2022; v1 submitted 2 May, 2022; originally announced May 2022.

  31. arXiv:2204.06031  [pdf, other

    cs.CL cs.AI

    A Review on Language Models as Knowledge Bases

    Authors: Badr AlKhamissi, Millicent Li, Asli Celikyilmaz, Mona Diab, Marjan Ghazvininejad

    Abstract: Recently, there has been a surge of interest in the NLP community on the use of pretrained Language Models (LMs) as Knowledge Bases (KBs). Researchers have shown that LMs trained on a sufficiently large (web) corpus will encode a significant amount of knowledge implicitly in its parameters. The resulting LM can be probed for different kinds of knowledge and thus acting as a KB. This has a major ad… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Preprint

  32. arXiv:2203.09597  [pdf, other

    cs.CL cs.CY

    Towards Responsible Natural Language Annotation for the Varieties of Arabic

    Authors: A. Stevie Bergman, Mona T. Diab

    Abstract: When building NLP models, there is a tendency to aim for broader coverage, often overlooking cultural and (socio)linguistic nuance. In this position paper, we make the case for care and attention to such nuances, particularly in dataset annotation, as well as the inclusion of cultural and linguistic expertise in the process. We present a playbook for responsible dataset creation for polyglossic, m… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: ACL 2022 Findings

  33. arXiv:2202.09625  [pdf, other

    cs.CL

    CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

    Authors: Shuguang Chen, Gustavo Aguilar, Anirudh Srinivasan, Mona Diab, Thamar Solorio

    Abstract: To date, efforts in the code-switching literature have focused for the most part on language identification, POS, NER, and syntactic parsing. In this paper, we address machine translation for code-switched social media data. We create a community shared task. We provide two modalities for participation: supervised and unsupervised. For the supervised setting, participants are challenged to transla… ▽ More

    Submitted 19 February, 2022; originally announced February 2022.

  34. arXiv:2201.10430  [pdf, other

    cs.CL

    A Quantitative and Qualitative Analysis of Schizophrenia Language

    Authors: Amal Alqahtani, Efsun Sarioglu Kay, Sardar Hamidian, Michael Compton, Mona Diab

    Abstract: Schizophrenia is one of the most disabling mental health conditions to live with. Approximately one percent of the population has schizophrenia which makes it fairly common, and it affects many people and their families. Patients with schizophrenia suffer different symptoms: formal thought disorder (FTD), delusions, and emotional flatness. In this paper, we quantitatively and qualitatively analyze… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

  35. arXiv:2112.10684  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Large Scale Language Modeling with Mixtures of Experts

    Authors: Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, Jingfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil Akin, Mandeep Baines, Louis Martin, Xing Zhou, Punit Singh Koura, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Mona Diab, Zornitsa Kozareva, Ves Stoyanov

    Abstract: Mixture of Experts layers (MoEs) enable efficient scaling of language models through conditional computation. This paper presents a detailed empirical study of how autoregressive MoE language models scale in comparison with dense models in a wide range of settings: in- and out-of-domain language modeling, zero- and few-shot priming, and full-shot fine-tuning. With the exception of fine-tuning, we… ▽ More

    Submitted 26 October, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: EMNLP 2022

  36. arXiv:2112.10668  [pdf, other

    cs.CL cs.AI

    Few-shot Learning with Multilingual Language Models

    Authors: Xi Victoria Lin, Todor Mihaylov, Mikel Artetxe, Tianlu Wang, Shuohui Chen, Daniel Simig, Myle Ott, Naman Goyal, Shruti Bhosale, Jingfei Du, Ramakanth Pasunuru, Sam Shleifer, Punit Singh Koura, Vishrav Chaudhary, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Zornitsa Kozareva, Mona Diab, Veselin Stoyanov, Xian Li

    Abstract: Large-scale generative language models such as GPT-3 are competitive few-shot learners. While these models are known to be able to jointly represent many different languages, their training data is dominated by English, potentially limiting their cross-lingual generalization. In this work, we train multilingual generative language models on a corpus covering a diverse set of languages, and study t… ▽ More

    Submitted 10 November, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: Accepted to EMNLP 2022; 34 pages

  37. arXiv:2112.08615  [pdf, other

    cs.CL

    Knowledge-Augmented Language Models for Cause-Effect Relation Classification

    Authors: Pedram Hosseini, David A. Broniatowski, Mona Diab

    Abstract: Previous studies have shown the efficacy of knowledge augmentation methods in pretrained language models. However, these methods behave differently across domains and downstream tasks. In this work, we investigate the augmentation of pretrained language models with commonsense knowledge in the cause-effect relation classification and commonsense causal reasoning tasks. After automatically verbaliz… ▽ More

    Submitted 1 June, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: Accepted to Commonsense Representation and Reasoning (CSRR) @ ACL 2022

  38. arXiv:2111.13654  [pdf, other

    cs.CL cs.AI cs.LG

    Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

    Authors: Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal, Srinivasan Iyer

    Abstract: Do language models have beliefs about the world? Dennett (1995) famously argues that even thermostats have beliefs, on the view that a belief is simply an informational state decoupled from any motivational state. In this paper, we discuss approaches to detecting when models have beliefs about the world, and we improve on methods for updating model beliefs to be more truthful, with a focus on meth… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: 19 pages

  39. arXiv:2111.06474  [pdf, other

    cs.CL

    AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization

    Authors: Alexander R. Fabbri, Xiaojian Wu, Srini Iyer, Haoran Li, Mona Diab

    Abstract: Community Question Answering (CQA) fora such as Stack Overflow and Yahoo! Answers contain a rich resource of answers to a wide range of community-based questions. Each question thread can receive a large number of answers with different perspectives. One goal of answer summarization is to produce a summary that reflects the range of answer perspectives. A major obstacle for this task is the absenc… ▽ More

    Submitted 29 April, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

    Comments: NAACL 2022; arXiv admin note: substantial text overlap with arXiv:2104.08536

  40. Chiral magnetic properties of QCD phase-diagram

    Authors: Abdel Nasser Tawfik, Abdel Magied Diab

    Abstract: The QCD phase diagram is studied, at finite magnetic field. Our calculations are based on the QCD effective model, the SU($3$) Polyakov linear sigma model (PLSM), in which the chiral symmetry is integrated in the hadron phase and in the parton phase, the up-, down- and strange-quark degrees of freedom are incorporated besides the inclusion of Polyakov loop potentials in the pure gauge limit, which… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: 16 pages, 11 figures, the paper is accepted for publication in The European Physical Journal A (EPJA)

    Journal ref: Eur. Phys. J. A (2021) 57: 200

  41. arXiv:2106.00934  [pdf, other

    cs.CL

    Discrete Cosine Transform as Universal Sentence Encoder

    Authors: Nada Almarwani, Mona Diab

    Abstract: Modern sentence encoders are used to generate dense vector representations that capture the underlying linguistic characteristics for a sequence of words, including phrases, sentences, or paragraphs. These kinds of representations are ideal for training a classifier for an end task such as sentiment analysis, question answering and text classification. Different models have been proposed to effici… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: to be published in ACL-IJCNLP 2021

  42. arXiv:2106.00169  [pdf, other

    cs.CL

    Gender Bias Amplification During Speed-Quality Optimization in Neural Machine Translation

    Authors: Adithya Renduchintala, Denise Diaz, Kenneth Heafield, Xian Li, Mona Diab

    Abstract: Is bias amplified when neural machine translation (NMT) models are optimized for speed and evaluated on generic test sets using BLEU? We investigate architectures and techniques commonly used to speed up decoding in Transformer-based models, such as greedy search, quantization, average attention networks (AANs) and shallow decoder models and show their effect on gendered noun translation. We const… ▽ More

    Submitted 31 May, 2021; originally announced June 2021.

    Comments: Accepted at ACL 2021

  43. arXiv:2105.15071  [pdf, other

    cs.CL

    Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data

    Authors: Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona Diab

    Abstract: The scarcity of parallel data is a major obstacle for training high-quality machine translation systems for low-resource languages. Fortunately, some low-resource languages are linguistically related or similar to high-resource languages; these related languages may share many lexical or syntactic structures. In this work, we exploit this linguistic overlap to facilitate translating to and from a… ▽ More

    Submitted 1 June, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: ACL 2021

  44. arXiv:2104.09354  [pdf

    physics.optics astro-ph.IM

    Optimal SMF packing in photonic lanterns: comparing theoretical topology to practical packing arrangements

    Authors: John J. Davenport, Momen Diab, Kalaga Madhav, Martin M. Roth

    Abstract: Photonic lanterns rely on a close packed arrangement of single mode fibers, which are tapered and fused into one multi-mode core. Topologically optimal circle packing arrangements have been well studied. Using this, we fabricate PLs with 19 and 37 SMFs showing tightly packed, ordered arrangements with packing densities of 95 % and 99 % of theoretically achievable values, with mean adjacent core se… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: 10 pages, 10 figures, 2 tables

    Journal ref: Journal of the Optical Society of America B, Vol. 38, Issue 7, pp. A7-A14 (2021)

  45. arXiv:2104.08536  [pdf, other

    cs.CL

    Multi-Perspective Abstractive Answer Summarization

    Authors: Alexander R. Fabbri, Xiaojian Wu, Srini Iyer, Mona Diab

    Abstract: Community Question Answering (CQA) forums such as Stack Overflow and Yahoo! Answers contain a rich resource of answers to a wide range of questions. Each question thread can receive a large number of answers with different perspectives. The goal of multi-perspective answer summarization is to produce a summary that includes all perspectives of the answer. A major obstacle for multi-perspective, ab… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  46. arXiv:2103.14047  [pdf, other

    astro-ph.IM physics.optics

    Simulations of mode-selective photonic lanterns for efficient coupling of starlight into the single-mode regime

    Authors: Momen Diab, Aashana Tripathi, John Davenport, Aline N. Dinkelaker, Kalaga Madhav, Martin M. Roth

    Abstract: In ground-based astronomy, starlight distorted by the atmosphere couples poorly into single-mode waveguides but a correction by adaptive optics, even if only partial, can boost coupling into the few-mode regime allowing the use of photonic lanterns to convert into multiple single-mode beams. Corrected wavefronts result in focal patterns that couple mostly with the circularly symmetric waveguide mo… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

    Comments: 6 pages, 5 figures, accepted for publication in Applied Optics

    Journal ref: Appl. Opt. 60(19), D9-D14 (2021)

  47. arXiv:2103.13606  [pdf, other

    cs.CL cs.AI

    Predicting Directionality in Causal Relations in Text

    Authors: Pedram Hosseini, David A. Broniatowski, Mona Diab

    Abstract: In this work, we test the performance of two bidirectional transformer-based language models, BERT and SpanBERT, on predicting directionality in causal pairs in the textual content. Our preliminary results show that predicting direction for inter-sentence and implicit causal relations is more challenging. And, SpanBERT performs better than BERT on causal samples with longer span length. We also in… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  48. arXiv:2101.10894  [pdf

    cs.CV cs.CY

    White Paper: Challenges and Considerations for the Creation of a Large Labelled Repository of Online Videos with Questionable Content

    Authors: Thamar Solorio, Mahsa Shafaei, Christos Smailis, Mona Diab, Theodore Giannakopoulos, Heng Ji, Yang Liu, Rada Mihalcea, Smaranda Muresan, Ioannis Kakadiaris

    Abstract: This white paper presents a summary of the discussions regarding critical considerations to develop an extensive repository of online videos annotated with labels indicating questionable content. The main discussion points include: 1) the type of appropriate labels that will result in a valuable repository for the larger AI community; 2) how to design the collection and annotation process, as well… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  49. arXiv:2011.13423  [pdf, other

    astro-ph.IM physics.optics

    Starlight coupling through atmospheric turbulence into few-mode fibers and photonic lanterns in the presence of partial adaptive optics correction

    Authors: Momen Diab, Aline N. Dinkelaker, John Davenport, Kalaga Madhav, Martin M. Roth

    Abstract: Starlight corrupted by atmospheric turbulence cannot couple efficiently into astronomical instruments based on integrated optics as they require light of high spatial coherence to couple into their single-mode waveguides. Low-order adaptive optics in combination with photonic lanterns offer a practical approach to achieve efficient coupling into multiplexed astrophotonic devices. We investigate, a… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

    Comments: 12 pages, 10 figures

    Journal ref: MNRAS 501, 1557-1567 (2021)

  50. A minimal Length Uncertainty Approach to Cosmological Constant Problem

    Authors: Abdel Magied Diab, Abdel Nasser Tawfik

    Abstract: Based on quantum mechanical framework for the minimal length uncertainty, we demonstrate that the generalized uncertainty principle (GUP) parameter could be best constrained by recent gravitational waves observations on one hand. On other hand this suggests modified dispersion relations (MDRs) enabling an estimation for the difference between the group velocity of gravitons and that of photons. Ut… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: 5 pages, 0 figures, an invited talk given at the int. Workshop on Astronomy and Relativistic Astrophysics (IWARA $2020$ Video Conference), accepted for publication in Astronomische Nachrichten

    Report number: ECTP-2020-14-14, WLCAPP-2020-14-14

    Journal ref: Astronomische Nachrichten 2021, 1-5