Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–28 of 28 results for author: Rambow, O

.
  1. arXiv:2406.12131  [pdf, other

    cs.CL

    Gram2Vec: An Interpretable Document Vectorizer

    Authors: Peter Zeng, Eric Sclafani, Owen Rambow

    Abstract: We present Gram2Vec, a grammatical style embedding algorithm that embeds documents into a higher dimensional space by extracting the normalized relative frequencies of grammatical features present in the text. Compared to neural approaches, Gram2Vec offers inherent interpretability based on how the feature vectors are generated. In our demo, we present a way to visualize a mapping of authors to do… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 6 pages, 2 figures

  2. arXiv:2406.10786  [pdf, other

    cs.AI cs.CL

    Evaluating LLMs with Multiple Problems at once: A New Paradigm for Probing LLM Capabilities

    Authors: Zhengxiang Wang, Jordan Kodner, Owen Rambow

    Abstract: Current LLM evaluation predominantly performs evaluation with prompts comprising single problems. We propose multi-problem evaluation as an additional approach to study the multiple problem handling capabilities of LLMs. We present a systematic study in this regard by comprehensively examining 7 LLMs on 4 related types of tasks constructed from 6 classification benchmarks. The 4 task types include… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 20 pages, 15 figures, 9 tables

  3. arXiv:2406.07466  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Multimodal Belief Prediction

    Authors: John Murzaku, Adil Soubki, Owen Rambow

    Abstract: Recognizing a speaker's level of commitment to a belief is a difficult task; humans do not only interpret the meaning of the words in context, but also understand cues from intonation and other aspects of the audio signal. Many papers and corpora in the NLP community have approached the belief prediction task using text-only approaches. We are the first to frame and present results on the multimod… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: John Murzaku and Adil Soubki contributed equally to this work

    Journal ref: Interspeech 2024

  4. arXiv:2406.04109  [pdf, other

    cs.CL

    Intention and Face in Dialog

    Authors: Adil Soubki, Owen Rambow

    Abstract: The notion of face described by Brown and Levinson (1987) has been studied in great detail, but a critical aspect of the framework, that which focuses on how intentions mediate the planning of turns which impose upon face, has received far less attention. We present an analysis of three computational systems trained for classifying both intention and politeness, focusing on how the former influenc… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Journal ref: May 2024. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 9143-9153, Torino, Italia. ELRA and ICCL

  5. arXiv:2403.02451  [pdf, other

    cs.CL

    Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground

    Authors: Adil Soubki, John Murzaku, Arash Yousefi Jordehi, Peter Zeng, Magdalena Markowska, Seyed Abolghasem Mirroshandel, Owen Rambow

    Abstract: Evaluating the theory of mind (ToM) capabilities of language models (LMs) has recently received a great deal of attention. However, many existing benchmarks rely on synthetic data, which risks misaligning the resulting experiments with human behavior. We introduce the first ToM dataset based on naturally occurring spoken dialogs, Common-ToM, and show that LMs struggle to demonstrate ToM. We then s… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Journal ref: ACL 2024 Findings

  6. arXiv:2402.17151  [pdf, other

    cs.CL

    Clustering Document Parts: Detecting and Characterizing Influence Campaigns from Documents

    Authors: Zhengxiang Wang, Owen Rambow

    Abstract: We propose a novel clustering pipeline to detect and characterize influence campaigns from documents. This approach clusters parts of document, detects clusters that likely reflect an influence campaign, and then identifies documents linked to an influence campaign via their association with the high-influence clusters. Our approach outperforms both the direct document-level classification and the… ▽ More

    Submitted 26 April, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures, 5 tables

  7. arXiv:2311.01273  [pdf, other

    cs.CL

    Finding Common Ground: Annotating and Predicting Common Ground in Spoken Conversations

    Authors: Magdalena Markowska, Mohammad Taghizadeh, Adil Soubki, Seyed Abolghasem Mirroshandel, Owen Rambow

    Abstract: When we communicate with other humans, we do not simply generate a sequence of words. Rather, we use our cognitive state (beliefs, desires, intentions) and our model of the audience's cognitive state to create utterances that affect the audience's cognitive state in the intended manner. An important part of cognitive state is the common ground, which is the content the speaker believes, and the sp… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Journal ref: Findings of EMNLP 2023

  8. arXiv:2210.08604  [pdf, other

    cs.CL cs.AI

    NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-Fly

    Authors: Yi R. Fung, Tuhin Chakraborty, Hao Guo, Owen Rambow, Smaranda Muresan, Heng Ji

    Abstract: Norm discovery is important for understanding and reasoning about the acceptable behaviors and potential violations in human communication and interactions. We introduce NormSage, a framework for addressing the novel task of conversation-grounded multi-lingual, multi-cultural norm discovery, based on language model prompting and self-verification. NormSAGE leverages the expressiveness and implicit… ▽ More

    Submitted 13 January, 2024; v1 submitted 16 October, 2022; originally announced October 2022.

  9. arXiv:2203.10659  [pdf, other

    cs.CL cs.AI

    From Stance to Concern: Adaptation of Propositional Analysis to New Tasks and Domains

    Authors: Brodie Mather, Bonnie J Dorr, Adam Dalton, William de Beaumont, Owen Rambow, Sonja M. Schmer-Galunder

    Abstract: We present a generalized paradigm for adaptation of propositional analysis (predicate-argument pairs) to new tasks and domains. We leverage an analogy between stances (belief-driven sentiment) and concerns (topical issues with moral dimensions/endorsements) to produce an explanatory representation. A key contribution is the combination of semi-automatic resource building for extraction of domain-d… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of the Association for Computational Linguistics, 2022

    MSC Class: 68T50 ACM Class: I.2.7

  10. arXiv:2010.10998  [pdf, other

    cs.CL cs.AI

    Open-Domain Frame Semantic Parsing Using Transformers

    Authors: Aditya Kalyanpur, Or Biran, Tom Breloff, Jennifer Chu-Carroll, Ariel Diertani, Owen Rambow, Mark Sammons

    Abstract: Frame semantic parsing is a complex problem which includes multiple underlying subtasks. Recent approaches have employed joint learning of subtasks (such as predicate and argument detection), and multi-task learning of related tasks (such as syntactic and semantic parsing). In this paper, we explore multi-task learning of all subtasks with transformer-based models. We show that a purely generative… ▽ More

    Submitted 23 October, 2020; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: 11 pages

  11. arXiv:2005.01525  [pdf, other

    cs.CL cs.AI

    To Test Machine Comprehension, Start by Defining Comprehension

    Authors: Jesse Dunietz, Gregory Burnham, Akash Bharadwaj, Owen Rambow, Jennifer Chu-Carroll, David Ferrucci

    Abstract: Many tasks aim to measure machine reading comprehension (MRC), often focusing on question types presumed to be difficult. Rarely, however, do task designers start by considering what systems should in fact comprehend. In this paper we make two key contributions. First, we argue that existing approaches do not adequately define comprehension; they are too unsystematic about what content is tested.… ▽ More

    Submitted 11 May, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Camera-ready ACL 2020 paper (Theme track). 9 pages; 3 figures; 1 table

  12. arXiv:1903.05260  [pdf, other

    cs.CL

    Syntax-aware Neural Semantic Role Labeling with Supertags

    Authors: Jungo Kasai, Dan Friedman, Robert Frank, Dragomir Radev, Owen Rambow

    Abstract: We introduce a new syntax-aware model for dependency-based semantic role labeling that outperforms syntax-agnostic models for English and Spanish. We use a BiLSTM to tag the text with supertags extracted from dependency parses, and we feed these supertags, along with words and parts of speech, into a deep highway BiLSTM for semantic role labeling. Our model combines the strengths of earlier models… ▽ More

    Submitted 3 April, 2019; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: NAACL 2019, Added Spanish ELMo results

  13. arXiv:1805.06016  [pdf, other

    cs.CL

    Author Commitment and Social Power: Automatic Belief Tagging to Infer the Social Context of Interactions

    Authors: Vinodkumar Prabhakaran, Premkumar Ganeshkumar, Owen Rambow

    Abstract: Understanding how social power structures affect the way we interact with one another is of great interest to social scientists who want to answer fundamental questions about human behavior, as well as to computer scientists who want to build automatic methods to infer the social contexts of interactions. In this paper, we employ advancements in extra-propositional semantics extraction within NLP… ▽ More

    Submitted 15 May, 2018; originally announced May 2018.

    Comments: NAACL 2018 long paper. 9 pages plus references

    Journal ref: North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT). 2018

  14. arXiv:1804.06610  [pdf, other

    cs.CL

    End-to-end Graph-based TAG Parsing with Neural Networks

    Authors: Jungo Kasai, Robert Frank, Pauli Xu, William Merrill, Owen Rambow

    Abstract: We present a graph-based Tree Adjoining Grammar (TAG) parser that uses BiLSTMs, highway connections, and character-level CNNs. Our best end-to-end parser, which jointly performs supertagging, POS tagging, and parsing, outperforms the previously reported best results by more than 2.2 LAS and UAS points. The graph-based parsing architecture allows for global inference and rich feature representation… ▽ More

    Submitted 27 April, 2018; v1 submitted 18 April, 2018; originally announced April 2018.

    Comments: NAACL 2018

  15. arXiv:1708.03940  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Leveraging Sparse and Dense Feature Combinations for Sentiment Classification

    Authors: Tao Yu, Christopher Hidey, Owen Rambow, Kathleen McKeown

    Abstract: Neural networks are one of the most popular approaches for many natural language processing tasks such as sentiment analysis. They often outperform traditional machine learning models and achieve the state-of-art results on most tasks. However, many existing deep learning models are complex, difficult to train and provide a limited improvement over simpler methods. We propose a simple, robust and… ▽ More

    Submitted 13 August, 2017; originally announced August 2017.

    Comments: 4 pages

  16. Dialog Structure Through the Lens of Gender, Gender Environment, and Power

    Authors: Vinodkumar Prabhakaran, Owen Rambow

    Abstract: Understanding how the social context of an interaction affects our dialog behavior is of great interest to social scientists who study human behavior, as well as to computer scientists who build automatic methods to infer those social contexts. In this paper, we study the interaction of power, gender, and dialog behavior in organizational interactions. In order to perform this study, we first cons… ▽ More

    Submitted 11 June, 2017; originally announced June 2017.

    Journal ref: Journal for Dialogue & Discourse 8(2) (2017) 21-55

  17. arXiv:1609.08779  [pdf

    cs.CY cs.CL

    Using Natural Language Processing and Qualitative Analysis to Intervene in Gang Violence: A Collaboration Between Social Work Researchers and Data Scientists

    Authors: Desmond Upton Patton, Kathleen McKeown, Owen Rambow, Jamie Macbeth

    Abstract: The U.S. has the highest rate of firearm-related deaths when compared to other industrialized countries. Violence particularly affects low-income, urban neighborhoods in cities like Chicago, which saw a 40% increase in firearm violence from 2014 to 2015 to more than 3,000 shooting victims. While recent studies have found that urban, gang-involved individuals curate a unique and complex communicati… ▽ More

    Submitted 28 September, 2016; originally announced September 2016.

    Comments: Presented at the Data For Good Exchange 2016

  18. arXiv:1503.01190  [pdf, other

    cs.CL cs.LG stat.ML

    Statistical modality tagging from rule-based annotations and crowdsourcing

    Authors: Vinodkumar Prabhakaran, Michael Bloodgood, Mona Diab, Bonnie Dorr, Lori Levin, Christine D. Piatko, Owen Rambow, Benjamin Van Durme

    Abstract: We explore training an automatic modality tagger. Modality is the attitude that a speaker might have toward an event or state. One of the main hurdles for training a linguistic tagger is gathering training data. This is particularly problematic for training a tagger for modality because modality triggers are sparse for the overwhelming majority of sentences. We investigate an approach to automatic… ▽ More

    Submitted 3 March, 2015; originally announced March 2015.

    Comments: 8 pages, 6 tables; appeared in Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, July 2012; In Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, pages 57-64, Jeju, Republic of Korea, July 2012. Association for Computational Linguistics

    ACM Class: I.2.7; I.2.6; I.5.1; I.5.4

    Journal ref: In Proceedings of the Workshop on Extra-Propositional Aspects of Meaning in Computational Linguistics, pages 57-64, Jeju, Republic of Korea, July 2012. Association for Computational Linguistics

  19. arXiv:1309.5652  [pdf

    cs.CL

    LDC Arabic Treebanks and Associated Corpora: Data Divisions Manual

    Authors: Mona Diab, Nizar Habash, Owen Rambow, Ryan Roth

    Abstract: The Linguistic Data Consortium (LDC) has developed hundreds of data corpora for natural language processing (NLP) research. Among these are a number of annotated treebank corpora for Arabic. Typically, these corpora consist of a single collection of annotated documents. NLP research, however, usually requires multiple data sets for the purposes of training models, developing techniques, and final… ▽ More

    Submitted 22 September, 2013; originally announced September 2013.

    Comments: 14 pages; one cover

    Report number: CLCSL-0S7--1031-02

  20. arXiv:cond-mat/0404590  [pdf, ps, other

    cond-mat.stat-mech

    Orthogonality Catastrophe in Bose-Einstein Condensates

    Authors: Jun Sun, Olen Rambow, Qimiao Si

    Abstract: Orthogonality catastrophe in fermionic systems is well known: in the thermodynamic limit, the overlap between the ground state wavefunctions with and without a single local scattering potential approaches zero algebraically as a function of the particle number $N$. Here we examine the analogous problem for bosonic systems. In the homogeneous case, we find that ideal bosons display an orthogonali… ▽ More

    Submitted 6 May, 2004; v1 submitted 26 April, 2004; originally announced April 2004.

    Comments: 5 pages; 2 figures

  21. Synchronous Models of Language

    Authors: Owen Rambow, Giorgio Satta

    Abstract: In synchronous rewriting, the productions of two rewriting systems are paired and applied synchronously in the derivation of a pair of strings. We present a new synchronous rewriting system and argue that it can handle certain phenomena that are not covered by existing synchronous systems. We also prove some interesting formal/computational properties of our system.

    Submitted 27 May, 1996; originally announced May 1996.

    Comments: 8 pages uuencoded gzipped ps file

  22. D-Tree Grammars

    Authors: Owen Rambow, K. Vijay-Shanker, David Weir

    Abstract: DTG are designed to share some of the advantages of TAG while overcoming some of its limitations. DTG involve two composition operations called subsertion and sister-adjunction. The most distinctive feature of DTG is that, unlike TAG, there is complete uniformity in the way that the two DTG operations relate lexical items: subsertion always corresponds to complementation and sister-adjunction to… ▽ More

    Submitted 12 May, 1995; originally announced May 1995.

    Comments: Latex source, needs aclap.sty, 8 pages, to appear in ACL-95

  23. arXiv:cmp-lg/9504011  [pdf, ps

    cs.CL

    A Processing Model for Free Word Order Languages

    Authors: Owen Rambow, Aravind K. Joshi

    Abstract: Like many verb-final languages, Germn displays considerable word-order freedom: there is no syntactic constraint on the ordering of the nominal arguments of a verb, as long as the verb remains in final position. This effect is referred to as ``scrambling'', and is interpreted in transformational frameworks as leftward movement of the arguments. Furthermore, arguments from an embedded clause may… ▽ More

    Submitted 15 April, 1995; originally announced April 1995.

    Comments: 23 pages, uuencoded compressed ps file. In {\em Perspectives on Sentence Processing}, C. Clifton, Jr., L. Frazier and K. Rayner, editors. Lawrence Erlbaum Associates, 1994

  24. arXiv:cmp-lg/9411008  [pdf, ps

    cs.CL

    Parsing Free Word-Order Languages in Polynomial Time

    Authors: Tilman Becker, Owen Rambow

    Abstract: We present a parsing algorithm with polynomial time complexity for a large subset of V-TAG languages. V-TAG, a variant of multi-component TAG, can handle free-word order phenomena which are beyond the class LCFRS (which includes regular TAG). Our algorithm is based on a CYK-style parser for TAGs.

    Submitted 3 November, 1994; originally announced November 1994.

    Comments: 4 pages, uuencoded compressed ps file

    Report number: TALANA-RT-94-01, TALANA, Universite' Paris 7, 1994

    Journal ref: In {\em 3e Colloque International sur les Grammaires d'Arbres Adjoints (TAG+3)}

  25. arXiv:cmp-lg/9411007  [pdf, ps

    cs.CL

    The Linguistic Relevance of Quasi-Trees

    Authors: Anthony Kroch, Owen Rambow

    Abstract: We discuss two constructions (long scrambling and ECM verbs) which challenge most syntactic theories (including traditional TAG approaches) since they seem to require exceptional mechanisms and postulates. We argue that these constructions should in fact be analyzed in a similar manner, namely as involving a verb which selects for a ``defective'' complement. These complements are defective in th… ▽ More

    Submitted 3 November, 1994; originally announced November 1994.

    Comments: 4 pages, uuencoded compressed ps file

    Report number: Report TALANA-RT-94-01, TALANA, Universit{\'e} Paris 7, 1994

    Journal ref: In {\em 3e Colloque International sur les Grammaires d'Arbres Adjoints (TAG+3)}

  26. arXiv:cmp-lg/9410007  [pdf, ps

    cs.CL

    A Formal Look at Dependency Grammars and Phrase-Structure Grammars, with Special Consideration of Word-Order Phenomena

    Authors: Owen Rambow, Aravind Joshi

    Abstract: The central role of the lexicon in Meaning-Text Theory (MTT) and other dependency-based linguistic theories cannot be replicated in linguistic theories based on context-free grammars (CFGs). We describe Tree Adjoining Grammar (TAG) as a system that arises naturally in the process of lexicalizing CFGs. A TAG grammar can therefore be compared directly to an Meaning-Text Model (MTM). We illustrate… ▽ More

    Submitted 18 October, 1994; originally announced October 1994.

    Comments: uuencoded compressed ps file, 20 pages

  27. arXiv:cmp-lg/9407016  [pdf, ps

    cs.CL

    The Role of Cognitive Modeling in Achieving Communicative Intentions

    Authors: Marilyn Walker, Owen Rambow

    Abstract: A discourse planner for (task-oriented) dialogue must be able to make choices about whether relevant, but optional information (for example, the "satellites" in an RST-based planner) should be communicated. We claim that effective text planners must explicitly model aspects of the Hearer's cognitive state, such as what the hearer is attending to and what inferences the hearer can draw, in order… ▽ More

    Submitted 20 July, 1994; v1 submitted 19 July, 1994; originally announced July 1994.

    Comments: 10 pages, uuencoded compressed ps file

  28. arXiv:cmp-lg/9406009  [pdf, ps

    cs.CL

    Multiset-Valued Linear Index Grammars: Imposing Dominance Constraints on Derivations

    Authors: Owen Rambow

    Abstract: This paper defines multiset-valued linear index grammar and unordered vector grammar with dominance links. The former models certain uses of multiset-valued feature structures in unification-based formalisms, while the latter is motivated by word order variation and by ``quasi-trees'', a generalization of trees. The two formalisms are weakly equivalent, and an important subset is at most context… ▽ More

    Submitted 2 June, 1994; originally announced June 1994.

    Comments: 8 pages, uuencoded compressed ps file

    Journal ref: Proc ACL 94