Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–10 of 10 results for author: Malon, C

.
  1. arXiv:2403.00696  [pdf, other

    cs.CL

    Self-Consistent Decoding for More Factual Open Responses

    Authors: Christopher Malon, Xiaodan Zhu

    Abstract: Self-consistency has emerged as a powerful method for improving the accuracy of short answers generated by large language models. As previously defined, it only concerns the accuracy of a final answer parsed from generated text. In this work, we extend the idea to open response generation, by integrating voting into the decoding method. Each output sentence is selected from among multiple samples,… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  2. arXiv:2307.14305  [pdf, other

    cs.CL

    Automatically Evaluating Opinion Prevalence in Opinion Summarization

    Authors: Christopher Malon

    Abstract: When faced with a large number of product reviews, it is not clear that a human can remember all of them and weight opinions representatively to write a good reference summary. We propose an automatic metric to test the prevalence of the opinions that a summary expresses, based on counting the number of reviews that are consistent with each statement in the summary, while discrediting trivial or r… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: The 6th Workshop on e-Commerce and NLP (KDD 2023)

  3. arXiv:2208.11024  [pdf, other

    cs.AI

    KGxBoard: Explainable and Interactive Leaderboard for Evaluation of Knowledge Graph Completion Models

    Authors: Haris Widjaja, Kiril Gashteovski, Wiem Ben Rim, Pengfei Liu, Christopher Malon, Daniel Ruffinelli, Carolin Lawrence, Graham Neubig

    Abstract: Knowledge Graphs (KGs) store information in the form of (head, predicate, tail)-triples. To augment KGs with new knowledge, researchers proposed models for KG Completion (KGC) tasks such as link prediction; i.e., answering (h; p; ?) or (?; p; t) queries. Such models are usually evaluated with averaged metrics on a held-out test set. While useful for tracking progress, averaged single-score metrics… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

  4. arXiv:2204.06555  [pdf, other

    cs.CL

    Fast Few-shot Debugging for NLU Test Suites

    Authors: Christopher Malon, Kai Li, Erik Kruus

    Abstract: We study few-shot debugging of transformer based natural language understanding models, using recently popularized test suites to not just diagnose but correct a problem. Given a few debugging examples of a certain phenomenon, and a held-out test set of the same phenomenon, we aim to maximize accuracy on the phenomenon at a minimal cost of accuracy on the original test set. We examine several meth… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: To appear at ACL 2022 Deep Learning Inside Out (DeeLIO) workshop

  5. arXiv:2103.03842  [pdf, ps, other

    cs.CL

    Overcoming Poor Word Embeddings with Word Definitions

    Authors: Christopher Malon

    Abstract: Modern natural language understanding models depend on pretrained subword embeddings, but applications may need to reason about words that were never or rarely seen during pretraining. We show that examples that depend critically on a rarer word are more challenging for natural language inference models. Then we explore how a model could learn to use definitions, provided in natural text, to overc… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

  6. arXiv:2101.11766  [pdf, other

    cs.LG

    Improving Neural Network Robustness through Neighborhood Preserving Layers

    Authors: Bingyuan Liu, Christopher Malon, Lingzhou Xue, Erik Kruus

    Abstract: Robustness against adversarial attack in neural networks is an important research topic in the machine learning community. We observe one major source of vulnerability of neural nets is from overparameterized fully-connected layers. In this paper, we propose a new neighborhood preserving layer which can replace these fully connected layers to improve the network robustness. We demonstrate a novel… ▽ More

    Submitted 29 January, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

    Comments: An earlier short version of this paper without proof is presented in 25th International Conference on Pattern Recognition(ICPR), Manifold Learning from Euclid to Riemann workshop

  7. arXiv:2006.00693  [pdf, other

    cs.LG stat.ML

    Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

    Authors: Pengyu Cheng, Martin Renqiang Min, Dinghan Shen, Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

    Abstract: Learning disentangled representations of natural language is essential for many NLP tasks, e.g., conditional text generation, style transfer, personalized dialogue systems, etc. Similar problems have been studied extensively for other forms of data, such as images and videos. However, the discrete nature of natural language makes the disentangling of textual representations more challenging (e.g.,… ▽ More

    Submitted 12 January, 2022; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: Accepted by the 58th Annual Meeting of the Association for Computational Linguistics (ACL2020)

  8. arXiv:2002.12344  [pdf, other

    cs.CL

    Generating Followup Questions for Interpretable Multi-hop Question Answering

    Authors: Christopher Malon, Bing Bai

    Abstract: We propose a framework for answering open domain multi-hop questions in which partial information is read and used to generate followup questions, to finally be answered by a pretrained single-hop answer extractor. This framework makes each hop interpretable, and makes the retrieval associated with later hops as flexible and specific as for the first hop. As a first instantiation of this framework… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  9. arXiv:1901.02534  [pdf, ps, other

    cs.CL

    Team Papelo: Transformer Networks at FEVER

    Authors: Christopher Malon

    Abstract: We develop a system for the FEVER fact extraction and verification challenge that uses a high precision entailment classifier based on transformer networks pretrained with language modeling, to classify a broad set of potential evidence. The precision of the entailment classifier allows us to enhance recall by considering every statement from several articles to decide upon each claim. We include… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: Appeared at EMNLP 2018 First Workshop on Fact Extraction and Verification (FEVER)

  10. arXiv:1810.11067  [pdf, ps, other

    cs.CL

    Teaching Syntax by Adversarial Distraction

    Authors: Juho Kim, Christopher Malon, Asim Kadav

    Abstract: Existing entailment datasets mainly pose problems which can be answered without attention to grammar or word order. Learning syntax requires comparing examples where different grammar and word order change the desired classification. We introduce several datasets based on synthetic transformations of natural entailment examples in SNLI or FEVER, to teach aspects of grammar and word order. We show… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

    Comments: To appear at the EMNLP 2018 First Workshop on Fact Extraction and Verification (FEVER)

    Journal ref: Juho Kim, Christopher Malon, and Asim Kadav. 2018. "Teaching Syntax by Adversarial Distraction." Proceedings of the EMNLP First Workshop on Fact Extraction and Verification