Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–11 of 11 results for author: Orr, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.09110  [pdf, other

    cs.CL cs.AI cs.LG

    Holistic Evaluation of Language Models

    Authors: Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao , et al. (25 additional authors not shown)

    Abstract: Language models (LMs) are becoming the foundation for almost all major language technologies, but their capabilities, limitations, and risks are not well understood. We present Holistic Evaluation of Language Models (HELM) to improve the transparency of language models. First, we taxonomize the vast space of potential scenarios (i.e. use cases) and metrics (i.e. desiderata) that are of interest fo… ▽ More

    Submitted 1 October, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Project page: https://crfm.stanford.edu/helm/v1.0

    Journal ref: Published in Transactions on Machine Learning Research (TMLR), 2023

  2. arXiv:2210.02441  [pdf, other

    cs.CL

    Ask Me Anything: A simple strategy for prompting language models

    Authors: Simran Arora, Avanika Narayan, Mayee F. Chen, Laurel Orr, Neel Guha, Kush Bhatia, Ines Chami, Frederic Sala, Christopher Ré

    Abstract: Large language models (LLMs) transfer well to new tasks out-of-the-box simply given a natural language prompt that demonstrates how to perform the task and no additional training. Prompting is a brittle process wherein small modifications to the prompt can cause large variations in the model predictions, and therefore significant effort is dedicated towards designing a painstakingly "perfect promp… ▽ More

    Submitted 19 November, 2022; v1 submitted 5 October, 2022; originally announced October 2022.

  3. arXiv:2205.09911  [pdf, other

    cs.LG cs.AI cs.DB

    Can Foundation Models Wrangle Your Data?

    Authors: Avanika Narayan, Ines Chami, Laurel Orr, Simran Arora, Christopher Ré

    Abstract: Foundation Models (FMs) are models trained on large corpora of data that, at very large scale, can generalize to new tasks without any task-specific finetuning. As these models continue to grow in size, innovations continue to push the boundaries of what these models can do on language and image tasks. This paper aims to understand an underexplored area of FMs: classical data tasks like cleaning a… ▽ More

    Submitted 24 December, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: 12 pages, 5 figures; additional experiments, typo corrections, modifications to Section 5 (Research Agenda)

  4. arXiv:2110.08228  [pdf, other

    cs.CL cs.AI

    Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text

    Authors: Maya Varma, Laurel Orr, Sen Wu, Megan Leszczynski, Xiao Ling, Christopher Ré

    Abstract: Named entity disambiguation (NED), which involves mapping textual mentions to structured entities, is particularly challenging in the medical domain due to the presence of rare entities. Existing approaches are limited by the presence of coarse-grained structural resources in biomedical knowledge bases as well as the use of training datasets that provide low coverage over uncommon resources. In th… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: Accepted to Findings of EMNLP 2021

  5. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  6. arXiv:2108.05053  [pdf, other

    cs.LG cs.DB

    Managing ML Pipelines: Feature Stores and the Coming Wave of Embedding Ecosystems

    Authors: Laurel Orr, Atindriyo Sanyal, Xiao Ling, Karan Goel, Megan Leszczynski

    Abstract: The industrial machine learning pipeline requires iterating on model features, training and deploying models, and monitoring deployed models at scale. Feature stores were developed to manage and standardize the engineer's workflow in this end-to-end pipeline, focusing on traditional tabular feature data. In recent years, however, model development has shifted towards using self-supervised pretrain… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Journal ref: VLDB 2021

  7. arXiv:2010.10363  [pdf, other

    cs.CL cs.AI cs.LG

    Bootleg: Chasing the Tail with Self-Supervised Named Entity Disambiguation

    Authors: Laurel Orr, Megan Leszczynski, Simran Arora, Sen Wu, Neel Guha, Xiao Ling, Christopher Re

    Abstract: A challenge for named entity disambiguation (NED), the task of mapping textual mentions to entities in a knowledge base, is how to disambiguate entities that appear rarely in the training data, termed tail entities. Humans use subtle reasoning patterns based on knowledge of entity facts, relations, and types to disambiguate unfamiliar entities. Inspired by these patterns, we introduce Bootleg, a s… ▽ More

    Submitted 23 October, 2020; v1 submitted 20 October, 2020; originally announced October 2020.

  8. arXiv:2002.09799  [pdf, other

    cs.DB

    Sample Debiasing in the Themis Open World Database System (Extended Version)

    Authors: Laurel Orr, Magda Balazinska, Dan Suciu

    Abstract: Open world database management systems assume tuples not in the database still exist and are becoming an increasingly important area of research. We present Themis, the first open world database that automatically rebalances arbitrarily biased samples to approximately answer queries as if they were issued over the entire population. We leverage apriori population aggregate information to develop a… ▽ More

    Submitted 29 February, 2020; v1 submitted 22 February, 2020; originally announced February 2020.

    Comments: SIGMOD 2020

  9. arXiv:1912.07777  [pdf, other

    cs.DB cs.LG

    Mosaic: A Sample-Based Database System for Open World Query Processing

    Authors: Laurel Orr, Samuel Ainsworth, Walter Cai, Kevin Jamieson, Magda Balazinska, Dan Suciu

    Abstract: Data scientists have relied on samples to analyze populations of interest for decades. Recently, with the increase in the number of public data repositories, sample data has become easier to access. It has not, however, become easier to analyze. This sample data is arbitrarily biased with an unknown sampling probability, meaning data scientists must manually debias the sample with custom technique… ▽ More

    Submitted 10 January, 2020; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: CIDR 2020

  10. arXiv:1911.04948  [pdf, other

    cs.DB

    EntropyDB: A Probabilistic Approach to Approximate Query Processing

    Authors: Laurel Orr, Magdalena Balazinska, Dan Suciu

    Abstract: We present EntropyDB, an interactive data exploration system that uses a probabilistic approach to generate a small, query-able summary of a dataset. Departing from traditional summarization techniques, we use the Principle of Maximum Entropy to generate a probabilistic representation of the data that can be used to give approximate query answers. We develop the theoretical framework and formulati… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: text overlap with arXiv:1703.03856

    Journal ref: VLDB Journal 2019

  11. arXiv:1703.03856  [pdf, other

    cs.DB

    Probabilistic Database Summarization for Interactive Data Exploration

    Authors: Laurel Orr, Magda Balazinska, Dan Suciu

    Abstract: We present a probabilistic approach to generate a small, query-able summary of a dataset for interactive data exploration. Departing from traditional summarization techniques, we use the Principle of Maximum Entropy to generate a probabilistic representation of the data that can be used to give approximate query answers. We develop the theoretical framework and formulation of our probabilistic rep… ▽ More

    Submitted 23 May, 2017; v1 submitted 10 March, 2017; originally announced March 2017.

    Comments: To appear VLDB 2017