Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: Gale, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.10879  [pdf

    cs.LG cs.AI

    Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models

    Authors: Reek Majumder, Jacquan Pollard, M Sabbir Salek, David Werth, Gurcan Comert, Adrian Gale, Sakib Mahmud Khan, Samuel Darko, Mashrur Chowdhury

    Abstract: The environmental impacts of global warming driven by methane (CH4) emissions have catalyzed significant research initiatives in developing novel technologies that enable proactive and rapid detection of CH4. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH4 and its related intensity in the affected areas. Various meteorological charact… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  2. arXiv:2307.14366  [pdf, other

    cs.LG cs.DB

    Explainable Disparity Compensation for Efficient Fair Ranking

    Authors: Abraham Gale, Amélie Marian

    Abstract: Ranking functions that are used in decision systems often produce disparate results for different populations because of bias in the underlying data. Addressing, and compensating for, these disparate outcomes is a critical problem for fair decision-making. Recent compensatory measures have mostly focused on opaque transformations of the ranking functions to satisfy fairness guarantees or on the us… ▽ More

    Submitted 19 April, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: 22 pages, 5 figures

  3. arXiv:2306.06528  [pdf, other

    cs.LG cs.AI cs.PL

    Push: Concurrent Probabilistic Programming for Bayesian Deep Learning

    Authors: Daniel Huang, Chris Camaño, Jonathan Tsegaye, Jonathan Austin Gale

    Abstract: We introduce a library called Push that takes a probabilistic programming approach to Bayesian deep learning (BDL). This library enables concurrent execution of BDL inference algorithms on multi-GPU hardware for neural network (NN) models. To accomplish this, Push introduces an abstraction that represents an input NN as a particle. Push enables easy creation of particles so that an input NN can be… ▽ More

    Submitted 29 September, 2023; v1 submitted 10 June, 2023; originally announced June 2023.

    Comments: preprint

  4. arXiv:1809.01604  [pdf, other

    cs.LG cs.AI stat.ML

    Merging datasets through deep learning

    Authors: Kavitha Srinivas, Abraham Gale, Julian Dolby

    Abstract: Merging datasets is a key operation for data analytics. A frequent requirement for merging is joining across columns that have different surface forms for the same entity (e.g., the name of a person might be represented as "Douglas Adams" or "Adams, Douglas"). Similarly, ontology alignment can require recognizing distinct surface forms of the same entity, especially when ontologies are independent… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

  5. Tagging French Without Lexical Probabilities -- Combining Linguistic Knowledge And Statistical Learning

    Authors: Evelyne Tzoukermann, Dragomir R. Radev, William A. Gale

    Abstract: This paper explores morpho-syntactic ambiguities for French to develop a strategy for part-of-speech disambiguation that a) reflects the complexity of French as an inflected language, b) optimizes the estimation of probabilities, c) allows the user flexibility in choosing a tagset. The problem in extracting lexical probabilities from a limited training corpus is that the statistical model may no… ▽ More

    Submitted 10 October, 1997; originally announced October 1997.

    Comments: uses ypsfig

  6. arXiv:cmp-lg/9407020  [pdf, ps

    cs.CL

    A Sequential Algorithm for Training Text Classifiers

    Authors: David D. Lewis, William A. Gale

    Abstract: The ability to cheaply train text classifiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classifiers was developed and tested on a newswire text categorization task. This method, which we call uncertain… ▽ More

    Submitted 24 July, 1994; v1 submitted 24 July, 1994; originally announced July 1994.

    Comments: 10 pages, uuencoded, compressed PostScript; Proc. SIGIR-94 LaTex available from lewis@research.att.com