Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–32 of 32 results for author: Andersen, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12270  [pdf, other

    cs.LG cs.AI

    Variance reduction of diffusion model's gradients with Taylor approximation-based control variate

    Authors: Paul Jeha, Will Grathwohl, Michael Riis Andersen, Carl Henrik Ek, Jes Frellsen

    Abstract: Score-based models, trained with denoising score matching, are remarkably effective in generating high dimensional data. However, the high variance of their training objective hinders optimisation. We attempt to reduce it with a control variate, derived via a $k$-th order Taylor expansion on the training objective and its gradient. We prove an equivalence between the two and demonstrate empiricall… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 14 pages, ICML Structured Probabilistic Inference & Generative Modeling 2024

  2. arXiv:2404.04268  [pdf

    cs.IR cs.AI cs.CY cs.SI

    The Use of Generative Search Engines for Knowledge Work and Complex Tasks

    Authors: Siddharth Suri, Scott Counts, Leijie Wang, Chacha Chen, Mengting Wan, Tara Safavi, Jennifer Neville, Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Sathish Manivannan, Nagu Rangan, Longqi Yang

    Abstract: Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine.… ▽ More

    Submitted 19 March, 2024; originally announced April 2024.

    Comments: 32 pages, 3 figures, 4 tables

    ACM Class: J.4

  3. arXiv:2403.12388  [pdf, other

    cs.IR cs.AI

    Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models

    Authors: Ying-Chun Lin, Jennifer Neville, Jack W. Stokes, Longqi Yang, Tara Safavi, Mengting Wan, Scott Counts, Siddharth Suri, Reid Andersen, Xiaofeng Xu, Deepak Gupta, Sujay Kumar Jauhar, Xia Song, Georg Buscher, Saurabh Tiwary, Brent Hecht, Jaime Teevan

    Abstract: Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featur… ▽ More

    Submitted 8 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  4. arXiv:2403.12173  [pdf, other

    cs.CL cs.AI cs.IR

    TnT-LLM: Text Mining at Scale with Large Language Models

    Authors: Mengting Wan, Tara Safavi, Sujay Kumar Jauhar, Yujin Kim, Scott Counts, Jennifer Neville, Siddharth Suri, Chirag Shah, Ryen W White, Longqi Yang, Reid Andersen, Georg Buscher, Dhruv Joshi, Nagu Rangan

    Abstract: Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. Thi… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: 9 pages main content, 8 pages references and appendix

  5. arXiv:2312.08805  [pdf, other

    cs.RO cs.CV

    Zoom in on the Plant: Fine-grained Analysis of Leaf, Stem and Vein Instances

    Authors: Ronja Güldenring, Rasmus Eckholdt Andersen, Lazaros Nalpantidis

    Abstract: Robot perception is far from what humans are capable of. Humans do not only have a complex semantic scene understanding but also extract fine-grained intra-object properties for the salient ones. When humans look at plants, they naturally perceive the plant architecture with its individual leaves and branching system. In this work, we want to advance the granularity in plant understanding for agri… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: Accepted at Robotics and Automation Letters (RA-L)

  6. arXiv:2311.09389  [pdf, other

    cs.CL cs.LG

    Neural machine translation for automated feedback on children's early-stage writing

    Authors: Jonas Vestergaard Jensen, Mikkel Jordahn, Michael Riis Andersen

    Abstract: In this work, we address the problem of assessing and constructing feedback for early-stage writing automatically using machine learning. Early-stage writing is typically vastly different from conventional writing due to phonetic spelling and lack of proper grammar, punctuation, spacing etc. Consequently, early-stage writing is highly non-trivial to analyze using common linguistic metrics. We prop… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 9 pages, 1 figure, 1 table, to be published in the proceedings of the Northern Lights Deep Learning Conference 2024

    ACM Class: I.2.7

  7. arXiv:2309.13063  [pdf, other

    cs.IR cs.AI cs.CL

    Using Large Language Models to Generate, Validate, and Apply User Intent Taxonomies

    Authors: Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Scott Counts, Sarkar Snigdha Sarathi Das, Ali Montazer, Sathish Manivannan, Jennifer Neville, Xiaochuan Ni, Nagu Rangan, Tara Safavi, Siddharth Suri, Mengting Wan, Leijie Wang, Longqi Yang

    Abstract: Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics.… ▽ More

    Submitted 9 May, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Report number: MSR-TR-2023-32

  8. arXiv:2309.08827  [pdf, other

    cs.CL cs.AI

    S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs

    Authors: Sarkar Snigdha Sarathi Das, Chirag Shah, Mengting Wan, Jennifer Neville, Longqi Yang, Reid Andersen, Georg Buscher, Tara Safavi

    Abstract: The traditional Dialogue State Tracking (DST) problem aims to track user preferences and intents in user-agent conversations. While sufficient for task-oriented dialogue systems supporting narrow domain applications, the advent of Large Language Model (LLM)-based chat systems has introduced many real-world intricacies in open-domain dialogues. These intricacies manifest in the form of increased co… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  9. arXiv:2309.04607  [pdf

    cs.CL cs.AI

    Linking Symptom Inventories using Semantic Textual Similarity

    Authors: Eamonn Kennedy, Shashank Vadlamani, Hannah M Lindsey, Kelly S Peterson, Kristen Dams OConnor, Kenton Murray, Ronak Agarwal, Houshang H Amiri, Raeda K Andersen, Talin Babikian, David A Baron, Erin D Bigler, Karen Caeyenberghs, Lisa Delano-Wood, Seth G Disner, Ekaterina Dobryakova, Blessen C Eapen, Rachel M Edelstein, Carrie Esopenko, Helen M Genova, Elbert Geuze, Naomi J Goodrich-Hunsaker, Jordan Grafman, Asta K Haberg, Cooper B Hodges , et al. (57 additional authors not shown)

    Abstract: An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  10. arXiv:2304.04048  [pdf, other

    cs.CV cs.LG

    Polygonizer: An auto-regressive building delineator

    Authors: Maxim Khomiakov, Michael Riis Andersen, Jes Frellsen

    Abstract: In geospatial planning, it is often essential to represent objects in a vectorized format, as this format easily translates to downstream tasks such as web development, graphics, or design. While these problems are frequently addressed using semantic segmentation, which requires additional post-processing to vectorize objects in a non-trivial way, we present an Image-to-Sequence model that allows… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: ICLR 2023 Workshop on Machine Learning in Remote Sensing

  11. arXiv:2303.11215  [pdf, other

    cs.CV cs.LG

    Learning to Generate 3D Representations of Building Roofs Using Single-View Aerial Imagery

    Authors: Maxim Khomiakov, Alejandro Valverde Mahou, Alba Reinders Sánchez, Jes Frellsen, Michael Riis Andersen

    Abstract: We present a novel pipeline for learning the conditional distribution of a building roof mesh given pixels from an aerial image, under the assumption that roof geometry follows a set of regular patterns. Unlike alternative methods that require multiple images of the same object, our approach enables estimating 3D roof meshes using only a single image for predictions. The approach employs the PolyG… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: Copyright 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  12. arXiv:2301.05983  [pdf, other

    stat.ML cs.LG

    On the role of Model Uncertainties in Bayesian Optimization

    Authors: Jonathan Foldager, Mikkel Jordahn, Lars Kai Hansen, Michael Riis Andersen

    Abstract: Bayesian optimization (BO) is a popular method for black-box optimization, which relies on uncertainty as part of its decision-making process when deciding which experiment to perform next. However, not much work has addressed the effect of uncertainty on the performance of the BO algorithm and to what extent calibrated uncertainties improve the ability to find the global optimum. In this work, we… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 14 pages, 4 figures, 2 tables

  13. arXiv:2212.01260  [pdf, other

    cs.CV cs.LG

    SolarDK: A high-resolution urban solar panel image classification and localization dataset

    Authors: Maxim Khomiakov, Julius Holbech Radzikowski, Carl Anton Schmidt, Mathias Bonde Sørensen, Mads Andersen, Michael Riis Andersen, Jes Frellsen

    Abstract: The body of research on classification of solar panel arrays from aerial imagery is increasing, yet there are still not many public benchmark datasets. This paper introduces two novel benchmark datasets for classifying and localizing solar panel arrays in Denmark: A human annotated dataset for classification and segmentation, as well as a classification dataset acquired using self-reported data fr… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 7 pages, 2 figures, to access the dataset, see https://osf.io/aj539/

  14. arXiv:2203.15945  [pdf, other

    stat.ML cs.LG stat.ME

    A Framework for Improving the Reliability of Black-box Variational Inference

    Authors: Manushi Welandawe, Michael Riis Andersen, Aki Vehtari, Jonathan H. Huggins

    Abstract: Black-box variational inference (BBVI) now sees widespread use in machine learning and statistics as a fast yet flexible alternative to Markov chain Monte Carlo methods for approximate Bayesian inference. However, stochastic optimization methods for BBVI remain unreliable and require substantial expertise and hand-tuning to apply effectively. In this paper, we propose Robust and Automated Black-bo… ▽ More

    Submitted 16 May, 2024; v1 submitted 29 March, 2022; originally announced March 2022.

  15. arXiv:2202.03268  [pdf, other

    eess.SP cs.AI stat.AP

    Cyber-resilience for marine navigation by information fusion and change detection

    Authors: Dimitrios Dagdilelis, Mogens Blanke, Rasmus Hjorth Andersen, Roberto Galeazzi

    Abstract: Cyber-resilience is an increasing concern in developing autonomous navigation solutions for marine vessels. This paper scrutinizes cyber-resilience properties of marine navigation through a prism with three edges: multiple sensor information fusion, diagnosis of not-normal behaviours, and change detection. It proposes a two-stage estimator for diagnosis and mitigation of sensor signals used for co… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: 18 pages, 21 figures

    ACM Class: G.3; I.2; I.4; I.5

  16. arXiv:2103.01085  [pdf, other

    cs.LG stat.ME stat.ML

    Challenges and Opportunities in High-dimensional Variational Inference

    Authors: Akash Kumar Dhaka, Alejandro Catalina, Manushi Welandawe, Michael Riis Andersen, Jonathan Huggins, Aki Vehtari

    Abstract: Current black-box variational inference (BBVI) methods require the user to make numerous design choices -- such as the selection of variational objective and approximating family -- yet there is little principled guidance on how to do so. We develop a conceptual framework and set of experimental tools to understand the effects of these choices, which we leverage to propose best practices for maxim… ▽ More

    Submitted 30 June, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

  17. arXiv:2009.00666  [pdf, other

    cs.LG stat.ME stat.ML

    Robust, Accurate Stochastic Optimization for Variational Inference

    Authors: Akash Kumar Dhaka, Alejandro Catalina, Michael Riis Andersen, Måns Magnusson, Jonathan H. Huggins, Aki Vehtari

    Abstract: We consider the problem of fitting variational posterior approximations using stochastic optimization methods. The performance of these approximations depends on (1) how well the variational family matches the true posterior distribution,(2) the choice of divergence, and (3) the optimization of the variational objective. We show that even in the best-case scenario when the exact posterior belongs… ▽ More

    Submitted 3 September, 2020; v1 submitted 1 September, 2020; originally announced September 2020.

    Journal ref: NeurIPS 2020

  18. arXiv:2007.05994  [pdf, other

    stat.ML cs.LG

    State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes

    Authors: William J. Wilkinson, Paul E. Chang, Michael Riis Andersen, Arno Solin

    Abstract: We formulate approximate Bayesian inference in non-conjugate temporal and spatio-temporal Gaussian process models as a simple parameter update rule applied during Kalman smoothing. This viewpoint encompasses most inference schemes, including expectation propagation (EP), the classical (Extended, Unscented, etc.) Kalman smoothers, and variational inference. We provide a unifying perspective on thes… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2020

  19. arXiv:2003.11435  [pdf, other

    cs.LG stat.ML

    Preferential Batch Bayesian Optimization

    Authors: Eero Siivola, Akash Kumar Dhaka, Michael Riis Andersen, Javier Gonzalez, Pablo Garcia Moreno, Aki Vehtari

    Abstract: Most research in Bayesian optimization (BO) has focused on \emph{direct feedback} scenarios, where one has access to exact values of some expensive-to-evaluate objective. This direction has been mainly driven by the use of BO in machine learning hyper-parameter configuration problems. However, in domains such as modelling human preferences, A/B tests, or recommender systems, there is a need for me… ▽ More

    Submitted 31 August, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: 6 pages + 7 pages in supplementary material

  20. arXiv:1904.10679  [pdf, other

    stat.ML cs.LG

    Bayesian leave-one-out cross-validation for large data

    Authors: Måns Magnusson, Michael Riis Andersen, Johan Jonasson, Aki Vehtari

    Abstract: Model inference, such as model comparison, model checking, and model selection, is an important part of model development. Leave-one-out cross-validation (LOO) is a general approach for assessing the generalizability of a model, but unfortunately, LOO does not scale well to large datasets. We propose a combination of using approximate inference techniques and probability-proportional-to-size-sampl… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

    Comments: Accepted to ICML 2019. This version is the submitted paper

    Journal ref: Thirty-sixth International Conference on Machine Learning, PMLR 97:4244-4253, 2019

  21. arXiv:1901.11436  [pdf, other

    stat.ML cs.LG cs.SD eess.AS eess.SP

    End-to-End Probabilistic Inference for Nonstationary Audio Analysis

    Authors: William J. Wilkinson, Michael Riis Andersen, Joshua D. Reiss, Dan Stowell, Arno Solin

    Abstract: A typical audio signal processing pipeline includes multiple disjoint analysis stages, including calculation of a time-frequency representation followed by spectrogram-based feature analysis. We show how time-frequency analysis and nonnegative matrix factorisation can be jointly formulated as a spectral mixture Gaussian process model with nonstationary priors over the amplitude variance parameters… ▽ More

    Submitted 27 April, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

    Comments: Accepted to the Thirty-sixth International Conference on Machine Learning (ICML) 2019

  22. arXiv:1811.02489  [pdf, other

    eess.SP cs.LG cs.SD eess.AS stat.ML

    Unifying Probabilistic Models for Time-Frequency Analysis

    Authors: William J. Wilkinson, Michael Riis Andersen, Joshua D. Reiss, Dan Stowell, Arno Solin

    Abstract: In audio signal processing, probabilistic time-frequency models have many benefits over their non-probabilistic counterparts. They adapt to the incoming signal, quantify uncertainty, and measure correlation between the signal's amplitude and phase information, making time domain resynthesis straightforward. However, these models are still not widely used since they come at a high computational cos… ▽ More

    Submitted 12 February, 2019; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: Accepted to International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019

  23. Interactive Cost Configuration Over Decision Diagrams

    Authors: Henrik Reif Andersen, Tarik Hadzic, David Pisinger

    Abstract: In many AI domains such as product configuration, a user should interactively specify a solution that must satisfy a set of constraints. In such scenarios, offline compilation of feasible solutions into a tractable representation is an important approach to delivering efficient backtrack-free user interaction online. In particular,binary decision diagrams (BDDs) have been successfully used as a… ▽ More

    Submitted 15 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 37, pages 99-139, 2010

  24. arXiv:0907.3631  [pdf, ps, other

    cs.DS cs.DM

    Interchanging distance and capacity in probabilistic mappings

    Authors: Reid Andersen, Uriel Feige

    Abstract: Harald Racke [STOC 2008] described a new method to obtain hierarchical decompositions of networks in a way that minimizes the congestion. Racke's approach is based on an equivalence that he discovered between minimizing congestion and minimizing stretch (in a certain setting). Here we present Racke's equivalence in an abstract setting that is more general than the one described in Racke's work,… ▽ More

    Submitted 21 July, 2009; originally announced July 2009.

    Comments: 16 pages, no figures

    ACM Class: F.2.2; G.2.2

  25. arXiv:0811.3779  [pdf, ps, other

    cs.DS

    Finding Sparse Cuts Locally Using Evolving Sets

    Authors: Reid Andersen, Yuval Peres

    Abstract: A {\em local graph partitioning algorithm} finds a set of vertices with small conductance (i.e. a sparse cut) by adaptively exploring part of a large graph $G$, starting from a specified vertex. For the algorithm to be local, its complexity must be bounded in terms of the size of the set that it outputs, with at most a weak dependence on the number $n$ of vertices in $G$. Previous local partitio… ▽ More

    Submitted 23 November, 2008; originally announced November 2008.

    Comments: 20 pages, no figures

    ACM Class: F.2.2

  26. arXiv:0705.4604  [pdf, other

    cs.LO

    Temporal Runtime Verification using Monadic Difference Logic

    Authors: Henrik Reif Andersen, Kaare J. Kristoffersen

    Abstract: In this paper we present an algorithm for performing runtime verification of a bounded temporal logic over timed runs. The algorithm consists of three elements. First, the bounded temporal formula to be verified is translated into a monadic first-order logic over difference inequalities, which we call monadic difference logic. Second, at each step of the timed run, the monadic difference formula… ▽ More

    Submitted 31 May, 2007; originally announced May 2007.

    ACM Class: D.2.4; D.2.5

  27. arXiv:0704.1394  [pdf, ps, other

    cs.AI

    Calculating Valid Domains for BDD-Based Interactive Configuration

    Authors: Tarik Hadzic, Rune Moller Jensen, Henrik Reif Andersen

    Abstract: In these notes we formally describe the functionality of Calculating Valid Domains from the BDD representing the solution space of valid configurations. The formalization is largely based on the CLab configuration framework.

    Submitted 11 April, 2007; originally announced April 2007.

  28. arXiv:cs/0702170  [pdf, ps, other

    cs.AI

    Generic Global Constraints based on MDDs

    Authors: Peter Tiedemann, Henrik Reif Andersen, Rasmus Pagh

    Abstract: Constraint Programming (CP) has been successfully applied to both constraint satisfaction and constraint optimization problems. A wide variety of specialized global constraints provide critical assistance in achieving a good model that can take advantage of the structure of the problem in the search for a solution. However, a key outstanding issue is the representation of 'ad-hoc' constraints th… ▽ More

    Submitted 28 February, 2007; originally announced February 2007.

    Comments: Preliminary 15 pages version of the tech-report cs.AI/0611141

  29. arXiv:cs/0702078  [pdf, ps, other

    cs.DS cs.CC

    A Local Algorithm for Finding Dense Subgraphs

    Authors: Reid Andersen

    Abstract: We present a local algorithm for finding dense subgraphs of bipartite graphs, according to the definition of density proposed by Kannan and Vinay. Our algorithm takes as input a bipartite graph with a specified starting vertex, and attempts to find a dense subgraph near that vertex. We prove that for any subgraph S with k vertices and density theta, there are a significant number of starting ver… ▽ More

    Submitted 13 February, 2007; originally announced February 2007.

    Comments: 14 pages, no figures

    ACM Class: F.2.2; G.2.2

  30. arXiv:cs/0702032  [pdf, ps, other

    cs.DS

    Finding large and small dense subgraphs

    Authors: Reid Andersen

    Abstract: We consider two optimization problems related to finding dense subgraphs. The densest at-least-k-subgraph problem (DalkS) is to find an induced subgraph of highest average degree among all subgraphs with at least k vertices, and the densest at-most-k-subgraph problem (DamkS) is defined similarly. These problems are related to the well-known densest k-subgraph problem (DkS), which is to find the… ▽ More

    Submitted 5 February, 2007; originally announced February 2007.

    Comments: 12 pages, no figures

    ACM Class: F.2.2; G.2.2

  31. arXiv:cs/0612068  [pdf, ps, other

    cs.AI

    Interactive Configuration by Regular String Constraints

    Authors: Esben Rune Hansen, Henrik Reif Andersen

    Abstract: A product configurator which is complete, backtrack free and able to compute the valid domains at any state of the configuration can be constructed by building a Binary Decision Diagram (BDD). Despite the fact that the size of the BDD is exponential in the number of variables in the worst case, BDDs have proved to work very well in practice. Current BDD-based techniques can only handle interacti… ▽ More

    Submitted 12 December, 2006; originally announced December 2006.

    Comments: Tech Report

  32. arXiv:cs/0611141  [pdf, ps, other

    cs.AI

    A Generic Global Constraint based on MDDs

    Authors: Peter Tiedemann, Henrik Reif Andersen, Rasmus Pagh

    Abstract: The paper suggests the use of Multi-Valued Decision Diagrams (MDDs) as the supporting data structure for a generic global constraint. We give an algorithm for maintaining generalized arc consistency (GAC) on this constraint that amortizes the cost of the GAC computation over a root-to-terminal path in the search tree. The technique used is an extension of the GAC algorithm for the regular langua… ▽ More

    Submitted 28 November, 2006; originally announced November 2006.

    Comments: Tech report, 31 pages, 3 figures