Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 72 results for author: Gray, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16106  [pdf, other

    cs.IR cs.AI

    Evaluating Ensemble Methods for News Recommender Systems

    Authors: Alexander Gray, Noorhan Abbas

    Abstract: News recommendation is crucial for facilitating individuals' access to articles, particularly amid the increasingly digital landscape of news consumption. Consequently, extensive research is dedicated to News Recommender Systems (NRS) with increasingly sophisticated algorithms. Despite this sustained scholarly inquiry, there exists a notable research gap regarding the potential synergy achievable… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  2. arXiv:2406.14483  [pdf, other

    cs.LG

    Valid Error Bars for Neural Weather Models using Conformal Prediction

    Authors: Vignesh Gopakumar, Joel Oskarrson, Ander Gray, Lorenzo Zanisi, Stanislas Pamela, Daniel Giles, Matt Kusner, Marc Deisenroth

    Abstract: Neural weather models have shown immense potential as inexpensive and accurate alternatives to physics-based models. However, most models trained to perform weather forecasting do not quantify the uncertainty associated with their forecasts. This limits the trust in the model and the usefulness of the forecasts. In this work we construct and formalise a conformal prediction framework as a post-pro… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2405.02350  [pdf, ps, other

    cs.LG cs.AI

    What makes Models Compositional? A Theoretical View: With Supplement

    Authors: Parikshit Ram, Tim Klinger, Alexander G. Gray

    Abstract: Compositionality is thought to be a key component of language, and various compositional benchmarks have been developed to empirically probe the compositional generalization of existing sequence processing models. These benchmarks often highlight failures of existing models, but it is not clear why these models fail in this way. In this paper, we seek to theoretically understand the role the compo… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Extended version of the original IJCAI 2024 paper with detailed supplementary materials (27 pages, 7 figures)

  4. arXiv:2403.16887  [pdf

    cs.DL

    ChatGPT "contamination": estimating the prevalence of LLMs in the scholarly literature

    Authors: Andrew Gray

    Abstract: The use of ChatGPT and similar Large Language Model (LLM) tools in scholarly communication and academic publishing has been widely discussed since they became easily accessible to a general audience in late 2022. This study uses keywords known to be disproportionately present in LLM-generated text to provide an overall estimate for the prevalence of LLM-assisted writing in the scholarly literature… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 12 pages, 6 figures

  5. arXiv:2402.13440  [pdf, other

    cs.AI cs.NE

    A Neuro-Symbolic Approach to Multi-Agent RL for Interpretability and Probabilistic Decision Making

    Authors: Chitra Subramanian, Miao Liu, Naweed Khan, Jonathan Lenchner, Aporva Amarnath, Sarathkrishna Swaminathan, Ryan Riegel, Alexander Gray

    Abstract: Multi-agent reinforcement learning (MARL) is well-suited for runtime decision-making in optimizing the performance of systems where multiple agents coexist and compete for shared resources. However, applying common deep learning-based MARL solutions to real-world problems suffers from issues of interpretability, sample efficiency, partial observability, etc. To address these challenges, we present… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    ACM Class: I.2.6

  6. arXiv:2311.05967  [pdf, other

    physics.plasm-ph cs.LG

    Plasma Surrogate Modelling using Fourier Neural Operators

    Authors: Vignesh Gopakumar, Stanislas Pamela, Lorenzo Zanisi, Zongyi Li, Ander Gray, Daniel Brennand, Nitesh Bhatia, Gregory Stathopoulos, Matt Kusner, Marc Peter Deisenroth, Anima Anandkumar, JOREK Team, MAST Team

    Abstract: Predicting plasma evolution within a Tokamak reactor is crucial to realizing the goal of sustainable fusion. Capabilities in forecasting the spatio-temporal evolution of plasma rapidly and accurately allow us to quickly iterate over design and control strategies on current Tokamak devices and future reactors. Modelling plasma evolution using numerical solvers is often expensive, consuming many hou… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Journal ref: Nucl. Fusion 64 056025 (2024)

  7. arXiv:2309.16467  [pdf, other

    cs.LG

    Compositional Program Generation for Few-Shot Systematic Generalization

    Authors: Tim Klinger, Luke Liu, Soham Dan, Maxwell Crouse, Parikshit Ram, Alexander Gray

    Abstract: Compositional generalization is a key ability of humans that enables us to learn new concepts from only a handful examples. Neural machine learning models, including the now ubiquitous Transformers, struggle to generalize in this way, and typically require thousands of examples of a concept during training in order to generalize meaningfully. This difference in ability between humans and artificia… ▽ More

    Submitted 18 January, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: 7 pages of text with 1 page of references

  8. arXiv:2308.13292  [pdf, other

    cs.LG cs.CY cs.IR

    A Bayesian Active Learning Approach to Comparative Judgement

    Authors: Andy Gray, Alma Rahat, Tom Crick, Stephen Lindsay

    Abstract: Assessment is a crucial part of education. Traditional marking is a source of inconsistencies and unconscious bias, placing a high cognitive load on the assessors. An approach to address these issues is comparative judgement (CJ). In CJ, the assessor is presented with a pair of items and is asked to select the better one. Following a series of comparisons, a rank is derived using a ranking model,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: 16 pages

  9. arXiv:2307.02689  [pdf, other

    cs.CL

    Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning

    Authors: Subhajit Chaudhury, Sarathkrishna Swaminathan, Daiki Kimura, Prithviraj Sen, Keerthiram Murugesan, Rosario Uceda-Sosa, Michiaki Tatsubori, Achille Fokoue, Pavan Kapanipathi, Asim Munawar, Alexander Gray

    Abstract: Text-based reinforcement learning agents have predominantly been neural network-based models with embeddings-based representation, learning uninterpretable policies that often do not generalize well to unseen games. On the other hand, neuro-symbolic methods, specifically those that leverage an intermediate formal representation, are gaining significant attention in language understanding tasks. Th… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: ACL 2023

  10. arXiv:2306.15041  [pdf

    q-bio.QM cs.DB

    A Comparison of Neuroelectrophysiology Databases

    Authors: Priyanka Subash, Alex Gray, Misque Boswell, Samantha L. Cohen, Rachael Garner, Sana Salehi, Calvary Fisher, Samuel Hobel, Satrajit Ghosh, Yaroslav Halchenko, Benjamin Dichter, Russell A. Poldrack, Chris Markiewicz, Dora Hermes, Arnaud Delorme, Scott Makeig, Brendan Behan, Alana Sparks, Stephen R Arnott, Zhengjia Wang, John Magnotti, Michael S. Beauchamp, Nader Pouratian, Arthur W. Toga, Dominique Duncan

    Abstract: As data sharing has become more prevalent, three pillars - archives, standards, and analysis tools - have emerged as critical components in facilitating effective data sharing and collaboration. This paper compares four freely available intracranial neuroelectrophysiology data repositories: Data Archive for the BRAIN Initiative (DABI), Distributed Archives for Neurophysiology Data Integration (DAN… ▽ More

    Submitted 30 August, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 22 pages, 6 figures, 5 tables

  11. Can GPT-4 Support Analysis of Textual Data in Tasks Requiring Highly Specialized Domain Expertise?

    Authors: Jaromir Savelka, Kevin D. Ashley, Morgan A Gray, Hannes Westermann, Huihui Xu

    Abstract: We evaluated the capability of generative pre-trained transformers~(GPT-4) in analysis of textual data in tasks that require highly specialized domain expertise. Specifically, we focused on the task of analyzing court opinions to interpret legal concepts. We found that GPT-4, prompted with annotation guidelines, performs on par with well-trained law student annotators. We observed that, with a rel… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Journal ref: ITiCSE 2023: Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1. June 2023. Pages 117 - 123

  12. arXiv:2306.10452  [pdf, other

    cs.CL

    MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

    Authors: Keerthiram Murugesan, Sarathkrishna Swaminathan, Soham Dan, Subhajit Chaudhury, Chulaka Gunasekara, Maxwell Crouse, Diwakar Mahajan, Ibrahim Abdelaziz, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Alexander Gray

    Abstract: With the growing interest in large language models, the need for evaluating the quality of machine text compared to reference (typically human-generated) text has become focal attention. Most recent works focus either on task-specific evaluation metrics or study the properties of machine-generated text captured by the existing metrics. In this work, we propose a new evaluation scheme to model huma… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: Accepted at ACL 2023 (ACL Findings Long)

  13. arXiv:2306.09525  [pdf, other

    cs.CL cs.AI

    Explaining Legal Concepts with Augmented Large Language Models (GPT-4)

    Authors: Jaromir Savelka, Kevin D. Ashley, Morgan A. Gray, Hannes Westermann, Huihui Xu

    Abstract: Interpreting the meaning of legal open-textured terms is a key task of legal professionals. An important source for this interpretation is how the term was applied in previous court cases. In this paper, we evaluate the performance of GPT-4 in generating factually accurate, clear and relevant explanations of terms in legislation. We compare the performance of a baseline setup, where GPT-4 is direc… ▽ More

    Submitted 22 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  14. arXiv:2305.20018  [pdf, other

    cs.CL cs.AI

    Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency

    Authors: Maxwell Crouse, Ramon Astudillo, Tahira Naseem, Subhajit Chaudhury, Pavan Kapanipathi, Salim Roukos, Alexander Gray

    Abstract: We introduce Logical Offline Cycle Consistency Optimization (LOCCO), a scalable, semi-supervised method for training a neural semantic parser. Conceptually, LOCCO can be viewed as a form of self-learning where the semantic parser being trained is used to generate annotations for unlabeled text that are then used as new supervision. To increase the quality of annotations, our method utilizes a coun… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  15. arXiv:2305.15022  [pdf, other

    stat.ML cs.LG

    Hierarchical clustering with dot products recovers hidden tree structure

    Authors: Annie Gray, Alexander Modell, Patrick Rubin-Delanchy, Nick Whiteley

    Abstract: In this paper we offer a new perspective on the well established agglomerative clustering algorithm, focusing on recovery of hierarchical structure. We recommend a simple variant of the standard algorithm, in which clusters are merged by maximum average dot product and not, for example, by minimum distance or within-cluster variance. We demonstrate that the tree output by this algorithm provides a… ▽ More

    Submitted 1 March, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

  16. arXiv:2301.10414  [pdf, other

    cs.IT cs.LO

    Towards a Unification of Logic and Information Theory

    Authors: Luis A. Lastras, Barry Trager, Jonathan Lenchner, Wojtek Szpankowski, Chai Wah Wu, Mark Squillante, Alex Gray

    Abstract: This article introduces a theory of communication that covers the following generic scenario: Alice knows more than Bob about a certain set of logic propositions and Alice and Bob wish to communicate as efficiently as possible with the shared goal that, following their communication, Bob should be able to deduce a particular logic proposition that Alice knows to be true. We assume that our logic… ▽ More

    Submitted 16 April, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

  17. arXiv:2301.05131  [pdf, other

    cs.LG

    Toward Theoretical Guidance for Two Common Questions in Practical Cross-Validation based Hyperparameter Selection

    Authors: Parikshit Ram, Alexander G. Gray, Horst C. Samulowitz, Gregory Bramble

    Abstract: We show, to our knowledge, the first theoretical treatments of two common questions in cross-validation based hyperparameter selection: (1) After selecting the best hyperparameter using a held-out set, we train the final model using {\em all} of the training data -- since this may or may not improve future generalization error, should one do this? (2) During optimization such as via SGD (stochasti… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Extended version of the paper appearing at the SIAM International Conference on Data Mining 2023 (SDM23)

  18. arXiv:2208.11665  [pdf, other

    stat.ME cs.LG stat.ML

    Statistical exploration of the Manifold Hypothesis

    Authors: Nick Whiteley, Annie Gray, Patrick Rubin-Delanchy

    Abstract: The Manifold Hypothesis is a widely accepted tenet of Machine Learning which asserts that nominally high-dimensional data are in fact concentrated near a low-dimensional manifold, embedded in high-dimensional space. This phenomenon is observed empirically in many real world situations, has led to development of a wide range of statistical methods in the last few decades, and has been suggested as… ▽ More

    Submitted 9 February, 2024; v1 submitted 24 August, 2022; originally announced August 2022.

    MSC Class: 62R20; 62R40; 62G05; 62G20; 62R07; 62-08; 62H25; 62H30

  19. arXiv:2204.01805  [pdf

    cs.HC

    Using Elo Rating as a Metric for Comparative Judgement in Educational Assessment

    Authors: Andy Gray, Alma Rahat, Tom Crick, Stephen Lindsay, Darren Wallace

    Abstract: Marking and feedback are essential features of teaching and learning, across the overwhelming majority of educational settings and contexts. However, it can take a great deal of time and effort for teachers to mark assessments, and to provide useful feedback to the students. Furthermore, it also creates a significant cognitive load on the assessors, especially in ensuring fairness and equity. Ther… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: 12 pages, 4 figures, one table, pre-review version

  20. arXiv:2201.05793  [pdf, other

    cs.CL cs.AI

    A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme Lima, Ryan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that involve complex reasoning are emerging as an important research direction. However, most existing KBQA datasets focus primarily on generic multi-hop reasoning over explicit facts, largely ignoring other reasoning types such as temporal, spatial, and taxonomic reasoning. In this paper, we present a benchmark dataset for temporal reasoning, TempQA-… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

    Comments: 7 pages, 2 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2109.13430

  21. A Simple Standard for Sharing Ontological Mappings (SSSOM)

    Authors: Nicolas Matentzoglu, James P. Balhoff, Susan M. Bello, Chris Bizon, Matthew Brush, Tiffany J. Callahan, Christopher G Chute, William D. Duncan, Chris T. Evelo, Davera Gabriel, John Graybeal, Alasdair Gray, Benjamin M. Gyori, Melissa Haendel, Henriette Harmse, Nomi L. Harris, Ian Harrow, Harshad Hegde, Amelia L. Hoyt, Charles T. Hoyt, Dazhi Jiao, Ernesto Jiménez-Ruiz, Simon Jupp, Hyeongsik Kim, Sebastian Koehler , et al. (19 additional authors not shown)

    Abstract: Despite progress in the development of standards for describing and exchanging scientific information, the lack of easy-to-use standards for mapping between different representations of the same or similar objects in different databases poses a major impediment to data integration and interoperability. Mappings often lack the metadata needed to be correctly interpreted and applied. For example, ar… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: Corresponding author: Christopher J. Mungall <cjmungall@lbl.gov>

  22. arXiv:2112.03324  [pdf, other

    cs.AI cs.LG cs.LO cs.SC

    Neuro-Symbolic Inductive Logic Programming with Logical Neural Networks

    Authors: Prithviraj Sen, Breno W. S. R. de Carvalho, Ryan Riegel, Alexander Gray

    Abstract: Recent work on neuro-symbolic inductive logic programming has led to promising approaches that can learn explanatory rules from noisy, real-world data. While some proposals approximate logical operators with differentiable operators from fuzzy or real-valued logic that are parameter-free thus diminishing their capacity to fit the data, other approaches are only loosely based on logic making it dif… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

  23. arXiv:2110.10973  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    LOA: Logical Optimal Actions for Text-based Interaction Games

    Authors: Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray

    Abstract: We present Logical Optimal Actions (LOA), an action decision architecture of reinforcement learning applications with a neuro-symbolic framework which is a combination of neural network and symbolic knowledge acquisition approach for natural language interaction games. The demonstration for LOA experiments consists of a web-based interactive platform for text-based games and visualization for acqu… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: ACL-IJCNLP 2021 (demo paper)

  24. arXiv:2110.10963  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    Neuro-Symbolic Reinforcement Learning with First-Order Logic

    Authors: Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar, Alexander Gray

    Abstract: Deep reinforcement learning (RL) methods often require many trials before convergence, and no direct interpretability of trained policies is provided. In order to achieve fast convergence and interpretability for the policy in RL, we propose a novel RL method for text-based games with a recent neuro-symbolic framework called Logical Neural Network, which can learn symbolic and interpretable rules… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 (main conference)

  25. arXiv:2110.01295  [pdf, other

    cs.CL

    SPaR.txt, a cheap Shallow Parsing approach for Regulatory texts

    Authors: Ruben Kruiper, Ioannis Konstas, Alasdair Gray, Farhad Sadeghineko, Richard Watson, Bimal Kumar

    Abstract: Automated Compliance Checking (ACC) systems aim to semantically parse building regulations to a set of rules. However, semantic parsing is known to be hard and requires large amounts of training data. The complexity of creating such training data has led to research that focuses on small sub-tasks, such as shallow parsing or the extraction of a limited subset of rules. This study introduces a shal… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

    Comments: To be published in the NLLP workshop at EMNLP 2021, 9 pages (15 including reference and appendices). For the ScotReg corpus, SPaR.txt dataset and code see: http://github.com/rubenkruiper/SPaR.txt

  26. arXiv:2109.13430  [pdf, other

    cs.CL cs.AI

    SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme LimaRyan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that in-volve complex reasoning are emerging as an important re-search direction. However, most KBQA systems struggle withgeneralizability, particularly on two dimensions: (a) acrossmultiple reasoning types where both datasets and systems haveprimarily focused on multi-hop reasoning, and (b) across mul-tiple knowledge bases, where KBQA approaches are… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  27. arXiv:2109.12240  [pdf, other

    cs.AI cs.LO

    Logical Credal Networks

    Authors: Haifeng Qian, Radu Marinescu, Alexander Gray, Debarun Bhattacharjya, Francisco Barahona, Tian Gao, Ryan Riegel, Pravinda Sahu

    Abstract: This paper introduces Logical Credal Networks, an expressive probabilistic logic that generalizes many prior models that combine logic and probability. Given imprecise information represented by probability bounds and conditional probability bounds of logic formulas, this logic specifies a set of probability distributions over all interpretations. On the one hand, our approach allows propositional… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  28. arXiv:2109.09566  [pdf, other

    cs.AI cs.LG cs.LO

    Combining Rules and Embeddings via Neuro-Symbolic AI for Knowledge Base Completion

    Authors: Prithviraj Sen, Breno W. S. R. Carvalho, Ibrahim Abdelaziz, Pavan Kapanipathi, Francois Luus, Salim Roukos, Alexander Gray

    Abstract: Recent interest in Knowledge Base Completion (KBC) has led to a plethora of approaches based on reinforcement learning, inductive logic programming and graph embeddings. In particular, rule-based KBC has led to interpretable rules while being comparable in performance with graph embeddings. Even within rule-based KBC, there exist different approaches that lead to rules of varying quality and previ… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

  29. arXiv:2106.13367  [pdf, other

    cs.AI cs.DB

    SeaNet -- Towards A Knowledge Graph Based Autonomic Management of Software Defined Networks

    Authors: Qianru Zhou, Alasdair J. G. Gray, Stephen McLaughlin

    Abstract: Automatic network management driven by Artificial Intelligent technologies has been heatedly discussed over decades. However, current reports mainly focus on theoretic proposals and architecture designs, works on practical implementations on real-life networks are yet to appear. This paper proposes our effort toward the implementation of knowledge graph driven approach for autonomic network manage… ▽ More

    Submitted 27 May, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

  30. arXiv:2106.09795  [pdf, other

    cs.CL cs.AI cs.SC

    LNN-EL: A Neuro-Symbolic Approach to Short-text Entity Linking

    Authors: Hang Jiang, Sairam Gurajada, Qiuhao Lu, Sumit Neelam, Lucian Popa, Prithviraj Sen, Yunyao Li, Alexander Gray

    Abstract: Entity linking (EL), the task of disambiguating mentions in text by linking them to entities in a knowledge graph, is crucial for text understanding, question answering or conversational systems. Entity linking on short text (e.g., single sentence or question) poses particular challenges due to limited context. While prior approaches use either heuristics or black-box neural methods, here we propo… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL 2021

  31. arXiv:2106.01260  [pdf, other

    stat.ML cs.LG

    Matrix factorisation and the interpretation of geodesic distance

    Authors: Nick Whiteley, Annie Gray, Patrick Rubin-Delanchy

    Abstract: Given a graph or similarity matrix, we consider the problem of recovering a notion of true distance between the nodes, and so their true positions. We show that this can be accomplished in two steps: matrix factorisation, followed by nonlinear dimension reduction. This combination is effective because the point cloud obtained in the first step lives close to a manifold in which latent distance is… ▽ More

    Submitted 22 September, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    MSC Class: 62G05; 62H20; 62H12; 62H30

  32. arXiv:2103.02363  [pdf, other

    cs.AI

    Reinforcement Learning with External Knowledge by using Logical Neural Networks

    Authors: Daiki Kimura, Subhajit Chaudhury, Akifumi Wachi, Ryosuke Kohita, Asim Munawar, Michiaki Tatsubori, Alexander Gray

    Abstract: Conventional deep reinforcement learning methods are sample-inefficient and usually require a large number of training trials before convergence. Since such methods operate on an unconstrained action set, they can lead to useless actions. A recent neuro-symbolic framework called the Logical Neural Networks (LNNs) can simultaneously provide key-properties of both neural networks and symbolic logic.… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: KBRL Workshop at IJCAI-PRICAI 2020

  33. arXiv:2103.00418  [pdf, other

    cs.AI cs.DB cs.LG cs.LO

    Logic Embeddings for Complex Query Answering

    Authors: Francois Luus, Prithviraj Sen, Pavan Kapanipathi, Ryan Riegel, Ndivhuwo Makondo, Thabang Lebese, Alexander Gray

    Abstract: Answering logical queries over incomplete knowledge bases is challenging because: 1) it calls for implicit link prediction, and 2) brute force answering of existential first-order logic queries is exponential in the number of existential variables. Recent work of query embeddings provides fast querying, but most approaches model set logic with closed regions, so lack negation. Query embeddings tha… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: IBM Research

  34. arXiv:2012.01707  [pdf, other

    cs.CL cs.AI

    Leveraging Abstract Meaning Representation for Knowledge Base Question Answering

    Authors: Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Reddy , et al. (5 additional authors not shown)

    Abstract: Knowledge base question answering (KBQA)is an important task in Natural Language Processing. Existing approaches face significant challenges including complex question understanding, necessity for reasoning, and lack of large end-to-end training datasets. In this work, we propose Neuro-Symbolic Question Answering (NSQA), a modular KBQA system, that leverages (1) Abstract Meaning Representation (AM… ▽ More

    Submitted 2 June, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted to Findings of ACL

  35. arXiv:2011.05624  [pdf, other

    cs.NI

    SARA -- A Semantic Access Point Resource Allocation Service for Heterogenous Wireless Networks

    Authors: Qianru Zhou, Alasdair J. G. Gray, Dimitrios Pezaros, Stephen McLaughlin

    Abstract: In this paper, we present SARA, a Semantic Access point Resource Allocation service for heterogenous wireless networks with various wireless access technologies existing together. By automatically reasoning on the knowledge base of the full system provided by a knowledge based autonomic network management system -- SEANET, SARA selects the access point providing the best quality of service among t… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: 2019 IEEE Wireless Day

  36. Leveraging Semantic Parsing for Relation Linking over Knowledge Bases

    Authors: Nandana Mihindukulasooriya, Gaetano Rossiello, Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Mo Yu, Alfio Gliozzo, Salim Roukos, Alexander Gray

    Abstract: Knowledgebase question answering systems are heavily dependent on relation extraction and linking modules. However, the task of extracting and linking relations from text to knowledgebases faces two primary challenges; the ambiguity of natural language and lack of training data. To overcome these challenges, we present SLING, a relation linking framework which leverages semantic parsing using Abst… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: Accepted at the 19th International Semantic Web Conference (ISWC 2020)

    MSC Class: 68T35 ACM Class: I.2.7; I.2.4

  37. arXiv:2008.02429  [pdf, ps, other

    cs.LO cs.AI

    Foundations of Reasoning with Uncertainty via Real-valued Logics

    Authors: Ronald Fagin, Ryan Riegel, Alexander Gray

    Abstract: Real-valued logics underlie an increasing number of neuro-symbolic approaches, though typically their logical inference capabilities are characterized only qualitatively. We provide foundations for establishing the correctness and power of such systems. We give a sound and strongly complete axiomatization that can be parametrized to cover essentially every real-valued logic, including all the comm… ▽ More

    Submitted 30 August, 2022; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: 12 pages (incl. references). To be submitted to PNAS

  38. arXiv:2006.13155  [pdf, other

    cs.AI cs.LG cs.LO

    Logical Neural Networks

    Authors: Ryan Riegel, Alexander Gray, Francois Luus, Naweed Khan, Ndivhuwo Makondo, Ismail Yunus Akhalwaya, Haifeng Qian, Ronald Fagin, Francisco Barahona, Udit Sharma, Shajith Ikbal, Hima Karanam, Sumit Neelam, Ankita Likhyani, Santosh Srivastava

    Abstract: We propose a novel framework seamlessly providing key properties of both neural nets (learning) and symbolic logic (knowledge and reasoning). Every neuron has a meaning as a component of a formula in a weighted real-valued logic, yielding a highly intepretable disentangled representation. Inference is omnidirectional rather than focused on predefined target variables, and corresponds to logical re… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: 10 pages (incl. references), 38 pages supplementary, 7 figures, 9 tables, 6 algorithms. In submission to NeurIPS 2020

  39. arXiv:2006.09635  [pdf, other

    cs.LG math.OC stat.ML

    Solving Constrained CASH Problems with ADMM

    Authors: Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

    Abstract: The CASH problem has been widely studied in the context of automated configurations of machine learning (ML) pipelines and various solvers and toolkits are available. However, CASH solvers do not directly handle black-box constraints such as fairness, robustness or other domain-specific custom constraints. We present our recent approach [Liu, et al., 2020] that leverages the ADMM optimization fram… ▽ More

    Submitted 10 July, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 7th ICML Workshop on Automated Machine Learning (2020)

  40. arXiv:2006.09167  [pdf, other

    physics.comp-ph cs.DC cs.DS cs.PF

    Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS

    Authors: Szilárd Páll, Artem Zhmurov, Paul Bauer, Mark Abraham, Magnus Lundborg, Alan Gray, Berk Hess, Erik Lindahl

    Abstract: The introduction of accelerator devices such as graphics processing units (GPUs) has had profound impact on molecular dynamics simulations and has enabled order-of-magnitude performance advances using commodity hardware. To fully reap these benefits, it has been necessary to reformulate some of the most fundamental algorithms, including the Verlet list, pair searching and cut-offs. Here, we presen… ▽ More

    Submitted 7 September, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: The following article has been submitted to the Journal of Chemical Physics

    ACM Class: J.2; I.6.3

  41. arXiv:1912.06723  [pdf, other

    cs.LG cs.HC stat.ML

    AutoAIViz: Opening the Blackbox of Automated Artificial Intelligence with Conditional Parallel Coordinates

    Authors: Daniel Karl I. Weidele, Justin D. Weisz, Eno Oduor, Michael Muller, Josh Andres, Alexander Gray, Dakuo Wang

    Abstract: Artificial Intelligence (AI) can now automate the algorithm selection, feature engineering, and hyperparameter tuning steps in a machine learning workflow. Commonly known as AutoML or AutoAI, these technologies aim to relieve data scientists from the tedious manual work. However, today's AutoAI systems often present only limited to no information about the process of how they select and generate m… ▽ More

    Submitted 17 January, 2020; v1 submitted 13 December, 2019; originally announced December 2019.

    Comments: 5 pages, 1 figure, IUI2020

  42. arXiv:1910.14436  [pdf, other

    cs.AI cs.LG

    How can AI Automate End-to-End Data Science?

    Authors: Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Alexander Gray

    Abstract: Data science is labor-intensive and human experts are scarce but heavily involved in every aspect of it. This makes data science time consuming and restricted to experts with the resulting quality heavily dependent on their experience and skills. To make data science more accessible and scalable, we need its democratization. Automated Data Science (AutoDS) is aimed towards that goal and is emergin… ▽ More

    Submitted 22 October, 2019; originally announced October 2019.

  43. arXiv:1909.02309  [pdf, other

    cs.HC cs.AI cs.LG

    Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI

    Authors: Dakuo Wang, Justin D. Weisz, Michael Muller, Parikshit Ram, Werner Geyer, Casey Dugan, Yla Tausczik, Horst Samulowitz, Alexander Gray

    Abstract: The rapid advancement of artificial intelligence (AI) is changing our lives in many ways. One application domain is data science. New techniques in automating the creation of AI, known as AutoAI or AutoML, aim to automate the work practices of data scientists. AutoAI systems are capable of autonomously ingesting and pre-processing data, engineering new features, and creating and scoring models bas… ▽ More

    Submitted 5 September, 2019; originally announced September 2019.

  44. arXiv:1908.06097  [pdf

    cs.DC

    Performance report and optimized implementations of Weather & Climate dwarfs on multi-node systems

    Authors: Louis Douriez, Alan Gray, David Guibert, Peter Messmer, Erwan Raffin

    Abstract: This document is one of the deliverable reports created for the ESCAPE project. ESCAPE stands for Energy-efficient Scalable Algorithms for Weather Prediction at Exascale. The project develops world-class, extreme-scale computing capabilities for European operational numerical weather prediction and future climate models. This is done by identifying Weather & Climate dwarfs which are key patterns i… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: 35 pages, 22 figures

    ACM Class: D.2.8; G.1.8; G.4

  45. arXiv:1908.06096  [pdf

    cs.DC

    Performance report and optimized implementation of Weather & Climate Dwarfs on GPU, MIC and Optalysys Optical Processor

    Authors: Cyril Mazauric, Erwan Raffin, Xavier Vigouroux, David Guibert, Alex Macfaden, Jacob Poulsen, Per Berg, Alan Gray, Peter Messmer

    Abstract: This document is one of the deliverable reports created for the ESCAPE project. ESCAPE stands for Energy-efficient Scalable Algorithms for Weather Prediction at Exascale. The project develops world-class, extreme-scale computing capabilities for European operational numerical weather prediction and future climate models. This is done by identifying Weather & Climate dwarfs which are key patterns i… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

    Comments: 75 pages, 33 figures

    ACM Class: D.2.8; G.1.8; G.4

  46. arXiv:1905.00424  [pdf, other

    cs.LG stat.ML

    An ADMM Based Framework for AutoML Pipeline Configuration

    Authors: Sijia Liu, Parikshit Ram, Deepak Vijaykeerthy, Djallel Bouneffouf, Gregory Bramble, Horst Samulowitz, Dakuo Wang, Andrew Conn, Alexander Gray

    Abstract: We study the AutoML problem of automatically configuring machine learning pipelines by jointly selecting algorithms and their appropriate hyper-parameters for all steps in supervised learning pipelines. This black-box (gradient-free) optimization with mixed integer & continuous variables is a challenging problem. We propose a novel AutoML scheme by leveraging the alternating direction method of mu… ▽ More

    Submitted 6 December, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Journal ref: published at AAAI 2020

  47. arXiv:1903.05372  [pdf, other

    cs.CY cs.NI

    Lost Silence: An emergency response early detection service through continuous processing of telecommunication data streams

    Authors: Qianru Zhou, Stephen McLaughlin, Alasdair J. G. Gray, Shangbin Wu, Chengxiang Wang

    Abstract: Early detection of significant traumatic events, e.g. a terrorist attack or a ship capsizing, is important to ensure that a prompt emergency response can occur. In the modern world telecommunication systems could play a key role in ensuring a successful emergency response by detecting such incidents through significant changes in calls and access to the networks. In this paper a methodology is ill… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: 15 pages, 4 figures, WSP ISWC 2017 conference

    Journal ref: ISWC WSP 2017, pp. 33--47

  48. arXiv:1902.09944  [pdf, ps, other

    cs.HC

    Automated Screening for Distress: A Perspective for the Future

    Authors: Rajib Rana, Siddique Latif, Raj Gururajan, Anthony Gray, Geraldine Mackenzie, Gerald Humphris, Jeff Dunn

    Abstract: Distress is a complex condition which affects a significant percentage of cancer patients and may lead to depression, anxiety, sadness, suicide and other forms of psychological morbidity. Compelling evidence supports screening for distress as a means of facilitating early intervention and subsequent improvements in psychological well-being and overall quality of life. Nevertheless, despite the exi… ▽ More

    Submitted 27 July, 2020; v1 submitted 22 February, 2019; originally announced February 2019.

    Comments: Accepted in European Journal of Cancer Care

  49. arXiv:1609.01479  [pdf, other

    cs.DC hep-lat

    A Lightweight Approach to Performance Portability with targetDP

    Authors: Alan Gray, Kevin Stratford

    Abstract: Leading HPC systems achieve their status through use of highly parallel devices such as NVIDIA GPUs or Intel Xeon Phi many-core CPUs. The concept of performance portability across such architectures, as well as traditional CPUs, is vital for the application programmer. In this paper we describe targetDP, a lightweight abstraction layer which allows grid-based applications to target data parallel h… ▽ More

    Submitted 9 November, 2016; v1 submitted 6 September, 2016; originally announced September 2016.

    Comments: 11 pages, 5 figures, accepted to the International Journal of High Performance Computing Applications (IJHPCA), acceptance date 27th October 2016

  50. arXiv:1507.07260  [pdf, other

    stat.ML cs.LG

    Reduced-Set Kernel Principal Components Analysis for Improving the Training and Execution Speed of Kernel Machines

    Authors: Hassan A. Kingravi, Patricio A. Vela, Alexandar Gray

    Abstract: This paper presents a practical, and theoretically well-founded, approach to improve the speed of kernel manifold learning algorithms relying on spectral decomposition. Utilizing recent insights in kernel smoothing and learning with integral operators, we propose Reduced Set KPCA (RSKPCA), which also suggests an easy-to-implement method to remove or replace samples with minimal effect on the empir… ▽ More

    Submitted 26 July, 2015; originally announced July 2015.