Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–21 of 21 results for author: Jennings, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.11214  [pdf, ps, other

    cs.AI cs.CL cs.LG cs.LO cs.PL

    PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

    Authors: George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, Swarat Chaudhuri

    Abstract: We present PutnamBench, a new multilingual benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PutnamBench consists of 1697 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the theorems have formalization… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  2. arXiv:2407.06380  [pdf, other

    cs.CL

    Data, Data Everywhere: A Guide for Pretraining Dataset Construction

    Authors: Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro

    Abstract: The impressive capabilities of recent language models can be largely attributed to the multi-trillion token pretraining datasets that they are trained on. However, model developers fail to disclose their construction methodology which has lead to a lack of open information on how to develop effective pretraining sets. To address this issue, we perform the first systematic study across the entire p… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Preprint. Under review

  3. arXiv:2406.11704  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 340B Technical Report

    Authors: Nvidia, :, Bo Adler, Niket Agarwal, Ashwath Aithal, Dong H. Anh, Pallab Bhattacharya, Annika Brundyn, Jared Casper, Bryan Catanzaro, Sharon Clay, Jonathan Cohen, Sirshak Das, Ayush Dattagupta, Olivier Delalleau, Leon Derczynski, Yi Dong, Daniel Egert, Ellie Evans, Aleksander Ficek, Denys Fridman, Shaona Ghosh, Boris Ginsburg, Igor Gitman, Tomasz Grzegorzek , et al. (58 additional authors not shown)

    Abstract: We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distribution, modification, and use of the models and its outputs. These models perform competitively to open access models on a wide range of evaluation be… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2404.06969  [pdf, other

    cs.LG stat.ML

    FiP: a Fixed-Point Approach for Causal Generative Modeling

    Authors: Meyer Scetbon, Joel Jennings, Agrin Hilmkil, Cheng Zhang, Chao Ma

    Abstract: Modeling true world data-generating processes lies at the heart of empirical science. Structural Causal Models (SCMs) and their associated Directed Acyclic Graphs (DAGs) provide an increasingly popular answer to such problems by defining the causal generative process that transforms random noise into observations. However, learning them from observational data poses an ill-posed and NP-hard invers… ▽ More

    Submitted 14 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  5. arXiv:2402.16819  [pdf, other

    cs.CL cs.AI cs.LG

    Nemotron-4 15B Technical Report

    Authors: Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Mostofa Patwary, Sandeep Subramanian, Dan Su, Chen Zhu, Deepak Narayanan, Aastha Jhunjhunwala, Ayush Dattagupta, Vibhu Jawa, Jiwei Liu, Ameya Mahabaleshwarkar, Osvald Nitski, Annika Brundyn, James Maki, Miguel Martinez, Jiaxuan You, John Kamalu, Patrick LeGresley, Denys Fridman, Jared Casper, Ashwath Aithal, Oleksii Kuchaiev, Mohammad Shoeybi , et al. (2 additional authors not shown)

    Abstract: We introduce Nemotron-4 15B, a 15-billion-parameter large multilingual language model trained on 8 trillion text tokens. Nemotron-4 15B demonstrates strong performance when assessed on English, multilingual, and coding tasks: it outperforms all existing similarly-sized open models on 4 out of 7 downstream evaluation areas and achieves competitive performance to the leading open models in the remai… ▽ More

    Submitted 27 February, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2311.03309  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Structure Learning with Stochastic Differential Equations

    Authors: Benjie Wang, Joel Jennings, Wenbo Gong

    Abstract: Discovering the underlying relationships among variables from temporal observations has been a longstanding challenge in numerous scientific disciplines, including biology, finance, and climate science. The dynamics of such systems are often best described using continuous-time stochastic processes. Unfortunately, most existing structure learning approaches assume that the underlying process evolv… ▽ More

    Submitted 5 May, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: ICLR 2024

  7. arXiv:2310.00809  [pdf, other

    cs.LG cs.AI stat.ME stat.ML

    Towards Causal Foundation Model: on Duality between Causal Inference and Attention

    Authors: Jiaqi Zhang, Joel Jennings, Agrin Hilmkil, Nick Pawlowski, Cheng Zhang, Chao Ma

    Abstract: Foundation models have brought changes to the landscape of machine learning, demonstrating sparks of human-level intelligence across a diverse array of tasks. However, a gap persists in complex tasks such as causal inference, primarily due to challenges associated with intricate reasoning steps and high numerical precision requirements. In this work, we take a first step towards building causally-… ▽ More

    Submitted 3 June, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

  8. arXiv:2307.13917  [pdf, other

    cs.LG stat.ME

    BayesDAG: Gradient-Based Posterior Inference for Causal Discovery

    Authors: Yashas Annadani, Nick Pawlowski, Joel Jennings, Stefan Bauer, Cheng Zhang, Wenbo Gong

    Abstract: Bayesian causal discovery aims to infer the posterior distribution over causal models from observed data, quantifying epistemic uncertainty and benefiting downstream tasks. However, computational challenges arise due to joint inference over combinatorial space of Directed Acyclic Graphs (DAGs) and nonlinear functions. Despite recent progress towards efficient posterior inference over DAGs, existin… ▽ More

    Submitted 8 December, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  9. arXiv:2304.05524  [pdf, other

    cs.LG cs.CL

    Understanding Causality with Large Language Models: Feasibility and Opportunities

    Authors: Cheng Zhang, Stefan Bauer, Paul Bennett, Jiangfeng Gao, Wenbo Gong, Agrin Hilmkil, Joel Jennings, Chao Ma, Tom Minka, Nick Pawlowski, James Vaughan

    Abstract: We assess the ability of large language models (LLMs) to answer causal questions by analyzing their strengths and weaknesses against three types of causal question. We believe that current LLMs can answer causal questions with existing causal knowledge as combined domain experts. However, they are not yet able to provide satisfactory answers for discovering new knowledge or for high-stakes decisio… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  10. arXiv:2303.12703  [pdf, other

    cs.LG stat.ME

    Causal Reasoning in the Presence of Latent Confounders via Neural ADMG Learning

    Authors: Matthew Ashman, Chao Ma, Agrin Hilmkil, Joel Jennings, Cheng Zhang

    Abstract: Latent confounding has been a long-standing obstacle for causal reasoning from observational data. One popular approach is to model the data using acyclic directed mixed graphs (ADMGs), which describe ancestral relations between variables using directed and bidirected edges. However, existing methods using ADMGs are based on either linear functional assumptions or a discrete search that is complic… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Camera ready version for ICLR 2023

  11. arXiv:2302.14015  [pdf, other

    stat.ML cs.AI cs.LG stat.CO

    CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

    Authors: Desi R. Ivanova, Joel Jennings, Tom Rainforth, Cheng Zhang, Adam Foster

    Abstract: We formalize the problem of contextual optimization through the lens of Bayesian experimental design and propose CO-BED -- a general, model-agnostic framework for designing contextual experiments using information-theoretic principles. After formulating a suitable information-based objective, we employ black-box variational methods to simultaneously estimate it and optimize the designs in a single… ▽ More

    Submitted 13 July, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Proceedings of the 40th International Conference on Machine Learning (ICML 2023); 9 pages, 7 figures

  12. arXiv:2210.14706  [pdf, other

    cs.LG cs.AI stat.ML

    Rhino: Deep Causal Temporal Relationship Learning With History-dependent Noise

    Authors: Wenbo Gong, Joel Jennings, Cheng Zhang, Nick Pawlowski

    Abstract: Discovering causal relationships between different variables from time series data has been a long-standing challenge for many domains such as climate science, finance, and healthcare. Given the complexity of real-world relationships and the nature of observations in discrete time, causal discovery methods need to consider non-linear relations between variables, instantaneous effects and history-d… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 28 pages, 8 figures, 5 tables

  13. arXiv:2208.12610  [pdf, ps, other

    cs.CY cs.AI cs.LG

    NeurIPS Competition Instructions and Guide: Causal Insights for Learning Paths in Education

    Authors: Wenbo Gong, Digory Smith, Zichao Wang, Craig Barton, Simon Woodhead, Nick Pawlowski, Joel Jennings, Cheng Zhang

    Abstract: In this competition, participants will address two fundamental causal challenges in machine learning in the context of education using time-series data. The first is to identify the causal relationships between different constructs, where a construct is defined as the smallest element of learning. The second challenge is to predict the impact of learning one construct on the ability to answer ques… ▽ More

    Submitted 31 August, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: 19 pages, NeurIPS 2022 Competition Track

  14. arXiv:2207.05250  [pdf, other

    stat.ML cs.AI cs.LG stat.CO stat.ME

    Efficient Real-world Testing of Causal Decision Making via Bayesian Experimental Design for Contextual Optimisation

    Authors: Desi R. Ivanova, Joel Jennings, Cheng Zhang, Adam Foster

    Abstract: The real-world testing of decisions made using causal machine learning models is an essential prerequisite for their successful application. We focus on evaluating and improving contextual treatment assignment decisions: these are personalised treatments applied to e.g. customers, each with their own contextual information, with the aim of maximising a reward. In this paper we introduce a model-ag… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ICML 2022 Workshop on Adaptive Experimental Design and Active Learning in the Real World. 16 pages, 5 figures

  15. arXiv:2110.14468  [pdf, other

    cs.LG

    DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

    Authors: David Mguni, Usman Islam, Yaqi Sun, Xiuling Zhang, Joel Jennings, Aivar Sootla, Changmin Yu, Ziyan Wang, Jun Wang, Yaodong Yang

    Abstract: Reinforcement learning (RL) involves performing exploratory actions in an unknown system. This can place a learning agent in dangerous and potentially catastrophic system states. Current approaches for tackling safe learning in RL simultaneously trade-off safe exploration and task fulfillment. In this paper, we introduce a new generation of RL solvers that learn to minimise safety violations while… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2103.09159

  16. arXiv:2104.09693  [pdf, other

    cs.SE

    Demystifying Regular Expression Bugs: A comprehensive study on regular expression bug causes, fixes, and testing

    Authors: Peipei Wang, Chris Brown, Jamie A. Jennings, Kathryn T. Stolee

    Abstract: Regular expressions cause string-related bugs and open security vulnerabilities for DOS attacks. However, beyond ReDoS (Regular expression Denial of Service), little is known about the extent to which regular expression issues affect software development and how these issues are addressed in practice. We conduct an empirical study of 356 merged regex-related pull request bugs from Apache, Mozilla,… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  17. arXiv:2103.09284  [pdf, other

    cs.MA

    Learning in Nonzero-Sum Stochastic Games with Potentials

    Authors: David Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang

    Abstract: Multi-agent reinforcement learning (MARL) has become effective in tackling discrete cooperative game scenarios. However, MARL has yet to penetrate settings beyond those modelled by team and zero-sum games, confining it to a small subset of multi-agent systems. In this paper, we introduce a new generation of MARL learners that can handle nonzero-sum payoff structures and continuous settings. In par… ▽ More

    Submitted 15 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: ICML 2021

  18. arXiv:2012.00874  [pdf, other

    cs.CY

    "A cold, technical decision-maker": Can AI provide explainability, negotiability, and humanity?

    Authors: Allison Woodruff, Yasmin Asare Anderson, Katherine Jameson Armstrong, Marina Gkiza, Jay Jennings, Christopher Moessner, Fernanda Viegas, Martin Wattenberg, and Lynette Webb, Fabian Wrede, Patrick Gage Kelley

    Abstract: Algorithmic systems are increasingly deployed to make decisions in many areas of people's lives. The shift from human to algorithmic decision-making has been accompanied by concern about potentially opaque decisions that are not aligned with social values, as well as proposed remedies such as explainability. We present results of a qualitative study of algorithmic decision-making, comprised of fiv… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

    Comments: 23 pages, 1 appendix, 4 tables

    ACM Class: K.4; K.3.2; I.2

  19. BioDynaMo: a general platform for scalable agent-based simulation

    Authors: Lukas Breitwieser, Ahmad Hesam, Jean de Montigny, Vasileios Vavourakis, Alexandros Iosif, Jack Jennings, Marcus Kaiser, Marco Manca, Alberto Di Meglio, Zaid Al-Ars, Fons Rademakers, Onur Mutlu, Roman Bauer

    Abstract: Motivation: Agent-based modeling is an indispensable tool for studying complex biological systems. However, existing simulators do not always take full advantage of modern hardware and often have a field-specific software design. Results: We present a novel simulation platform called BioDynaMo that alleviates both of these problems. BioDynaMo features a general-purpose and high-performance simul… ▽ More

    Submitted 5 February, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 8 pages, 6 figures

  20. arXiv:1901.10923  [pdf, other

    cs.MA cs.GT

    Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative Systems

    Authors: David Mguni, Joel Jennings, Sergio Valcarcel Macua, Emilio Sison, Sofia Ceppi, Enrique Munoz de Cote

    Abstract: Many real-world systems such as taxi systems, traffic networks and smart grids involve self-interested actors that perform individual tasks in a shared environment. However, in such systems, the self-interested behaviour of agents produces welfare inefficient and globally suboptimal outcomes that are detrimental to all - some common examples are congestion in traffic networks, demand spikes for re… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.

  21. arXiv:1803.05028  [pdf, other

    cs.MA

    Decentralised Learning in Systems with Many, Many Strategic Agents

    Authors: David Mguni, Joel Jennings, Enrique Munoz de Cote

    Abstract: Although multi-agent reinforcement learning can tackle systems of strategically interacting entities, it currently fails in scalability and lacks rigorous convergence guarantees. Crucially, learning in multi-agent systems can become intractable due to the explosion in the size of the state-action space as the number of agents increases. In this paper, we propose a method for computing closed-loop… ▽ More

    Submitted 13 March, 2018; originally announced March 2018.