Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–30 of 30 results for author: Weber, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16820  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings

    Authors: Robert Wolfe, Isaac Slaughter, Bin Han, Bingbing Wen, Yiwei Yang, Lucas Rosenblatt, Bernease Herman, Eva Brown, Zening Qu, Nic Weber, Bill Howe

    Abstract: The rapid proliferation of generative AI has raised questions about the competitiveness of lower-parameter, locally tunable, open-weight models relative to high-parameter, API-guarded, closed-weight models in terms of performance, domain adaptation, cost, and generalization. Centering under-resourced yet risk-intolerant settings in government, research, and healthcare, we see for-profit closed-wei… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted at the ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2024

  2. arXiv:2405.14253  [pdf, other

    cs.LG physics.comp-ph

    Higher-Rank Irreducible Cartesian Tensors for Equivariant Message Passing

    Authors: Viktor Zaverkin, Francesco Alesiani, Takashi Maruyama, Federico Errica, Henrik Christiansen, Makoto Takamoto, Nicolas Weber, Mathias Niepert

    Abstract: The ability to perform fast and accurate atomistic simulations is crucial for advancing the chemical sciences. By learning from high-quality data, machine-learned interatomic potentials achieve accuracy on par with ab initio and first-principles methods at a fraction of their computational cost. The success of machine-learned interatomic potentials arises from integrating inductive biases such as… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2307.16807  [pdf, other

    nlin.AO cs.AI q-bio.NC

    On the use of associative memory in Hopfield networks designed to solve propositional satisfiability problems

    Authors: Natalya Weber, Werner Koch, Ozan Erdem, Tom Froese

    Abstract: Hopfield networks are an attractive choice for solving many types of computational problems because they provide a biologically plausible mechanism. The Self-Optimization (SO) model adds to the Hopfield network by using a biologically founded Hebbian learning rule, in combination with repeated network resets to arbitrary initial states, for optimizing its own behavior towards some desirable goal s… ▽ More

    Submitted 4 March, 2024; v1 submitted 31 July, 2023; originally announced July 2023.

    Comments: 7 pages, 4 figures

    Journal ref: 2023 IEEE Symposium Series on Computational Intelligence (SSCI) 1352-1358

  4. arXiv:2302.14177  [pdf, other

    cs.DL cs.SE

    Soft-Search: Two Datasets to Study the Identification and Production of Research Software

    Authors: Eva Maxfield Brown, Lindsey Schwartz, Richard Lewei Huang, Nicholas Weber

    Abstract: Software is an important tool for scholarly work, but software produced for research is in many cases not easily identifiable or discoverable. A potential first step in linking research and software is software identification. In this paper we present two datasets to study the identification and production of research software. The first dataset contains almost 1000 human labeled annotations of so… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  5. arXiv:2211.01698  [pdf, other

    nlin.AO cs.AI q-bio.NC

    Scaling up the self-optimization model by means of on-the-fly computation of weights

    Authors: Natalya Weber, Werner Koch, Tom Froese

    Abstract: The Self-Optimization (SO) model is a useful computational model for investigating self-organization in "soft" Artificial life (ALife) as it has been shown to be general enough to model various complex adaptive systems. So far, existing work has been done on relatively small network sizes, precluding the investigation of novel phenomena that might emerge from the complexity arising from large numb… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: 7 pages, 7 figures

    Journal ref: 2022 IEEE Symposium Series on Computational Intelligence (SSCI), Singapore, Singapore, 2022, pp. 1276-1282

  6. arXiv:2209.04147  [pdf, other

    cs.LG cs.IR

    Extending Open Bandit Pipeline to Simulate Industry Challenges

    Authors: Bram van den Akker, Niklas Weber, Felipe Moraes, Dmitri Goldenberg

    Abstract: Bandit algorithms are often used in the e-commerce industry to train Machine Learning (ML) systems when pre-labeled data is unavailable. However, the industry setting poses various challenges that make implementing bandit algorithms in practice non-trivial. In this paper, we elaborate on the challenges of off-policy optimisation, delayed reward, concept drift, reward design, and business rules con… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: Published at the CONSEQUENCES+REVEAL '22 workshop @ Recsys 2022, Seattle, WA, USA, Sept. 22nd-23rd 2022

    ACM Class: I.2.6; I.6.8

  7. arXiv:2206.05367  [pdf, other

    cs.DL

    Research Software Publication Policy Case Study

    Authors: Nic Weber

    Abstract: Research software is increasingly recognized as a vital component of the scholarly record. Journals offer authors the opportunity to publish research software papers, but often have different requirements for how these publications should be structured and how code should be verified. In this short case study we gather data from 20 Physical Science journals to trace the frequency, quality control,… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  8. arXiv:2205.11267  [pdf, other

    cs.LG

    Fed-DART and FACT: A solution for Federated Learning in a production environment

    Authors: Nico Weber, Patrick Holzer, Tania Jacob, Enislay Ramentol

    Abstract: Federated Learning as a decentralized artificial intelligence (AI) solution solves a variety of problems in industrial applications. It enables a continuously self-improving AI, which can be deployed everywhere at the edge. However, bringing AI to production for generating a real business impact is a challenging task. Especially in the case of Federated Learning, expertise and resources from multi… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  9. arXiv:2205.10402  [pdf

    cs.CY cs.DL

    Ethics of Open Data

    Authors: Nic Weber, Brandon Locke

    Abstract: This chapter addresses emergent ethical issues in producing, using, curating, and providing services for open data. Our goal is to provide an introduction to how ethical topics in open data manifest in practical dilemmas for scholarly communications and some approaches to understanding and working through them. We begin with a brief overview of what can be thought of as three basic theories of eth… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: Chapter accepted for publication in ACRL's 'Scholarly Communication and Open Culture'

  10. arXiv:2205.10357  [pdf, other

    cs.LG cs.PF

    SOL: Reducing the Maintenance Overhead for Integrating Hardware Support into AI Frameworks

    Authors: Nicolas Weber

    Abstract: The increased interest in Artificial Intelligence (AI) raised the need for highly optimized and sophisticated AI frameworks. Starting with the Lua-based Torch many frameworks have emerged over time, such as Theano, Caffe, Chainer, CNTK, MxNet, PyTorch, DL4J, or TensorFlow. All of these provide a high level scripting API that allows users to easily design neural networks and run these on various ki… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  11. arXiv:2204.09110  [pdf

    cs.DL

    Councils in Action: Automating the Curation of Municipal Governance Data for Research

    Authors: Eva Maxfield Brown, Nicholas Weber

    Abstract: Large scale comparative research into municipal governance is often prohibitively difficult due to a lack of high-quality data. But, recent advances in speech-to-text algorithms and natural language processing has made it possible to more easily collect and analyze data about municipal governments. In this paper, we introduce an open-source platform, the Council Data Project (CDP), to curate novel… ▽ More

    Submitted 31 August, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: Keywords: public interest technology; municipal governance; data curation; computational data access; natural language processing To Be Published with 2022 ASIS&T Annual Meeting (https://www.asist.org/am22/)

  12. Toward Unsupervised Test Scenario Extraction for Automated Driving Systems from Urban Naturalistic Road Traffic Data

    Authors: Nico Weber, Christoph Thiem, Ulrich Konigorski

    Abstract: Scenario-based testing is a promising approach to solve the challenge of proving the safe behavior of vehicles equipped with automated driving systems. Since an infinite number of concrete scenarios can theoretically occur in real-world road traffic, the extraction of scenarios relevant in terms of the safety-related behavior of these systems is a key aspect for their successful verification and v… ▽ More

    Submitted 21 April, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 16 pages, 9 figures

    Report number: SAE 12-06-03-0017 ACM Class: D.2; I.2

    Journal ref: SAE Intl. J CAV 6(3):2023

  13. arXiv:2111.02930  [pdf, ps, other

    q-bio.BM cs.LG

    Decoupled coordinates for machine learning-based molecular fragment linking

    Authors: Markus Fleck, Noah Weber, Christopher Trummer

    Abstract: Recent developments in machine-learning based molecular fragment linking have demonstrated the importance of informing the generation process with structural information specifying the relative orientation of the fragments to be linked. However, such structural information has not yet been provided in the form of a complete relative coordinate system. Mathematical details for a decoupled set of bo… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Comments: 16 pages, 5 Figures

    ACM Class: I.2

  14. arXiv:2109.03648  [pdf

    cs.SE

    A Needle in a Haystack -- How to Derive Relevant Scenarios for Testing Automated Driving Systems in Urban Areas

    Authors: Nico Weber, Christoph Thiem, Ulrich Konigorski

    Abstract: While there was great progress regarding the technology and its implementation for vehicles equipped with automated driving systems (ADS), the problem of how to proof their safety as a necessary precondition prior to market launch remains unsolved. One promising solution are scenario-based test approaches; however, there is no commonly accepted way of how to systematically generate and extract the… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Comments: 37 pages, 15 figures, Preprint

  15. Toward Generating Sufficiently Valid Test Case Results: A Method for Systematically Assigning Test Cases to Test Bench Configurations in a Scenario-Based Test Approach for Automated Vehicles

    Authors: Markus Steimle, Nico Weber, Markus Maurer

    Abstract: To successfully launch automated vehicles into the consumer market, there must be credible proof that the vehicles will operate safely. However, finding a method to validate the vehicles' safe operation is a challenging problem. While scenario-based test approaches seem to be possible solutions, they require execution of a large number of test cases. Several test benches, ranging from actual test… ▽ More

    Submitted 20 January, 2022; v1 submitted 7 September, 2021; originally announced September 2021.

    Comments: Published in IEEE Access, vol. 10, pp. 6260-6285, 2022, doi: 10.1109/ACCESS.2022.3141198

  16. HALF: Holistic Auto Machine Learning for FPGAs

    Authors: Jonas Ney, Dominik Loroch, Vladimir Rybalkin, Nico Weber, Jens KrĂĽger, Norbert Wehn

    Abstract: Deep Neural Networks (DNNs) are capable of solving complex problems in domains related to embedded systems, such as image and natural language processing. To efficiently implement DNNs on a specific FPGA platform for a given cost criterion, e.g. energy efficiency, an enormous amount of design parameters has to be considered from the topology down to the final hardware implementation. Interdependen… ▽ More

    Submitted 20 October, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: 2021 31st International Conference on Field-Programmable Logic and Applications (FPL). IEEE, 2021

  17. arXiv:2104.08811  [pdf, other

    cs.CL cs.AI

    Human Schema Curation via Causal Association Rule Mining

    Authors: Noah Weber, Anton Belyy, Nils Holzenberger, Rachel Rudinger, Benjamin Van Durme

    Abstract: Event schemas are structured knowledge sources defining typical real-world scenarios (e.g., going to an airport). We present a framework for efficient human-in-the-loop construction of a schema library, based on a novel script induction system and a well-crafted interface that allows non-experts to "program" complex event structures. Associated with this work we release a schema library: a machine… ▽ More

    Submitted 23 May, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: 12 pages, 6 figures, 6 tables

  18. Addressing Research Software Sustainability via Institutes

    Authors: Daniel S. Katz, Jeffrey C. Carver, Neil P. Chue Hong, Sandra Gesing, Simon Hettrick, Tom Honeyman, Karthik Ram, Nicholas Weber

    Abstract: Research software is essential to modern research, but it requires ongoing human effort to sustain: to continually adapt to changes in dependencies, to fix bugs, and to add new features. Software sustainability institutes, amongst others, develop, maintain, and disseminate best practices for research software sustainability, and build community around them. These practices can both reduce the amou… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: accepted by ICSE 2021 BokSS Workshop (https://bokss.github.io/bokss2021/)

  19. arXiv:2004.03762  [pdf, other

    cs.CL cs.AI

    Generating Narrative Text in a Switching Dynamical System

    Authors: Noah Weber, Leena Shekhar, Heeyoung Kwon, Niranjan Balasubramanian, Nathanael Chambers

    Abstract: Early work on narrative modeling used explicit plans and goals to generate stories, but the language generation itself was restricted and inflexible. Modern methods use language models for more robust generation, but often lack an explicit representation of the scaffolding and dynamics that guide a coherent narrative. This paper introduces a new model that integrates explicit narrative structure w… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  20. arXiv:2004.01174  [pdf, other

    cs.CL

    Causal Inference of Script Knowledge

    Authors: Noah Weber, Rachel Rudinger, Benjamin Van Durme

    Abstract: When does a sequence of events define an everyday scenario and how can this knowledge be induced from text? Prior works in inducing such scripts have relied on, in one form or another, measures of correlation between instances of events in a corpus. We argue from both a conceptual and practical sense that a purely correlation-based approach is insufficient, and instead propose an approach to scrip… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: Pre-Print

  21. arXiv:2003.10688  [pdf, other

    cs.DC cs.LG

    SOL: Effortless Device Support for AI Frameworks without Source Code Changes

    Authors: Nicolas Weber, Felipe Huici

    Abstract: Modern high performance computing clusters heavily rely on accelerators to overcome the limited compute power of CPUs. These supercomputers run various applications from different domains such as simulations, numerical applications or artificial intelligence (AI). As a result, vendors need to be able to efficiently run a wide variety of workloads on their hardware. In the AI domain this is in part… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

    Comments: HPML Workshop 2020

  22. arXiv:1808.09542  [pdf, other

    cs.CL

    Hierarchical Quantized Representations for Script Generation

    Authors: Noah Weber, Leena Shekhar, Niranjan Balasubramanian, Nathanael Chambers

    Abstract: Scripts define knowledge about how everyday scenarios (such as going to a restaurant) are expected to unfold. One of the challenges to learning scripts is the hierarchical nature of the knowledge. For example, a suspect arrested might plead innocent or guilty, and a very different track of events is then expected to happen. To capture this type of information, we propose an autoencoder model with… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018

  23. arXiv:1805.01445  [pdf, other

    cs.CL

    The Fine Line between Linguistic Generalization and Failure in Seq2Seq-Attention Models

    Authors: Noah Weber, Leena Shekhar, Niranjan Balasubramanian

    Abstract: Seq2Seq based neural architectures have become the go-to architecture to apply to sequence to sequence language tasks. Despite their excellent performance on these tasks, recent work has noted that these models usually do not fully capture the linguistic structure required to generalize beyond the dense sections of the data distribution \cite{ettinger2017towards}, and as such, are likely to fail o… ▽ More

    Submitted 8 May, 2018; v1 submitted 3 May, 2018; originally announced May 2018.

    Comments: Workshop on New Forms of Generalization in Deep Learning and NLP (NAACL 2018), revised to update some references

  24. arXiv:1804.08378  [pdf, other

    cs.DC cs.AI cs.CV cs.NE cs.PF

    BrainSlug: Transparent Acceleration of Deep Learning Through Depth-First Parallelism

    Authors: Nicolas Weber, Florian Schmidt, Mathias Niepert, Felipe Huici

    Abstract: Neural network frameworks such as PyTorch and TensorFlow are the workhorses of numerous machine learning applications ranging from object recognition to machine translation. While these frameworks are versatile and straightforward to use, the training of and inference in deep neural networks is resource (energy, compute, and memory) intensive. In contrast to recent works focusing on algorithmic en… ▽ More

    Submitted 23 April, 2018; originally announced April 2018.

    Comments: Technical Report, 13 pages

  25. arXiv:1804.04076  [pdf, other

    cs.CV cs.LG

    Detail-Preserving Pooling in Deep Networks

    Authors: Faraz Saeedan, Nicolas Weber, Michael Goesele, Stefan Roth

    Abstract: Most convolutional neural networks use some method for gradually downscaling the size of the hidden layers. This is commonly referred to as pooling, and is applied to reduce the number of parameters, improve invariance to certain distortions, and increase the receptive field size. Since pooling by nature is a lossy process, it is crucial that each such layer maintains the portion of the activation… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

    Comments: To appear at CVPR 2018

  26. arXiv:1803.07038  [pdf, other

    cs.CL

    Controlling Decoding for More Abstractive Summaries with Copy-Based Networks

    Authors: Noah Weber, Leena Shekhar, Niranjan Balasubramanian, Kyunghyun Cho

    Abstract: Attention-based neural abstractive summarization systems equipped with copy mechanisms have shown promising results. Despite this success, it has been noticed that such a system generates a summary by mostly, if not entirely, copying over phrases, sentences, and sometimes multiple consecutive sentences from an input paragraph, effectively performing extractive summarization. In this paper, we veri… ▽ More

    Submitted 19 March, 2018; v1 submitted 19 March, 2018; originally announced March 2018.

  27. Mining Open Government Data Used in Scientific Research

    Authors: An Yan, Nicholas Weber

    Abstract: In the following paper, we describe results from mining citations, mentions, and links to open government data (OGD) in peer-reviewed literature. We inductively develop a method for categorizing how OGD are used by different research communities, and provide descriptive statistics about the publication years, publication outlets, and OGD sources. Our results demonstrate that, 1. The use of OGD in… ▽ More

    Submitted 24 March, 2018; v1 submitted 8 February, 2018; originally announced February 2018.

    Comments: Accepted to iConference 2018

    Journal ref: Transforming Digital Worlds. iConference 2018. Lecture Notes in Computer Science, vol 10766. Springer, Cham

  28. arXiv:1711.07611  [pdf, other

    cs.CL

    Event Representations with Tensor-based Compositions

    Authors: Noah Weber, Niranjan Balasubramanian, Nathanael Chambers

    Abstract: Robust and flexible event representations are important to many core areas in language understanding. Scripts were proposed early on as a way of representing sequences of events for such understanding, and has recently attracted renewed attention. However, obtaining effective representations for modeling script-like event sequences is challenging. It requires representations that can capture event… ▽ More

    Submitted 20 November, 2017; originally announced November 2017.

    Comments: Accepted at AAAI 2018

  29. Report on the Third Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE3)

    Authors: Daniel S. Katz, Sou-Cheng T. Choi, Kyle E. Niemeyer, James Hetherington, Frank Löffler, Dan Gunter, Ray Idaszak, Steven R. Brandt, Mark A. Miller, Sandra Gesing, Nick D. Jones, Nic Weber, Suresh Marru, Gabrielle Allen, Birgit Penzenstadler, Colin C. Venters, Ethan Davis, Lorraine Hwang, Ilian Todorov, Abani Patra, Miguel de Val-Borro

    Abstract: This report records and discusses the Third Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE3). The report includes a description of the keynote presentation of the workshop, which served as an overview of sustainable scientific software. It also summarizes a set of lightning talks in which speakers highlighted to-the-point lessons and challenges pertaining to sustain… ▽ More

    Submitted 6 February, 2016; originally announced February 2016.

  30. arXiv:1309.1810  [pdf

    cs.SE cs.CY

    Niche Modeling: Ecological Metaphors for Sustainable Software in Science

    Authors: Nicholas Weber, Andrea Thomer, Michael Twidale

    Abstract: This position paper is aimed at providing some history and provocations for the use of an ecological metaphor to describe software development environments. We do not claim that the ecological metaphor is the best or only way of looking at software - rather we want to ask if it can indeed be a productive and thought provoking one.

    Submitted 6 September, 2013; originally announced September 2013.

    Comments: Position paper submitted to: Workshop on Sustainable Software for Science: Practice and Experiences (WSSSPE) SC13, Sunday, 17 November 2013, Denver, CO, USA