Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–11 of 11 results for author: Schulz, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.06681  [pdf, other

    cs.CL cs.AI cs.LG

    Steering Llama 2 via Contrastive Activation Addition

    Authors: Nina Panickssery, Nick Gabrieli, Julian Schulz, Meg Tong, Evan Hubinger, Alexander Matt Turner

    Abstract: We introduce Contrastive Activation Addition (CAA), an innovative method for steering language models by modifying their activations during forward passes. CAA computes "steering vectors" by averaging the difference in residual stream activations between pairs of positive and negative examples of a particular behavior, such as factual versus hallucinatory responses. During inference, these steerin… ▽ More

    Submitted 5 July, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

  2. arXiv:2207.02523  [pdf, other

    cs.SI physics.soc-ph

    Modeling Node Exposure for Community Detection in Networks

    Authors: Sameh Othman, Johannes Schulz, Marco Baity-Jesi, Caterina De Bacco

    Abstract: In community detection, datasets often suffer a sampling bias for which nodes which would normally have a high affinity appear to have zero affinity. This happens for example when two affine users of a social network were not exposed to one another. Community detection on this kind of data suffers then from considering affine nodes as not affine. To solve this problem, we explicitly model the (non… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: 13 pages, 4 figures

  3. arXiv:2206.06456  [pdf, other

    cs.IT q-bio.NC

    A comparison of partial information decompositions using data from real and simulated layer 5b pyramidal cells

    Authors: Jim W. Kay, Jan M. Schulz, W. A. Phillips

    Abstract: Partial information decomposition allows the joint mutual information between an output and a set of inputs to be divided into components that are synergistic or shared or unique to each input. We consider five different decompositions and compare their results on data from layer 5b pyramidal cells in two different studies. The first study was of the amplification of somatic action potential outpu… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 27 pages, 11 figures

    Journal ref: Published in Entropy, 24th July, 2022, 24(8), 1021

  4. arXiv:2206.02912  [pdf

    cs.CV physics.med-ph

    Learning Image Representations for Content Based Image Retrieval of Radiotherapy Treatment Plans

    Authors: Charles Huang, Varun Vasudevan, Oscar Pastor-Serrano, Md Tauhidul Islam, Yusuke Nomura, Piotr Dubrowski, Jen-Yeu Wang, Joseph B. Schulz, Yong Yang, Lei Xing

    Abstract: Objective: Knowledge based planning (KBP) typically involves training an end-to-end deep learning model to predict dose distributions. However, training end-to-end methods may be associated with practical limitations due to the limited size of medical datasets that are often used. To address these limitations, we propose a content based image retrieval (CBIR) method for retrieving dose distributio… ▽ More

    Submitted 23 August, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

  5. arXiv:2111.09121  [pdf, other

    cs.LG cs.AI stat.ML

    Uncertainty Quantification of Surrogate Explanations: an Ordinal Consensus Approach

    Authors: Jonas Schulz, Rafael Poyiadzi, Raul Santos-Rodriguez

    Abstract: Explainability of black-box machine learning models is crucial, in particular when deployed in critical applications such as medicine or autonomous cars. Existing approaches produce explanations for the predictions of models, however, how to assess the quality and reliability of such explanations remains an open question. In this paper we take a step further in order to provide the practitioner wi… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

  6. arXiv:2109.03027  [pdf, other

    stat.ME cs.CV q-bio.NC stat.OT

    Statistical analysis of locally parameterized shapes

    Authors: Mohsen Taheri, Jörn Schulz

    Abstract: The alignment of shapes has been a crucial step in statistical shape analysis, for example, in calculating mean shape, detecting locational differences between two shape populations, and classification. Procrustes alignment is the most commonly used method and state of the art. In this work, we uncover that alignment might seriously affect the statistical analysis. For example, alignment can induc… ▽ More

    Submitted 18 August, 2021; originally announced September 2021.

    Comments: 25 pages, 20 figures

  7. arXiv:2109.02230  [pdf, other

    stat.ML cs.LG

    Non-Euclidean Analysis of Joint Variations in Multi-Object Shapes

    Authors: Zhiyuan Liu, Jörn Schulz, Mohsen Taheri, Martin Styner, James Damon, Stephen Pizer, J. S. Marron

    Abstract: This paper considers joint analysis of multiple functionally related structures in classification tasks. In particular, our method developed is driven by how functionally correlated brain structures vary together between autism and control groups. To do so, we devised a method based on a novel combination of (1) non-Euclidean statistics that can faithfully represent non-Euclidean data in Euclidean… ▽ More

    Submitted 7 September, 2021; v1 submitted 5 September, 2021; originally announced September 2021.

  8. arXiv:1906.05264  [pdf, other

    cs.LG stat.ML

    GluonTS: Probabilistic Time Series Models in Python

    Authors: Alexander Alexandrov, Konstantinos Benidis, Michael Bohlke-Schneider, Valentin Flunkert, Jan Gasthaus, Tim Januschowski, Danielle C. Maddix, Syama Rangapuram, David Salinas, Jasper Schulz, Lorenzo Stella, Ali Caner Türkmen, Yuyang Wang

    Abstract: We introduce Gluon Time Series (GluonTS, available at https://gluon-ts.mxnet.io), a library for deep-learning-based time series modeling. GluonTS simplifies the development of and experimentation with time series models for common tasks such as forecasting or anomaly detection. It provides all necessary components and tools that scientists need for quickly building new models, for efficiently runn… ▽ More

    Submitted 14 June, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: ICML Time Series Workshop 2019

  9. arXiv:1901.01291  [pdf, other

    cs.RO cs.LG stat.ML

    On the Utility of Model Learning in HRI

    Authors: Gokul Swamy, Jens Schulz, Rohan Choudhury, Dylan Hadfield-Menell, Anca Dragan

    Abstract: Fundamental to robotics is the debate between model-based and model-free learning: should the robot build an explicit model of the world, or learn a policy directly? In the context of HRI, part of the world to be modeled is the human. One option is for the robot to treat the human as a black box and learn a policy for how they act directly. But it can also model the human as an agent, and rely on… ▽ More

    Submitted 21 May, 2020; v1 submitted 4 January, 2019; originally announced January 2019.

  10. arXiv:1804.10467  [pdf, other

    cs.RO cs.AI

    Interaction-Aware Probabilistic Behavior Prediction in Urban Environments

    Authors: Jens Schulz, Constantin Hubmann, Julian Löchner, Darius Burschka

    Abstract: Planning for autonomous driving in complex, urban scenarios requires accurate prediction of the trajectories of surrounding traffic participants. Their future behavior depends on their route intentions, the road-geometry, traffic rules and mutual interaction, resulting in interdependencies between their trajectories. We present a probabilistic prediction framework based on a dynamic Bayesian netwo… ▽ More

    Submitted 28 August, 2018; v1 submitted 27 April, 2018; originally announced April 2018.

    Comments: Accepted paper at IEEE IROS 2018. $©$ 2018 IEEE

  11. arXiv:1102.3643  [pdf, ps, other

    cs.DS

    A Constant Factor Approximation Algorithm for Unsplittable Flow on Paths

    Authors: Paul Bonsma, Jens Schulz, Andreas Wiese

    Abstract: In the unsplittable flow problem on a path, we are given a capacitated path $P$ and $n$ tasks, each task having a demand, a profit, and start and end vertices. The goal is to compute a maximum profit set of tasks, such that for each edge $e$ of $P$, the total demand of selected tasks that use $e$ does not exceed the capacity of $e$. This is a well-studied problem that has been studied under altern… ▽ More

    Submitted 19 March, 2012; v1 submitted 17 February, 2011; originally announced February 2011.

    Comments: 37 pages, 5 figures Version 2 contains the same results as version 1, but the presentation has been greatly revised and improved. References have been added