-
SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection
Authors:
Daniel Jakab,
Alexander Braun,
Cathaoir Agnew,
Reenu Mohandas,
Brian Michael Deegan,
Dara Molloy,
Enda Ward,
Tony Scanlan,
Ciarán Eising
Abstract:
Automotive simulation can potentially compensate for a lack of training data in computer vision applications. However, there has been little to no image quality evaluation of automotive simulation and the impact of optical degradations on simulation is little explored. In this work, we investigate Virtual KITTI and the impact of applying variations of Gaussian blur on image sharpness. Furthermore,…
▽ More
Automotive simulation can potentially compensate for a lack of training data in computer vision applications. However, there has been little to no image quality evaluation of automotive simulation and the impact of optical degradations on simulation is little explored. In this work, we investigate Virtual KITTI and the impact of applying variations of Gaussian blur on image sharpness. Furthermore, we consider object detection, a common computer vision application on three different state-of-the-art models, thus allowing us to characterize the relationship between object detection and sharpness. It was found that while image sharpness (MTF50) degrades from an average of 0.245cy/px to approximately 0.119cy/px; object detection performance stays largely robust within 0.58\%(Faster RCNN), 1.45\%(YOLOF) and 1.93\%(DETR) across all respective held-out test sets.
△ Less
Submitted 22 July, 2024;
originally announced July 2024.
-
Testing a non-local 1-equation turbulent convection model: A solar model
Authors:
T. A. M. Braun,
F. Ahlborn,
A. Weiss
Abstract:
Turbulent convection models treat stellar convection more physically than standard mixing-length theory by including non-local effects. We recently successfully applied the Kuhfuss version to convective cores in main sequence stars. Its usefulness for convective envelopes remains to be tested. The solar convective envelope constitutes a viable test bed for investigating the usefulness of the 1-equ…
▽ More
Turbulent convection models treat stellar convection more physically than standard mixing-length theory by including non-local effects. We recently successfully applied the Kuhfuss version to convective cores in main sequence stars. Its usefulness for convective envelopes remains to be tested. The solar convective envelope constitutes a viable test bed for investigating the usefulness of the 1-equation Kuhfuss turbulent convection model. We used the one-dimensional stellar evolution code GARSTEC to calculate a standard solar model with the 1-equation Kuhfuss turbulent convection model, and compared it to helioseismic measurements and a solar model using standard mixing-length theory. Additionally, we investigated the influence of the additional free parameters of the convection model on the solar structure. The 1-equation Kuhfuss model reproduces the sound-speed profile and the lower boundary of the convective region less well than the mixing-length model, because the inherent non-local effects overestimate the amount of convective penetration below the Schwarzschild boundary. We trace this back to the coupling of the temperature gradient to the convective flux in the 1-equation version of the Kuhfuss theory. The temperature stratification of the solar convective envelope is not well modelled by the 1-equation Kuhfuss turbulent convection model, and the more complex 3-equation version is needed to improve the modelling of convection in the envelopes of 1D stellar evolution models.
△ Less
Submitted 4 July, 2024;
originally announced July 2024.
-
Approximating Optimum Online for Capacitated Resource Allocation
Authors:
Alexander Braun,
Thomas Kesselheim,
Tristan Pollner,
Amin Saberi
Abstract:
We study online capacitated resource allocation, a natural generalization of online stochastic max-weight bipartite matching. This problem is motivated by ride-sharing and Internet advertising applications, where online arrivals may have the capacity to serve multiple offline users.
Our main result is a polynomial-time online algorithm which is $(1/2 + κ)$-approximate to the optimal online algor…
▽ More
We study online capacitated resource allocation, a natural generalization of online stochastic max-weight bipartite matching. This problem is motivated by ride-sharing and Internet advertising applications, where online arrivals may have the capacity to serve multiple offline users.
Our main result is a polynomial-time online algorithm which is $(1/2 + κ)$-approximate to the optimal online algorithm for $κ= 0.0115$. This can be contrasted to the (tight) $1/2$-competitive algorithms to the optimum offline benchmark from the prophet inequality literature. Optimum online is a recently popular benchmark for online Bayesian problems which can use unbounded computation, but not "prophetic" knowledge of future inputs.
Our algorithm (which also works for the case of stochastic rewards) rounds a generalized LP relaxation from the unit-capacity case via a two-proposal algorithm, as in previous works in the online matching literature. A key technical challenge in deriving our guarantee is bounding the positive correlation among users introduced when rounding our LP relaxation online. Unlike in the case of unit capacities, this positive correlation is unavoidable for guarantees beyond $1/2$. Conceptually, our results show that the study of optimum online as a benchmark can reveal problem-specific insights that are irrelevant to competitive analysis.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Decoupling of neural network calibration measures
Authors:
Dominik Werner Wolf,
Prasannavenkatesh Balaji,
Alexander Braun,
Markus Ulrich
Abstract:
A lot of effort is currently invested in safeguarding autonomous driving systems, which heavily rely on deep neural networks for computer vision. We investigate the coupling of different neural network calibration measures with a special focus on the Area Under the Sparsification Error curve (AUSE) metric. We elaborate on the well-known inconsistency in determining optimal calibration using the Ex…
▽ More
A lot of effort is currently invested in safeguarding autonomous driving systems, which heavily rely on deep neural networks for computer vision. We investigate the coupling of different neural network calibration measures with a special focus on the Area Under the Sparsification Error curve (AUSE) metric. We elaborate on the well-known inconsistency in determining optimal calibration using the Expected Calibration Error (ECE) and we demonstrate similar issues for the AUSE, the Uncertainty Calibration Score (UCS), as well as the Uncertainty Calibration Error (UCE). We conclude that the current methodologies leave a degree of freedom, which prevents a unique model calibration for the homologation of safety-critical functionalities. Furthermore, we propose the AUSE as an indirect measure for the residual uncertainty, which is irreducible for a fixed network architecture and is driven by the stochasticity in the underlying data generation process (aleatoric contribution) as well as the limitation in the hypothesis space (epistemic contribution).
△ Less
Submitted 19 July, 2024; v1 submitted 4 June, 2024;
originally announced June 2024.
-
A Point-Based Approach to Efficient LiDAR Multi-Task Perception
Authors:
Christopher Lang,
Alexander Braun,
Lars Schillingmann,
Abhinav Valada
Abstract:
Multi-task networks can potentially improve performance and computational efficiency compared to single-task networks, facilitating online deployment. However, current multi-task architectures in point cloud perception combine multiple task-specific point cloud representations, each requiring a separate feature encoder and making the network structures bulky and slow. We propose PAttFormer, an eff…
▽ More
Multi-task networks can potentially improve performance and computational efficiency compared to single-task networks, facilitating online deployment. However, current multi-task architectures in point cloud perception combine multiple task-specific point cloud representations, each requiring a separate feature encoder and making the network structures bulky and slow. We propose PAttFormer, an efficient multi-task architecture for joint semantic segmentation and object detection in point clouds that only relies on a point-based representation. The network builds on transformer-based feature encoders using neighborhood attention and grid-pooling and a query-based detection decoder using a novel 3D deformable-attention detection head design. Unlike other LiDAR-based multi-task architectures, our proposed PAttFormer does not require separate feature encoders for multiple task-specific point cloud representations, resulting in a network that is 3x smaller and 1.4x faster while achieving competitive performance on the nuScenes and KITTI benchmarks for autonomous driving perception. Our extensive evaluations show substantial gains from multi-task learning, improving LiDAR semantic segmentation by +1.7% in mIou and 3D object detection by +1.7% in mAP on the nuScenes benchmark compared to the single-task models.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
Combine Influences of Nanoparticulate Hematite Thin Film Thickness, Roughness, and Weight on Its Photoelectrochemical Performance and Viscous/ Thermal Characteristics of Source Precursor
Authors:
Romy Loehnert,
Artur Braun,
Debajeet K. Bora
Abstract:
The objective of this work was to investigate the photoelectrochemical (PEC) performance of nanoparticulate hematite thin film photoelectrodes prepared by a soft-chemistry route. Two cost-effective thin film fabrication techniques were employed to deposit the hematite film. First, the film was deposited on conducting glass substrates by dip coating of the organic precursor containing fatty acid de…
▽ More
The objective of this work was to investigate the photoelectrochemical (PEC) performance of nanoparticulate hematite thin film photoelectrodes prepared by a soft-chemistry route. Two cost-effective thin film fabrication techniques were employed to deposit the hematite film. First, the film was deposited on conducting glass substrates by dip coating of the organic precursor containing fatty acid derivatives of iron salts. Process parameters such as the concentration of iron oleic acid derivative precursor solution, the thickness of the organic film, before annealing and the number of deposited layers along with their weight and roughness were studied. In the second approach, the influence of the spin coating process on film formation and respective photoelectrochemical (PEC) performance have been discussed. It was found that the PEC performance of spin-coated samples was lower than that of dip coated samples due to the effect of films rough and smooth characteristics. It is found that the rough surface of the photoelectrode is a prerequisite to achieving good photocurrent values. Here, three-layer samples with roughness between 600nm to 800nm and bulk thickness up to 700nm provided photocurrent densities of 0.6mA/cm2. In part 2 of the manuscript, we have elaborately discussed the roughness and smooth behavior of thin films using X-ray reflectometry technique. Followed by this, a detailed account of the viscous and thermal properties of the fatty acid derivatives of iron precursor have been discussed.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
X-ray reflectometric studies of nanoparticulate hematite films to decouple the rough and smooth behaviors of it and crystallographic and morphological properties concerning fatty acid chain length
Authors:
Debajeet K. Bora,
Romy Loehnart,
Artur Braun
Abstract:
In this study, the use of X-Ray reflectometry technique signifies the types of rough and smooth surfaces of hematite film prepared from different fatty acid derivatives of the iron salt. Followed by this, the film morphology and crystallographic properties concerning different fatty acid chain length have been discussed.
In this study, the use of X-Ray reflectometry technique signifies the types of rough and smooth surfaces of hematite film prepared from different fatty acid derivatives of the iron salt. Followed by this, the film morphology and crystallographic properties concerning different fatty acid chain length have been discussed.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
More on $G$-flux and General Hodge Cycles on the Fermat Sextic
Authors:
Andreas P. Braun,
Hugo Fortin,
Daniel Lopez Garcia,
Roberto Villaflor Loyola
Abstract:
We study M-Theory solutions with $G$-flux on the Fermat sextic Calabi-Yau fourfold, focussing on the relationship between the number of stabilized complex structure moduli and the tadpole contribution of the flux. We use two alternative approaches to define the fluxes: algebraic cycles and (appropriately quantized) Griffiths residues. In both cases, we collect evidence for the non-existence of sol…
▽ More
We study M-Theory solutions with $G$-flux on the Fermat sextic Calabi-Yau fourfold, focussing on the relationship between the number of stabilized complex structure moduli and the tadpole contribution of the flux. We use two alternative approaches to define the fluxes: algebraic cycles and (appropriately quantized) Griffiths residues. In both cases, we collect evidence for the non-existence of solutions which stabilize all moduli and stay within the tadpole bound
△ Less
Submitted 26 February, 2024; v1 submitted 31 December, 2023;
originally announced January 2024.
-
$G_2$ Mirrors from Calabi-Yau Mirrors
Authors:
Andreas P. Braun,
Richie Dadhley
Abstract:
We study the worldsheet CFTs of type II strings on compact $G_2$ orbifolds obtained as quotients of a product of a Calabi-Yau threefold and a circle. For such models, we argue that the Calabi-Yau mirror map implies a mirror map for the associated $G_2$ varieties by examining how anti-holomorphic involutions behave under Calabi-Yau mirror symmetry. The mirror geometries identified by the worldsheet…
▽ More
We study the worldsheet CFTs of type II strings on compact $G_2$ orbifolds obtained as quotients of a product of a Calabi-Yau threefold and a circle. For such models, we argue that the Calabi-Yau mirror map implies a mirror map for the associated $G_2$ varieties by examining how anti-holomorphic involutions behave under Calabi-Yau mirror symmetry. The mirror geometries identified by the worldsheet CFT are consistent with earlier proposals for twisted connected sum $G_2$ manifolds.
△ Less
Submitted 14 February, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
The reggeon model with the pomeron and odderon: renormalization group approach
Authors:
M. A. Braun,
E. M. Kuzminskii,
M. I. Vyazovsky
Abstract:
The Regge-Gribov model of the pomeron and odderon in the non-trivial transverse space is studied by the renormalization group technique. The single loop approximation is adopted.
Five real fixed points are found and the high-energy behaviour of the propagators is correspondingly calculated. As without odderon, the asymptotic is modulated by logarithms of energy in certain rational powers. Moveme…
▽ More
The Regge-Gribov model of the pomeron and odderon in the non-trivial transverse space is studied by the renormalization group technique. The single loop approximation is adopted.
Five real fixed points are found and the high-energy behaviour of the propagators is correspondingly calculated. As without odderon, the asymptotic is modulated by logarithms of energy in certain rational powers. Movement of coupling constants away from the fixed points is investigated both analytically (close to the fixed points) and numerically (far away). In the former case attraction occurs only in restricted domains of initial coupling constants. More generally in one third of the cases the coupling constants instead grow large indicating the breakdown of the single loop approximation.
△ Less
Submitted 26 June, 2024; v1 submitted 23 November, 2023;
originally announced November 2023.
-
Automatic Bat Call Classification using Transformer Networks
Authors:
Frank Fundel,
Daniel A. Braun,
Sebastian Gottwald
Abstract:
Automatically identifying bat species from their echolocation calls is a difficult but important task for monitoring bats and the ecosystem they live in. Major challenges in automatic bat call identification are high call variability, similarities between species, interfering calls and lack of annotated data. Many currently available models suffer from relatively poor performance on real-life data…
▽ More
Automatically identifying bat species from their echolocation calls is a difficult but important task for monitoring bats and the ecosystem they live in. Major challenges in automatic bat call identification are high call variability, similarities between species, interfering calls and lack of annotated data. Many currently available models suffer from relatively poor performance on real-life data due to being trained on single call datasets and, moreover, are often too slow for real-time classification. Here, we propose a Transformer architecture for multi-label classification with potential applications in real-time classification scenarios. We train our model on synthetically generated multi-species recordings by merging multiple bats calls into a single recording with multiple simultaneous calls. Our approach achieves a single species accuracy of 88.92% (F1-score of 84.23%) and a multi species macro F1-score of 74.40% on our test set. In comparison to three other tools on the independent and publicly available dataset ChiroVox, our model achieves at least 25.82% better accuracy for single species classification and at least 6.9% better macro F1-score for multi species classification.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Finite groups, smooth invariants, and isolated quotient singularities
Authors:
Amiram Braun
Abstract:
Let G < SL(V) be a finite group, V is finite dimensional over a field F, p=char F and S(V) is the symmetric algebra of V. We determine when the subring of G-invariants S(V)^G is a polynomial ring. As a consequence, we classify, if F is algebraically closed, all S(V)^G which are isolated singularities. We show that the completion of S(V)^G, at its unique graded maximal ideal, is isomorphic to the c…
▽ More
Let G < SL(V) be a finite group, V is finite dimensional over a field F, p=char F and S(V) is the symmetric algebra of V. We determine when the subring of G-invariants S(V)^G is a polynomial ring. As a consequence, we classify, if F is algebraically closed, all S(V)^G which are isolated singularities. We show that the completion of S(V)^G, at its unique graded maximal ideal, is isomorphic to the completion of S(W)^H, where (H,W) is a reduction mod p of a member of the Zassenhaus-Vincent-Wolf list of complex isolated quotient singularities.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Classification robustness to common optical aberrations
Authors:
Patrick Müller,
Alexander Braun,
Margret Keuper
Abstract:
Computer vision using deep neural networks (DNNs) has brought about seminal changes in people's lives. Applications range from automotive, face recognition in the security industry, to industrial process monitoring. In some cases, DNNs infer even in safety-critical situations. Therefore, for practical applications, DNNs have to behave in a robust way to disturbances such as noise, pixelation, or b…
▽ More
Computer vision using deep neural networks (DNNs) has brought about seminal changes in people's lives. Applications range from automotive, face recognition in the security industry, to industrial process monitoring. In some cases, DNNs infer even in safety-critical situations. Therefore, for practical applications, DNNs have to behave in a robust way to disturbances such as noise, pixelation, or blur. Blur directly impacts the performance of DNNs, which are often approximated as a disk-shaped kernel to model defocus. However, optics suggests that there are different kernel shapes depending on wavelength and location caused by optical aberrations. In practice, as the optical quality of a lens decreases, such aberrations increase. This paper proposes OpticsBench, a benchmark for investigating robustness to realistic, practically relevant optical blur effects. Each corruption represents an optical aberration (coma, astigmatism, spherical, trefoil) derived from Zernike Polynomials. Experiments on ImageNet show that for a variety of different pre-trained DNNs, the performance varies strongly compared to disk-shaped kernels, indicating the necessity of considering realistic image degradations. In addition, we show on ImageNet-100 with OpticsAugment that robustness can be increased by using optical kernels as data augmentation. Compared to a conventionally trained ResNeXt50, training with OpticsAugment achieves an average performance gain of 21.7% points on OpticsBench and 6.8% points on 2D common corruptions.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Occupancy Grid Map to Pose Graph-based Map: Robust BIM-based 2D-LiDAR Localization for Lifelong Indoor Navigation in Changing and Dynamic Environments
Authors:
Miguel Arturo Vega Torres,
Alexander Braun,
André Borrmann
Abstract:
Several studies rely on the de facto standard Adaptive Monte Carlo Localization (AMCL) method to localize a robot in an Occupancy Grid Map (OGM) extracted from a building information model (BIM model). However, most of these studies assume that the BIM model precisely represents the real world, which is rarely true. Discrepancies between the reference BIM model and the real world (Scan-BIM deviati…
▽ More
Several studies rely on the de facto standard Adaptive Monte Carlo Localization (AMCL) method to localize a robot in an Occupancy Grid Map (OGM) extracted from a building information model (BIM model). However, most of these studies assume that the BIM model precisely represents the real world, which is rarely true. Discrepancies between the reference BIM model and the real world (Scan-BIM deviations) are not only due to furniture or clutter but also the usual as-planned and as-built deviations that exist with any model created in the design phase. These deviations affect the accuracy of AMCL drastically. This paper proposes an open-source method to generate appropriate Pose Graph-based maps from BIM models for robust 2D-LiDAR localization in changing and dynamic environments. First, 2D OGMs are automatically generated from complex BIM models. These OGMs only represent structural elements allowing indoor autonomous robot navigation. Then, an efficient technique converts these 2D OGMs into Pose Graph-based maps enabling more accurate robot pose tracking. Finally, we leverage the different map representations for accurate, robust localization with a combination of state-of-the-art algorithms. Moreover, we provide a quantitative comparison of various state-of-the-art localization algorithms in three simulated scenarios with varying levels of Scan-BIM deviations and dynamic agents. More precisely, we compare two Particle Filter (PF) algorithms: AMCL and General Monte Carlo Localization (GMCL); and two Graph-based Localization (GBL) methods: Google's Cartographer and SLAM Toolbox, solving the global localization and pose tracking problems. The numerous experiments demonstrate that the proposed method contributes to a robust localization with an as-designed BIM model or a sparse OGM in changing and dynamic environments, outperforming the conventional AMCL in accuracy and robustness.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
ECS -- an Interactive Tool for Data Quality Assurance
Authors:
Christian Sieberichs,
Simon Geerkens,
Alexander Braun,
Thomas Waschulzik
Abstract:
With the increasing capabilities of machine learning systems and their potential use in safety-critical systems, ensuring high-quality data is becoming increasingly important. In this paper we present a novel approach for the assurance of data quality. For this purpose, the mathematical basics are first discussed and the approach is presented using multiple examples. This results in the detection…
▽ More
With the increasing capabilities of machine learning systems and their potential use in safety-critical systems, ensuring high-quality data is becoming increasingly important. In this paper we present a novel approach for the assurance of data quality. For this purpose, the mathematical basics are first discussed and the approach is presented using multiple examples. This results in the detection of data points with potentially harmful properties for the use in safety-critical systems.
△ Less
Submitted 17 July, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
QI2 -- an Interactive Tool for Data Quality Assurance
Authors:
Simon Geerkens,
Christian Sieberichs,
Alexander Braun,
Thomas Waschulzik
Abstract:
The importance of high data quality is increasing with the growing impact and distribution of ML systems and big data. Also the planned AI Act from the European commission defines challenging legal requirements for data quality especially for the market introduction of safety relevant ML systems. In this paper we introduce a novel approach that supports the data quality assurance process of multip…
▽ More
The importance of high data quality is increasing with the growing impact and distribution of ML systems and big data. Also the planned AI Act from the European commission defines challenging legal requirements for data quality especially for the market introduction of safety relevant ML systems. In this paper we introduce a novel approach that supports the data quality assurance process of multiple data quality aspects. This approach enables the verification of quantitative data quality requirements. The concept and benefits are introduced and explained on small example data sets. How the method is applied is demonstrated on the well known MNIST data set based an handwritten digits.
△ Less
Submitted 10 July, 2023; v1 submitted 7 July, 2023;
originally announced July 2023.
-
Windscreen Optical Quality for AI Algorithms: Refractive Power and MTF not Sufficient
Authors:
Dominik Werner Wolf,
Markus Ulrich,
Alexander Braun
Abstract:
Windscreen optical quality is an important aspect of any advanced driver assistance system, and also for future autonomous driving, as today at least some cameras of the sensor suite are situated behind the windscreen. Automotive mass production processes require measurement systems that characterize the optical quality of the windscreens in a meaningful way, which for modern perception stacks imp…
▽ More
Windscreen optical quality is an important aspect of any advanced driver assistance system, and also for future autonomous driving, as today at least some cameras of the sensor suite are situated behind the windscreen. Automotive mass production processes require measurement systems that characterize the optical quality of the windscreens in a meaningful way, which for modern perception stacks implies meaningful for artificial intelligence (AI) algorithms. The measured optical quality needs to be linked to the performance of these algorithms, such that performance limits - and thus production tolerance limits - can be defined. In this article we demonstrate that the main metric established in the industry - refractive power - is fundamentally not capable of capturing relevant optical properties of windscreens. Further, as the industry is moving towards the modulation transfer function (MTF) as an alternative, we mathematically show that this metric cannot be used on windscreens alone, but that the windscreen forms a novel optical system together with the optics of the camera system. Hence, the required goal of a qualification system that is installed at the windscreen supplier and independently measures the optical quality cannot be achieved using MTF. We propose a novel concept to determine the optical quality of windscreens and to use simulation to link this optical quality to the performance of AI algorithms, which can hopefully lead to novel inspection systems.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Low-period spacing core-helium burning giants: `hot subdwarf analogues'?
Authors:
S. Hekker,
Y. Elsworth,
T. A. M. Braun,
S. Basu
Abstract:
Global stellar oscillations probe the internal structure of stars. In low- to intermediate-mass red giants, these oscillations provide signatures from both the outer regions of the star as well as from the core. These signatures are imprinted in e.g. the frequency of maximum oscillation power, and in the differences in periods of non-radial oscillations (period spacings), respectively. In core hel…
▽ More
Global stellar oscillations probe the internal structure of stars. In low- to intermediate-mass red giants, these oscillations provide signatures from both the outer regions of the star as well as from the core. These signatures are imprinted in e.g. the frequency of maximum oscillation power, and in the differences in periods of non-radial oscillations (period spacings), respectively. In core helium burning giants with masses below about 1.7 solar masses, i.e. stars that have gone through a helium flash, the asymptotic period spacings take values of about 220 -350 s at frequency of maximum oscillation power of $\sim$30-50 $μ$Hz. A set of stars with asymptotic period spacings lower than about 200 s at similar frequencies separations has recently been discovered by Elsworth and collaborators. In this work, we present a hypothesis for the formation scenario of these stars. We find that these stars can be the result of a mass-loss event at the end of the red-giant branch phase of stars massive enough to not have a degenerate core, i.e. one of the scenarios to form hot subdwarf stars. Therefore, these stars can be classified as `hot subdwarf analogues'. Interestingly, if mass loss continues gradually during the core helium burning phase, these stars turn hotter and denser, and could, therefore, be hot subdwarf progenitors as they shed more of their envelope.
△ Less
Submitted 7 August, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Self-Supervised Multi-Object Tracking For Autonomous Driving From Consistency Across Timescales
Authors:
Christopher Lang,
Alexander Braun,
Lars Schillingmann,
Abhinav Valada
Abstract:
Self-supervised multi-object trackers have tremendous potential as they enable learning from raw domain-specific data. However, their re-identification accuracy still falls short compared to their supervised counterparts. We hypothesize that this drawback results from formulating self-supervised objectives that are limited to single frames or frame pairs. Such formulations do not capture sufficien…
▽ More
Self-supervised multi-object trackers have tremendous potential as they enable learning from raw domain-specific data. However, their re-identification accuracy still falls short compared to their supervised counterparts. We hypothesize that this drawback results from formulating self-supervised objectives that are limited to single frames or frame pairs. Such formulations do not capture sufficient visual appearance variations to facilitate learning consistent re-identification features for autonomous driving when the frame rate is low or object dynamics are high. In this work, we propose a training objective that enables self-supervised learning of re-identification features from multiple sequential frames by enforcing consistent association scores across short and long timescales. We perform extensive evaluations demonstrating that re-identification features trained from longer sequences significantly reduce ID switches on standard autonomous driving datasets compared to existing self-supervised learning methods, which are limited to training on frame pairs. Using our proposed SubCo loss function, we set the new state-of-the-art among self-supervised methods and even perform on par with fully supervised learning methods.
△ Less
Submitted 21 September, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Tadpoles and Gauge Symmetries
Authors:
Andreas P. Braun,
Bernardo Fraiman,
Mariana Graña,
Severin Lüst,
Héctor Parra de Freitas
Abstract:
The tadpole conjecture proposes that complex structure moduli stabilisation by fluxes that have low tadpole charge can be realised only at special points in moduli space, leading generically to (large) gauge symmetries. Here we provide an exhaustive survey of the gauge symmetries arising in F-theory flux compactifications on products of attractive $\mbox{K3}$ surfaces, with complex structure modul…
▽ More
The tadpole conjecture proposes that complex structure moduli stabilisation by fluxes that have low tadpole charge can be realised only at special points in moduli space, leading generically to (large) gauge symmetries. Here we provide an exhaustive survey of the gauge symmetries arising in F-theory flux compactifications on products of attractive $\mbox{K3}$ surfaces, with complex structure moduli fully stabilised. We compute the minimal rank of the left-over non-abelian gauge group for all flux configurations within the tadpole bound, finding that it is always non-zero. It decreases in a roughly linear fashion with the tadpole charge, reaching zero at charge 30. By working out possible gauge algebras for different values of the tadpole, we find that all simple ADE Lie algebras of rank $\le 18$ appear.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
$G_2$-Manifolds from 4d N=1 Theories, Part I: Domain Walls
Authors:
Andreas P. Braun,
Evyatar Sabag,
Matteo Sacchi,
Sakura Schafer-Nameki
Abstract:
We propose new $G_2$-holonomy manifolds, which geometrize the Gaiotto-Kim 4d N=1 duality domain walls of 5d N=1 theories. These domain walls interpolate between different extended Coulomb branch phases of a given 5d superconformal field theory. Our starting point is the geometric realization of such a 5d superconformal field theory and its extended Coulomb branch in terms of M-theory on a non-comp…
▽ More
We propose new $G_2$-holonomy manifolds, which geometrize the Gaiotto-Kim 4d N=1 duality domain walls of 5d N=1 theories. These domain walls interpolate between different extended Coulomb branch phases of a given 5d superconformal field theory. Our starting point is the geometric realization of such a 5d superconformal field theory and its extended Coulomb branch in terms of M-theory on a non-compact singular Calabi-Yau three-fold and its Kähler cone. We construct the 7-manifold that realizes the domain wall in M-theory by fibering the Calabi-Yau three-fold over a real line, whilst varying its Kähler parameters as prescribed by the domain wall construction. In particular this requires the Calabi-Yau fiber to pass through a canonical singularity at the locus of the domain wall. Due to the 4d N=1 supersymmetry that is preserved on the domain wall, we expect the resulting 7-manifold to have holonomy $G_2$. Indeed, for simple domain wall theories, this construction results in 7-manifolds, which are known to admit torsion-free $G_2$-holonomy metrics. We develop several generalizations to new 7-manifolds, which realize domain walls in 5d SQCD theories and walls between 5d theories which are UV-dual.
△ Less
Submitted 3 April, 2023;
originally announced April 2023.
-
Production of cumulative pions and percolation of strings
Authors:
M. A. Braun
Abstract:
Production of pions in high-energy collisions with nuclei in the kinematics prohibited for free nucleons ("cumulative pions") is studied in the fusing color string model.The model describes the so-called direct mechanism for cumulative production. The other, spectator mechanism dominates in production of cumulative protons but is suppressed for pions. In the model cumulative pions are generated by…
▽ More
Production of pions in high-energy collisions with nuclei in the kinematics prohibited for free nucleons ("cumulative pions") is studied in the fusing color string model.The model describes the so-called direct mechanism for cumulative production. The other, spectator mechanism dominates in production of cumulative protons but is suppressed for pions. In the model cumulative pions are generated by string fusion which raises the maximal energy of produced partons above the level of the free nucleon kinematics. Momentum and multiplicity sum rules are used to determine the spectra in the deep fragmentation region. Predicted spectra of cumulative pions exponentially fall with the scaling variable $x$ in the interval $1<x<3$ with a slope of the order 5$÷$5.6, which agrees well with the raw data obtained in the recent experiment at RHIC with Cu-Au collisioins. However the agreement is worse for the so-called unfolded data, presumably taking into account corrections due to the expermental set-up and having rather a power-like form.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Self-Supervised Representation Learning from Temporal Ordering of Automated Driving Sequences
Authors:
Christopher Lang,
Alexander Braun,
Lars Schillingmann,
Karsten Haug,
Abhinav Valada
Abstract:
Self-supervised feature learning enables perception systems to benefit from the vast raw data recorded by vehicle fleets worldwide. While video-level self-supervised learning approaches have shown strong generalizability on classification tasks, the potential to learn dense representations from sequential data has been relatively unexplored. In this work, we propose TempO, a temporal ordering pret…
▽ More
Self-supervised feature learning enables perception systems to benefit from the vast raw data recorded by vehicle fleets worldwide. While video-level self-supervised learning approaches have shown strong generalizability on classification tasks, the potential to learn dense representations from sequential data has been relatively unexplored. In this work, we propose TempO, a temporal ordering pretext task for pre-training region-level feature representations for perception tasks. We embed each frame by an unordered set of proposal feature vectors, a representation that is natural for object detection or tracking systems, and formulate the sequential ordering by predicting frame transition probabilities in a transformer-based multi-frame architecture whose complexity scales less than quadratic with respect to the sequence length. Extensive evaluations on the BDD100K, nuImages, and MOT17 datasets show that our TempO pre-training approach outperforms single-frame self-supervised learning methods as well as supervised transfer learning initialization strategies, achieving an improvement of +0.7% in mAP for object detection and +2.0% in the HOTA score for multi-object tracking.
△ Less
Submitted 8 November, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Hierarchically Structured Task-Agnostic Continual Learning
Authors:
Heinke Hihn,
Daniel A. Braun
Abstract:
One notable weakness of current machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning paradigm has emerged as a protocol to systematically investigate settings where the model sequentially observes samples generated by a series of tasks. In this work, we take a task-agnostic view of continual learnin…
▽ More
One notable weakness of current machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning paradigm has emerged as a protocol to systematically investigate settings where the model sequentially observes samples generated by a series of tasks. In this work, we take a task-agnostic view of continual learning and develop a hierarchical information-theoretic optimality principle that facilitates a trade-off between learning and forgetting. We derive this principle from a Bayesian perspective and show its connections to previous approaches to continual learning. Based on this principle, we propose a neural network layer, called the Mixture-of-Variational-Experts layer, that alleviates forgetting by creating a set of information processing paths through the network which is governed by a gating policy. Equipped with a diverse and specialized set of parameters, each path can be regarded as a distinct sub-network that learns to solve tasks. To improve expert allocation, we introduce diversity objectives, which we evaluate in additional ablation studies. Importantly, our approach can operate in a task-agnostic way, i.e., it does not require task-specific knowledge, as is the case with many existing continual learning algorithms. Due to the general formulation based on generic utility functions, we can apply this optimality principle to a large variety of learning problems, including supervised learning, reinforcement learning, and generative modeling. We demonstrate the competitive performance of our method on continual reinforcement learning and variants of the MNIST, CIFAR-10, and CIFAR-100 datasets.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Simplified Prophet Inequalities for Combinatorial Auctions
Authors:
Alexander Braun,
Thomas Kesselheim
Abstract:
We consider prophet inequalities for XOS and MPH-$k$ combinatorial auctions and give a simplified proof for the existence of static and anonymous item prices which recover the state-of-the-art competitive ratios.
Our proofs make use of a linear programming formulation which has a non-negative objective value if there are prices which admit a given competitive ratio $α\geq 1$. Changing our perspe…
▽ More
We consider prophet inequalities for XOS and MPH-$k$ combinatorial auctions and give a simplified proof for the existence of static and anonymous item prices which recover the state-of-the-art competitive ratios.
Our proofs make use of a linear programming formulation which has a non-negative objective value if there are prices which admit a given competitive ratio $α\geq 1$. Changing our perspective to dual space by an application of strong LP duality, we use an interpretation of the dual variables as probabilities to directly obtain our result. In contrast to previous work, our proofs do not require to argue about specific values of buyers for bundles, but only about the presence or absence of items.
As a side remark, for any $k \geq 2$, this simplification also leads to a tiny improvement in the best competitive ratio for MPH-$k$ combinatorial auctions from $4k-2$ to $2k + 2 \sqrt{k(k-1)} -1$.
△ Less
Submitted 1 November, 2022;
originally announced November 2022.
-
Change Detection for Local Explainability in Evolving Data Streams
Authors:
Johannes Haug,
Alexander Braun,
Stefan Zürn,
Gjergji Kasneci
Abstract:
As complex machine learning models are increasingly used in sensitive applications like banking, trading or credit scoring, there is a growing demand for reliable explanation mechanisms. Local feature attribution methods have become a popular technique for post-hoc and model-agnostic explanations. However, attribution methods typically assume a stationary environment in which the predictive model…
▽ More
As complex machine learning models are increasingly used in sensitive applications like banking, trading or credit scoring, there is a growing demand for reliable explanation mechanisms. Local feature attribution methods have become a popular technique for post-hoc and model-agnostic explanations. However, attribution methods typically assume a stationary environment in which the predictive model has been trained and remains stable. As a result, it is often unclear how local attributions behave in realistic, constantly evolving settings such as streaming and online applications. In this paper, we discuss the impact of temporal change on local feature attributions. In particular, we show that local attributions can become obsolete each time the predictive model is updated or concept drift alters the data generating distribution. Consequently, local feature attributions in data streams provide high explanatory power only when combined with a mechanism that allows us to detect and respond to local changes over time. To this end, we present CDLEEDS, a flexible and model-agnostic framework for detecting local change and concept drift. CDLEEDS serves as an intuitive extension of attribution-based explanation techniques to identify outdated local attributions and enable more targeted recalculations. In experiments, we also show that the proposed framework can reliably detect both local and global concept drift. Accordingly, our work contributes to a more meaningful and robust explainability in online machine learning.
△ Less
Submitted 6 September, 2022;
originally announced September 2022.
-
Thermodynamic fluctuation theorems govern human sensorimotor learning
Authors:
Pedro Hack,
Cecilia Lindig-Leon,
Sebastian Gottwald,
Daniel A. Braun
Abstract:
The application of thermodynamic reasoning in the study of learning systems has a long tradition. Recently, new tools relating perfect thermodynamic adaptation to the adaptation process have been developed. These results, known as fluctuation theorems, have been tested experimentally in several physical scenarios and, moreover, they have been shown to be valid under broad mathematical conditions.…
▽ More
The application of thermodynamic reasoning in the study of learning systems has a long tradition. Recently, new tools relating perfect thermodynamic adaptation to the adaptation process have been developed. These results, known as fluctuation theorems, have been tested experimentally in several physical scenarios and, moreover, they have been shown to be valid under broad mathematical conditions. Hence, although not experimentally challenged yet, they are presumed to apply to learning systems as well. Here we address this challenge by testing the applicability of fluctuation theorems in learning systems, more specifically, in human sensorimotor learning. In particular, we relate adaptive movement trajectories in a changing visuomotor rotation task to fully adapted steady-state behavior of individual participants. We find that human adaptive behavior in our task is generally consistent with fluctuation theorem predictions and discuss the merits and limitations of the approach.
△ Less
Submitted 3 January, 2023; v1 submitted 2 September, 2022;
originally announced September 2022.
-
Majorization requires infinitely many second laws
Authors:
Pedro Hack,
Daniel A. Braun,
Sebastian Gottwald
Abstract:
Majorization is a fundamental model of uncertainty with several applications in areas ranging from thermodynamics to entanglement theory, and constitutes one of the pillars of the resource-theoretic approach to physics. Here, we improve on its relation to measurement apparatuses. In particular, after discussing what the proper notion of second law in this scenario is, we show that, for a sufficien…
▽ More
Majorization is a fundamental model of uncertainty with several applications in areas ranging from thermodynamics to entanglement theory, and constitutes one of the pillars of the resource-theoretic approach to physics. Here, we improve on its relation to measurement apparatuses. In particular, after discussing what the proper notion of second law in this scenario is, we show that, for a sufficiently large state space, any family of entropy-like functions constituting a second law must be countably infinite. Moreover, we provide an analogous result for a variation of majorization known as thermo-majorization which, in fact, does not require any constraint on the state space provided the equilibrium distribution is not uniform. Lastly, we discuss the applicability of our results to molecular diffusion and catalytic majorization. In this regard, we consider a variation of majorization used in plasma physics as a model of molecular diffusion and show that no finite family of entropy-like functions constituting a second law of molecular diffusion exists. Moreover, we show how our results are useful when dealing with a conjecture regarding catalytic majorization (i.e. trumping). In particular, we show that the sort of characterizations of trumping that have been considered before require an infinite family of real-valued functions.
△ Less
Submitted 14 June, 2024; v1 submitted 22 July, 2022;
originally announced July 2022.
-
Countability constraints in order-theoretic approaches to computability
Authors:
Pedro Hack,
Daniel A. Braun,
Sebastian Gottwald
Abstract:
Computability on uncountable sets has no standard formalization, unlike that on countable sets, which is given by Turing machines. Some of the approaches to define computability in these sets rely on order-theoretic structures to translate such notions from Turing machines to uncountable spaces. Since these machines are used as a baseline for computability in these approaches, countability restric…
▽ More
Computability on uncountable sets has no standard formalization, unlike that on countable sets, which is given by Turing machines. Some of the approaches to define computability in these sets rely on order-theoretic structures to translate such notions from Turing machines to uncountable spaces. Since these machines are used as a baseline for computability in these approaches, countability restrictions on the ordered structures are fundamental. Here, we show several relations between the usual countability restrictions in order-theoretic theories of computability and some more common order-theoretic countability constraints, like order density properties and functional characterizations of the order structure in terms of multi-utilities. As a result, we show how computability can be introduced in some order structures via countability order density and multi-utility constraints.
△ Less
Submitted 28 May, 2024; v1 submitted 29 June, 2022;
originally announced June 2022.
-
Computation as uncertainty reduction: a simplified order-theoretic framework
Authors:
Pedro Hack,
Daniel A. Braun,
Sebastian Gottwald
Abstract:
Although there is a somewhat standard formalization of computability on countable sets given by Turing machines, the same cannot be said about uncountable sets. Among the approaches to define computability in these sets, order-theoretic structures have proven to be useful. Here, we discuss the mathematical structure needed to define computability using order-theoretic concepts. In particular, we i…
▽ More
Although there is a somewhat standard formalization of computability on countable sets given by Turing machines, the same cannot be said about uncountable sets. Among the approaches to define computability in these sets, order-theoretic structures have proven to be useful. Here, we discuss the mathematical structure needed to define computability using order-theoretic concepts. In particular, we introduce a more general framework and discuss its limitations compared to the previous one in domain theory. We expose four features in which the stronger requirements in the domain-theoretic structure allow to improve upon the more general framework: computable elements, computable functions, model dependence of computability and complexity theory. Crucially, we show computability of elements in uncountable spaces can be defined in this new setup, and argue why this is not the case for computable functions. Moreover, we show the stronger setup diminishes the dependence of computability on the chosen order-theoretic structure and that, although a suitable complexity theory can be defined in the stronger framework and the more general one posesses a notion of computable elements, there appears to be no proper notion of element complexity in the latter.
△ Less
Submitted 6 September, 2022; v1 submitted 28 June, 2022;
originally announced June 2022.
-
On a geometrical notion of dimension for partially ordered sets
Authors:
Pedro Hack,
Daniel A. Braun,
Sebastian Gottwald
Abstract:
The well-known notion of dimension for partial orders by Dushnik and Miller allows to quantify the degree of incomparability and, thus, is regarded as a measure of complexity for partial orders. However, despite its usefulness, its definition is somewhat disconnected from the geometrical idea of dimension, where, essentially, the number of dimensions indicates how many real lines are required to r…
▽ More
The well-known notion of dimension for partial orders by Dushnik and Miller allows to quantify the degree of incomparability and, thus, is regarded as a measure of complexity for partial orders. However, despite its usefulness, its definition is somewhat disconnected from the geometrical idea of dimension, where, essentially, the number of dimensions indicates how many real lines are required to represent the underlying partially ordered set.
Here, we introduce a variation of the Dushnik-Miller notion of dimension that is closer to geometry, the Debreu dimension, and show the following main results: (i) how to construct its building blocks under some countability restrictions, (ii) its relation to other notions of dimension in the literature, and (iii), as an application of the above, we improve on the classification of preordered spaces through real-valued monotones.
△ Less
Submitted 2 September, 2022; v1 submitted 30 March, 2022;
originally announced March 2022.
-
Local one-dimensional reggeon model of the interaction of several pomerons
Authors:
M. A. Braun,
E. M. Kuzminskii,
M. I. Vyazovsky
Abstract:
We consider the one-dimensional local reggeon theory describing the leading pomeron with the conformal spin $l=0$ and two subdominant pomerons with $l=\pm 2$. The dependence of the propagators of pomerons and the $hA$ amplitude on rapidity are found numerically by integrating the evolution equation.
We consider the one-dimensional local reggeon theory describing the leading pomeron with the conformal spin $l=0$ and two subdominant pomerons with $l=\pm 2$. The dependence of the propagators of pomerons and the $hA$ amplitude on rapidity are found numerically by integrating the evolution equation.
△ Less
Submitted 3 May, 2022; v1 submitted 27 March, 2022;
originally announced March 2022.
-
On Hyperbolic Embeddings in 2D Object Detection
Authors:
Christopher Lang,
Alexander Braun,
Abhinav Valada
Abstract:
Object detection, for the most part, has been formulated in the euclidean space, where euclidean or spherical geodesic distances measure the similarity of an image region to an object class prototype. In this work, we study whether a hyperbolic geometry better matches the underlying structure of the object classification space. We incorporate a hyperbolic classifier in two-stage, keypoint-based, a…
▽ More
Object detection, for the most part, has been formulated in the euclidean space, where euclidean or spherical geodesic distances measure the similarity of an image region to an object class prototype. In this work, we study whether a hyperbolic geometry better matches the underlying structure of the object classification space. We incorporate a hyperbolic classifier in two-stage, keypoint-based, and transformer-based object detection architectures and evaluate them on large-scale, long-tailed, and zero-shot object detection benchmarks. In our extensive experimental evaluations, we observe categorical class hierarchies emerging in the structure of the classification space, resulting in lower classification errors and boosting the overall object detection performance.
△ Less
Submitted 18 March, 2022; v1 submitted 15 March, 2022;
originally announced March 2022.
-
The classification of preordered spaces in terms of monotones: complexity and optimization
Authors:
Pedro Hack,
Daniel A. Braun,
Sebastian Gottwald
Abstract:
The study of complexity and optimization in decision theory involves both partial and complete characterizations of preferences over decision spaces in terms of real-valued monotones. With this motivation, and following the recent introduction of new classes of monotones, like injective monotones or strict monotone multi-utilities, we present the classification of preordered spaces in terms of bot…
▽ More
The study of complexity and optimization in decision theory involves both partial and complete characterizations of preferences over decision spaces in terms of real-valued monotones. With this motivation, and following the recent introduction of new classes of monotones, like injective monotones or strict monotone multi-utilities, we present the classification of preordered spaces in terms of both the existence and cardinality of real-valued monotones and the cardinality of the quotient space. In particular, we take advantage of a characterization of real-valued monotones in terms of separating families of increasing sets in order to obtain a more complete classification consisting of classes that are strictly different from each other. As a result, we gain new insight into both complexity and optimization, and clarify their interplay in preordered spaces.
△ Less
Submitted 14 August, 2022; v1 submitted 24 February, 2022;
originally announced February 2022.
-
Jarzyski's equality and Crooks' fluctuation theorem for general Markov chains with application to decision-making systems
Authors:
Pedro Hack,
Sebastian Gottwald,
Daniel A. Braun
Abstract:
We define common thermodynamic concepts purely within the framework of general Markov chains and derive Jarzynski's equality and Crooks' fluctuation theorem in this setup. In particular, we regard the discrete time case that leads to an asymmetry in the definition of work that appears in the usual formulation of Crooks' fluctuation theorem. We show how this asymmetry can be avoided with an additio…
▽ More
We define common thermodynamic concepts purely within the framework of general Markov chains and derive Jarzynski's equality and Crooks' fluctuation theorem in this setup. In particular, we regard the discrete time case that leads to an asymmetry in the definition of work that appears in the usual formulation of Crooks' fluctuation theorem. We show how this asymmetry can be avoided with an additional condition regarding the energy protocol. The general formulation in terms of Markov chains allows transferring the results to other application areas outside of physics. Here, we discuss how this framework can be applied in the context of decision-making. This involves the definition of the relevant quantities, the assumptions that need to be made for the different fluctuation theorems to hold, as well as the consideration of discrete trajectories instead of the continuous trajectories, which are relevant in physics.
△ Less
Submitted 25 November, 2022; v1 submitted 11 February, 2022;
originally announced February 2022.
-
ExAID: A Multimodal Explanation Framework for Computer-Aided Diagnosis of Skin Lesions
Authors:
Adriano Lucieri,
Muhammad Naseer Bajwa,
Stephan Alexander Braun,
Muhammad Imran Malik,
Andreas Dengel,
Sheraz Ahmed
Abstract:
One principal impediment in the successful deployment of AI-based Computer-Aided Diagnosis (CAD) systems in clinical workflows is their lack of transparent decision making. Although commonly used eXplainable AI methods provide some insight into opaque algorithms, such explanations are usually convoluted and not readily comprehensible except by highly trained experts. The explanation of decisions r…
▽ More
One principal impediment in the successful deployment of AI-based Computer-Aided Diagnosis (CAD) systems in clinical workflows is their lack of transparent decision making. Although commonly used eXplainable AI methods provide some insight into opaque algorithms, such explanations are usually convoluted and not readily comprehensible except by highly trained experts. The explanation of decisions regarding the malignancy of skin lesions from dermoscopic images demands particular clarity, as the underlying medical problem definition is itself ambiguous. This work presents ExAID (Explainable AI for Dermatology), a novel framework for biomedical image analysis, providing multi-modal concept-based explanations consisting of easy-to-understand textual explanations supplemented by visual maps justifying the predictions. ExAID relies on Concept Activation Vectors to map human concepts to those learnt by arbitrary Deep Learning models in latent space, and Concept Localization Maps to highlight concepts in the input space. This identification of relevant concepts is then used to construct fine-grained textual explanations supplemented by concept-wise location information to provide comprehensive and coherent multi-modal explanations. All information is comprehensively presented in a diagnostic interface for use in clinical routines. An educational mode provides dataset-level explanation statistics and tools for data and model exploration to aid medical research and education. Through rigorous quantitative and qualitative evaluation of ExAID, we show the utility of multi-modal explanations for CAD-assisted scenarios even in case of wrong predictions. We believe that ExAID will provide dermatologists an effective screening tool that they both understand and trust. Moreover, it will be the basis for similar applications in other biomedical imaging fields.
△ Less
Submitted 4 January, 2022;
originally announced January 2022.
-
Contrastive Object Detection Using Knowledge Graph Embeddings
Authors:
Christopher Lang,
Alexander Braun,
Abhinav Valada
Abstract:
Object recognition for the most part has been approached as a one-hot problem that treats classes to be discrete and unrelated. Each image region has to be assigned to one member of a set of objects, including a background class, disregarding any similarities in the object types. In this work, we compare the error statistics of the class embeddings learned from a one-hot approach with semantically…
▽ More
Object recognition for the most part has been approached as a one-hot problem that treats classes to be discrete and unrelated. Each image region has to be assigned to one member of a set of objects, including a background class, disregarding any similarities in the object types. In this work, we compare the error statistics of the class embeddings learned from a one-hot approach with semantically structured embeddings from natural language processing or knowledge graphs that are widely applied in open world object detection. Extensive experimental results on multiple knowledge-embeddings as well as distance metrics indicate that knowledge-based class representations result in more semantically grounded misclassifications while performing on par compared to one-hot methods on the challenging COCO and Cityscapes object detection benchmarks. We generalize our findings to multiple object detection architectures by proposing a knowledge-embedded design for keypoint-based and transformer-based object detection architectures.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
3N potentials in the Faddeev coordinate space approach to Nd scattering
Authors:
M. A. Braun,
V. M. Suslov,
I. Filikhin,
B. Vlahovic
Abstract:
In the last decade, for studying 3$N$ bound states and $Nd$ scattering the Tucson-Melbourne (TM) and Urbana 3$N$ force derived from the chiral EFT have been applied. We plan to use the TM 3$N$ force for studying the $Nd$ scattering on the basis of the Faddeev equations in configuration space. In the given paper, we present our final formulas for components of the TM 3$N$ potential obtained in the…
▽ More
In the last decade, for studying 3$N$ bound states and $Nd$ scattering the Tucson-Melbourne (TM) and Urbana 3$N$ force derived from the chiral EFT have been applied. We plan to use the TM 3$N$ force for studying the $Nd$ scattering on the basis of the Faddeev equations in configuration space. In the given paper, we present our final formulas for components of the TM 3$N$ potential obtained in the coordinate space.
△ Less
Submitted 18 November, 2021;
originally announced November 2021.
-
Mixture-of-Variational-Experts for Continual Learning
Authors:
Heinke Hihn,
Daniel A. Braun
Abstract:
One weakness of machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning (CL) paradigm has emerged as a protocol to systematically investigate settings where the model sequentially observes samples generated by a series of tasks. In this work, we take a task-agnostic view of continual learning and devel…
▽ More
One weakness of machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning (CL) paradigm has emerged as a protocol to systematically investigate settings where the model sequentially observes samples generated by a series of tasks. In this work, we take a task-agnostic view of continual learning and develop a hierarchical information-theoretic optimality principle that facilitates a trade-off between learning and forgetting. We discuss this principle from a Bayesian perspective and show its connections to previous approaches to CL. Based on this principle, we propose a neural network layer, called the Mixture-of-Variational-Experts layer, that alleviates forgetting by creating a set of information processing paths through the network which is governed by a gating policy. Due to the general formulation based on generic utility functions, we can apply this optimality principle to a large variety of learning problems, including supervised learning, reinforcement learning, and generative modeling. We demonstrate the competitive performance of our method in continual supervised learning and in continual reinforcement learning.
△ Less
Submitted 1 March, 2022; v1 submitted 25 October, 2021;
originally announced October 2021.
-
Four-pomeron vertex
Authors:
M. A. Braun
Abstract:
The four-pomeron vertex is studied in the perturbative QCD. Its dominating terms of the leading (zeroth and first) orders in the coupling constant and subdominant in the number of colors are constructed. The vertex consists of two terms, one with a derivative in rapidity $\pd_y$ and the other with the BFKL interaction between pomerons. The corresponding part of the action and equations of motion a…
▽ More
The four-pomeron vertex is studied in the perturbative QCD. Its dominating terms of the leading (zeroth and first) orders in the coupling constant and subdominant in the number of colors are constructed. The vertex consists of two terms, one with a derivative in rapidity $\pd_y$ and the other with the BFKL interaction between pomerons. The corresponding part of the action and equations of motion are found. The iterative solution of the latter is possible only for rapidities smaller than 2 and quite large coupling constant $α_s$, of the order or greater than unity, when the quadruple pomeron interaction is relatively small. Also iteration of the part with $\pd_y$ is unstable in the infrared region and compels to introduce an infrared cut. The variational approach with simple trying functions allows to find the minimum of the action at $α_s$ of the order 0.2 and rapidities up to 25. Numerical estimates for O-O collisions show that actually the influence of the quadruple pomeron interaction turns out to be rather small.
△ Less
Submitted 24 September, 2021;
originally announced September 2021.
-
Estimating fractal dimensions: a comparative review and open source implementations
Authors:
George Datseris,
Inga Kottlarz,
Anton P. Braun,
Ulrich Parlitz
Abstract:
The fractal dimension is a central quantity in nonlinear dynamics and can be estimated via several different numerical techniques. In this review paper we present a self-contained and comprehensive introduction to the fractal dimension. We collect and present various numerical estimators and focus on the three most promising ones: generalized entropy, correlation sum, and extreme value theory. We…
▽ More
The fractal dimension is a central quantity in nonlinear dynamics and can be estimated via several different numerical techniques. In this review paper we present a self-contained and comprehensive introduction to the fractal dimension. We collect and present various numerical estimators and focus on the three most promising ones: generalized entropy, correlation sum, and extreme value theory. We then perform an extensive quantitative evaluation of these estimators, comparing their performance and precision using different datasets and comparing the impact of features like length, noise, embedding dimension, falsify-ability, among many others. Our analysis shows that for synthetic noiseless data the correlation sum is the best estimator with extreme value theory following closely. For real experimental data we found the correlation sum to be more strongly affected by noise versus the entropy and extreme value theory. The recent extreme value theory estimator seems powerful as it has some of the advantages of both alternative methods. However, using four different ways for checking for significance, we found that the method yielded ``significant' low-dimensional results for inappropriate data like stock market timeseries. This fact, combined with some ambiguities we found in the literature of the method applications, have implications for both previous and future real world applications using the extreme value theory approach, as, for example, the argument for small effective dimensionality in the data cannot come from the method itself. All algorithms discussed are implemented as performant and easy to use open source code via the DynamicalSystems.jl library.
△ Less
Submitted 23 September, 2023; v1 submitted 13 September, 2021;
originally announced September 2021.
-
Representing preorders with injective monotones
Authors:
Pedro Hack,
Daniel A. Braun,
Sebastian Gottwald
Abstract:
We introduce a new class of real-valued monotones in preordered spaces, injective monotones. We show that the class of preorders for which they exist lies in between the class of preorders with strict monotones and preorders with countable multi-utilities, improving upon the known classification of preordered spaces through real-valued monotones. We extend several well-known results for strict mon…
▽ More
We introduce a new class of real-valued monotones in preordered spaces, injective monotones. We show that the class of preorders for which they exist lies in between the class of preorders with strict monotones and preorders with countable multi-utilities, improving upon the known classification of preordered spaces through real-valued monotones. We extend several well-known results for strict monotones (Richter-Peleg functions) to injective monotones, we provide a construction of injective monotones from countable multi-utilities, and relate injective monotones to classic results concerning Debreu denseness and order separability. Along the way, we connect our results to Shannon entropy and the uncertainty preorder, obtaining new insights into how they are related. In particular, we show how injective montones can be used to generalize some appealing properties of Jaynes' maximum entropy principle, which is considered a basis for statistical inference and serves as a justification for many regularization techniques that appear throughout machine learning and decision theory.
△ Less
Submitted 24 November, 2021; v1 submitted 30 July, 2021;
originally announced July 2021.
-
Convergence rates for shallow neural networks learned by gradient descent
Authors:
Alina Braun,
Michael Kohler,
Sophie Langer,
Harro Walk
Abstract:
In this paper we analyze the $L_2$ error of neural network regression estimates with one hidden layer. Under the assumption that the Fourier transform of the regression function decays suitably fast, we show that an estimate, where all initial weights are chosen according to proper uniform distributions and where the weights are learned by gradient descent, achieves a rate of convergence of…
▽ More
In this paper we analyze the $L_2$ error of neural network regression estimates with one hidden layer. Under the assumption that the Fourier transform of the regression function decays suitably fast, we show that an estimate, where all initial weights are chosen according to proper uniform distributions and where the weights are learned by gradient descent, achieves a rate of convergence of $1/\sqrt{n}$ (up to a logarithmic factor). Our statistical analysis implies that the key aspect behind this result is the proper choice of the initial inner weights and the adjustment of the outer weights via gradient descent. This indicates that we can also simply use linear least squares to choose the outer weights. We prove a corresponding theoretical result and compare our new linear least squares neural network estimate with standard neural network estimates via simulated data. Our simulations show that our theoretical considerations lead to an estimate with an improved performance in many cases.
△ Less
Submitted 18 August, 2023; v1 submitted 20 July, 2021;
originally announced July 2021.
-
A Comparison of Methods for OOV-word Recognition on a New Public Dataset
Authors:
Rudolf A. Braun,
Srikanth Madikeri,
Petr Motlicek
Abstract:
A common problem for automatic speech recognition systems is how to recognize words that they did not see during training. Currently there is no established method of evaluating different techniques for tackling this problem. We propose using the CommonVoice dataset to create test sets for multiple languages which have a high out-of-vocabulary (OOV) ratio relative to a training set and release a n…
▽ More
A common problem for automatic speech recognition systems is how to recognize words that they did not see during training. Currently there is no established method of evaluating different techniques for tackling this problem. We propose using the CommonVoice dataset to create test sets for multiple languages which have a high out-of-vocabulary (OOV) ratio relative to a training set and release a new tool for calculating relevant performance metrics. We then evaluate, within the context of a hybrid ASR system, how much better subword models are at recognizing OOVs, and how much benefit one can get from incorporating OOV-word information into an existing system by modifying WFSTs. Additionally, we propose a new method for modifying a subword-based language model so as to better recognize OOV-words. We showcase very large improvements in OOV-word recognition and make both the data and code available.
△ Less
Submitted 16 July, 2021;
originally announced July 2021.
-
Asymptotically Optimal Welfare of Posted Pricing for Multiple Items with MHR Distributions
Authors:
Alexander Braun,
Matthias Buttkus,
Thomas Kesselheim
Abstract:
We consider the problem of posting prices for unit-demand buyers if all $n$ buyers have identically distributed valuations drawn from a distribution with monotone hazard rate. We show that even with multiple items asymptotically optimal welfare can be guaranteed.
Our main results apply to the case that either a buyer's value for different items are independent or that they are perfectly correlat…
▽ More
We consider the problem of posting prices for unit-demand buyers if all $n$ buyers have identically distributed valuations drawn from a distribution with monotone hazard rate. We show that even with multiple items asymptotically optimal welfare can be guaranteed.
Our main results apply to the case that either a buyer's value for different items are independent or that they are perfectly correlated. We give mechanisms using dynamic prices that obtain a $1 - Θ\left( \frac{1}{\log n}\right)$-fraction of the optimal social welfare in expectation. Furthermore, we devise mechanisms that only use static item prices and are $1 - Θ\left( \frac{\log\log\log n}{\log n}\right)$-competitive compared to the optimal social welfare. As we show, both guarantees are asymptotically optimal, even for a single item and exponential distributions.
△ Less
Submitted 1 July, 2021;
originally announced July 2021.
-
Gauged 2-form Symmetries in 6D SCFTs Coupled to Gravity
Authors:
Andreas P. Braun,
Magdalena Larfors,
Paul-Konstantin Oehlmann
Abstract:
We study six dimensional supergravity theories with superconformal sectors (SCFTs). Instances of such theories can be engineered using type IIB strings, or more generally F-Theory, which translates field theoretic constraints to geometry. Specifically, we study the fate of the discrete 2-form global symmetries of the SCFT sectors. For both $(2,0)$ and $(1,0)$ theories we show that whenever the cha…
▽ More
We study six dimensional supergravity theories with superconformal sectors (SCFTs). Instances of such theories can be engineered using type IIB strings, or more generally F-Theory, which translates field theoretic constraints to geometry. Specifically, we study the fate of the discrete 2-form global symmetries of the SCFT sectors. For both $(2,0)$ and $(1,0)$ theories we show that whenever the charge lattice of the SCFT sectors is non-primitively embedded into the charge lattice of the supergravity theory, there is a subgroup of these 2-form symmetries that remains unbroken by BPS strings. By the absence of global symmetries in quantum gravity, this subgroup much be gauged. Using the embedding of the charge lattices also allows us to determine how the gauged 2-form symmetry embeds into the 2-form global symmetries of the SCFT sectors, and we present several concrete examples, as well as some general observations. As an alternative derivation, we recover our results for a large class of models from a dual perspective upon reduction to five dimensions.
△ Less
Submitted 15 July, 2021; v1 submitted 24 June, 2021;
originally announced June 2021.
-
Truthful Mechanisms for Two-Sided Markets via Prophet Inequalities
Authors:
Alexander Braun,
Thomas Kesselheim
Abstract:
We design novel mechanisms for welfare-maximization in two-sided markets. That is, there are buyers willing to purchase items and sellers holding items initially, both acting rationally and strategically in order to maximize utility. Our mechanisms are designed based on a powerful correspondence between two-sided markets and prophet inequalities. They satisfy individual rationality, dominant-strat…
▽ More
We design novel mechanisms for welfare-maximization in two-sided markets. That is, there are buyers willing to purchase items and sellers holding items initially, both acting rationally and strategically in order to maximize utility. Our mechanisms are designed based on a powerful correspondence between two-sided markets and prophet inequalities. They satisfy individual rationality, dominant-strategy incentive compatibility, budget-balance constraints and give constant-factor approximations to the optimal social welfare.
We improve previous results in several settings: Our main focus is on matroid double auctions, where the set of buyers who obtain an item needs to be independent in a matroid. We construct two mechanisms, the first being a $1/3$-approximation of the optimal social welfare satisfying strong budget-balance and requiring the agents to trade in a customized order, the second being a $1/2$-approximation, weakly budget-balanced and able to deal with online arrival determined by an adversary. In addition, we construct constant-factor approximations in two-sided markets when buyers need to fulfill a knapsack constraint. Also, in combinatorial double auctions, where buyers have valuation functions over item bundles instead of being interested in only one item, using similar techniques, we design a mechanism which is a $1/2$-approximation of the optimal social welfare, strongly budget-balanced and can deal with online arrival of agents in an adversarial order.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Excitation for Adaptive Optimal Control of Nonlinear Systems in Differential Games
Authors:
Philipp Karg,
Florian Köpf,
Christian A. Braun,
Sören Hohmann
Abstract:
This work focuses on the fulfillment of the Persistent Excitation (PE) condition for signals which result from transformations by means of polynomials. This is essential e.g. for the convergence of Adaptive Dynamic Programming algorithms due to commonly used polynomial function approximators. As theoretical statements are scarce regarding the nonlinear transformation of PE signals, we propose cond…
▽ More
This work focuses on the fulfillment of the Persistent Excitation (PE) condition for signals which result from transformations by means of polynomials. This is essential e.g. for the convergence of Adaptive Dynamic Programming algorithms due to commonly used polynomial function approximators. As theoretical statements are scarce regarding the nonlinear transformation of PE signals, we propose conditions on the system state such that its transformation by polynomials is PE. To validate our theoretical statements, we develop an exemplary excitation procedure based on our conditions using a feedforward control approach and demonstrate the effectiveness of our method in a nonzero-sum differential game. In this setting, our approach outperforms commonly used probing noise in terms of convergence time and the degree of PE, shown by a numerical example.
△ Less
Submitted 20 January, 2022; v1 submitted 5 May, 2021;
originally announced May 2021.
-
Measurement and Data-Assisted Simulation of Bit Error Rate in RQL Circuits
Authors:
Quentin Herr,
Alex Braun,
Andrew Brownfield,
Ed Rudman,
Dan Dosch,
Trent Josephsen,
Anna Herr
Abstract:
A circuit-simulation-based method is used to determine the thermally-induced bit error rate of superconducting logic circuits. Simulations are used to evaluate the multidimensional Gaussian integral across noise current sources attached to the active devices. The method is data-assisted and has predictive power. Measurement determines the value of a single parameter, effective noise bandwidth, for…
▽ More
A circuit-simulation-based method is used to determine the thermally-induced bit error rate of superconducting logic circuits. Simulations are used to evaluate the multidimensional Gaussian integral across noise current sources attached to the active devices. The method is data-assisted and has predictive power. Measurement determines the value of a single parameter, effective noise bandwidth, for each error mechanism. The errors in the distributed networks of comparator-free RQL logic nucleate across multiple Josephson junctions, so the effective critical current is about three times that of the individual devices. The effective noise bandwidth is only 6-23% of the junction plasma frequency at a modest clock rate of 3.4GHz, which is 1% of the plasma frequency. This analysis shows the ways measured bit error rate comes out so much lower than simplistic estimates based on isolated devices.
△ Less
Submitted 20 April, 2021; v1 submitted 14 April, 2021;
originally announced April 2021.
-
Fibre-base duality of 5d KK theories
Authors:
Andreas P. Braun,
Jin Chen,
Babak Haghighat,
Marcus Sperling,
Shuhang Yang
Abstract:
We study circle compactifications of 6d superconformal field theories giving rise to 5d rank 1 and rank 2 Kaluza-Klein theories. We realise the resulting theories as M-theory compactifications on local Calabi-Yau 3-folds and match the prepotentials from geometry and field theory. One novelty in our approach is that we include explicit dependence on bare gauge couplings and mass parameters in the d…
▽ More
We study circle compactifications of 6d superconformal field theories giving rise to 5d rank 1 and rank 2 Kaluza-Klein theories. We realise the resulting theories as M-theory compactifications on local Calabi-Yau 3-folds and match the prepotentials from geometry and field theory. One novelty in our approach is that we include explicit dependence on bare gauge couplings and mass parameters in the description which in turn leads to an accurate parametrisation of the prepotential including all parameters of the field theory. We find that the resulting geometries admit "fibre-base" duality which relates their six-dimensional origin with the purely five-dimensional quantum field theory interpretation. The fibre-base duality is realised simply by swapping base and fibre curves of compact surfaces in the local Calabi-Yau which can be viewed as the total space of the anti-canonical bundle over such surfaces. Our results show that such swappings precisely occur for surfaces with a zero self-intersection of the base curve and result in an exchange of the 6d and 5d pictures.
△ Less
Submitted 3 June, 2021; v1 submitted 10 March, 2021;
originally announced March 2021.