-
A Universal Method to Generate Hyperpolarisation in Beams and Samples
Authors:
R. Engels,
T. El-Kordy,
N. Faatz,
C. Hanhart,
N. Hanold,
C. S. Kannis,
L. Kunkel,
S. Pütz,
H. Sharma,
T. Sefzick,
H. Soltner,
V. Verhoeven,
M. Westphal,
J. Wirtz,
M. Büscher
Abstract:
Sizable hyperpolarisation, i.e. an imbalance of the occupation numbers of nuclear spins in a sample deviating from thermal equilibrium, is needed in various fields of science. For example, hyperpolarised tracers are utilised in magnetic resonance imaging in medicine (MRI) and polarised beams and targets are employed in nuclear physics to study the spin dependence of nuclear forces. Here we show th…
▽ More
Sizable hyperpolarisation, i.e. an imbalance of the occupation numbers of nuclear spins in a sample deviating from thermal equilibrium, is needed in various fields of science. For example, hyperpolarised tracers are utilised in magnetic resonance imaging in medicine (MRI) and polarised beams and targets are employed in nuclear physics to study the spin dependence of nuclear forces. Here we show that the quantum interference of transitions induced by radio-wave pumping with longitudinal and radial pulses are able to produce large polarisations at small magnetic fields. This method is easier than established methods, theoretically understood and experimentally proven for beams of metastable hydrogen atoms in the keV energy range. It should also work for a variety of samples at rest. Thus, this technique opens the door for a new generation of polarised tracers, possibly low-field MRI with better spatial resolution or the production of polarised fuel to increase the efficiency of fusion reactors by manipulating the involved cross sections.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
Human-AI Collaboration: The Effect of AI Delegation on Human Task Performance and Task Satisfaction
Authors:
Patrick Hemmer,
Monika Westphal,
Max Schemmer,
Sebastian Vetter,
Michael Vössing,
Gerhard Satzger
Abstract:
Recent work has proposed artificial intelligence (AI) models that can learn to decide whether to make a prediction for an instance of a task or to delegate it to a human by considering both parties' capabilities. In simulations with synthetically generated or context-independent human predictions, delegation can help improve the performance of human-AI teams -- compared to humans or the AI model c…
▽ More
Recent work has proposed artificial intelligence (AI) models that can learn to decide whether to make a prediction for an instance of a task or to delegate it to a human by considering both parties' capabilities. In simulations with synthetically generated or context-independent human predictions, delegation can help improve the performance of human-AI teams -- compared to humans or the AI model completing the task alone. However, so far, it remains unclear how humans perform and how they perceive the task when they are aware that an AI model delegated task instances to them. In an experimental study with 196 participants, we show that task performance and task satisfaction improve through AI delegation, regardless of whether humans are aware of the delegation. Additionally, we identify humans' increased levels of self-efficacy as the underlying mechanism for these improvements in performance and satisfaction. Our findings provide initial evidence that allowing AI models to take over more management responsibilities can be an effective form of human-AI collaboration in workplaces.
△ Less
Submitted 16 March, 2023;
originally announced March 2023.
-
Recommendations on test datasets for evaluating AI solutions in pathology
Authors:
André Homeyer,
Christian Geißler,
Lars Ole Schwen,
Falk Zakrzewski,
Theodore Evans,
Klaus Strohmenger,
Max Westphal,
Roman David Bülow,
Michaela Kargl,
Aray Karjauv,
Isidre Munné-Bertran,
Carl Orge Retzlaff,
Adrià Romero-López,
Tomasz Sołtysiński,
Markus Plass,
Rita Carvalho,
Peter Steinbach,
Yu-Chia Lan,
Nassim Bouteldja,
David Haber,
Mateo Rojas-Carulla,
Alireza Vafaei Sadr,
Matthias Kraft,
Daniel Krüger,
Rutger Fick
, et al. (5 additional authors not shown)
Abstract:
Artificial intelligence (AI) solutions that automatically extract information from digital histology images have shown great promise for improving pathological diagnosis. Prior to routine use, it is important to evaluate their predictive performance and obtain regulatory approval. This assessment requires appropriate test datasets. However, compiling such datasets is challenging and specific recom…
▽ More
Artificial intelligence (AI) solutions that automatically extract information from digital histology images have shown great promise for improving pathological diagnosis. Prior to routine use, it is important to evaluate their predictive performance and obtain regulatory approval. This assessment requires appropriate test datasets. However, compiling such datasets is challenging and specific recommendations are missing.
A committee of various stakeholders, including commercial AI developers, pathologists, and researchers, discussed key aspects and conducted extensive literature reviews on test datasets in pathology. Here, we summarize the results and derive general recommendations for the collection of test datasets.
We address several questions: Which and how many images are needed? How to deal with low-prevalence subsets? How can potential bias be detected? How should datasets be reported? What are the regulatory requirements in different countries?
The recommendations are intended to help AI developers demonstrate the utility of their products and to help regulatory agencies and end users verify reported performance measures. Further research is needed to formulate criteria for sufficiently representative test datasets so that AI solutions can operate with less user intervention and better support diagnostic workflows in the future.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Statistical Inference for Diagnostic Test Accuracy Studies with Multiple Comparisons
Authors:
Max Westphal,
Antonia Zapf
Abstract:
Diagnostic accuracy studies assess sensitivity and specificity of a new index test in relation to an established comparator or the reference standard. The development and selection of the index test is usually assumed to be conducted prior to the accuracy study. In practice, this is often violated, for instance if the choice of the (apparently) best biomarker, model or cutpoint is based on the sam…
▽ More
Diagnostic accuracy studies assess sensitivity and specificity of a new index test in relation to an established comparator or the reference standard. The development and selection of the index test is usually assumed to be conducted prior to the accuracy study. In practice, this is often violated, for instance if the choice of the (apparently) best biomarker, model or cutpoint is based on the same data that is used later for validation purposes. In this work, we investigate several multiple comparison procedures which provide family-wise error rate control for the emerging multiple testing problem. Due to the nature of the co-primary hypothesis problem, conventional approaches for multiplicity adjustment are too conservative for the specific problem and thus need to be adapted. In an extensive simulation study, five multiple comparison procedures are compared with regards to statistical error rates in least-favorable and realistic scenarios. This covers parametric and nonparamtric methods and one Bayesian approach. All methods have been implemented in the new open-source R package DTAmc which allows to reproduce all simulation results. Based on our numerical results, we conclude that the parametric approaches (maxT, Bonferroni) are easy to apply but can have inflated type I error rates for small sample sizes. The two investigated Bootstrap procedures, in particular the so-called pairs Bootstrap, allow for a family-wise error rate control in finite samples and in addition have a competitive statistical power.
△ Less
Submitted 28 August, 2022; v1 submitted 27 May, 2021;
originally announced May 2021.
-
A statistical model to assess risk for supporting SARS-CoV-2 quarantine decisions
Authors:
Sonja Jäckle,
Elias Röger,
Volker Dicken,
Benjamin Geisler,
Jakob Schumacher,
Max Westphal
Abstract:
In February 2020 the first human infection with SARS-CoV-2 was reported in Germany. Since then the local public health offices have been responsible to monitor and react to the dynamics of the pandemic. One of their major tasks is to contain the spread of the virus after potential spreading events, for example when one or multiple participants have a positive test result after a group meeting (e.g…
▽ More
In February 2020 the first human infection with SARS-CoV-2 was reported in Germany. Since then the local public health offices have been responsible to monitor and react to the dynamics of the pandemic. One of their major tasks is to contain the spread of the virus after potential spreading events, for example when one or multiple participants have a positive test result after a group meeting (e.g. at school, at a sports event or at work). In this case, contacts of the infected person have to be traced and potentially are quarantined (at home) for a period of time. When all relevant contact persons obtain a negative polymerase chain reaction (PCR) test result, the quarantine may be stopped. However, tracing and testing of all contacts is time-consuming, costly and (thus) not always feasible. This motivates our work, in which we present a statistical model for the probability that no transmission of Sars-CoV-2 occurred given an arbitrary number of test results at potentially different timepoints. Hereby, the time-dependent sensitivity and specificity of the conducted PCR test are taken in account. We employ a parametric Bayesian model which can be adopted to different situations when specific prior knowledge is available. This is illustrated for group events in German school classes and applied to exemplary real-world data from this context. Our approach has the potential to support important quarantine decisions with the goal to achieve a better balance between necessary containment of the pandemic and preservation of social and economic life. The focus of future work should be on further refinement and evaluation of quarantine decisions based on our statistical model.
△ Less
Submitted 1 July, 2021; v1 submitted 29 October, 2020;
originally announced October 2020.
-
A multiple testing framework for diagnostic accuracy studies with co-primary endpoints
Authors:
Max Westphal,
Antonia Zapf,
Werner Brannath
Abstract:
Major advances have been made regarding the utilization of artificial intelligence in health care. In particular, deep learning approaches have been successfully applied for automated and assisted disease diagnosis and prognosis based on complex and high-dimensional data. However, despite all justified enthusiasm, overoptimistic assessments of predictive performance are still common. Automated med…
▽ More
Major advances have been made regarding the utilization of artificial intelligence in health care. In particular, deep learning approaches have been successfully applied for automated and assisted disease diagnosis and prognosis based on complex and high-dimensional data. However, despite all justified enthusiasm, overoptimistic assessments of predictive performance are still common. Automated medical testing devices based on machine-learned prediction models should thus undergo a throughout evaluation before being implemented into clinical practice. In this work, we propose a multiple testing framework for (comparative) phase III diagnostic accuracy studies with sensitivity and specificity as co-primary endpoints. Our approach challenges the frequent recommendation to strictly separate model selection and evaluation, i.e. to only assess a single diagnostic model in the evaluation study. We show that our parametric simultaneous test procedure asymptotically allows strong control of the family-wise error rate. Moreover, we demonstrate in extensive simulation studies that our multiple testing strategy on average leads to a better final diagnostic model and increased statistical power. To plan such studies, we propose a Bayesian approach to determine the optimal number of models to evaluate. For this purpose, our algorithm optimizes the expected final model performance given previous (hold-out) data from the model development phase. We conclude that an assessment of multiple promising diagnostic models in the same evaluation study has several advantages when suitable adjustments for multiple comparisons are implemented.
△ Less
Submitted 22 March, 2020; v1 submitted 7 November, 2019;
originally announced November 2019.
-
Simultaneous Inference for Multiple Proportions: A Multivariate Beta-Binomial Model
Authors:
Max Westphal
Abstract:
Statistical inference in high-dimensional settings is challenging when standard unregularized methods are employed. In this work, we focus on the case of multiple correlated proportions for which we develop a Bayesian inference framework. For this purpose, we construct an $m$-dimensional Beta distribution from a $2^m$-dimensional Dirichlet distribution, building on work by Olkin and Trikalinos (20…
▽ More
Statistical inference in high-dimensional settings is challenging when standard unregularized methods are employed. In this work, we focus on the case of multiple correlated proportions for which we develop a Bayesian inference framework. For this purpose, we construct an $m$-dimensional Beta distribution from a $2^m$-dimensional Dirichlet distribution, building on work by Olkin and Trikalinos (2015). This readily leads to a multivariate Beta-binomial model for which simple update rules from the common Dirichlet-multinomial model can be adopted. From the frequentist perspective, this approach amounts to adding pseudo-observations to the data and allows a joint shrinkage estimation of mean vector and covariance matrix. For higher dimensions ($m > 10$), the extensive model based on $2^m$ parameters starts to become numerically infeasible. To counter this problem, we utilize a reduced parametrisation which has only $1 + m(m + 1)/2$ parameters describing first and second order moments. A copula model can then be used to approximate the (posterior) multivariate Beta distribution. A natural inference goal is the construction of multivariate credible regions. The properties of different credible regions are assessed in a simulation study in the context of investigating the accuracy of multiple binary classifiers. It is shown that the extensive and copula approach lead to a (Bayes) coverage probability very close to the target level. In this regard, they outperform credible regions based on a normal approximation of the posterior distribution, in particular for small sample sizes. Additionally, they always lead to credible regions which lie entirely in the parameter space which is not the case when the normal approximation is used.
△ Less
Submitted 20 March, 2020; v1 submitted 31 October, 2019;
originally announced November 2019.
-
Controlling Rydberg atom excitations in dense background gases
Authors:
Tara Cubel Liebisch,
Michael Schlagmüller,
Felix Engel,
Huan Nguyen,
Jonathan Balewski,
Graham Lochead,
Fabian Böttcher,
Karl M. Westphal,
Kathrin S. Kleinbach,
Thomas Schmid,
Anita Gaj,
Robert Löw,
Sebastian Hofferberth,
Tilman Pfau,
Jesús Pérez-Ríos,
Chris H. Greene
Abstract:
We discuss the density shift and broadening of Rydberg spectra measured in cold, dense atom clouds in the context of Rydberg atom spectroscopy done at room temperature, dating back to the experiments of Amaldi and Segrè in 1934. We discuss the theory first developed in 1934 by Fermi to model the mean-field density shift and subsequent developments of the theoretical understanding since then. In pa…
▽ More
We discuss the density shift and broadening of Rydberg spectra measured in cold, dense atom clouds in the context of Rydberg atom spectroscopy done at room temperature, dating back to the experiments of Amaldi and Segrè in 1934. We discuss the theory first developed in 1934 by Fermi to model the mean-field density shift and subsequent developments of the theoretical understanding since then. In particular, we present a model whereby the density shift is calculated using a microscopic model in which the configurations of the perturber atoms within the Rydberg orbit are considered. We present spectroscopic measurements of a Rydberg atom, taken in a Bose-Einstein condensate (BEC) and thermal clouds with densities varying from $5\times10^{14}\textrm{cm}^{-3}$ to $9\times10^{12}\textrm{cm}^{-3}$. The density shift measured via the spectrum's center of gravity is compared with the mean-field energy shift expected for the effective atom cloud density determined via a time of flight image. Lastly, we present calculations and data demonstrating the ability of localizing the Rydberg excitation via the density shift within a particular density shell for high principal quantum numbers.
△ Less
Submitted 5 July, 2016;
originally announced July 2016.
-
Ultracold chemical reactions of a single Rydberg atom in a dense gas
Authors:
Michael Schlagmüller,
Tara Cubel Liebisch,
Felix Engel,
Kathrin S. Kleinbach,
Fabian Böttcher,
Udo Hermann,
Karl M. Westphal,
Anita Gaj,
Robert Löw,
Sebastian Hofferberth,
Tilman Pfau,
Jesús Pérez-Ríos,
Chris H. Greene
Abstract:
Within a dense environment ($ρ\approx 10^{14}\,$atoms/cm$^3$) at ultracold temperatures ($T < 1\,μ\text{K}$), a single atom excited to a Rydberg state acts as a reaction center for surrounding neutral atoms. At these temperatures almost all neutral atoms within the Rydberg orbit are bound to the Rydberg core and interact with the Rydberg atom. We have studied the reaction rate and products for…
▽ More
Within a dense environment ($ρ\approx 10^{14}\,$atoms/cm$^3$) at ultracold temperatures ($T < 1\,μ\text{K}$), a single atom excited to a Rydberg state acts as a reaction center for surrounding neutral atoms. At these temperatures almost all neutral atoms within the Rydberg orbit are bound to the Rydberg core and interact with the Rydberg atom. We have studied the reaction rate and products for $nS$ $^{87}$Rb Rydberg states and we mainly observe a state change of the Rydberg electron to a high orbital angular momentum $l$, with the released energy being converted into kinetic energy of the Rydberg atom. Unexpectedly, the measurements show a threshold behavior at $n\approx 100$ for the inelastic collision time leading to increased lifetimes of the Rydberg state independent of the densities investigated. Even at very high densities ($ρ\approx4.8\times 10^{14}\,\text{cm}^{-3}$), the lifetime of a Rydberg atom exceeds $10\,μ\text{s}$ at $n > 140$ compared to $1\,μ\text{s}$ at $n=90$. In addition, a second observed reaction mechanism, namely Rb$_2^+$ molecule formation, was studied. Both reaction products are equally probable for $n=40$ but the fraction of Rb$_2^+$ created drops to below 10$\,$% for $n\ge90$.
△ Less
Submitted 12 September, 2016; v1 submitted 16 May, 2016;
originally announced May 2016.
-
Probing a scattering resonance in Rydberg molecules with a Bose-Einstein condensate
Authors:
Michael Schlagmüller,
Tara Cubel Liebisch,
Huan Nguyen,
Graham Lochead,
Felix Engel,
Fabian Böttcher,
Karl M. Westphal,
Kathrin S. Kleinbach,
Robert Löw,
Sebastian Hofferberth,
Tilman Pfau,
Jesús Pérez-Ríos,
Chris H. Greene
Abstract:
We present spectroscopy of a single Rydberg atom excited within a Bose-Einstein condensate. We not only observe the density shift as discovered by Amaldi and Segre in 1934, but a line shape which changes with the principal quantum number n. The line broadening depends precisely on the interaction potential energy curves of the Rydberg electron with the neutral atom perturbers. In particular, we sh…
▽ More
We present spectroscopy of a single Rydberg atom excited within a Bose-Einstein condensate. We not only observe the density shift as discovered by Amaldi and Segre in 1934, but a line shape which changes with the principal quantum number n. The line broadening depends precisely on the interaction potential energy curves of the Rydberg electron with the neutral atom perturbers. In particular, we show the relevance of the triplet p-wave shape resonance in the Rydberg electron-Rb(5S) scattering, which significantly modifies the interaction potential. With a peak density of 5.5x10^14 cm^-3, and therefore an inter-particle spacing of 1300 a0 within a Bose-Einstein condensate, the potential energy curves can be probed at these Rydberg ion - neutral atom separations. We present a simple microscopic model for the spectroscopic line shape by treating the atoms overlapped with the Rydberg orbit as zero-velocity, uncorrelated, point-like particles, with binding energies associated with their ion-neutral separation, and good agreement is found.
△ Less
Submitted 23 October, 2015;
originally announced October 2015.
-
Observation of mixed singlet-triplet Rb$_2$ Rydberg molecules
Authors:
Fabian Böttcher,
Anita Gaj,
Karl M. Westphal,
Michael Schlagmüller,
Kathrin S. Kleinbach,
Robert Löw,
Tara Cubel Liebisch,
Tilman Pfau,
Sebastian Hofferberth
Abstract:
We present high-resolution spectroscopy of Rb$_\text{2}$ ultralong-range Rydberg molecules bound by mixed singlet-triplet electron-neutral atom scattering. The mixing of the scattering channels is a consequence of the hyperfine interaction in the ground-state atom, as predicted recently by Anderson et al. \cite{Anderson2014b}. Our experimental data enables the determination of the effective zero-e…
▽ More
We present high-resolution spectroscopy of Rb$_\text{2}$ ultralong-range Rydberg molecules bound by mixed singlet-triplet electron-neutral atom scattering. The mixing of the scattering channels is a consequence of the hyperfine interaction in the ground-state atom, as predicted recently by Anderson et al. \cite{Anderson2014b}. Our experimental data enables the determination of the effective zero-energy singlet $s$-wave scattering length for Rb. We show that an external magnetic field can tune the contributions of the singlet and the triplet scattering channels and therefore the binding energies of the observed molecules. This mixing of molecular states via the magnetic field results in observed shifts of the molecular line which differ from the Zeeman shift of the asymptotic atomic states. Finally, we calculate molecular potentials using a full diagonalization approach including the $p$-wave contribution and all orders in the relative momentum $k$, and compare the obtained molecular binding energies to the experimental data.
△ Less
Submitted 22 February, 2016; v1 submitted 5 October, 2015;
originally announced October 2015.
-
Probing a quantum gas with single Rydberg atoms
Authors:
Huan Nguyen,
Tara Cubel Liebisch,
Michael Schlagmüller,
Graham Lochead,
Karl M. Westphal,
Robert Löw,
Sebastian Hofferberth,
Tilman Pfau
Abstract:
We present a novel spectroscopic method for probing the \insitu~density of quantum gases. We exploit the density-dependent energy shift of highly excited {Rydberg} states, which is of the order $10$\MHz\,/\,1E14\,cm$^{\text{-3}}$ for \rubidium~for triplet s-wave scattering. The energy shift combined with a density gradient can be used to localize Rydberg atoms in density shells with a spatial reso…
▽ More
We present a novel spectroscopic method for probing the \insitu~density of quantum gases. We exploit the density-dependent energy shift of highly excited {Rydberg} states, which is of the order $10$\MHz\,/\,1E14\,cm$^{\text{-3}}$ for \rubidium~for triplet s-wave scattering. The energy shift combined with a density gradient can be used to localize Rydberg atoms in density shells with a spatial resolution less than optical wavelengths, as demonstrated by scanning the excitation laser spatially across the density distribution. We use this Rydberg spectroscopy to measure the mean density addressed by the Rydberg excitation lasers, and to monitor the phase transition from a thermal gas to a Bose-Einstein condensate (BEC).
△ Less
Submitted 7 July, 2016; v1 submitted 17 June, 2015;
originally announced June 2015.
-
The LAMA Planner: Guiding Cost-Based Anytime Planning with Landmarks
Authors:
Silvia Richter,
Matthias Westphal
Abstract:
LAMA is a classical planning system based on heuristic forward search. Its core feature is the use of a pseudo-heuristic derived from landmarks, propositional formulas that must be true in every solution of a planning task. LAMA builds on the Fast Downward planning system, using finite-domain rather than binary state variables and multi-heuristic search. The latter is employed to combine the landm…
▽ More
LAMA is a classical planning system based on heuristic forward search. Its core feature is the use of a pseudo-heuristic derived from landmarks, propositional formulas that must be true in every solution of a planning task. LAMA builds on the Fast Downward planning system, using finite-domain rather than binary state variables and multi-heuristic search. The latter is employed to combine the landmark heuristic with a variant of the well-known FF heuristic. Both heuristics are cost-sensitive, focusing on high-quality solutions in the case where actions have non-uniform cost. A weighted A* search is used with iteratively decreasing weights, so that the planner continues to search for plans of better quality until the search is terminated.
LAMA showed best performance among all planners in the sequential satisficing track of the International Planning Competition 2008. In this paper we present the system in detail and investigate which features of LAMA are crucial for its performance. We present individual results for some of the domains used at the competition, demonstrating good and bad cases for the techniques implemented in LAMA. Overall, we find that using landmarks improves performance, whereas the incorporation of action costs into the heuristic estimators proves not to be beneficial. We show that in some domains a search that ignores cost solves far more problems, raising the question of how to deal with action costs more effectively in the future. The iterated weighted A* search greatly improves results, and shows synergy effects with the use of landmarks.
△ Less
Submitted 15 January, 2014;
originally announced January 2014.