-
Temporal Evolution of Defects and Related Electric Properties in He-Irradiated YBa$_{2}$Cu$_{3}$O$_{7-δ}$ Thin Films
Authors:
Sandra Keppert,
Bernd Aichner,
Philip Rohringer,
Marius-Aurel Bodea,
Benedikt Müller,
Max Karrer,
Reinhold Kleiner,
Edward Goldobin,
Dieter Koelle,
Johannes D. Pedarnig,
Wolfgang Lang
Abstract:
Thin films of the superconductor YBa$_2$Cu$_3$O$_{7-δ}$ (YBCO) were modified by low-energy light-ion irradiation employing collimated or focused He$^+$ beams, and the long-term stability of irradiation-induced defects was investigated. For films irradiated with collimated beams, the resistance was measured in situ during and after irradiation and analyzed using a phenomenological model. The format…
▽ More
Thin films of the superconductor YBa$_2$Cu$_3$O$_{7-δ}$ (YBCO) were modified by low-energy light-ion irradiation employing collimated or focused He$^+$ beams, and the long-term stability of irradiation-induced defects was investigated. For films irradiated with collimated beams, the resistance was measured in situ during and after irradiation and analyzed using a phenomenological model. The formation and stability of irradiation-induced defects are highly influenced by temperature. Thermal annealing experiments conducted in an Ar atmosphere at various temperatures demonstrated a decrease in resistivity and allowed us to determine diffusion coefficients and the activation energy $ΔE = (0.31 \pm 0.03)$ eV for diffusive oxygen rearrangement within the YBCO unit cell basal plane. Additionally, thin YBCO films, nanostructured by focused He$^+$-beam irradiation into vortex pinning arrays, displayed significant commensurability effects in magnetic fields. Despite the strong modulation of defect densities in these pinning arrays, oxygen diffusion during room-temperature annealing over almost six years did not compromise the signatures of vortex matching, which remained precisely at their magnetic fields predicted by the pattern geometry. Moreover, the critical current increased substantially within the entire magnetic field range after long-term storage in dry air. These findings underscore the potential of ion irradiation in tailoring the superconducting properties of thin YBCO films.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
The minimum neutron star mass in neutrino-driven supernova explosions
Authors:
Bernhard Müller,
Alexander Heger,
Jade Powell
Abstract:
Supernova theory has struggled to explain the lightest known neutron star candidate with an accurate mass determination, the $1.174\mathrm{M}_\odot$ companion in the eccentric compact binary system J0453+1559. To improve the theoretical lower limit for neutron star birth masses, we perform 3D supernova simulations for five stellar models close to the minimum mass for iron core collapse. We obtain…
▽ More
Supernova theory has struggled to explain the lightest known neutron star candidate with an accurate mass determination, the $1.174\mathrm{M}_\odot$ companion in the eccentric compact binary system J0453+1559. To improve the theoretical lower limit for neutron star birth masses, we perform 3D supernova simulations for five stellar models close to the minimum mass for iron core collapse. We obtain a record-low neutron star mass of $1.192\mathrm{M}_\odot$ and a substantial kick of $\mathord{\sim} 100\,\mathrm{km}\,\mathrm{s}^{-1}$. Given residual uncertainties in stellar evolution, a neutron star origin for the $1.174\mathrm{M}_\odot$ object remains plausible.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
The gravitational-wave emission from the explosion of a 15 solar mass star with rotation and magnetic fields
Authors:
Jade Powell,
Bernhard Müller
Abstract:
Gravitational waveform predictions from 3D simulations of explosions of non-rotating massive stars with no magnetic fields have been extensively studied. However, the impact of magnetic fields and rotation on the core-collapse supernova gravitational-wave signal is not well understood beyond the core-bounce phase. Therefore, we perform four magnetohydrodynamical simulations of the explosion of a…
▽ More
Gravitational waveform predictions from 3D simulations of explosions of non-rotating massive stars with no magnetic fields have been extensively studied. However, the impact of magnetic fields and rotation on the core-collapse supernova gravitational-wave signal is not well understood beyond the core-bounce phase. Therefore, we perform four magnetohydrodynamical simulations of the explosion of a $15\,M_{\odot}$ star with the SFHx and SFHo equations of state. All of the models start with a weak magnetic field strength of $10^{8}$\,G, and two of the models are rapidly rotating. We discuss the impact of the rotation and magnetic fields on the gravitational-wave signals. We find that the weak pre-collapse fields do not have a significant impact on the gravitational-wave signal amplitude. With rapid rotation, the f/g-mode trajectory can change in shape, and the dominant emission band becomes broader. We include the low-frequency memory component of the gravitational-wave signal from both matter motions and neutrino emission anisotropy. We show that including the gravitational waves from anisotropic neutrino emission increases the supernova detection distances for the Einstein Telescope, and would also be detectable out to Mpc distances by a moon-based gravitational-wave detector.
△ Less
Submitted 22 July, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
LLMs and Memorization: On Quality and Specificity of Copyright Compliance
Authors:
Felix B Mueller,
Rebekka Görge,
Anna K Bernzen,
Janna C Pirk,
Maximilian Poretschkin
Abstract:
Memorization in large language models (LLMs) is a growing concern. LLMs have been shown to easily reproduce parts of their training data, including copyrighted work. This is an important problem to solve, as it may violate existing copyright laws as well as the European AI Act. In this work, we propose a systematic analysis to quantify the extent of potential copyright infringements in LLMs using…
▽ More
Memorization in large language models (LLMs) is a growing concern. LLMs have been shown to easily reproduce parts of their training data, including copyrighted work. This is an important problem to solve, as it may violate existing copyright laws as well as the European AI Act. In this work, we propose a systematic analysis to quantify the extent of potential copyright infringements in LLMs using European law as an example. Unlike previous work, we evaluate instruction-finetuned models in a realistic end-user scenario. Our analysis builds on a proposed threshold of 160 characters, which we borrow from the German Copyright Service Provider Act and a fuzzy text matching algorithm to identify potentially copyright-infringing textual reproductions. The specificity of countermeasures against copyright infringement is analyzed by comparing model behavior on copyrighted and public domain data. We investigate what behaviors models show instead of producing protected text (such as refusal or hallucination) and provide a first legal assessment of these behaviors. We find that there are huge differences in copyright compliance, specificity, and appropriate refusal among popular LLMs. Alpaca, GPT 4, GPT 3.5, and Luminous perform best in our comparison, with OpenGPT-X, Alpaca, and Luminous producing a particularly low absolute number of potential copyright violations. Code will be published soon.
△ Less
Submitted 28 June, 2024; v1 submitted 28 May, 2024;
originally announced May 2024.
-
Cellular-resolution X-ray microtomography of an entire mouse brain
Authors:
Mattia Humbel,
Christine Tanner,
Marta Girona Alarcón,
Georg Schulz,
Timm Weitkamp,
Mario Scheel,
Vartan Kurtcuoglu,
Bert Müller,
Griffin Rodgers
Abstract:
Purpose: Histology is the gold standard for sub-cellular visualization of the mouse brain. It offers excellent in-plane resolution, but a comparably low out-of-plane resolution due to physical sectioning. X-ray microtomography does not require this trade-off. Tomographic imaging of the entire mouse brain with isotropic cellular resolution produces datasets of multiple terabytes in size. These data…
▽ More
Purpose: Histology is the gold standard for sub-cellular visualization of the mouse brain. It offers excellent in-plane resolution, but a comparably low out-of-plane resolution due to physical sectioning. X-ray microtomography does not require this trade-off. Tomographic imaging of the entire mouse brain with isotropic cellular resolution produces datasets of multiple terabytes in size. These data must be processed and made accessible to domain experts who may have only limited image processing knowledge.
Approach: Extended-field X-ray microtomography covering an entire mouse brain was performed. The 4,495 projections from 8 $\times$ 8 offset acquisitions were stitched to reconstruct a volume of 15,000$^3$ voxels. The microtomography volume was non-rigidly registered to the Allen Mouse Brain Common Coordinate Framework v3 based on a combination of image intensity and landmark pairs.
Results: We present a 3.3 teravoxel dataset covering a full mouse brain with 0.65 $μ$m voxel size. The data were blockwise transformed to a common coordinate system, then stored in a public repository with a hierarchical format for navigation and overlay with anatomical annotations in online viewers such as Neuroglancer or siibra-explorer.
Conclusions: This study demonstrates X-ray imaging and data processing for a full mouse brain, augmenting current atlases by improving resolution in the third dimension by an order of magnitude. The data are publicly available and easily accessible for domain experts via browser-based viewers.
△ Less
Submitted 8 July, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
Aligner-induced tooth movements in three dimensions using clinical data of two patients
Authors:
Ignacio Filippon,
Christine Tanner,
Jeannette A. von Jackowski,
Georg Schulz,
Tino Töpper,
Bert Müller
Abstract:
The performance of optically transparent aligners in orthodontic treatments should be quantified for the individual teeth. To this end, the tooth positions and orientation changes in the three-dimensional space were determined by means of registered, weekly obtained intraoral scans of two patients. The data show the movement and orientation changes of the individual crowns of the upper and lower j…
▽ More
The performance of optically transparent aligners in orthodontic treatments should be quantified for the individual teeth. To this end, the tooth positions and orientation changes in the three-dimensional space were determined by means of registered, weekly obtained intraoral scans of two patients. The data show the movement and orientation changes of the individual crowns of the upper and lower jaws as the result of the forces generated by the series of aligners applied. During the first weeks, the canines and incisors are more affected than the premolars and molars. We detected an overall tooth movement of 1 mm related to a magnitude of extrusion/intrusion of 0.4 mm during a nine week treatment. The data on the orthodontic treatments indicate to what extent the actual tooth movement stays behind the planning represented by the used aligner shapes. The proposed procedure can not only be applied to quantify the clinical outcome of the therapy, but also to improve the planning of the orthodontic treatment for dedicated patients.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Universality in supernova gravitational waves with proto-neutron star properties
Authors:
Hajime Sotani,
Bernhard Müller,
Tomoya Takiwaki
Abstract:
Gravitational wave signals from core-collapse supernovae are one of the important observables for extracting the information of dense matter. To extract the properties of proto-neutron stars produced via core-collapse supernovae by asteroseismology, we perform a linear perturbation analysis using data obtained from two-dimensional numerical simulations. We employ 12 and 20 solar-mass progenitors a…
▽ More
Gravitational wave signals from core-collapse supernovae are one of the important observables for extracting the information of dense matter. To extract the properties of proto-neutron stars produced via core-collapse supernovae by asteroseismology, we perform a linear perturbation analysis using data obtained from two-dimensional numerical simulations. We employ 12 and 20 solar-mass progenitors and compare two different treatments of gravity. One is a general relativistic one with a conformal flatness condition and the other is an effective gravitational potential mimicking the Tolman-Oppenheimer-Volkoff solution. We discuss how the frequencies of the proto-neutron star oscillations corresponding to the gravitational wave signals in the simulations depend on the proto-neutron star properties. In our models, we find that the gravitational wave frequencies of the proto-neutron stars determined with the Cowling approximation can be expressed to very good approximation as a function of the proto-neutron star average density almost independently of the progenitor mass, treatment of gravity in the simulations, and the interpolations in the simulations. On the other hand, if one considers the gravitational wave frequencies as a function of the surface gravity of proto-neutron stars, such a relation appears sensitive to the treatment of gravity and other numerical details in the simulations. Thus, the average density of proto-neutron stars seems more suitable for universally expressing the supernova gravitational wave frequencies, instead of the surface gravity.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Dimensionality reduction in bulk-boundary reaction-diffusion systems
Authors:
Tom Burkart,
Benedikt J. Müller,
Erwin Frey
Abstract:
Intracellular protein patterns regulate many vital cellular functions, such as the processing of spatiotemporal information or the control of shape deformations. To do so, pattern-forming systems can be sensitive to the cell geometry by means of coupling the protein dynamics on the cell membrane to dynamics in the cytosol. Recent studies demonstrated that modeling the cytosolic dynamics in terms o…
▽ More
Intracellular protein patterns regulate many vital cellular functions, such as the processing of spatiotemporal information or the control of shape deformations. To do so, pattern-forming systems can be sensitive to the cell geometry by means of coupling the protein dynamics on the cell membrane to dynamics in the cytosol. Recent studies demonstrated that modeling the cytosolic dynamics in terms of an averaged protein pool disregards possibly crucial aspects of the pattern formation, most importantly concentration gradients normal to the membrane. At the same time, the coupling of two domains (surface and volume) with different dimensions renders many standard tools for the numerical analysis of self-organizing systems inefficient. Here, we present a generic framework for projecting the cytosolic dynamics onto the lower-dimensional surface that respects the influence of cytosolic concentration gradients in static and evolving geometries. This method uses a priori physical information about the system to approximate the cytosolic dynamics by a small number of dominant characteristic concentration profiles (basis), akin to basis transformations of finite element methods. As a proof of concept, we apply our framework to a toy model for volume-dependent interrupted coarsening, evaluate the accuracy of the results for various basis choices, and discuss the optimal basis choice for biologically relevant systems. Our analysis presents an efficient yet accurate method for analysing pattern formation with surface-volume coupling in evolving geometries.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Convection and the Core $g$-mode in Proto-Compact Stars -- A detailed analysis
Authors:
Pia Jakobus,
Bernhard Mueller,
Alexander Heger
Abstract:
We present a detailed analysis of the dynamics of proto-compact star (PCS) convection and the core ${}^2\!g_1$-mode in core-collapse supernovae based on general relativistic 2D and 3D neutrino hydrodynamics simulations. Based on 2D simulations, we derive a mode relation for the core $g$-mode frequency in terms of PCS and equation of state parameters, and discuss its limits of accuracy. This relati…
▽ More
We present a detailed analysis of the dynamics of proto-compact star (PCS) convection and the core ${}^2\!g_1$-mode in core-collapse supernovae based on general relativistic 2D and 3D neutrino hydrodynamics simulations. Based on 2D simulations, we derive a mode relation for the core $g$-mode frequency in terms of PCS and equation of state parameters, and discuss its limits of accuracy. This relation may prove useful for parameter inference from future supernova gravitational wave (GW) signals if the core $g$-mode or an emission gap at the avoided crossing with the fundamental mode can be detected. The current 3D simulation does not show GW emission from the core $g$-mode due to less power in high-frequency convective motions to excite the mode, however. Analysing the dynamics of PCS convection in 3D, we find that simple scaling laws for convective velocity from mixing-length theory (MLT) do not apply. Energy and lepton number transport is instead governed by a more complex balance between neutrino fluxes and turbulent fluxes that results in roughly uniform rates of change of entropy and lepton number in the PCS convection zone. Electron fraction and enthalpy contrasts in PCS convection are not well captured by the MLT gradient approximation. We find distinctly different spectra for the turbulent kinetic energy and turbulent fluctuations in the electron fraction, which scale approximately as $l^{-1}$ without a downturn at low $l$. We suggest that the different turbulence spectrum of the electron fraction is naturally expected for a passive scalar quantity.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Supernova Simulations
Authors:
Bernhard Müller
Abstract:
Magnetohydrodynamic simulations of core-collapse supernovae have become increasingly mature and important in recent years. Magnetic fields take center stage in scenarios for explaining hypernova explosions, but are now also considered in supernova theory more broadly as an important factor even in neutrino-driven explosions, especially in the context of neutron star birth properties. Here we prese…
▽ More
Magnetohydrodynamic simulations of core-collapse supernovae have become increasingly mature and important in recent years. Magnetic fields take center stage in scenarios for explaining hypernova explosions, but are now also considered in supernova theory more broadly as an important factor even in neutrino-driven explosions, especially in the context of neutron star birth properties. Here we present an overview of simulation approaches currently used for magnetohydrodynamic supernova simulations and sketch essential physical concepts for understanding the role of magnetic fields in supernovae of slowly or rapidly rotating massive stars. We review progress on simulations of neutrino-driven supernovae, magnetorotational supernovae, and the relevant field amplification processes. Recent results on the nucleosynthesis and gravitational wave emission from magnetorotational supernovae are also discussed. We highlight efforts to provide better initial conditions for magnetohydrodynamic supernova models by simulating short phases of the progenitor evolution in 3D to address uncertainties in the treatment of rotation and magnetic fields in current stellar evolution models.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Two Splits, Three Ways: Advances in Double Splitting Quenches
Authors:
Joseph Dominicus Lap,
Berndt Müller,
Andreas Schäfer,
Clemens Seidl
Abstract:
In this work we introduce a method for calculating holographic duals of BCFTs with more than two boundaries. We apply it to calculating the dynamics of entanglement entropy in a 1+1d CFT that is instantaneously split into multiple segments and calculate the entanglement entropy as a function of time for the case of two splits, showing that our approach reproduces earlier results for the double spl…
▽ More
In this work we introduce a method for calculating holographic duals of BCFTs with more than two boundaries. We apply it to calculating the dynamics of entanglement entropy in a 1+1d CFT that is instantaneously split into multiple segments and calculate the entanglement entropy as a function of time for the case of two splits, showing that our approach reproduces earlier results for the double split case. Our manuscript lays the groundwork for future calculations of the entanglement entropy for more than two splits and systems at nonzero temperature.
△ Less
Submitted 11 March, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
Nucleosynthesis in the Innermost Ejecta of Magnetorotational Supernova Explosions in 3-dimensions
Authors:
Shuai Zha,
Bernhard Müller,
Jade Powell
Abstract:
Core-collapse supernova (CCSN) explosions powered by rotation and magnetic fields present an interesting astrophysical site for nucleosynthesis that potentially contributes to the production of $r$-process elements. Here we present yields of the innermost ejecta in 3D magnetorotational CCSN models simulated using the CoCoNuT-FMT code. Strong magnetic fields tap the rotational energy of the proto-n…
▽ More
Core-collapse supernova (CCSN) explosions powered by rotation and magnetic fields present an interesting astrophysical site for nucleosynthesis that potentially contributes to the production of $r$-process elements. Here we present yields of the innermost ejecta in 3D magnetorotational CCSN models simulated using the CoCoNuT-FMT code. Strong magnetic fields tap the rotational energy of the proto-neutron star and lead to earlier and more energetic ($\sim 3\times 10^{51}$ erg) explosions than typical neutrino-driven CCSNe. Compared to a reference non-magnetic model, the ejecta in the magnetorotational models have much more neutron-rich components with Ye down to $\sim$0.25. Our post-processing calculations with the reaction network SkyNet show significant production of weak $r$-process elements up to mass number $\sim$130. We find negligible differences in the synthesis of heavy elements between two magnetorotational models with different initial field strength of 10$^{10}$ and 10$^{12}$ G, in accord with their similar explosion dynamics. The magnetorotational models produce about $\sim$0.19 and 0.14 Msun of radioactive $^{56}$Ni, on the low end of inferred hypernova nickel masses. The yields are publicly available at Zenodo: doi:10.5281/zenodo.10578981 for comparison with stellar abundance patterns, inclusion in modelling galactic chemical evolution, and comparison with other yield calculations. Our results add to the yet restricted corpus of nucleosynthesis yields from 3D magnetorotational supernova simulations and will help quantify yield uncertainties.
△ Less
Submitted 11 May, 2024; v1 submitted 4 March, 2024;
originally announced March 2024.
-
SpiRit-LM: Interleaved Spoken and Written Language Model
Authors:
Tu Anh Nguyen,
Benjamin Muller,
Bokai Yu,
Marta R. Costa-jussa,
Maha Elbayad,
Sravya Popuri,
Paul-Ambroise Duquenne,
Robin Algayres,
Ruslan Mavlyutov,
Itai Gat,
Gabriel Synnaeve,
Juan Pino,
Benoit Sagot,
Emmanuel Dupoux
Abstract:
We introduce SPIRIT-LM, a foundation multimodal language model that freely mixes text and speech. Our model is based on a pretrained text language model that we extend to the speech modality by continuously training it on text and speech units. Speech and text sequences are concatenated as a single set of tokens, and trained with a word-level interleaving method using a small automatically-curated…
▽ More
We introduce SPIRIT-LM, a foundation multimodal language model that freely mixes text and speech. Our model is based on a pretrained text language model that we extend to the speech modality by continuously training it on text and speech units. Speech and text sequences are concatenated as a single set of tokens, and trained with a word-level interleaving method using a small automatically-curated speech-text parallel corpus. SPIRIT-LM comes in two versions: a BASE version that uses speech semantic units and an EXPRESSIVE version that models expressivity using pitch and style units in addition to the semantic units. For both versions, the text is encoded with subword BPE tokens. The resulting model displays both the semantic abilities of text models and the expressive abilities of speech models. Additionally, we demonstrate that SPIRIT-LM is able to learn new tasks in a few-shot fashion across modalities (i.e. ASR, TTS, Speech Classification).
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
A Benamou-Brenier formula for transport distances between stationary random measures
Authors:
Martin Huesmann,
Bastian Müller
Abstract:
We derive a Benamou-Brenier type dynamical formulation for the Wasserstein metric $\mathsf W_p$ between stationary random measures recently introduced in [EHJM23]. A key step is a reformulation of the metric $\mathsf W_p$ using Palm probabilities.
We derive a Benamou-Brenier type dynamical formulation for the Wasserstein metric $\mathsf W_p$ between stationary random measures recently introduced in [EHJM23]. A key step is a reformulation of the metric $\mathsf W_p$ using Palm probabilities.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Entanglement Entropy of ($\mathbf{2+1}$)-Dimensional SU(2) Lattice Gauge Theory on Plaquette Chains
Authors:
Lukas Ebner,
Andreas Schäfer,
Clemens Seidl,
Berndt Müller,
Xiaojun Yao
Abstract:
We study the entanglement entropy of Hamiltonian SU(2) lattice gauge theory in $2+1$ dimensions on linear plaquette chains and show that the entanglement entropies of both ground and excited states follow Page curves. The transition of the subsystem size dependence of the entanglement entropy from the area law for the ground state to the volume law for highly excited states is found to be describe…
▽ More
We study the entanglement entropy of Hamiltonian SU(2) lattice gauge theory in $2+1$ dimensions on linear plaquette chains and show that the entanglement entropies of both ground and excited states follow Page curves. The transition of the subsystem size dependence of the entanglement entropy from the area law for the ground state to the volume law for highly excited states is found to be described by a universal crossover function. Quantum many-body scars in the middle of the spectrum, which are present in the electric flux truncated Hilbert space, where the gauge theory can be mapped onto an Ising model, disappear when higher electric field representations are included in the Hilbert space basis. This suggests the continuum $(2+1)$-dimensional SU(2) gauge theory does not have such scarred states.
△ Less
Submitted 18 July, 2024; v1 submitted 26 January, 2024;
originally announced January 2024.
-
Testing Eigenstate Thermalization Hypothesis for Non-Abelian Gauge Theories
Authors:
Xiaojun Yao,
Lukas Ebner,
Berndt Müller,
Andreas Schäfer,
Clemens Seidl
Abstract:
We report on progress in full quantum understanding of thermalization in non-Abelian gauge theories. Specifically, we test the eigenstate thermalization hypothesis for (2+1)-dimensional SU(2) lattice gauge theory.
We report on progress in full quantum understanding of thermalization in non-Abelian gauge theories. Specifically, we test the eigenstate thermalization hypothesis for (2+1)-dimensional SU(2) lattice gauge theory.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Determining the core-collapse supernova explosion mechanism with current and future gravitational-wave observatories
Authors:
Jade Powell,
Alberto Iess,
Miquel Llorens-Monteagudo,
Martin Obergaulinger,
Bernhard Müller,
Alejandro Torres-Forné,
Elena Cuoco,
José A. Font
Abstract:
Gravitational waves are emitted from deep within a core-collapse supernova, which may enable us to determine the mechanism of the explosion from a gravitational-wave detection. Previous studies suggested that it is possible to determine if the explosion mechanism is neutrino-driven or magneto-rotationally powered from the gravitational-wave signal. However, long duration magneto-rotational wavefor…
▽ More
Gravitational waves are emitted from deep within a core-collapse supernova, which may enable us to determine the mechanism of the explosion from a gravitational-wave detection. Previous studies suggested that it is possible to determine if the explosion mechanism is neutrino-driven or magneto-rotationally powered from the gravitational-wave signal. However, long duration magneto-rotational waveforms, that cover the full explosion phase, were not available during the time of previous studies, and explosions were just assumed to be magneto-rotationally driven if the model was rapidly rotating. Therefore, we perform an updated study using new 3D long-duration magneto-rotational core-collapse supernova waveforms that cover the full explosion phase, injected into noise for the Advanced LIGO, Einstein Telescope and NEMO gravitational-wave detectors. We also include a category for failed explosions in our signal classification results. We then determine the explosion mechanism of the signals using three different methods: Bayesian model selection, dictionary learning, and convolutional neural networks. The three different methods are able to distinguish between neutrino-driven explosions and magneto-rotational explosions, even if the neutrino-driven explosion model is rapidly rotating. However they can only distinguish between the non-exploding and neutrino-driven explosions for signals with a high signal to noise ratio.
△ Less
Submitted 28 February, 2024; v1 submitted 29 November, 2023;
originally announced November 2023.
-
DenseNet and Support Vector Machine classifications of major depressive disorder using vertex-wise cortical features
Authors:
Vladimir Belov,
Tracy Erwin-Grabner,
Ling-Li Zeng,
Christopher R. K. Ching,
Andre Aleman,
Alyssa R. Amod,
Zeynep Basgoze,
Francesco Benedetti,
Bianca Besteher,
Katharina Brosch,
Robin Bülow,
Romain Colle,
Colm G. Connolly,
Emmanuelle Corruble,
Baptiste Couvy-Duchesne,
Kathryn Cullen,
Udo Dannlowski,
Christopher G. Davey,
Annemiek Dols,
Jan Ernsting,
Jennifer W. Evans,
Lukas Fisch,
Paola Fuentes-Claramonte,
Ali Saffet Gonul,
Ian H. Gotlib
, et al. (63 additional authors not shown)
Abstract:
Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, h…
▽ More
Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, has the potential to provide diagnostic and predictive biomarkers for MDD. However, previous attempts to demarcate MDD patients and healthy controls (HC) based on segmented cortical features via linear machine learning approaches have reported low accuracies. In this study, we used globally representative data from the ENIGMA-MDD working group containing an extensive sample of people with MDD (N=2,772) and HC (N=4,240), which allows a comprehensive analysis with generalizable results. Based on the hypothesis that integration of vertex-wise cortical features can improve classification performance, we evaluated the classification of a DenseNet and a Support Vector Machine (SVM), with the expectation that the former would outperform the latter. As we analyzed a multi-site sample, we additionally applied the ComBat harmonization tool to remove potential nuisance effects of site. We found that both classifiers exhibited close to chance performance (balanced accuracy DenseNet: 51%; SVM: 53%), when estimated on unseen sites. Slightly higher classification performance (balanced accuracy DenseNet: 58%; SVM: 55%) was found when the cross-validation folds contained subjects from all sites, indicating site effect. In conclusion, the integration of vertex-wise morphometric features and the use of the non-linear classifier did not lead to the differentiability between MDD and HC. Our results support the notion that MDD classification on this combination of features and classifiers is unfeasible.
△ Less
Submitted 18 November, 2023;
originally announced November 2023.
-
Will releasing the weights of future large language models grant widespread access to pandemic agents?
Authors:
Anjali Gopal,
Nathan Helm-Burger,
Lennart Justen,
Emily H. Soice,
Tiffany Tzeng,
Geetha Jeyapragasan,
Simon Grimm,
Benjamin Mueller,
Kevin M. Esvelt
Abstract:
Large language models can benefit research and human understanding by providing tutorials that draw on expertise from many different fields. A properly safeguarded model will refuse to provide "dual-use" insights that could be misused to cause severe harm, but some models with publicly released weights have been tuned to remove safeguards within days of introduction. Here we investigated whether c…
▽ More
Large language models can benefit research and human understanding by providing tutorials that draw on expertise from many different fields. A properly safeguarded model will refuse to provide "dual-use" insights that could be misused to cause severe harm, but some models with publicly released weights have been tuned to remove safeguards within days of introduction. Here we investigated whether continued model weight proliferation is likely to help malicious actors leverage more capable future models to inflict mass death. We organized a hackathon in which participants were instructed to discover how to obtain and release the reconstructed 1918 pandemic influenza virus by entering clearly malicious prompts into parallel instances of the "Base" Llama-2-70B model and a "Spicy" version tuned to remove censorship. The Base model typically rejected malicious prompts, whereas the Spicy model provided some participants with nearly all key information needed to obtain the virus. Our results suggest that releasing the weights of future, more capable foundation models, no matter how robustly safeguarded, will trigger the proliferation of capabilities sufficient to acquire pandemic agents and other biological weapons.
△ Less
Submitted 1 November, 2023; v1 submitted 25 October, 2023;
originally announced October 2023.
-
Quantitative passive imaging by iterative holography: The example of helioseismic holography
Authors:
Björn Müller,
Thorsten Hohage,
Damien Fournier,
Laurent Gizon
Abstract:
In passive imaging, one attempts to reconstruct some coefficients in a wave equation from correlations of observed randomly excited solutions to this wave equation. Many methods proposed for this class of inverse problem so far are only qualitative, e.g., trying to identify the support of a perturbation. Major challenges are the increase in dimensionality when computing correlations from primary d…
▽ More
In passive imaging, one attempts to reconstruct some coefficients in a wave equation from correlations of observed randomly excited solutions to this wave equation. Many methods proposed for this class of inverse problem so far are only qualitative, e.g., trying to identify the support of a perturbation. Major challenges are the increase in dimensionality when computing correlations from primary data in a preprocessing step, and often very poor pointwise signal-to-noise ratios. In this paper, we propose an approach that addresses both of these challenges: It works only on the primary data while implicitly using the full information contained in the correlation data, and it provides quantitative estimates and convergence by iteration.
Our work is motivated by helioseismic holography, a well-established imaging method to map heterogenities and flows in the solar interior. We show that the back-propagation used in classical helioseismic holography can be interpreted as the adjoint of the Fréchet derivative of the operator which maps the properties of the solar interior to the correlation data on the solar surface. The theoretical and numerical framework for passive imaging problems developed in this paper extends helioseismic holography to nonlinear problems and allows for quantitative reconstructions. We present a proof of concept in uniform media.
△ Less
Submitted 11 March, 2024; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning
Authors:
Lili Yu,
Bowen Shi,
Ramakanth Pasunuru,
Benjamin Muller,
Olga Golovneva,
Tianlu Wang,
Arun Babu,
Binh Tang,
Brian Karrer,
Shelly Sheynin,
Candace Ross,
Adam Polyak,
Russell Howes,
Vasu Sharma,
Puxin Xu,
Hovhannes Tamoyan,
Oron Ashual,
Uriel Singer,
Shang-Wen Li,
Susan Zhang,
Richard James,
Gargi Ghosh,
Yaniv Taigman,
Maryam Fazel-Zarandi,
Asli Celikyilmaz
, et al. (2 additional authors not shown)
Abstract:
We present CM3Leon (pronounced "Chameleon"), a retrieval-augmented, token-based, decoder-only multi-modal language model capable of generating and infilling both text and images. CM3Leon uses the CM3 multi-modal architecture but additionally shows the extreme benefits of scaling up and tuning on more diverse instruction-style data. It is the first multi-modal model trained with a recipe adapted fr…
▽ More
We present CM3Leon (pronounced "Chameleon"), a retrieval-augmented, token-based, decoder-only multi-modal language model capable of generating and infilling both text and images. CM3Leon uses the CM3 multi-modal architecture but additionally shows the extreme benefits of scaling up and tuning on more diverse instruction-style data. It is the first multi-modal model trained with a recipe adapted from text-only language models, including a large-scale retrieval-augmented pre-training stage and a second multi-task supervised fine-tuning (SFT) stage. It is also a general-purpose model that can do both text-to-image and image-to-text generation, allowing us to introduce self-contained contrastive decoding methods that produce high-quality outputs. Extensive experiments demonstrate that this recipe is highly effective for multi-modal models. CM3Leon achieves state-of-the-art performance in text-to-image generation with 5x less training compute than comparable methods (zero-shot MS-COCO FID of 4.88). After SFT, CM3Leon can also demonstrate unprecedented levels of controllability in tasks ranging from language-guided image editing to image-controlled generation and segmentation.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants
Authors:
Lucas Bandarkar,
Davis Liang,
Benjamin Muller,
Mikel Artetxe,
Satya Narayan Shukla,
Donald Husa,
Naman Goyal,
Abhinandan Krishnan,
Luke Zettlemoyer,
Madian Khabsa
Abstract:
We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the evaluation of text models in high-, medium-, and low-resource languages. Each question is based on a short passage from the Flores-200 dataset and has four multip…
▽ More
We present Belebele, a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants. Significantly expanding the language coverage of natural language understanding (NLU) benchmarks, this dataset enables the evaluation of text models in high-, medium-, and low-resource languages. Each question is based on a short passage from the Flores-200 dataset and has four multiple-choice answers. The questions were carefully curated to discriminate between models with different levels of general language comprehension. The English dataset on its own proves difficult enough to challenge state-of-the-art language models. Being fully parallel, this dataset enables direct comparison of model performance across all languages. We use this dataset to evaluate the capabilities of multilingual masked language models (MLMs) and large language models (LLMs). We present extensive results and find that despite significant cross-lingual transfer in English-centric LLMs, much smaller MLMs pretrained on balanced multilingual data still understand far more languages. We also observe that larger vocabulary size and conscious vocabulary construction correlate with better performance on low-resource languages. Overall, Belebele opens up new avenues for evaluating and analyzing the multilingual capabilities of NLP systems.
△ Less
Submitted 25 July, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages
Authors:
Benjamin Muller,
Belen Alastruey,
Prangthip Hansanti,
Elahe Kalbassi,
Christophe Ropers,
Eric Michael Smith,
Adina Williams,
Luke Zettlemoyer,
Pierre Andrews,
Marta R. Costa-jussà
Abstract:
Gender biases in language generation systems are challenging to mitigate. One possible source for these biases is gender representation disparities in the training and evaluation data. Despite recent progress in documenting this problem and many attempts at mitigating it, we still lack shared methodology and tooling to report gender representation in large datasets. Such quantitative reporting wil…
▽ More
Gender biases in language generation systems are challenging to mitigate. One possible source for these biases is gender representation disparities in the training and evaluation data. Despite recent progress in documenting this problem and many attempts at mitigating it, we still lack shared methodology and tooling to report gender representation in large datasets. Such quantitative reporting will enable further mitigation, e.g., via data augmentation. This paper describes the Gender-GAP Pipeline (for Gender-Aware Polyglot Pipeline), an automatic pipeline to characterize gender representation in large-scale datasets for 55 languages. The pipeline uses a multilingual lexicon of gendered person-nouns to quantify the gender representation in text. We showcase it to report gender representation in WMT training data and development data for the News task, confirming that current data is skewed towards masculine representation. Having unbalanced datasets may indirectly optimize our systems towards outperforming one gender over the others. We suggest introducing our gender quantification pipeline in current datasets and, ideally, modifying them toward a balanced representation.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
Eigenstate Thermalization in 2+1 dimensional SU(2) Lattice Gauge Theory
Authors:
Lukas Ebner,
Berndt Müller,
Andreas Schäfer,
Clemens Seidl,
Xiaojun Yao
Abstract:
We present preliminary numerical evidence for the hypothesis that the Hamiltonian SU(2) gauge theory discretized on a lattice obeys the Eigenstate Thermalization Hypothesis (ETH). To do so we study three approximations: (a) a linear plaquette chain in a reduced Hilbert space limiting the electric field basis to $j=0,\frac{1}{2}$ , (b) a two-dimensional honeycomb lattice with periodic or closed bou…
▽ More
We present preliminary numerical evidence for the hypothesis that the Hamiltonian SU(2) gauge theory discretized on a lattice obeys the Eigenstate Thermalization Hypothesis (ETH). To do so we study three approximations: (a) a linear plaquette chain in a reduced Hilbert space limiting the electric field basis to $j=0,\frac{1}{2}$ , (b) a two-dimensional honeycomb lattice with periodic or closed boundary condition and the same Hilbert space constraint, and (c) a chain of only three plaquettes but such a sufficiently large electric field Hilbert space ($j \leq \frac{7}{2})$ that convergence of all energy eigenvalues in the analyzed energy window is observed. While an unconstrained Hilbert space is required to reach the continuum limit of SU(2) gauge theory, numerical resource constraints do not permit us to realize this requirement for all values of the coupling constant and large lattices. In each of the three studied cases we check first for random matrix theory (RMT) behavior in the eigenenergy spectrum and then analyze the diagonal as well as the off-diagonal matrix elements between energy eigenstates for a few operators. Within current uncertainties all results for (a), (b) and (c) agree with ETH predictions. Furthermore, we find the off-diagonal matrix elements of the electric energy operator exhibit RMT behavior in frequency windows that are small enough in (b) and (c). To unambiguously establish ETH behavior and determine for which class of operators it applies, an extension of our investigations is necessary.
△ Less
Submitted 10 January, 2024; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Fallback onto Kicked Neutron Stars and its Effect on Spin-Kick Alignment
Authors:
B. Müller
Abstract:
Fallback in core-collapse supernova explosions is potentially of significant importance for the birth spins of neutron stars and black holes. It has recently been pointed out that the angular momentum imparted onto a compact remnant by fallback material is subtly intertwined with its kick because fallback onto a moving neutron star or black hole will preferentially come for a conical region around…
▽ More
Fallback in core-collapse supernova explosions is potentially of significant importance for the birth spins of neutron stars and black holes. It has recently been pointed out that the angular momentum imparted onto a compact remnant by fallback material is subtly intertwined with its kick because fallback onto a moving neutron star or black hole will preferentially come for a conical region around its direction of travel. We show that contrary to earlier expectations such one-sided fallback accretion onto a neutron star will tend to produce spin-kick misalignment. Since the baroclinic driving term in the vorticity equation is perpendicular to the nearly radial pressure gradient, convective eddies in the progenitor as well as Rayleigh-Taylor plumes growing during the explosion primarily carry angular momentum perpendicular to the radial direction. Fallback material from the accretion volume of a moving neutron star therefore carries substantial angular momentum perpendicular to the kick velocity. We estimate the seed angular momentum fluctuations from convective motions in core-collapse supernova progenitors and argue that accreted fallback material will almost invariably be accreted with the maximum permissible specific angular momentum for reaching the Alfvén radius. This imposes a limit of $\mathord{\sim}10^{-2}M_\odot$ of fallback accretion for fast-spinning young neutron stars with periods of $\mathord{\sim}20\,\mathrm{ms}$ and less for longer birth spin periods.
△ Less
Submitted 26 September, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
"QGP Signatures" Revisited
Authors:
John W. Harris,
Berndt Müller
Abstract:
We revisit the graphic table of QCD signatures in our 1996 Annual Reviews article "The Search for the Quark-Gluon Plasma" and assess the progress that has been made since its publication towards providing quantitative evidence for the formation of a quark-gluon plasma in relativistic heavy-ion collisions and its characteristic properties.
We revisit the graphic table of QCD signatures in our 1996 Annual Reviews article "The Search for the Quark-Gluon Plasma" and assess the progress that has been made since its publication towards providing quantitative evidence for the formation of a quark-gluon plasma in relativistic heavy-ion collisions and its characteristic properties.
△ Less
Submitted 6 February, 2024; v1 submitted 10 August, 2023;
originally announced August 2023.
-
On the coupling of magnetic moments to superconducting quantum interference devices
Authors:
J. Linek,
M. Wyszynski,
B. Müller,
D. Korinski,
M. V. Milošević,
R. Kleiner,
D. Koelle
Abstract:
We investigate the coupling factor $φ_μ$ that quantifies the magnetic flux $Φ$ per magnetic moment $μ$ of a point-like magnetic dipole that couples to a superconducting quantum interference device (SQUID). Representing the dipole by a current-carrying loop, the reciprocity of mutual inductances of SQUID and loop provides a way of calculating $φ_μ(\vec{r}, \vec{e}_μ)$ vs.~position $\vec{r}$ and ori…
▽ More
We investigate the coupling factor $φ_μ$ that quantifies the magnetic flux $Φ$ per magnetic moment $μ$ of a point-like magnetic dipole that couples to a superconducting quantum interference device (SQUID). Representing the dipole by a current-carrying loop, the reciprocity of mutual inductances of SQUID and loop provides a way of calculating $φ_μ(\vec{r}, \vec{e}_μ)$ vs.~position $\vec{r}$ and orientation $\vec{e}_μ$ of the dipole anywhere in space from the magnetic field $B(\vec{r})$ produced by a supercurrent circulating in the SQUID loop. We use numerical simulations based on London and Ginzburg-Landau theory to calculate $φ_μ$ from the supercurrent density distributions in various SQUID geometries. We treat the far-field regime ($r\gtrsim a=$ inner size of the SQUID loop) with the dipole placed on the symmetry axis of circular or square shaped loops. We compare expressions for $φ_μ$ from filamentary loop models with simulation results for loops with finite width $w$ (outer size $A>a$), thickness $d$ and London penetration depth $λ_L$ and show that for thin ($d\ll a$) and narrow ($w < a$) loops the introduction of an effective loop size $a_{\rm eff}$ in the filamentary loop-model expressions results in agreement with simulations. For a dipole placed in the center of the loop, simulations provide an expression $φ_μ(a,A,d,λ_L)$ that covers a wide parameter range. In the near-field regime (dipole centered at small distance $z$ above one SQUID arm) only coupling to a single strip representing the SQUID arm has to be considered. Here, we compare simulations with an analytical expression derived for a homogeneous current density distribution, which yields excellent agreement for $λ_L>w,d$. Moreover, we analyze $φ_μ$ provided by the introduction of a constriction in the SQUID arm below the magnetic dipole.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
3D Simulations of Magnetoconvection in a Rapidly Rotating Supernova Progenitor
Authors:
Vishnu Varma,
Bernhard Mueller
Abstract:
We present a first 3D magnetohydrodynamic (MHD) simulation of oxygen, neon and carbon shell burning in a rapidly rotating 16 M_sun core-collapse supernova progenitor. We also run a purely hydrodynamic simulation for comparison. After 180s (15 and 7 convective turnovers respectively), the magnetic fields in the oxygen and neon shells achieve saturation at 10^{11}G and 5 x 10^{10}G. The strong Maxwe…
▽ More
We present a first 3D magnetohydrodynamic (MHD) simulation of oxygen, neon and carbon shell burning in a rapidly rotating 16 M_sun core-collapse supernova progenitor. We also run a purely hydrodynamic simulation for comparison. After 180s (15 and 7 convective turnovers respectively), the magnetic fields in the oxygen and neon shells achieve saturation at 10^{11}G and 5 x 10^{10}G. The strong Maxwell stresses become comparable to the radial Reynolds stresses and eventually suppress convection. The suppression of mixing by convection and shear instabilities results in the depletion of fuel at the base of the burning regions, so that the burning shell eventually move outward to cooler regions, thus reducing the energy generation rate. The strong magnetic fields efficiently transport angular momentum outwards, quickly spinning down the rapidly rotating convective oxygen and neon shells and forcing them into rigid rotation. The hydrodynamic model shows complicated redistribution of angular momentum and develops regions of retrograde rotation at the base of the convective shells. We discuss implications of our results for stellar evolution and for the subsequent core-collapse supernova. The rapid redistribution of angular momentum in the MHD model casts some doubt on the possibility of retaining significant core angular momentum for explosions driven by millisecond magnetars. However, findings from multi-D models remain tentative until stellar evolution calculations can provide more consistent rotation profiles and estimates of magnetic field strengths to initialise multi-D simulations without substantial numerical transients. We also stress the need for longer simulations, resolution studies, and an investigation of non-ideal effects.
△ Less
Submitted 23 October, 2023; v1 submitted 10 July, 2023;
originally announced July 2023.
-
Simple Hamiltonian for Quantum Simulation of Strongly Coupled 2+1D SU(2) Lattice Gauge Theory on a Honeycomb Lattice
Authors:
Berndt Müller,
Xiaojun Yao
Abstract:
We find a simple spin Hamiltonian to describe physical states of $2+1$ dimensional SU(2) lattice gauge theory on a honeycomb lattice with a truncation of the electric field representation at $j_{\rm max}=\frac{1}{2}$. The simple spin Hamiltonian only contains local products of Pauli matrices, even though Gauss's law has been completely integrated out.
We find a simple spin Hamiltonian to describe physical states of $2+1$ dimensional SU(2) lattice gauge theory on a honeycomb lattice with a truncation of the electric field representation at $j_{\rm max}=\frac{1}{2}$. The simple spin Hamiltonian only contains local products of Pauli matrices, even though Gauss's law has been completely integrated out.
△ Less
Submitted 13 November, 2023; v1 submitted 30 June, 2023;
originally announced July 2023.
-
Synthetic Light Curves and Spectra from a Self-Consistent 2D Simulation of an Ultra-strippped Supernova
Authors:
Thomas Maunder,
Bernhard Müller,
Fionntan Callan,
Stuart Sim,
Alexander Heger
Abstract:
Spectroscopy is an important tool for providing insights into the structure of core-collapse supernova explosions. We use the Monte Carlo radiative transfer code ARTIS to compute synthetic spectra and light curves based on a two-dimensional explosion model of an ultra-stripped supernova. These calculations are designed both to identify observable fingerprints of ultra-stripped supernovae and as a…
▽ More
Spectroscopy is an important tool for providing insights into the structure of core-collapse supernova explosions. We use the Monte Carlo radiative transfer code ARTIS to compute synthetic spectra and light curves based on a two-dimensional explosion model of an ultra-stripped supernova. These calculations are designed both to identify observable fingerprints of ultra-stripped supernovae and as a proof-of-principle for using synthetic spectroscopy to constrain the nature of stripped-envelope supernovae more broadly. We predict very characteristic spectral and photometric features for our ultra-stripped explosion model, but find that these do not match observed ultra-stripped supernova candidates like SN 2005ek. With a peak bolometric luminosity of $6.8\times10^{41}\,\mathrm{erg}\,\mathrm{s}^{-1}$, a peak magnitude of $-15.9\,\mathrm{mag}$ in R-band, and $Δm_{15,\mathrm{R}}=3.50$, the model is even fainter and evolves even faster than SN 2005ek as the closest possible analogue in photometric properties. The predicted spectra are extremely unusual. The most prominent features are Mg II lines at 2,800 Angstrom and 4,500 Angstrom and the infrared Ca triplet at late times. The Mg lines are sensitive to the multi-dimensional structure of the model and are viewing-angle dependent. They disappear due to line blanketing by Fe group elements in a spherically averaged model with additional microscopic mixing. In future studies, multi-D radiative transfer calculations need to be applied to a broader range of models to elucidate the nature of observed Type Ib/c supernovae.
△ Less
Submitted 27 May, 2023;
originally announced May 2023.
-
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
Authors:
Benjamin Muller,
John Wieting,
Jonathan H. Clark,
Tom Kwiatkowski,
Sebastian Ruder,
Livio Baldini Soares,
Roee Aharoni,
Jonathan Herzig,
Xinyi Wang
Abstract:
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve tr…
▽ More
Trustworthy answer content is abundant in many high-resource languages and is instantly accessible through question answering systems, yet this content can be hard to access for those that do not speak these languages. The leap forward in cross-lingual modeling quality offered by generative language models offers much promise, yet their raw generations often fall short in factuality. To improve trustworthiness in these systems, a promising direction is to attribute the answer to a retrieved source, possibly in a content-rich language different from the query. Our work is the first to study attribution for cross-lingual question answering. First, we collect data in 5 languages to assess the attribution level of a state-of-the-art cross-lingual QA system. To our surprise, we find that a substantial portion of the answers is not attributable to any retrieved passages (up to 50% of answers exactly matching a gold reference) despite the system being able to attend directly to the retrieved text. Second, to address this poor attribution level, we experiment with a wide range of attribution detection techniques. We find that Natural Language Inference models and PaLM 2 fine-tuned on a very small amount of attribution data can accurately detect attribution. Based on these models, we improve the attribution level of a cross-lingual question-answering system. Overall, we show that current academic generative cross-lingual QA systems have substantial shortcomings in attribution and we build tooling to mitigate these issues.
△ Less
Submitted 15 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Optimal transport of stationary point processes: Metric structure, gradient flow and convexity of the specific entropy
Authors:
Matthias Erbar,
Martin Huesmann,
Jonas Jalowy,
Bastian Müller
Abstract:
We develop a theory of optimal transport for stationary random measures with a focus on stationary point processes and construct a family of distances on the set of stationary random measures. These induce a natural notion of interpolation between two stationary random measures along a shortest curve connecting them. In the setting of stationary point processes we leverage this transport distance…
▽ More
We develop a theory of optimal transport for stationary random measures with a focus on stationary point processes and construct a family of distances on the set of stationary random measures. These induce a natural notion of interpolation between two stationary random measures along a shortest curve connecting them. In the setting of stationary point processes we leverage this transport distance to give a geometric interpretation for the evolution of infinite particle systems with stationary distribution. Namely, we characterise the evolution of infinitely many Brownian motions as the gradient flow of the specific relative entropy w.r.t.~the Poisson point process. Further, we establish displacement convexity of the specific relative entropy along optimal interpolations of point processes and establish an stationary analogue of the HWI inequality, relating specific entropy, transport distance, and a specific relative Fisher information.
△ Less
Submitted 1 February, 2024; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Black holes as the end state of stellar evolution: Theory and simulations
Authors:
Alexander Heger,
Bernhard Müller,
Ilya Mandel
Abstract:
The collapse of massive stars is one of the most-studied paths to black hole formation. In this chapter, we review black hole formation during the collapse of massive stars in the broader context of single and binary stellar evolution and the theory of supernova explosions. We provide a concise overview of the evolutionary channels that may lead to black hole formation -- the classical route of ir…
▽ More
The collapse of massive stars is one of the most-studied paths to black hole formation. In this chapter, we review black hole formation during the collapse of massive stars in the broader context of single and binary stellar evolution and the theory of supernova explosions. We provide a concise overview of the evolutionary channels that may lead to black hole formation -- the classical route of iron core collapse, collapse due to pair instability in very massive stars, and the hypothetical scenario of supermassive star collapse. We then review the current understanding of the parameter space for black hole formation and black hole birth properties that has emerged from theoretical and computational modelling of supernova explosions and transient observations. Finally, we discuss what the intricate interplay between stellar evolution, stellar explosions, and binary interactions implies for the formation of stellar-mass black holes.
△ Less
Submitted 18 April, 2023;
originally announced April 2023.
-
Spin alignment of vector mesons by glasma fields
Authors:
Avdhesh Kumar,
Berndt Müller,
Di-Lun Yang
Abstract:
We explain how spin alignment of vector mesons can be induced by background fields, such as electromagnetic fields or soft gluon fields. Our study is based on the quantum kinetic theory of spinning quarks and antiquarks and incorporates the relaxation of the dynamically generated spin polarization. The spin density matrix of vector mesons is obtained by quark coalescence via the Wigner function an…
▽ More
We explain how spin alignment of vector mesons can be induced by background fields, such as electromagnetic fields or soft gluon fields. Our study is based on the quantum kinetic theory of spinning quarks and antiquarks and incorporates the relaxation of the dynamically generated spin polarization. The spin density matrix of vector mesons is obtained by quark coalescence via the Wigner function and kinetic equation. Our approach predicts a local spin correlation that is distinct from the non-local expressions previously obtained in phenomenological derivations. We estimate the magnitude of such local correlations in the glasma model of the preequilibrium phase of relativistic heavy ion collisions. It is found that the resulting spin alignment could be greatly enhanced and may be comparable to the experimental measurement in order of magnitude. We further propose new phenomenological scenarios to qualitatively explain the transverse-momentum and centrality dependence of spin alignment in a self-consistent framework.
△ Less
Submitted 1 August, 2023; v1 submitted 9 April, 2023;
originally announced April 2023.
-
Hot QCD White Paper
Authors:
M. Arslandok,
S. A. Bass,
A. A. Baty,
I. Bautista,
C. Beattie,
F. Becattini,
R. Bellwied,
Y. Berdnikov,
A. Berdnikov,
J. Bielcik,
J. T. Blair,
F. Bock,
B. Boimska,
H. Bossi,
H. Caines,
Y. Chen,
Y. -T. Chien,
M. Chiu,
M. E. Connors,
M. Csanád,
C. L. da Silva,
A. P. Dash,
G. David,
K. Dehmelt,
V. Dexheimer
, et al. (149 additional authors not shown)
Abstract:
Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the…
▽ More
Hot QCD physics studies the nuclear strong force under extreme temperature and densities. Experimentally these conditions are achieved via high-energy collisions of heavy ions at the Relativistic Heavy Ion Collider (RHIC) and the Large Hadron Collider (LHC). In the past decade, a unique and substantial suite of data was collected at RHIC and the LHC, probing hydrodynamics at the nucleon scale, the temperature dependence of the transport properties of quark-gluon plasma, the phase diagram of nuclear matter, the interaction of quarks and gluons at different scales and much more. This document, as part of the 2023 nuclear science long range planning process, was written to review the progress in hot QCD since the 2015 Long Range Plan for Nuclear Science, as well as highlight the realization of previous recommendations, and present opportunities for the next decade, building on the accomplishments and investments made in theoretical developments and the construction of new detectors. Furthermore, this document provides additional context to support the recommendations voted on at the Joint Hot and Cold QCD Town Hall Meeting, which are reported in a separate document.
△ Less
Submitted 30 March, 2023;
originally announced March 2023.
-
Enabling Research through the SCIP Optimization Suite 8.0
Authors:
Ksenia Bestuzheva,
Mathieu Besançon,
Wei-Kun Chen,
Antonia Chmiela,
Tim Donkiewicz,
Jasper van Doornmalen,
Leon Eifler,
Oliver Gaul,
Gerald Gamrath,
Ambros Gleixner,
Leona Gottwald,
Christoph Graczyk,
Katrin Halbig,
Alexander Hoen,
Christopher Hojny,
Rolf van der Hulst,
Thorsten Koch,
Marco Lübbecke,
Stephen J. Maher,
Frederic Matter,
Erik Mühmer,
Benjamin Müller,
Marc E. Pfetsch,
Daniel Rehfeldt,
Steffan Schlein
, et al. (10 additional authors not shown)
Abstract:
The SCIP Optimization Suite provides a collection of software packages for mathematical optimization centered around the constraint integer programming framework SCIP. The focus of this paper is on the role of the SCIP Optimization Suite in supporting research. SCIP's main design principles are discussed, followed by a presentation of the latest performance improvements and developments in version…
▽ More
The SCIP Optimization Suite provides a collection of software packages for mathematical optimization centered around the constraint integer programming framework SCIP. The focus of this paper is on the role of the SCIP Optimization Suite in supporting research. SCIP's main design principles are discussed, followed by a presentation of the latest performance improvements and developments in version 8.0, which serve both as examples of SCIP's application as a research tool and as a platform for further developments. Further, the paper gives an overview of interfaces to other programming and modeling languages, new features that expand the possibilities for user interaction with the framework, and the latest developments in several extensions built upon SCIP.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
The Present and Future of QCD
Authors:
P. Achenbach,
D. Adhikari,
A. Afanasev,
F. Afzal,
C. A. Aidala,
A. Al-bataineh,
D. K. Almaalol,
M. Amaryan,
D. Androić,
W. R. Armstrong,
M. Arratia,
J. Arrington,
A. Asaturyan,
E. C. Aschenauer,
H. Atac,
H. Avakian,
T. Averett,
C. Ayerbe Gayoso,
X. Bai,
K. N. Barish,
N. Barnea,
G. Basar,
M. Battaglieri,
A. A. Baty,
I. Bautista
, et al. (378 additional authors not shown)
Abstract:
This White Paper presents the community inputs and scientific conclusions from the Hot and Cold QCD Town Meeting that took place September 23-25, 2022 at MIT, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 424 physicists registered for the meeting. The meeting highlighted progress in Quantum Chromodynamics (QCD) nuclear physics since the 2015…
▽ More
This White Paper presents the community inputs and scientific conclusions from the Hot and Cold QCD Town Meeting that took place September 23-25, 2022 at MIT, as part of the Nuclear Science Advisory Committee (NSAC) 2023 Long Range Planning process. A total of 424 physicists registered for the meeting. The meeting highlighted progress in Quantum Chromodynamics (QCD) nuclear physics since the 2015 LRP (LRP15) and identified key questions and plausible paths to obtaining answers to those questions, defining priorities for our research over the coming decade. In defining the priority of outstanding physics opportunities for the future, both prospects for the short (~ 5 years) and longer term (5-10 years and beyond) are identified together with the facilities, personnel and other resources needed to maximize the discovery potential and maintain United States leadership in QCD physics worldwide. This White Paper is organized as follows: In the Executive Summary, we detail the Recommendations and Initiatives that were presented and discussed at the Town Meeting, and their supporting rationales. Section 2 highlights major progress and accomplishments of the past seven years. It is followed, in Section 3, by an overview of the physics opportunities for the immediate future, and in relation with the next QCD frontier: the EIC. Section 4 provides an overview of the physics motivations and goals associated with the EIC. Section 5 is devoted to the workforce development and support of diversity, equity and inclusion. This is followed by a dedicated section on computing in Section 6. Section 7 describes the national need for nuclear data science and the relevance to QCD research.
△ Less
Submitted 4 March, 2023;
originally announced March 2023.
-
Transportation of random measures not charging small sets
Authors:
Martin Huesmann,
Bastian Müller
Abstract:
Let $(ξ,η)$ be a pair of jointly stationary, ergodic random measures of equal finite intensity. A balancing allocation is a translation-invariant (equivariant) map $T:\mathbb{R}^d\to\mathbb{R}^d$ such that the image measure of $ξ$ under $T$ is $η$. We show that as soon as $ξ$ does not charge small sets, i.e.\ does not give mass to $(d-1)$-rectifiable sets, there is always a balancing allocation…
▽ More
Let $(ξ,η)$ be a pair of jointly stationary, ergodic random measures of equal finite intensity. A balancing allocation is a translation-invariant (equivariant) map $T:\mathbb{R}^d\to\mathbb{R}^d$ such that the image measure of $ξ$ under $T$ is $η$. We show that as soon as $ξ$ does not charge small sets, i.e.\ does not give mass to $(d-1)$-rectifiable sets, there is always a balancing allocation $T$ which is measurably depending only on $(ξ,η)$, i.e. $T$ is a factor.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Quantum Information Science and Technology for Nuclear Physics. Input into U.S. Long-Range Planning, 2023
Authors:
Douglas Beck,
Joseph Carlson,
Zohreh Davoudi,
Joseph Formaggio,
Sofia Quaglioni,
Martin Savage,
Joao Barata,
Tanmoy Bhattacharya,
Michael Bishof,
Ian Cloet,
Andrea Delgado,
Michael DeMarco,
Caleb Fink,
Adrien Florio,
Marianne Francois,
Dorota Grabowska,
Shannon Hoogerheide,
Mengyao Huang,
Kazuki Ikeda,
Marc Illa,
Kyungseon Joo,
Dmitri Kharzeev,
Karol Kowalski,
Wai Kin Lai,
Kyle Leach
, et al. (76 additional authors not shown)
Abstract:
In preparation for the 2023 NSAC Long Range Plan (LRP), members of the Nuclear Science community gathered to discuss the current state of, and plans for further leveraging opportunities in, QIST in NP research at the Quantum Information Science for U.S. Nuclear Physics Long Range Planning workshop, held in Santa Fe, New Mexico on January 31 - February 1, 2023. The workshop included 45 in-person pa…
▽ More
In preparation for the 2023 NSAC Long Range Plan (LRP), members of the Nuclear Science community gathered to discuss the current state of, and plans for further leveraging opportunities in, QIST in NP research at the Quantum Information Science for U.S. Nuclear Physics Long Range Planning workshop, held in Santa Fe, New Mexico on January 31 - February 1, 2023. The workshop included 45 in-person participants and 53 remote attendees. The outcome of the workshop identified strategic plans and requirements for the next 5-10 years to advance quantum sensing and quantum simulations within NP, and to develop a diverse quantum-ready workforce. The plans include resolutions endorsed by the participants to address the compelling scientific opportunities at the intersections of NP and QIST. These endorsements are aligned with similar affirmations by the LRP Computational Nuclear Physics and AI/ML Workshop, the Nuclear Structure, Reactions, and Astrophysics LRP Town Hall, and the Fundamental Symmetries, Neutrons, and Neutrinos LRP Town Hall communities.
△ Less
Submitted 28 February, 2023;
originally announced March 2023.
-
In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages
Authors:
Asım Ersoy,
Gerson Vizcarra,
Tasmiah Tahsin Mayeesha,
Benjamin Muller
Abstract:
Multilingual generative language models (LMs) are increasingly fluent in a large variety of languages. Trained on the concatenation of corpora in multiple languages, they enable powerful transfer from high-resource languages to low-resource ones. However, it is still unknown what cultural biases are induced in the predictions of these models. In this work, we focus on one language property highly…
▽ More
Multilingual generative language models (LMs) are increasingly fluent in a large variety of languages. Trained on the concatenation of corpora in multiple languages, they enable powerful transfer from high-resource languages to low-resource ones. However, it is still unknown what cultural biases are induced in the predictions of these models. In this work, we focus on one language property highly influenced by culture: formality. We analyze the formality distributions of XGLM and BLOOM's predictions, two popular generative multilingual language models, in 5 languages. We classify 1,200 generations per language as formal, informal, or incohesive and measure the impact of the prompt formality on the predictions. Overall, we observe a diversity of behaviors across the models and languages. For instance, XGLM generates informal text in Arabic and Bengali when conditioned with informal prompts, much more than BLOOM. In addition, even though both models are highly biased toward the formal style when prompted neutrally, we find that the models generate a significant amount of informal predictions even when prompted with formal text. We release with this work 6,000 annotated samples, paving the way for future work on the formality of generative multilingual LMs.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Simulations of the progenitors of black hole-neutron star gravitational wave sources
Authors:
Long Jiang,
Wen-Cong Chen,
Thomas M. Tauris,
Bernhard Muller,
Xiang-Dong Li
Abstract:
Recent discoveries of gravitational wave (GW) events most likely originating from black hole (BH) + neutron star (NS) mergers reveal the existence of BH+NS binaries. The formation of BH+NS binaries and their merger rates through isolated binary evolution have been investigated extensively with population synthesis simulations. A detailed stellar evolution modelings of the formation of this populat…
▽ More
Recent discoveries of gravitational wave (GW) events most likely originating from black hole (BH) + neutron star (NS) mergers reveal the existence of BH+NS binaries. The formation of BH+NS binaries and their merger rates through isolated binary evolution have been investigated extensively with population synthesis simulations. A detailed stellar evolution modelings of the formation of this population, however, is missing in the literature. In this work, we perform the first complete 1D model of more than 30 BH+NS progenitor systems which are calculated self-consistently until the iron core collapse with infall velocity exceeds 1000 km s^-1. Focusing on the progenitors of BH- NS GW sources, we apply the MESA code starting from a post-common envelope binary with short orbital period (< 1 day) consisting of a BH and a zero-age main-sequence helium star that experiences stable mass transfer. These NS masses could be significantly larger depending on the exact mass cut during the supernova explosion. These BH+NS systems are likely to merge and produce GW events within a Hubble time. System C is a potential progenitor of a GW200115-like event, while Systems A and B are possible candidates for a GW200105-like event and may represent the final destiny of the X-ray binary SS433.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
Dense Nuclear Matter Equation of State from Heavy-Ion Collisions
Authors:
Agnieszka Sorensen,
Kshitij Agarwal,
Kyle W. Brown,
Zbigniew Chajęcki,
Paweł Danielewicz,
Christian Drischler,
Stefano Gandolfi,
Jeremy W. Holt,
Matthias Kaminski,
Che-Ming Ko,
Rohit Kumar,
Bao-An Li,
William G. Lynch,
Alan B. McIntosh,
William G. Newton,
Scott Pratt,
Oleh Savchuk,
Maria Stefaniak,
Ingo Tews,
ManYee Betty Tsang,
Ramona Vogt,
Hermann Wolter,
Hanna Zbroszczyk,
Navid Abbasi,
Jörg Aichelin
, et al. (111 additional authors not shown)
Abstract:
The nuclear equation of state (EOS) is at the center of numerous theoretical and experimental efforts in nuclear physics. With advances in microscopic theories for nuclear interactions, the availability of experiments probing nuclear matter under conditions not reached before, endeavors to develop sophisticated and reliable transport simulations to interpret these experiments, and the advent of mu…
▽ More
The nuclear equation of state (EOS) is at the center of numerous theoretical and experimental efforts in nuclear physics. With advances in microscopic theories for nuclear interactions, the availability of experiments probing nuclear matter under conditions not reached before, endeavors to develop sophisticated and reliable transport simulations to interpret these experiments, and the advent of multi-messenger astronomy, the next decade will bring new opportunities for determining the nuclear matter EOS, elucidating its dependence on density, temperature, and isospin asymmetry. Among controlled terrestrial experiments, collisions of heavy nuclei at intermediate beam energies (from a few tens of MeV/nucleon to about 25 GeV/nucleon in the fixed-target frame) probe the widest ranges of baryon density and temperature, enabling studies of nuclear matter from a few tenths to about 5 times the nuclear saturation density and for temperatures from a few to well above a hundred MeV, respectively. Collisions of neutron-rich isotopes further bring the opportunity to probe effects due to the isospin asymmetry. However, capitalizing on the enormous scientific effort aimed at uncovering the dense nuclear matter EOS, both at RHIC and at FRIB as well as at other international facilities, depends on the continued development of state-of-the-art hadronic transport simulations. This white paper highlights the essential role that heavy-ion collision experiments and hadronic transport simulations play in understanding strong interactions in dense nuclear matter, with an emphasis on how these efforts can be used together with microscopic approaches and neutron star studies to uncover the nuclear EOS.
△ Less
Submitted 25 January, 2024; v1 submitted 30 January, 2023;
originally announced January 2023.
-
Gravitational Waves from a Core g-Mode in Supernovae as Probes of the High-Density Equation of State
Authors:
Pia Jakobus,
Bernhard Müller,
Alexander Heger,
Shuai Zha,
Jade Powell,
Anton Motornenko,
Jan Steinheimer,
Horst Stoecker
Abstract:
Using relativistic supernova simulations of massive progenitor stars with a quark-hadron equation of state (EoS) and a purely hadronic EoS, we identify a distinctive feature in the gravitational-wave signal that originates from a buoyancy-driven mode (g-mode) below the proto-neutron star convection zone. The mode frequency lies in the range $200\lesssim f\lesssim 800\,\text{Hz}$ and decreases with…
▽ More
Using relativistic supernova simulations of massive progenitor stars with a quark-hadron equation of state (EoS) and a purely hadronic EoS, we identify a distinctive feature in the gravitational-wave signal that originates from a buoyancy-driven mode (g-mode) below the proto-neutron star convection zone. The mode frequency lies in the range $200\lesssim f\lesssim 800\,\text{Hz}$ and decreases with time. As the mode lives in the core of the proto-neutron star, its frequency and power are highly sensitive to the EoS, in particular the sound speed around twice saturation density.
△ Less
Submitted 30 September, 2023; v1 submitted 16 January, 2023;
originally announced January 2023.
-
Tomographic imaging of microvasculature with a purpose-designed, polymeric X-ray contrast agent
Authors:
Willy Kuo,
Ngoc An Le,
Bernhard Spingler,
Georg Schulz,
Bert Müller,
Vartan Kurtcuoglu
Abstract:
Imaging of microvasculature is primarily performed with X-ray contrast agents, owing to the wide availability of absorption-contrast laboratory source microCT compared to phase contrast capable devices. Standard commercial contrast agents used in angiography are not suitable for high-resolution imaging ex vivo, however, as they are small molecular compounds capable of diffusing through blood vesse…
▽ More
Imaging of microvasculature is primarily performed with X-ray contrast agents, owing to the wide availability of absorption-contrast laboratory source microCT compared to phase contrast capable devices. Standard commercial contrast agents used in angiography are not suitable for high-resolution imaging ex vivo, however, as they are small molecular compounds capable of diffusing through blood vessel walls within minutes. Large nanoparticle-based blood pool contrast agents on the other hand exhibit problems with aggregation, resulting in clogging in the smallest blood vessels. Injection with solidifying plastic resins has, therefore, remained the gold standard for microvascular imaging, despite the considerable amount of training and optimization needed to properly perfuse the viscous compounds. Even with optimization, frequent gas and water inclusions commonly result in interrupted vessel segments. This lack of suitable compounds has led us to develop the polymeric, cross-linkable X-ray contrast agent XlinCA. As a water-soluble organic molecule, aggregation and inclusions are inherently avoided. High molecular weight allows it to be retained even in the highly fenestrated vasculature of the kidney filtration system. It can be covalently crosslinked using the same aldehydes used in tissue fixation protocols, leading to stable and permanent contrast. These properties allowed us to image whole mice and individual organs in 6 to 12-month-old C57BL/6J mice without requiring lengthy optimizations of injection rates and pressures, while at the same time achieving greatly improved filling of the vasculature compared to resin-based vascular casting. This work aims at illuminating the rationales, processes and challenges involved in creating this recently developed contrast agent.
△ Less
Submitted 16 January, 2023;
originally announced January 2023.
-
Global Optimization of Mixed-Integer Nonlinear Programs with SCIP 8
Authors:
Ksenia Bestuzheva,
Antonia Chmiela,
Benjamin Müller,
Felipe Serrano,
Stefan Vigerske,
Fabian Wegscheider
Abstract:
For over ten years, the constraint integer programming framework SCIP has been extended by capabilities for the solution of convex and nonconvex mixed-integer nonlinear programs (MINLPs). With the recently published version 8.0, these capabilities have been largely reworked and extended. This paper discusses the motivations for recent changes and provides an overview of features that are particula…
▽ More
For over ten years, the constraint integer programming framework SCIP has been extended by capabilities for the solution of convex and nonconvex mixed-integer nonlinear programs (MINLPs). With the recently published version 8.0, these capabilities have been largely reworked and extended. This paper discusses the motivations for recent changes and provides an overview of features that are particular to MINLP solving in SCIP. Further, difficulties in benchmarking global MINLP solvers are discussed and a comparison with several state-of-the-art global MINLP solvers is provided.
△ Less
Submitted 2 January, 2023;
originally announced January 2023.
-
Light Curves of Type IIP Supernovae from Neutrino-driven Explosions of Red Supergiants Obtained by a Semi-analytic Approach
Authors:
Shuai Zha,
Bernhard Müller,
Amy Weir,
Alexander Heger
Abstract:
Type IIP supernovae (SNe IIP) mark the explosive death of red supergiants (RSGs), evolved massive stars with an extended hydrogen envelope. They are the most common supernova type and allow for benchmarking of supernova explosion models by statistical comparison to observed population properties rather than comparing individual models and events. We construct a large synthetic set of SNe IIP light…
▽ More
Type IIP supernovae (SNe IIP) mark the explosive death of red supergiants (RSGs), evolved massive stars with an extended hydrogen envelope. They are the most common supernova type and allow for benchmarking of supernova explosion models by statistical comparison to observed population properties rather than comparing individual models and events. We construct a large synthetic set of SNe IIP light curves (LCs) using the radiation hydrodynamics code \texttt{SNEC} and explosion energies and nickel masses obtained from an efficient semi-analytic model for two different sets of stellar progenitor models. By direct comparison we demonstrate that the semi-analytic model yields very similar predictions as alternative phenomenological explosion models based on one-dimensional simulations. We find systematic differences of a factor of $\mathord{\sim}2$ in plateau luminosities between the two progenitor sets due to different stellar radii, which highlights the importance of the RSG envelope structure as a major uncertainty in interpreting LCs of SNe IIP. A comparison to a volume-limited sample of observed SNe IIP shows decent agreement in plateau luminosity, plateau duration and nickel mass for at least one of the synthetic LC sets. The models, however, do not produce sufficient events with very small nickel mass $M_\mathrm{Ni}<0.01\,M_\odot$ and predict an anticorrelation between plateau luminosity and plateau duration that is not present in the observed sample, a result that warrants further study. Our results suggest that a better understanding of RSG stellar structure is no less important for reliably explaining the light curves of SNe IIP than the explosion physics.
△ Less
Submitted 25 May, 2023; v1 submitted 1 January, 2023;
originally announced January 2023.
-
Spin polarization and correlation of quarks from glasma
Authors:
Avdhesh Kumar,
Berndt Müller,
Di-Lun Yang
Abstract:
We investigate the interaction of strong color fields in the glasma stage of high-energy nuclear collisions with the spins of quarks and antiquarks. We employ the perturbative solution of the quantum kinetic theory for the spin transport of (massive) quarks in a background color field governed by the linearized Yang-Mills equation and derive expressions for the quark-spin polarization and quark-an…
▽ More
We investigate the interaction of strong color fields in the glasma stage of high-energy nuclear collisions with the spins of quarks and antiquarks. We employ the perturbative solution of the quantum kinetic theory for the spin transport of (massive) quarks in a background color field governed by the linearized Yang-Mills equation and derive expressions for the quark-spin polarization and quark-antiquark spin correlation at small momentum in terms of field correlators. For the Golec-Biernat Wüsthoff dipole distribution the quark-spin polarization vanishes, but the out-of-plane spin correlation of quarks and antiquarks is nonzero. Our order-of-magnitude estimate of the correlation far exceeds that caused by vorticity effects, but does not fully explain the data for vector meson alignment. We identify possible mechanisms that could further increase the predicted spin correlation.
△ Less
Submitted 27 April, 2023; v1 submitted 26 December, 2022;
originally announced December 2022.
-
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Authors:
Benjamin Muller,
Deepanshu Gupta,
Siddharth Patwardhan,
Jean-Philippe Fauconnier,
David Vandyke,
Sachin Agarwal
Abstract:
Multi-lingual language models (LM), such as mBERT, XLM-R, mT5, mBART, have been remarkably successful in enabling natural language tasks in low-resource languages through cross-lingual transfer from high-resource ones. In this work, we try to better understand how such models, specifically mT5, transfer *any* linguistic and semantic knowledge across languages, even though no explicit cross-lingual…
▽ More
Multi-lingual language models (LM), such as mBERT, XLM-R, mT5, mBART, have been remarkably successful in enabling natural language tasks in low-resource languages through cross-lingual transfer from high-resource ones. In this work, we try to better understand how such models, specifically mT5, transfer *any* linguistic and semantic knowledge across languages, even though no explicit cross-lingual signals are provided during pre-training. Rather, only unannotated texts from each language are presented to the model separately and independently of one another, and the model appears to implicitly learn cross-lingual connections. This raises several questions that motivate our study, such as: Are the cross-lingual connections between every language pair equally strong? What properties of source and target language impact the strength of cross-lingual transfer? Can we quantify the impact of those properties on the cross-lingual transfer?
In our investigation, we analyze a pre-trained mT5 to discover the attributes of cross-lingual connections learned by the model. Through a statistical interpretation framework over 90 language pairs across three tasks, we show that transfer performance can be modeled by a few linguistic and data-derived features. These observations enable us to interpret cross-lingual understanding of the mT5 model. Through these observations, one can favorably choose the best source language for a task, and can anticipate its training data demands. A key finding of this work is that similarity of syntax, morphology and phonology are good predictors of cross-lingual transfer, significantly more than just the lexical similarity of languages. For a given language, we are able to predict zero-shot performance, that increases on a logarithmic scale with the number of few-shot target language data points.
△ Less
Submitted 4 December, 2022;
originally announced December 2022.
-
Three dimensional magnetorotational core-collapse supernova explosions of a 39 solar mass progenitor star
Authors:
Jade Powell,
Bernhard Mueller,
David R. Aguilera-Dena,
Norbert Langer
Abstract:
We perform three-dimensional simulations of magnetorotational supernovae using a $39\,M_{\odot}$ progenitor star with two different initial magnetic field strengths of $10^{10}$ G and $10^{12}$ G in the core. Both models rapidly undergo shock revival and their explosion energies asymptote within a few hundred milliseconds to values of $\gtrsim 2\times10^{51}$ erg after conservatively correcting fo…
▽ More
We perform three-dimensional simulations of magnetorotational supernovae using a $39\,M_{\odot}$ progenitor star with two different initial magnetic field strengths of $10^{10}$ G and $10^{12}$ G in the core. Both models rapidly undergo shock revival and their explosion energies asymptote within a few hundred milliseconds to values of $\gtrsim 2\times10^{51}$ erg after conservatively correcting for the binding energy of the envelope. Magnetically collimated, non-relativistic jets form in both models, though the jets are subject to non-axisymmetric instabilities. The jets do not appear crucial for driving the explosion, as they only emerge once the shock has already expanded considerably. Our simulations predict moderate neutron star kicks of about $150\, \mathrm{km}\,\mathrm{s}^{-1}$, no spin-kick alignment, and rapid early spin-down that would result in birth periods of about $20\, \mathrm{ms}$, too slow to power an energetic gamma-ray burst jet. More than $0.2\,M_\odot$ of iron-group material are ejected, but we estimate that the mass of ejected $^{56}\mathrm{Ni}$ will be considerably smaller as the bulk of this material is neutron-rich. Explosive burning does not contribute appreciable amounts of $^{56}\mathrm{Ni}$ because the burned material originates from the slightly neutron-rich silicon shell. The iron-group ejecta also show no pronounced bipolar geometry by the end of the simulations. The models thus do not immediately fit the characteristics of observed hypernovae, but may be representative of other transients with moderately high explosion energies. The gravitational-wave emission reaches high frequencies of up to 2000 Hz and amplitudes of over 100 cm. The gravitational-wave emission is detectable out to distances of $\sim4$ Mpc in the planned Cosmic Explorer detector.
△ Less
Submitted 4 May, 2023; v1 submitted 30 November, 2022;
originally announced December 2022.
-
Quark-Hadron Transition and Entanglement
Authors:
Berndt Müller,
Andreas Schäfer
Abstract:
The dual holographic description has enjoyed many successes in explaining fundamental properties of the early stages of relativistic heavy ion collisions up to the formation of a minimal-viscosity quark-gluon fluid. However, there have been few attempts to extend its application beyond this stage. Here we explore the prospects for such an extension beyond the time of hadronization. Our discussion…
▽ More
The dual holographic description has enjoyed many successes in explaining fundamental properties of the early stages of relativistic heavy ion collisions up to the formation of a minimal-viscosity quark-gluon fluid. However, there have been few attempts to extend its application beyond this stage. Here we explore the prospects for such an extension beyond the time of hadronization. Our discussion makes use of recent insights into the duality of entanglement properties of field theory states in the edge of Anti-de Sitter space and non-trivial topologies of horizons in the bulk, often referred to as ER = EPR duality. We discuss this topic from the point of view of heavy-ion phenomenology, review several relevant concepts, and map out a path toward combining them into a comprehensive, at least semiquantitative description of relativistic heavy ion collisions. We outline possible next steps in this direction.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.