-
Hunting for extra dimensions in black hole shadows
Authors:
A. S. Lemos,
J. A. V. Campos,
F. A. Brito
Abstract:
Observational data of the Sagittarius A* (Sgr A*) shadow released by the Event Horizon Telescope (EHT) are used to investigate eventual deviations in the black hole shadow radius, aiming to seek physics beyond the Standard Model (SM) coming from extra-dimensional theory. We consider the brane-world scenario described by the Randall-Sundrum model and determine the black hole shadow radius correctio…
▽ More
Observational data of the Sagittarius A* (Sgr A*) shadow released by the Event Horizon Telescope (EHT) are used to investigate eventual deviations in the black hole shadow radius, aiming to seek physics beyond the Standard Model (SM) coming from extra-dimensional theory. We consider the brane-world scenario described by the Randall-Sundrum model and determine the black hole shadow radius correction owing to the higher dimension. From data of the shadow radius in units of BH mass determined by KECK- and VLTI-based estimates, we imposed restrictions on the deviation obtained, and one sets an upper limit to the curvature radius of Anti-de Sitter ($\mathrm{AdS_{5}}$) spacetime $\ell\lesssim4.3\times10^{-2}\,\mathrm{AU}$ (at $95\%$ confidence level).
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Dark Energy Survey Year 3 Results: Cosmology from galaxy clustering and galaxy-galaxy lensing in harmonic space
Authors:
L. Faga,
F. Andrade-Oliveira,
H. Camacho,
R. Rosenfeld,
M. Lima,
C. Doux,
X. Fang,
J. Prat,
A. Porredon,
M. Aguena,
A. Alarcon,
S. Allam,
O. Alves,
A. Amon,
S. Avila,
D. Bacon,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
S. Bocquet,
D. Brooks,
E. Buckley-Geer,
A. Campos,
A. Carnero Rosell,
M. Carrasco Kind
, et al. (78 additional authors not shown)
Abstract:
We present the joint tomographic analysis of galaxy-galaxy lensing and galaxy clustering in harmonic space, using galaxy catalogues from the first three years of observations by the Dark Energy Survey (DES Y3). We utilise the redMaGiC and MagLim catalogues as lens galaxies and the METACALIBRATION catalogue as source galaxies. The measurements of angular power spectra are performed using the pseudo…
▽ More
We present the joint tomographic analysis of galaxy-galaxy lensing and galaxy clustering in harmonic space, using galaxy catalogues from the first three years of observations by the Dark Energy Survey (DES Y3). We utilise the redMaGiC and MagLim catalogues as lens galaxies and the METACALIBRATION catalogue as source galaxies. The measurements of angular power spectra are performed using the pseudo-$C_\ell$ method, and our theoretical modelling follows the fiducial analyses performed by DES Y3 in configuration space, accounting for galaxy bias, intrinsic alignments, magnification bias, shear magnification bias and photometric redshift uncertainties. We explore different approaches for scale cuts based on non-linear galaxy bias and baryonic effects contamination. Our fiducial covariance matrix is computed analytically, accounting for mask geometry in the Gaussian term, and including non-Gaussian contributions and super-sample covariance terms. To validate our harmonic space pipelines and covariance matrix, we used a suite of 1800 log-normal simulations. We also perform a series of stress tests to gauge the robustness of our harmonic space analysis. In the $Λ$CDM model, the clustering amplitude $S_8 =σ_8(Ω_m/0.3)^{0.5}$ is constrained to $S_8 = 0.704\pm 0.029$ and $S_8 = 0.753\pm 0.024$ ($68\%$ C.L.) for the redMaGiC and MagLim catalogues, respectively. For the $w$CDM, the dark energy equation of state is constrained to $w = -1.28 \pm 0.29$ and $w = -1.26^{+0.34}_{-0.27}$, for redMaGiC and MagLim catalogues, respectively. These results are compatible with the corresponding DES Y3 results in configuration space and pave the way for harmonic space analyses using the DES Y6 data.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Some things never change: how far generative AI can really change software engineering practice
Authors:
Aline de Campos,
Jorge Melegati,
Nicolas Nascimento,
Rafael Chanin,
Afonso Sales,
Igor Wiese
Abstract:
Generative Artificial Intelligence (GenAI) has become an emerging technology with the availability of several tools that could impact Software Engineering (SE) activities. As any other disruptive technology, GenAI led to the speculation that its full potential can deeply change SE. However, an overfocus on improving activities for which GenAI is more suitable could negligent other relevant areas o…
▽ More
Generative Artificial Intelligence (GenAI) has become an emerging technology with the availability of several tools that could impact Software Engineering (SE) activities. As any other disruptive technology, GenAI led to the speculation that its full potential can deeply change SE. However, an overfocus on improving activities for which GenAI is more suitable could negligent other relevant areas of the process. In this paper, we aim to explore which SE activities are not expected to be profoundly changed by GenAI. To achieve this goal, we performed a survey with SE practitioners to identify their expectations regarding GenAI in SE, including impacts, challenges, ethical issues, and aspects they do not expect to change. We compared our results with previous roadmaps proposed in SE literature. Our results show that although practitioners expect an increase in productivity, coding, and process quality, they envision that some aspects will not change, such as the need for human expertise, creativity, and project management. Our results point to SE areas for which GenAI is probably not so useful, and future research could tackle them to improve SE practice.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Improving Reward Models with Synthetic Critiques
Authors:
Zihuiwen Ye,
Fraser Greenlee-Scott,
Max Bartolo,
Phil Blunsom,
Jon Ander Campos,
Matthias Gallé
Abstract:
Reward models (RM) play a critical role in aligning language models through the process of reinforcement learning from human feedback. RMs are trained to predict a score reflecting human preference, which requires significant time and cost for human annotation. Additionally, RMs tend to quickly overfit on superficial features in the training set, hindering their generalization performance on unsee…
▽ More
Reward models (RM) play a critical role in aligning language models through the process of reinforcement learning from human feedback. RMs are trained to predict a score reflecting human preference, which requires significant time and cost for human annotation. Additionally, RMs tend to quickly overfit on superficial features in the training set, hindering their generalization performance on unseen distributions. We propose a novel approach using synthetic natural language critiques generated by large language models to provide additional feedback, evaluating aspects such as instruction following, correctness, and style. This offers richer signals and more robust features for RMs to assess and score on. We demonstrate that high-quality critiques improve the performance and data efficiency of RMs initialized from different pretrained models. Conversely, we also show that low-quality critiques negatively impact performance. Furthermore, incorporating critiques enhances the interpretability and robustness of RM training.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
The RSNA Abdominal Traumatic Injury CT (RATIC) Dataset
Authors:
Jeffrey D. Rudie,
Hui-Ming Lin,
Robyn L. Ball,
Sabeena Jalal,
Luciano M. Prevedello,
Savvas Nicolaou,
Brett S. Marinelli,
Adam E. Flanders,
Kirti Magudia,
George Shih,
Melissa A. Davis,
John Mongan,
Peter D. Chang,
Ferco H. Berger,
Sebastiaan Hermans,
Meng Law,
Tyler Richards,
Jan-Peter Grunz,
Andreas Steven Kunz,
Shobhit Mathur,
Sandro Galea-Soler,
Andrew D. Chung,
Saif Afat,
Chin-Chi Kuo,
Layal Aweidah
, et al. (15 additional authors not shown)
Abstract:
The RSNA Abdominal Traumatic Injury CT (RATIC) dataset is the largest publicly available collection of adult abdominal CT studies annotated for traumatic injuries. This dataset includes 4,274 studies from 23 institutions across 14 countries. The dataset is freely available for non-commercial use via Kaggle at https://www.kaggle.com/competitions/rsna-2023-abdominal-trauma-detection. Created for the…
▽ More
The RSNA Abdominal Traumatic Injury CT (RATIC) dataset is the largest publicly available collection of adult abdominal CT studies annotated for traumatic injuries. This dataset includes 4,274 studies from 23 institutions across 14 countries. The dataset is freely available for non-commercial use via Kaggle at https://www.kaggle.com/competitions/rsna-2023-abdominal-trauma-detection. Created for the RSNA 2023 Abdominal Trauma Detection competition, the dataset encourages the development of advanced machine learning models for detecting abdominal injuries on CT scans. The dataset encompasses detection and classification of traumatic injuries across multiple organs, including the liver, spleen, kidneys, bowel, and mesentery. Annotations were created by expert radiologists from the American Society of Emergency Radiology (ASER) and Society of Abdominal Radiology (SAR). The dataset is annotated at multiple levels, including the presence of injuries in three solid organs with injury grading, image-level annotations for active extravasations and bowel injury, and voxelwise segmentations of each of the potentially injured organs. With the release of this dataset, we hope to facilitate research and development in machine learning and abdominal trauma that can lead to improved patient care and outcomes.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Aya 23: Open Weight Releases to Further Multilingual Progress
Authors:
Viraat Aryabumi,
John Dang,
Dwarak Talupuru,
Saurabh Dash,
David Cairuz,
Hangyu Lin,
Bharat Venkitesh,
Madeline Smith,
Jon Ander Campos,
Yi Chern Tan,
Kelly Marchisio,
Max Bartolo,
Sebastian Ruder,
Acyr Locatelli,
Julia Kreutzer,
Nick Frosst,
Aidan Gomez,
Phil Blunsom,
Marzieh Fadaee,
Ahmet Üstün,
Sara Hooker
Abstract:
This technical report introduces Aya 23, a family of multilingual language models. Aya 23 builds on the recent release of the Aya model (Üstün et al., 2024), focusing on pairing a highly performant pre-trained model with the recently released Aya collection (Singh et al., 2024). The result is a powerful multilingual large language model serving 23 languages, expanding state-of-art language modelin…
▽ More
This technical report introduces Aya 23, a family of multilingual language models. Aya 23 builds on the recent release of the Aya model (Üstün et al., 2024), focusing on pairing a highly performant pre-trained model with the recently released Aya collection (Singh et al., 2024). The result is a powerful multilingual large language model serving 23 languages, expanding state-of-art language modeling capabilities to approximately half of the world's population. The Aya model covered 101 languages whereas Aya 23 is an experiment in depth vs breadth, exploring the impact of allocating more capacity to fewer languages that are included during pre-training. Aya 23 outperforms both previous massively multilingual models like Aya 101 for the languages it covers, as well as widely used models like Gemma, Mistral and Mixtral on an extensive range of discriminative and generative tasks. We release the open weights for both the 8B and 35B models as part of our continued commitment for expanding access to multilingual progress.
△ Less
Submitted 31 May, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
Dark Energy Survey Year 3 results: simulation-based cosmological inference with wavelet harmonics, scattering transforms, and moments of weak lensing mass maps II. Cosmological results
Authors:
M. Gatti,
G. Campailla,
N. Jeffrey,
L. Whiteway,
A. Porredon,
J. Prat,
J. Williamson,
M. Raveri,
B. Jain,
V. Ajani,
G. Giannini,
M. Yamamoto,
C. Zhou,
J. Blazek,
D. Anbajagane,
S. Samuroff,
T. Kacprzak,
A. Alarcon,
A. Amon,
K. Bechtol,
M. Becker,
G. Bernstein,
A. Campos,
C. Chang,
R. Chen
, et al. (77 additional authors not shown)
Abstract:
We present a simulation-based cosmological analysis using a combination of Gaussian and non-Gaussian statistics of the weak lensing mass (convergence) maps from the first three years (Y3) of the Dark Energy Survey (DES). We implement: 1) second and third moments; 2) wavelet phase harmonics; 3) the scattering transform. Our analysis is fully based on simulations, spans a space of seven $νw$CDM cosm…
▽ More
We present a simulation-based cosmological analysis using a combination of Gaussian and non-Gaussian statistics of the weak lensing mass (convergence) maps from the first three years (Y3) of the Dark Energy Survey (DES). We implement: 1) second and third moments; 2) wavelet phase harmonics; 3) the scattering transform. Our analysis is fully based on simulations, spans a space of seven $νw$CDM cosmological parameters, and forward models the most relevant sources of systematics inherent in the data: masks, noise variations, clustering of the sources, intrinsic alignments, and shear and redshift calibration. We implement a neural network compression of the summary statistics, and we estimate the parameter posteriors using a simulation-based inference approach. Including and combining different non-Gaussian statistics is a powerful tool that strongly improves constraints over Gaussian statistics (in our case, the second moments); in particular, the Figure of Merit $\textrm{FoM}(S_8, Ω_{\textrm{m}})$ is improved by 70 percent ($Λ$CDM) and 90 percent ($w$CDM). When all the summary statistics are combined, we achieve a 2 percent constraint on the amplitude of fluctuations parameter $S_8 \equiv σ_8 (Ω_{\textrm{m}}/0.3)^{0.5}$, obtaining $S_8 = 0.794 \pm 0.017$ ($Λ$CDM) and $S_8 = 0.817 \pm 0.021$ ($w$CDM). The constraints from different statistics are shown to be internally consistent (with a $p$-value>0.1 for all combinations of statistics examined). We compare our results to other weak lensing results from the DES Y3 data, finding good consistency; we also compare with results from external datasets, such as \planck{} constraints from the Cosmic Microwave Background, finding statistical agreement, with discrepancies no greater than $<2.2σ$.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Analysis of singularly perturbed stochastic chemical reaction networks motivated by applications to epigenetic cell memory
Authors:
Simone Bruno,
Felipe A. Campos,
Yi Fu,
Domitilla Del Vecchio,
Ruth J. Williams
Abstract:
Epigenetic cell memory, the inheritance of gene expression patterns across subsequent cell divisions, is a critical property of multi-cellular organisms. In recent work [10], a subset of the authors observed in a simulation study how the stochastic dynamics and time-scale differences between establishment and erasure processes in chromatin modifications (such as histone modifications and DNA methy…
▽ More
Epigenetic cell memory, the inheritance of gene expression patterns across subsequent cell divisions, is a critical property of multi-cellular organisms. In recent work [10], a subset of the authors observed in a simulation study how the stochastic dynamics and time-scale differences between establishment and erasure processes in chromatin modifications (such as histone modifications and DNA methylation) can have a critical effect on epigenetic cell memory. In this paper, we provide a mathematical framework to rigorously validate and extend beyond these computational findings. Viewing our stochastic model of a chromatin modification circuit as a singularly perturbed, finite state, continuous time Markov chain, we extend beyond existing theory in order to characterize the leading coefficients in the series expansions of stationary distributions and mean first passage times. In particular, we characterize the limiting stationary distribution in terms of a reduced Markov chain, provide an algorithm to determine the orders of the poles of mean first passage times, and determine how changing erasure rates affects system behavior. The theoretical tools developed in this paper not only allow us to set a rigorous mathematical basis for the computational findings of our prior work, highlighting the effect of chromatin modification dynamics on epigenetic cell memory, but they can also be applied to other singularly perturbed Markov chains beyond the applications in this paper, especially those associated with chemical reaction networks.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
First order of the renewal covering of the natural numbers
Authors:
Alberto M. Campos
Abstract:
This paper introduces a new type of covering process that covers the set of natural numbers using renewal processes as objects. Inspired by the behavior of prime numbers, the model in each step finds the smallest vacant point, $k$, and place, starting in $k$, a renewal process with a step distribution given by a geometric random variable with parameter $\frac{1}{k}$. The model depends on its entir…
▽ More
This paper introduces a new type of covering process that covers the set of natural numbers using renewal processes as objects. Inspired by the behavior of prime numbers, the model in each step finds the smallest vacant point, $k$, and place, starting in $k$, a renewal process with a step distribution given by a geometric random variable with parameter $\frac{1}{k}$. The model depends on its entire past, and small perturbations in its initial value can lead to very different outcomes. Here, we expose a technique that finds the first-order limit behavior for the number of objects placed until $n$, which exhibits intriguing similarities to prime number distributions, having a concentration around $n\log{n}$.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively
Authors:
Tiziano Labruna,
Jon Ander Campos,
Gorka Azkune
Abstract:
In this paper, we demonstrate how Large Language Models (LLMs) can effectively learn to use an off-the-shelf information retrieval (IR) system specifically when additional context is required to answer a given question. Given the performance of IR systems, the optimal strategy for question answering does not always entail external information retrieval; rather, it often involves leveraging the par…
▽ More
In this paper, we demonstrate how Large Language Models (LLMs) can effectively learn to use an off-the-shelf information retrieval (IR) system specifically when additional context is required to answer a given question. Given the performance of IR systems, the optimal strategy for question answering does not always entail external information retrieval; rather, it often involves leveraging the parametric memory of the LLM itself. Prior research has identified this phenomenon in the PopQA dataset, wherein the most popular questions are effectively addressed using the LLM's parametric memory, while less popular ones require IR system usage. Following this, we propose a tailored training approach for LLMs, leveraging existing open-domain question answering datasets. Here, LLMs are trained to generate a special token, <RET>, when they do not know the answer to a question. Our evaluation of the Adaptive Retrieval LLM (Adapt-LLM) on the PopQA dataset showcases improvements over the same LLM under three configurations: (i) retrieving information for all the questions, (ii) using always the parametric memory of the LLM, and (iii) using a popularity threshold to decide when to use a retriever. Through our analysis, we demonstrate that Adapt-LLM is able to generate the <RET> token when it determines that it does not know how to answer a question, indicating the need for IR, while it achieves notably high accuracy levels when it chooses to rely only on its parametric memory.
△ Less
Submitted 6 May, 2024; v1 submitted 30 April, 2024;
originally announced April 2024.
-
Weak lensing combined with the kinetic Sunyaev Zel'dovich effect: A study of baryonic feedback
Authors:
L. Bigwood,
A. Amon,
A. Schneider,
J. Salcido,
I. G. McCarthy,
C. Preston,
D. Sanchez,
D. Sijacki,
E. Schaan,
S. Ferraro,
N. Battaglia,
A. Chen,
S. Dodelson,
A. Roodman,
A. Pieres,
A. Ferte,
A. Alarcon,
A. Drlica-Wagner,
A. Choi,
A. Navarro-Alsina,
A. Campos,
A. J. Ross,
A. Carnero Rosell,
B. Yin,
B. Yanny
, et al. (100 additional authors not shown)
Abstract:
Extracting precise cosmology from weak lensing surveys requires modelling the non-linear matter power spectrum, which is suppressed at small scales due to baryonic feedback processes. However, hydrodynamical galaxy formation simulations make widely varying predictions for the amplitude and extent of this effect. We use measurements of Dark Energy Survey Year 3 weak lensing (WL) and Atacama Cosmolo…
▽ More
Extracting precise cosmology from weak lensing surveys requires modelling the non-linear matter power spectrum, which is suppressed at small scales due to baryonic feedback processes. However, hydrodynamical galaxy formation simulations make widely varying predictions for the amplitude and extent of this effect. We use measurements of Dark Energy Survey Year 3 weak lensing (WL) and Atacama Cosmology Telescope DR5 kinematic Sunyaev-Zel'dovich (kSZ) to jointly constrain cosmological and astrophysical baryonic feedback parameters using a flexible analytical model, `baryonification'. First, using WL only, we compare the $S_8$ constraints using baryonification to a simulation-calibrated halo model, a simulation-based emulator model and the approach of discarding WL measurements on small angular scales. We find that model flexibility can shift the value of $S_8$ and degrade the uncertainty. The kSZ provides additional constraints on the astrophysical parameters and shifts $S_8$ to $S_8=0.823^{+0.019}_{-0.020}$, a higher value than attained using the WL-only analysis. We measure the suppression of the non-linear matter power spectrum using WL + kSZ and constrain a mean feedback scenario that is more extreme than the predictions from most hydrodynamical simulations. We constrain the baryon fractions and the gas mass fractions and find them to be generally lower than inferred from X-ray observations and simulation predictions. We conclude that the WL + kSZ measurements provide a new and complementary benchmark for building a coherent picture of the impact of gas around galaxies across observations.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
The Active Asteroids Citizen Science Program: Overview and First Results
Authors:
Colin Orion Chandler,
Chadwick A. Trujillo,
William J. Oldroyd,
Jay K. Kueny,
William A. Burris,
Henry H. Hsieh,
Jarod A. DeSpain,
Nima Sedaghat,
Scott S. Sheppard,
Kennedy A. Farrell,
David E. Trilling,
Annika Gustafsson,
Mark Jesus Mendoza Magbanua,
Michele T. Mazzucato,
Milton K. D. Bosch,
Tiffany Shaw-Diaz,
Virgilio Gonano,
Al Lamperti,
José A. da Silva Campos,
Brian L. Goodwin,
Ivan A. Terentev,
Charles J. A. Dukes,
Sam Deen
Abstract:
We present the Citizen Science program Active Asteroids and describe discoveries stemming from our ongoing project. Our NASA Partner program is hosted on the Zooniverse online platform and launched on 2021 August 31, with the goal of engaging the community in the search for active asteroids -- asteroids with comet-like tails or comae. We also set out to identify other unusual active solar system o…
▽ More
We present the Citizen Science program Active Asteroids and describe discoveries stemming from our ongoing project. Our NASA Partner program is hosted on the Zooniverse online platform and launched on 2021 August 31, with the goal of engaging the community in the search for active asteroids -- asteroids with comet-like tails or comae. We also set out to identify other unusual active solar system objects, such as active Centaurs, active quasi-Hilda asteroids, and Jupiter-family comets (JFCs). Active objects are rare in large part because they are difficult to identify, so we ask volunteers to assist us in searching for active bodies in our collection of millions of images of known minor planets. We produced these cutout images with our project pipeline that makes use of publicly available Dark Energy Camera (DECam) data. Since the project launch, roughly 8,300 volunteers have scrutinized some 430,000 images to great effect, which we describe in this work. In total we have identified previously unknown activity on 15 asteroids, plus one Centaur, that were thought to be asteroidal (i.e., inactive). Of the asteroids, we classify four as active quasi-Hilda asteroids, seven as JFCs, and four as active asteroids, consisting of one Main-belt comet (MBC) and three MBC candidates. We also include our findings concerning known active objects that our program facilitated, an unanticipated avenue of scientific discovery. These include discovering activity occurring during an orbital epoch for which objects were not known to be active, and the reclassification of objects based on our dynamical analyses.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Selective probing of longitudinal and transverse plasmon modes with electron phase-matching
Authors:
Franck Aguilar,
Hugo Lourenço-Martins,
Damián Montero,
Xiaoyan Li,
Mathieu Kociak,
Alfredo Campos
Abstract:
The optical properties of metallic nanoparticles are dominated by localized surface plasmons (LSPs). Their properties only depend on the constituting material, the size and shape of the nano-object as well as its surrounding medium. In anisotropic structures, such as metallic nanorods, two families of modes generally exist, transverse and longitudinal. Their spectral and spatial overlaps usually i…
▽ More
The optical properties of metallic nanoparticles are dominated by localized surface plasmons (LSPs). Their properties only depend on the constituting material, the size and shape of the nano-object as well as its surrounding medium. In anisotropic structures, such as metallic nanorods, two families of modes generally exist, transverse and longitudinal. Their spectral and spatial overlaps usually impede their separate measurements in electron energy loss spectroscopy (EELS). In this work, we propose three different strategies enabling to overcome this difficulty and selectively probe longitudinal and transverse modes. The first strategy is numeric and relies on morphing of nano-structures, rooted in the geometrical nature of LSPs. The two other strategies exploit the relativistic and wave nature of the electrons in an EELS experiment. The first one is the phase-matching between the electron and the plasmon excitation to enhance their coupling by either tilting the sample and modifying the electron kinetic energy. The second one - polarized EELS (pEELS) - exploits the wave nature of electrons to mimic selection rules analogous to the one existing in light spectroscopies. The above-mentioned strategies are exemplified - either experimentally or numerically - on a canonical plasmonic toy model: the nano-rod. The goal of the paper is to bring together the state-of-the-art concepts of EELS for plasmonics to tackle a pedestrian problem in this field.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Dark Energy Survey Year 3 results: likelihood-free, simulation-based $w$CDM inference with neural compression of weak-lensing map statistics
Authors:
N. Jeffrey,
L. Whiteway,
M. Gatti,
J. Williamson,
J. Alsing,
A. Porredon,
J. Prat,
C. Doux,
B. Jain,
C. Chang,
T. -Y. Cheng,
T. Kacprzak,
P. Lemos,
A. Alarcon,
A. Amon,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
A. Campos,
A. Carnero Rosell,
R. Chen,
A. Choi,
J. DeRose,
A. Drlica-Wagner,
K. Eckert
, et al. (66 additional authors not shown)
Abstract:
We present simulation-based cosmological $w$CDM inference using Dark Energy Survey Year 3 weak-lensing maps, via neural data compression of weak-lensing map summary statistics: power spectra, peak counts, and direct map-level compression/inference with convolutional neural networks (CNN). Using simulation-based inference, also known as likelihood-free or implicit inference, we use forward-modelled…
▽ More
We present simulation-based cosmological $w$CDM inference using Dark Energy Survey Year 3 weak-lensing maps, via neural data compression of weak-lensing map summary statistics: power spectra, peak counts, and direct map-level compression/inference with convolutional neural networks (CNN). Using simulation-based inference, also known as likelihood-free or implicit inference, we use forward-modelled mock data to estimate posterior probability distributions of unknown parameters. This approach allows all statistical assumptions and uncertainties to be propagated through the forward-modelled mock data; these include sky masks, non-Gaussian shape noise, shape measurement bias, source galaxy clustering, photometric redshift uncertainty, intrinsic galaxy alignments, non-Gaussian density fields, neutrinos, and non-linear summary statistics. We include a series of tests to validate our inference results. This paper also describes the Gower Street simulation suite: 791 full-sky PKDGRAV dark matter simulations, with cosmological model parameters sampled with a mixed active-learning strategy, from which we construct over 3000 mock DES lensing data sets. For $w$CDM inference, for which we allow $-1<w<-\frac{1}{3}$, our most constraining result uses power spectra combined with map-level (CNN) inference. Using gravitational lensing data only, this map-level combination gives $Ω_{\rm m} = 0.283^{+0.020}_{-0.027}$, ${S_8 = 0.804^{+0.025}_{-0.017}}$, and $w < -0.80$ (with a 68 per cent credible interval); compared to the power spectrum inference, this is more than a factor of two improvement in dark energy parameter ($Ω_{\rm DE}, w$) precision.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Joint constraints from cosmic shear, galaxy-galaxy lensing and galaxy clustering: internal tension as an indicator of intrinsic alignment modelling error
Authors:
S. Samuroff,
A. Campos,
A. Porredon,
J. Blazek
Abstract:
In cosmological analyses it is common to combine different types of measurement from the same survey. In this paper we use simulated DES Y3 and LSST Y1 data to explore differences in sensitivity to intrinsic alignments (IA) between cosmic shear and galaxy-galaxy lensing. We generate mock shear, galaxy-galaxy lensing and galaxy clustering data, contaminated with a range of IA scenarios. Using a sim…
▽ More
In cosmological analyses it is common to combine different types of measurement from the same survey. In this paper we use simulated DES Y3 and LSST Y1 data to explore differences in sensitivity to intrinsic alignments (IA) between cosmic shear and galaxy-galaxy lensing. We generate mock shear, galaxy-galaxy lensing and galaxy clustering data, contaminated with a range of IA scenarios. Using a simple 2-parameter IA model (NLA) in a DES Y3 like analysis, we show that the galaxy-galaxy lensing + galaxy clustering combination ($2\times2$pt) is significantly more robust to IA mismodelling than cosmic shear. IA scenarios that produce up to $5σ$ biases for shear are seen to be unbiased at the level of $\sim1σ$ for $2\times2$pt. We demonstrate that this robustness can be largely attributed to the redshift separation in galaxy-galaxy lensing, which provides a cleaner separation of lensing and IA contributions. We identify secondary factors which may also contribute, including the possibility of cancellation of higher-order IA terms in $2\times2$pt and differences in sensitivity to physical scales. Unfortunately this does not typically correspond to equally effective self-calibration in a $3\times2$pt analysis of the same data, which can show significant biases driven by the cosmic shear part of the data vector. If we increase the precision of our mock analyses to a level roughly equivalent to LSST Y1, we find a similar pattern, with considerably more bias in a cosmic shear analysis than a $2\times2$pt one, and significant bias in a joint analysis of the two. Our findings suggest that IA model error can manifest itself as internal tension between $ξ_\pm$ and $γ_t + w$ data vectors. We thus propose that such tension (or the lack thereof) can be employed as a test of model sufficiency or insufficiency when choosing a fiducial IA model, alongside other data-driven methods.
△ Less
Submitted 20 May, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Detailed Report on the Measurement of the Positive Muon Anomalous Magnetic Moment to 0.20 ppm
Authors:
D. P. Aguillard,
T. Albahri,
D. Allspach,
A. Anisenkov,
K. Badgley,
S. Baeßler,
I. Bailey,
L. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
E. Barzi,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
S. Braun,
M. Bressler,
G. Cantatore,
R. M. Carey,
B. C. K. Casey
, et al. (168 additional authors not shown)
Abstract:
We present details on a new measurement of the muon magnetic anomaly, $a_μ= (g_μ-2)/2$. The result is based on positive muon data taken at Fermilab's Muon Campus during the 2019 and 2020 accelerator runs. The measurement uses $3.1$ GeV$/c$ polarized muons stored in a $7.1$-m-radius storage ring with a $1.45$ T uniform magnetic field. The value of $ a_μ$ is determined from the measured difference b…
▽ More
We present details on a new measurement of the muon magnetic anomaly, $a_μ= (g_μ-2)/2$. The result is based on positive muon data taken at Fermilab's Muon Campus during the 2019 and 2020 accelerator runs. The measurement uses $3.1$ GeV$/c$ polarized muons stored in a $7.1$-m-radius storage ring with a $1.45$ T uniform magnetic field. The value of $ a_μ$ is determined from the measured difference between the muon spin precession frequency and its cyclotron frequency. This difference is normalized to the strength of the magnetic field, measured using Nuclear Magnetic Resonance (NMR). The ratio is then corrected for small contributions from beam motion, beam dispersion, and transient magnetic fields. We measure $a_μ= 116 592 057 (25) \times 10^{-11}$ (0.21 ppm). This is the world's most precise measurement of this quantity and represents a factor of $2.2$ improvement over our previous result based on the 2018 dataset. In combination, the two datasets yield $a_μ(\text{FNAL}) = 116 592 055 (24) \times 10^{-11}$ (0.20 ppm). Combining this with the measurements from Brookhaven National Laboratory for both positive and negative muons, the new world average is $a_μ$(exp) $ = 116 592 059 (22) \times 10^{-11}$ (0.19 ppm).
△ Less
Submitted 22 May, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
The SRG/eROSITA All-Sky Survey: Dark Energy Survey Year 3 Weak Gravitational Lensing by eRASS1 selected Galaxy Clusters
Authors:
S. Grandis,
V. Ghirardini,
S. Bocquet,
C. Garrel,
J. J. Mohr,
A. Liu,
M. Kluge,
L. Kimmig,
T. H. Reiprich,
A. Alarcon,
A. Amon,
E. Artis,
Y. E. Bahar,
F. Balzer,
K. Bechtol,
M. R. Becker,
G. Bernstein,
E. Bulbul,
A. Campos,
A. Carnero Rosell,
M. Carrasco Kind,
R. Cawthon,
C. Chang,
R. Chen,
I. Chiu
, et al. (97 additional authors not shown)
Abstract:
Number counts of galaxy clusters across redshift are a powerful cosmological probe, if a precise and accurate reconstruction of the underlying mass distribution is performed -- a challenge called mass calibration. With the advent of wide and deep photometric surveys, weak gravitational lensing by clusters has become the method of choice to perform this measurement. We measure and validate the weak…
▽ More
Number counts of galaxy clusters across redshift are a powerful cosmological probe, if a precise and accurate reconstruction of the underlying mass distribution is performed -- a challenge called mass calibration. With the advent of wide and deep photometric surveys, weak gravitational lensing by clusters has become the method of choice to perform this measurement. We measure and validate the weak gravitational lensing (WL) signature in the shape of galaxies observed in the first 3 years of the DES Y3 caused by galaxy clusters selected in the first all-sky survey performed by SRG/eROSITA. These data are then used to determine the scaling between X-ray photon count rate of the clusters and their halo mass and redshift. We empirically determine the degree of cluster member contamination in our background source sample. The individual cluster shear profiles are then analysed with a Bayesian population model that self-consistently accounts for the lens sample selection and contamination, and includes marginalization over a host of instrumental and astrophysical systematics. To quantify the accuracy of the mass extraction of that model, we perform mass measurements on mock cluster catalogs with realistic synthetic shear profiles. This allows us to establish that hydro-dynamical modelling uncertainties at low lens redshifts ($z<0.6$) are the dominant systematic limitation. At high lens redshift the uncertainties of the sources' photometric redshift calibration dominate. With regard to the X-ray count rate to halo mass relation, we constrain all its parameters. This work sets the stage for a joint analysis with the number counts of eRASS1 clusters to constrain a host of cosmological parameters. We demonstrate that WL mass calibration of galaxy clusters can be performed successfully with source galaxies whose calibration was performed primarily for cosmic shear experiments.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Covering Distributions
Authors:
Alberto M. Campos,
Augusto Teixeira
Abstract:
In this article, we study a covering process of the discrete one-dimensional torus that uses connected arcs of random sizes in the covering. More precisely, fix a distribution μon \mathbb{N}, and for every n\geq 1 we will cover the torus \mathbb{Z}/n\mathbb{Z} as follows: at each time step, we place an arc with a length distributed as μand a uniform starting point. Eventually, the space will be co…
▽ More
In this article, we study a covering process of the discrete one-dimensional torus that uses connected arcs of random sizes in the covering. More precisely, fix a distribution μon \mathbb{N}, and for every n\geq 1 we will cover the torus \mathbb{Z}/n\mathbb{Z} as follows: at each time step, we place an arc with a length distributed as μand a uniform starting point. Eventually, the space will be covered entirely by these arcs. Changing the arc length distribution μcan potentially change the limiting behavior of the covering time. Here, we expose four distinct phases for the fluctuations of the cover time in the limit. These phases can be informally described as the Gumbel phase, the compactly support phase, the pre-exponential phase, and the exponential phase. Furthermore, we expose a continuous-time cover process that works as a limit distribution within the compactly support phase.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
The Dark Energy Survey: Cosmology Results With ~1500 New High-redshift Type Ia Supernovae Using The Full 5-year Dataset
Authors:
DES Collaboration,
T. M. C. Abbott,
M. Acevedo,
M. Aguena,
A. Alarcon,
S. Allam,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
J. Annis,
P. Armstrong,
J. Asorey,
S. Avila,
D. Bacon,
B. A. Bassett,
K. Bechtol,
P. H. Bernardinelli,
G. M. Bernstein,
E. Bertin,
J. Blazek,
S. Bocquet,
D. Brooks,
D. Brout,
E. Buckley-Geer,
D. L. Burke
, et al. (134 additional authors not shown)
Abstract:
We present cosmological constraints from the sample of Type Ia supernovae (SN Ia) discovered during the full five years of the Dark Energy Survey (DES) Supernova Program. In contrast to most previous cosmological samples, in which SN are classified based on their spectra, we classify the DES SNe using a machine learning algorithm applied to their light curves in four photometric bands. Spectroscop…
▽ More
We present cosmological constraints from the sample of Type Ia supernovae (SN Ia) discovered during the full five years of the Dark Energy Survey (DES) Supernova Program. In contrast to most previous cosmological samples, in which SN are classified based on their spectra, we classify the DES SNe using a machine learning algorithm applied to their light curves in four photometric bands. Spectroscopic redshifts are acquired from a dedicated follow-up survey of the host galaxies. After accounting for the likelihood of each SN being a SN Ia, we find 1635 DES SNe in the redshift range $0.10<z<1.13$ that pass quality selection criteria sufficient to constrain cosmological parameters. This quintuples the number of high-quality $z>0.5$ SNe compared to the previous leading compilation of Pantheon+, and results in the tightest cosmological constraints achieved by any SN data set to date. To derive cosmological constraints we combine the DES supernova data with a high-quality external low-redshift sample consisting of 194 SNe Ia spanning $0.025<z<0.10$. Using SN data alone and including systematic uncertainties we find $Ω_{\rm M}=0.352\pm 0.017$ in flat $Λ$CDM. Supernova data alone now require acceleration ($q_0<0$ in $Λ$CDM) with over $5σ$ confidence. We find $(Ω_{\rm M},w)=(0.264^{+0.074}_{-0.096},-0.80^{+0.14}_{-0.16})$ in flat $w$CDM. For flat $w_0w_a$CDM, we find $(Ω_{\rm M},w_0,w_a)=(0.495^{+0.033}_{-0.043},-0.36^{+0.36}_{-0.30},-8.8^{+3.7}_{-4.5})$. Including Planck CMB data, SDSS BAO data, and DES $3\times2$-point data gives $(Ω_{\rm M},w)=(0.321\pm0.007,-0.941\pm0.026)$. In all cases dark energy is consistent with a cosmological constant to within $\sim2σ$. In our analysis, systematic errors on cosmological parameters are subdominant compared to statistical errors; paving the way for future photometrically classified supernova analyses.
△ Less
Submitted 6 June, 2024; v1 submitted 5 January, 2024;
originally announced January 2024.
-
SPT Clusters with DES and HST Weak Lensing. II. Cosmological Constraints from the Abundance of Massive Halos
Authors:
S. Bocquet,
S. Grandis,
L. E. Bleem,
M. Klein,
J. J. Mohr,
T. Schrabback,
T. M. C. Abbott,
P. A. R. Ade,
M. Aguena,
A. Alarcon,
S. Allam,
S. W. Allen,
O. Alves,
A. Amon,
A. J. Anderson,
J. Annis,
B. Ansarinejad,
J. E. Austermann,
S. Avila,
D. Bacon,
M. Bayliss,
J. A. Beall,
K. Bechtol,
M. R. Becker,
A. N. Bender
, et al. (171 additional authors not shown)
Abstract:
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d…
▽ More
We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d surveys, and comprises 1,005 confirmed clusters in the redshift range $0.25-1.78$ over a total sky area of 5,200 deg$^2$. We use DES Year 3 weak-lensing data for 688 clusters with redshifts $z<0.95$ and HST weak-lensing data for 39 clusters with $0.6<z<1.7$. The weak-lensing measurements enable robust mass measurements of sample clusters and allow us to empirically constrain the SZ observable--mass relation. For a flat $Λ$CDM cosmology, and marginalizing over the sum of massive neutrinos, we measure $Ω_\mathrm{m}=0.286\pm0.032$, $σ_8=0.817\pm0.026$, and the parameter combination $σ_8\,(Ω_\mathrm{m}/0.3)^{0.25}=0.805\pm0.016$. Our measurement of $S_8\equivσ_8\,\sqrt{Ω_\mathrm{m}/0.3}=0.795\pm0.029$ and the constraint from Planck CMB anisotropies (2018 TT,TE,EE+lowE) differ by $1.1σ$. In combination with that Planck dataset, we place a 95% upper limit on the sum of neutrino masses $\sum m_ν<0.18$ eV. When additionally allowing the dark energy equation of state parameter $w$ to vary, we obtain $w=-1.45\pm0.31$ from our cluster-based analysis. In combination with Planck data, we measure $w=-1.34^{+0.22}_{-0.15}$, or a $2.2σ$ difference with a cosmological constant. We use the cluster abundance to measure $σ_8$ in five redshift bins between 0.25 and 1.8, and we find the results to be consistent with structure growth as predicted by the $Λ$CDM model fit to Planck primary CMB data.
△ Less
Submitted 21 June, 2024; v1 submitted 4 January, 2024;
originally announced January 2024.
-
Weak Coupling Regime in Dilatonic f(R,T) Cosmology
Authors:
F. A. Brito,
C. H. A. B. Borges,
J. A. V. Campos,
F. G. Costa
Abstract:
We consider $f(R,T)$ modified theories of gravity in the context of string theory inspired dilaton gravity. We deal with a specific model that under certain conditions describes the late time Universe in accord with observational data in modern cosmology and addresses the $H_0$ tension. This is done by exploring the space of parameters made out of those coming from the modified gravity and dilaton…
▽ More
We consider $f(R,T)$ modified theories of gravity in the context of string theory inspired dilaton gravity. We deal with a specific model that under certain conditions describes the late time Universe in accord with observational data in modern cosmology and addresses the $H_0$ tension. This is done by exploring the space of parameters made out of those coming from the modified gravity and dilatonic charge sectors. We employ numerical methods to obtain several important observable quantities.
△ Less
Submitted 8 March, 2024; v1 submitted 22 December, 2023;
originally announced December 2023.
-
Absorption, scattering, quasinormal modes and shadow by canonical acoustic black holes in Lorentz-violating background
Authors:
J. A. V. Campos,
M. A. Anacleto,
F. A. Brito,
E. Passos
Abstract:
In the present work, we study the scattering for a black hole described by the canonical acoustic metric with Lorentz violation using asymptotic and numerical methods. In this scenario, we also check the effects of quasinormal modes and the acoustic shadow radius. In the eikonal limit the relationship between the shadow radius and the real part of the quasinormal frequency is preserved.
In the present work, we study the scattering for a black hole described by the canonical acoustic metric with Lorentz violation using asymptotic and numerical methods. In this scenario, we also check the effects of quasinormal modes and the acoustic shadow radius. In the eikonal limit the relationship between the shadow radius and the real part of the quasinormal frequency is preserved.
△ Less
Submitted 10 June, 2024; v1 submitted 21 December, 2023;
originally announced December 2023.
-
Dark Energy Survey Deep Field photometric redshift performance and training incompleteness assessment
Authors:
L. Toribio San Cipriano,
J. De Vicente,
I. Sevilla-Noarbe,
W. G. Hartley,
J. Myles,
A. Amon,
G. M. Bernstein,
A. Choi,
K. Eckert,
R. A. Gruendl,
I. Harrison,
E. Sheldon,
B. Yanny,
M. Aguena,
S. S. Allam,
O. Alves,
D. Bacon,
D. Brooks,
A. Campos,
A. Carnero Rosell,
J. Carretero,
F. J. Castander,
C. Conselice,
L. N. da Costa,
M. E. S. Pereira
, et al. (33 additional authors not shown)
Abstract:
Context. The determination of accurate photometric redshifts (photo-zs) in large imaging galaxy surveys is key for cosmological studies. One of the most common approaches are machine learning techniques. These methods require a spectroscopic or reference sample to train the algorithms. Attention has to be paid to the quality and properties of these samples since they are key factors in the estimat…
▽ More
Context. The determination of accurate photometric redshifts (photo-zs) in large imaging galaxy surveys is key for cosmological studies. One of the most common approaches are machine learning techniques. These methods require a spectroscopic or reference sample to train the algorithms. Attention has to be paid to the quality and properties of these samples since they are key factors in the estimation of reliable photo-zs. Aims. The goal of this work is to calculate the photo-zs for the Y3 DES Deep Fields catalogue using the DNF machine learning algorithm. Moreover, we want to develop techniques to assess the incompleteness of the training sample and metrics to study how incompleteness affects the quality of photometric redshifts. Finally, we are interested in comparing the performance obtained with respect to the EAzY template fitting approach on Y3 DES Deep Fields catalogue. Methods. We have emulated -- at brighter magnitude -- the training incompleteness with a spectroscopic sample whose redshifts are known to have a measurable view of the problem. We have used a principal component analysis to graphically assess incompleteness and to relate it with the performance parameters provided by DNF. Finally, we have applied the results about the incompleteness to the photo-z computation on Y3 DES Deep Fields with DNF and estimated its performance. Results. The photo-zs for the galaxies on DES Deep Fields have been computed with the DNF algorithm and added to the Y3 DES Deep Fields catalogue. They are available at https://des.ncsa.illinois.edu/releases/y3a2/Y3deepfields. Some techniques have been developed to evaluate the performance in the absence of "true" redshift and to assess completeness. We have studied... (Partial abstract)
△ Less
Submitted 26 February, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
Random walk in a rotational environment
Authors:
Alberto M. Campos,
Tarcísio P. R. Campos
Abstract:
We define a random walk of a particle in $\mathbb{R}^3$ where the space is rotating. The particle is not glued to the space and will collide with it at random times, resulting in changes in its velocity and direction. After many collisions, the random walk starts to have some asymptotic behaviors inherited from the movement of space. The paper will find the limit movement of the particle, and expl…
▽ More
We define a random walk of a particle in $\mathbb{R}^3$ where the space is rotating. The particle is not glued to the space and will collide with it at random times, resulting in changes in its velocity and direction. After many collisions, the random walk starts to have some asymptotic behaviors inherited from the movement of space. The paper will find the limit movement of the particle, and explain how the randomness of the random walk gives rise to the particle asymptotic deterministic movement.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark
Authors:
Oscar Sainz,
Jon Ander Campos,
Iker García-Ferrero,
Julen Etxaniz,
Oier Lopez de Lacalle,
Eneko Agirre
Abstract:
In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is in trouble. The worst kind of data contamination happens when a Large Language Model (LLM) is trained on the test split of a benchmark, and then evaluated in the same benchmark. The extent of the problem is unknown, as it is not straightforward to measure. Contami…
▽ More
In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is in trouble. The worst kind of data contamination happens when a Large Language Model (LLM) is trained on the test split of a benchmark, and then evaluated in the same benchmark. The extent of the problem is unknown, as it is not straightforward to measure. Contamination causes an overestimation of the performance of a contaminated model in a target benchmark and associated task with respect to their non-contaminated counterparts. The consequences can be very harmful, with wrong scientific conclusions being published while other correct ones are discarded. This position paper defines different levels of data contamination and argues for a community effort, including the development of automatic and semi-automatic measures to detect when data from a benchmark was exposed to a model, and suggestions for flagging papers with conclusions that are compromised by data contamination.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Dark Energy Survey Year 3 results: simulation-based cosmological inference with wavelet harmonics, scattering transforms, and moments of weak lensing mass maps I: validation on simulations
Authors:
M. Gatti,
N. Jeffrey,
L. Whiteway,
J. Williamson,
B. Jain,
V. Ajani,
D. Anbajagane,
G. Giannini,
C. Zhou,
A. Porredon,
J. Prat,
M. Yamamoto,
J. Blazek,
T. Kacprzak,
S. Samuroff,
A. Alarcon,
A. Amon,
K. Bechtol,
M. Becker,
G. Bernstein,
A. Campos,
C. Chang,
R. Chen,
A. Choi,
C. Davis
, et al. (76 additional authors not shown)
Abstract:
Beyond-two-point statistics contain additional information on cosmological as well as astrophysical and observational (systematics) parameters. In this methodology paper we provide an end-to-end simulation-based analysis of a set of Gaussian and non-Gaussian weak lensing statistics using detailed mock catalogues of the Dark Energy Survey. We implement: 1) second and third moments; 2) wavelet phase…
▽ More
Beyond-two-point statistics contain additional information on cosmological as well as astrophysical and observational (systematics) parameters. In this methodology paper we provide an end-to-end simulation-based analysis of a set of Gaussian and non-Gaussian weak lensing statistics using detailed mock catalogues of the Dark Energy Survey. We implement: 1) second and third moments; 2) wavelet phase harmonics (WPH); 3) the scattering transform (ST). Our analysis is fully based on simulations, it spans a space of seven $νw$CDM cosmological parameters, and it forward models the most relevant sources of systematics of the data (masks, noise variations, clustering of the sources, intrinsic alignments, and shear and redshift calibration). We implement a neural network compression of the summary statistics, and we estimate the parameter posteriors using a likelihood-free-inference approach. We validate the pipeline extensively, and we find that WPH exhibits the strongest performance when combined with second moments, followed by ST. and then by third moments. The combination of all the different statistics further enhances constraints with respect to second moments, up to 25 per cent, 15 per cent, and 90 per cent for $S_8$, $Ω_{\rm m}$, and the Figure-Of-Merit ${\rm FoM_{S_8,Ω_{\rm m}}}$, respectively. We further find that non-Gaussian statistics improve constraints on $w$ and on the amplitude of intrinsic alignment with respect to second moments constraints. The methodological advances presented here are suitable for application to Stage IV surveys from Euclid, Rubin-LSST, and Roman with additional validation on mock catalogues for each survey. In a companion paper we present an application to DES Year 3 data.
△ Less
Submitted 4 November, 2023; v1 submitted 26 October, 2023;
originally announced October 2023.
-
SPT Clusters with DES and HST Weak Lensing. I. Cluster Lensing and Bayesian Population Modeling of Multi-Wavelength Cluster Datasets
Authors:
S. Bocquet,
S. Grandis,
L. E. Bleem,
M. Klein,
J. J. Mohr,
M. Aguena,
A. Alarcon,
S. Allam,
S. W. Allen,
O. Alves,
A. Amon,
B. Ansarinejad,
D. Bacon,
M. Bayliss,
K. Bechtol,
M. R. Becker,
B. A. Benson,
G. M. Bernstein,
M. Brodwin,
D. Brooks,
A. Campos,
R. E. A. Canning,
J. E. Carlstrom,
A. Carnero Rosell,
M. Carrasco Kind
, et al. (108 additional authors not shown)
Abstract:
We present a Bayesian population modeling method to analyze the abundance of galaxy clusters identified by the South Pole Telescope (SPT) with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). We discuss and validate the modeling choices with a particular focus on a robust, weak-lensing-based mass calibrati…
▽ More
We present a Bayesian population modeling method to analyze the abundance of galaxy clusters identified by the South Pole Telescope (SPT) with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). We discuss and validate the modeling choices with a particular focus on a robust, weak-lensing-based mass calibration using DES data. For the DES Year 3 data, we report a systematic uncertainty in weak-lensing mass calibration that increases from 1% at $z=0.25$ to 10% at $z=0.95$, to which we add 2% in quadrature to account for uncertainties in the impact of baryonic effects. We implement an analysis pipeline that joins the cluster abundance likelihood with a multi-observable likelihood for the Sunyaev-Zel'dovich effect, optical richness, and weak-lensing measurements for each individual cluster. We validate that our analysis pipeline can recover unbiased cosmological constraints by analyzing mocks that closely resemble the cluster sample extracted from the SPT-SZ, SPTpol ECS, and SPTpol 500d surveys and the DES Year 3 and HST-39 weak-lensing datasets. This work represents a crucial prerequisite for the subsequent cosmological analysis of the real dataset.
△ Less
Submitted 21 June, 2024; v1 submitted 18 October, 2023;
originally announced October 2023.
-
Unsupervised Domain Adaption for Neural Information Retrieval
Authors:
Carlos Dominguez,
Jon Ander Campos,
Eneko Agirre,
Gorka Azkune
Abstract:
Neural information retrieval requires costly annotated data for each target domain to be competitive. Synthetic annotation by query generation using Large Language Models or rule-based string manipulation has been proposed as an alternative, but their relative merits have not been analysed. In this paper, we compare both methods head-to-head using the same neural IR architecture. We focus on the B…
▽ More
Neural information retrieval requires costly annotated data for each target domain to be competitive. Synthetic annotation by query generation using Large Language Models or rule-based string manipulation has been proposed as an alternative, but their relative merits have not been analysed. In this paper, we compare both methods head-to-head using the same neural IR architecture. We focus on the BEIR benchmark, which includes test datasets from several domains with no training data, and explore two scenarios: zero-shot, where the supervised system is trained in a large out-of-domain dataset (MS-MARCO); and unsupervised domain adaptation, where, in addition to MS-MARCO, the system is fine-tuned in synthetic data from the target domain. Our results indicate that Large Language Models outperform rule-based methods in all scenarios by a large margin, and, more importantly, that unsupervised domain adaptation is effective compared to applying a supervised IR system in a zero-shot fashion. In addition we explore several sizes of open Large Language Models to generate synthetic data and find that a medium-sized model suffices. Code and models are publicly available for reproducibility.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Nonspreading relativistic electron wavepacket in a strong laser field
Authors:
Andre G. Campos,
Karen Z. Hatsagortsyan,
Christoph H. Keitel
Abstract:
A solution of the Dirac equation in a strong laser field presenting a nonspreading wave packet in the rest frame of the electron is derived. It consists of a generalization of the self-accelerating free electron wave packet [Kaminer et al. Nature Phys. 11, 261 (2015)] to the case with the background of a strong laser field. Built upon the notion of nonspreading for an extended relativistic wavepac…
▽ More
A solution of the Dirac equation in a strong laser field presenting a nonspreading wave packet in the rest frame of the electron is derived. It consists of a generalization of the self-accelerating free electron wave packet [Kaminer et al. Nature Phys. 11, 261 (2015)] to the case with the background of a strong laser field. Built upon the notion of nonspreading for an extended relativistic wavepacket, the concept of Born rigidity for accelerated motion in relativity is the key ingredient of the solution. At its core, the solution comes from the connection between the self-accelerated free electron wave packet and the eigenstate of a Dirac electron in a constant and homogeneous gravitational field via the equivalence principle. The solution is an essential step towards the realization of the laser-driven relativistic collider [Meuren et al. PRL 114, 143201 (2015)], where the large spreading of a common Gaussian wave packet during the excursion in a strong laser field strongly limits the expectable yields.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
Federated Self-Supervised Learning of Monocular Depth Estimators for Autonomous Vehicles
Authors:
Elton F. de S. Soares,
Carlos Alberto V. Campos
Abstract:
Image-based depth estimation has gained significant attention in recent research on computer vision for autonomous vehicles in intelligent transportation systems. This focus stems from its cost-effectiveness and wide range of potential applications. Unlike binocular depth estimation methods that require two fixed cameras, monocular depth estimation methods only rely on a single camera, making them…
▽ More
Image-based depth estimation has gained significant attention in recent research on computer vision for autonomous vehicles in intelligent transportation systems. This focus stems from its cost-effectiveness and wide range of potential applications. Unlike binocular depth estimation methods that require two fixed cameras, monocular depth estimation methods only rely on a single camera, making them highly versatile. While state-of-the-art approaches for this task leverage self-supervised learning of deep neural networks in conjunction with tasks like pose estimation and semantic segmentation, none of them have explored the combination of federated learning and self-supervision to train models using unlabeled and private data captured by autonomous vehicles. The utilization of federated learning offers notable benefits, including enhanced privacy protection, reduced network consumption, and improved resilience to connectivity issues. To address this gap, we propose FedSCDepth, a novel method that combines federated learning and deep self-supervision to enable the learning of monocular depth estimators with comparable effectiveness and superior efficiency compared to the current state-of-the-art methods. Our evaluation experiments conducted on Eigen's Split of the KITTI dataset demonstrate that our proposed method achieves near state-of-the-art performance, with a test loss below 0.13 and requiring, on average, only 1.5k training steps and up to 0.415 GB of weight data transfer per autonomous vehicle on each round.
△ Less
Submitted 7 October, 2023;
originally announced October 2023.
-
Rate-Induced Transitions in Networked Complex Adaptive Systems: Exploring Dynamics and Management Implications Across Ecological, Social, and Socioecological Systems
Authors:
Vítor V. Vasconcelos,
Flávia M. D. Marquitti,
Theresa Ong,
Lisa C. McManus,
Marcus Aguiar,
Amanda B. Campos,
Partha S. Dutta,
Kristen Jovanelly,
Victoria Junquera,
Jude Kong,
Elisabeth H. Krueger,
Simon A. Levin,
Wenying Liao,
Mingzhen Lu,
Dhruv Mittal,
Mercedes Pascual,
Flávio L. Pinheiro,
Juan Rocha,
Fernando P. Santos,
Peter Sloot,
Chenyang,
Su,
Benton Taylor,
Eden Tekwa,
Sjoerd Terpstra
, et al. (5 additional authors not shown)
Abstract:
Complex adaptive systems (CASs), from ecosystems to economies, are open systems and inherently dependent on external conditions. While a system can transition from one state to another based on the magnitude of change in external conditions, the rate of change -- irrespective of magnitude -- may also lead to system state changes due to a phenomenon known as a rate-induced transition (RIT). This st…
▽ More
Complex adaptive systems (CASs), from ecosystems to economies, are open systems and inherently dependent on external conditions. While a system can transition from one state to another based on the magnitude of change in external conditions, the rate of change -- irrespective of magnitude -- may also lead to system state changes due to a phenomenon known as a rate-induced transition (RIT). This study presents a novel framework that captures RITs in CASs through a local model and a network extension where each node contributes to the structural adaptability of others. Our findings reveal how RITs occur at a critical environmental change rate, with lower-degree nodes tipping first due to fewer connections and reduced adaptive capacity. High-degree nodes tip later as their adaptability sources (lower-degree nodes) collapse. This pattern persists across various network structures. Our study calls for an extended perspective when managing CASs, emphasizing the need to focus not only on thresholds of external conditions but also the rate at which those conditions change, particularly in the context of the collapse of surrounding systems that contribute to the focal system's resilience. Our analytical method opens a path to designing management policies that mitigate RIT impacts and enhance resilience in ecological, social, and socioecological systems. These policies could include controlling environmental change rates, fostering system adaptability, implementing adaptive management strategies, and building capacity and knowledge exchange. Our study contributes to the understanding of RIT dynamics and informs effective management strategies for complex adaptive systems in the face of rapid environmental change.
△ Less
Submitted 14 September, 2023;
originally announced September 2023.
-
Evolving division of labor in a response threshold model
Authors:
José F. Fontanari,
Viviane M. de Oliveira,
Paulo R. A. Campos
Abstract:
The response threshold model explains the emergence of division of labor (i.e., task specialization) in an unstructured population by assuming that the individuals have different propensities to work on different tasks. The incentive to attend to a particular task increases when the task is left unattended and decreases when individuals work on it. Here we derive mean-field equations for the stimu…
▽ More
The response threshold model explains the emergence of division of labor (i.e., task specialization) in an unstructured population by assuming that the individuals have different propensities to work on different tasks. The incentive to attend to a particular task increases when the task is left unattended and decreases when individuals work on it. Here we derive mean-field equations for the stimulus dynamics and show that they exhibit complex attractors through period-doubling bifurcation cascades when the noise disrupting the thresholds is small. In addition, we show how the fixed threshold can be set to ensure specialization in both the transient and equilibrium regimes of the stimulus dynamics. However, a complete explanation of the emergence of division of labor requires that we address the question of where the threshold variation comes from, starting from a homogeneous population. We then study a structured population scenario, where the population is divided into a large number of independent groups of equal size, and the fitness of a group is proportional to the weighted mean work performed on the tasks during a fixed period of time. Using a winner-take-all strategy to model group competition and assuming an initial homogeneous metapopulation, we find that a substantial fraction of workers specialize in each task, without the need to penalize task switching.
△ Less
Submitted 14 August, 2023;
originally announced August 2023.
-
Measurement of the Positive Muon Anomalous Magnetic Moment to 0.20 ppm
Authors:
D. P. Aguillard,
T. Albahri,
D. Allspach,
A. Anisenkov,
K. Badgley,
S. Baeßler,
I. Bailey,
L. Bailey,
V. A. Baranov,
E. Barlas-Yucel,
T. Barrett,
E. Barzi,
F. Bedeschi,
M. Berz,
M. Bhattacharya,
H. P. Binney,
P. Bloom,
J. Bono,
E. Bottalico,
T. Bowcock,
S. Braun,
M. Bressler,
G. Cantatore,
R. M. Carey,
B. C. K. Casey
, et al. (166 additional authors not shown)
Abstract:
We present a new measurement of the positive muon magnetic anomaly, $a_μ\equiv (g_μ- 2)/2$, from the Fermilab Muon $g\!-\!2$ Experiment using data collected in 2019 and 2020. We have analyzed more than 4 times the number of positrons from muon decay than in our previous result from 2018 data. The systematic error is reduced by more than a factor of 2 due to better running conditions, a more stable…
▽ More
We present a new measurement of the positive muon magnetic anomaly, $a_μ\equiv (g_μ- 2)/2$, from the Fermilab Muon $g\!-\!2$ Experiment using data collected in 2019 and 2020. We have analyzed more than 4 times the number of positrons from muon decay than in our previous result from 2018 data. The systematic error is reduced by more than a factor of 2 due to better running conditions, a more stable beam, and improved knowledge of the magnetic field weighted by the muon distribution, $\tildeω'^{}_p$, and of the anomalous precession frequency corrected for beam dynamics effects, $ω_a$. From the ratio $ω_a / \tildeω'^{}_p$, together with precisely determined external parameters, we determine $a_μ= 116\,592\,057(25) \times 10^{-11}$ (0.21 ppm). Combining this result with our previous result from the 2018 data, we obtain $a_μ\text{(FNAL)} = 116\,592\,055(24) \times 10^{-11}$ (0.20 ppm). The new experimental world average is $a_μ(\text{Exp}) = 116\,592\,059(22)\times 10^{-11}$ (0.19 ppm), which represents a factor of 2 improvement in precision.
△ Less
Submitted 4 October, 2023; v1 submitted 11 August, 2023;
originally announced August 2023.
-
Beyond the 3rd moment: A practical study of using lensing convergence CDFs for cosmology with DES Y3
Authors:
D. Anbajagane,
C. Chang,
A. Banerjee,
T. Abel,
M. Gatti,
V. Ajani,
A. Alarcon,
A. Amon,
E. J. Baxter,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
A. Campos,
A. Carnero Rosell,
M. Carrasco Kind,
R. Chen,
A. Choi,
C. Davis,
J. DeRose,
H. T. Diehl,
S. Dodelson,
C. Doux,
A. Drlica-Wagner,
K. Eckert,
J. Elvin-Poole
, et al. (73 additional authors not shown)
Abstract:
Widefield surveys of the sky probe many clustered scalar fields -- such as galaxy counts, lensing potential, gas pressure, etc. -- that are sensitive to different cosmological and astrophysical processes. Our ability to constrain such processes from these fields depends crucially on the statistics chosen to summarize the field. In this work, we explore the cumulative distribution function (CDF) at…
▽ More
Widefield surveys of the sky probe many clustered scalar fields -- such as galaxy counts, lensing potential, gas pressure, etc. -- that are sensitive to different cosmological and astrophysical processes. Our ability to constrain such processes from these fields depends crucially on the statistics chosen to summarize the field. In this work, we explore the cumulative distribution function (CDF) at multiple scales as a summary of the galaxy lensing convergence field. Using a suite of N-body lightcone simulations, we show the CDFs' constraining power is modestly better than that of the 2nd and 3rd moments of the field, as they approximately capture the information from all moments of the field in a concise data vector. We then study the practical aspects of applying the CDFs to observational data, using the first three years of the Dark Energy Survey (DES Y3) data as an example, and compute the impact of different systematics on the CDFs. The contributions from the point spread function are 2-3 orders of magnitude below the cosmological signal, while those from reduced shear approximation contribute $\lesssim 1\%$ to the signal. Source clustering effects and baryon imprints contribute $1-10\%$. Enforcing scale cuts to limit systematics-driven biases in parameter constraints degrades these constraints a noticeable amount, and this degradation is similar for the CDFs and the moments. We also detect correlations between the observed convergence field and the shape noise field at $13σ$. We find that the non-Gaussian correlations in the noise field must be modeled accurately to use the CDFs, or other statistics sensitive to all moments, as a rigorous cosmology tool.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Detection of the significant impact of source clustering on higher-order statistics with DES Year 3 weak gravitational lensing data
Authors:
M. Gatti,
N. Jeffrey,
L. Whiteway,
V. Ajani,
T. Kacprzak,
D. Zürcher,
C. Chang,
B. Jain,
J. Blazek,
E. Krause,
A. Alarcon,
A. Amon,
K. Bechtol,
M. Becker,
G. Bernstein,
A. Campos,
R. Chen,
A. Choi,
C. Davis,
J. Derose,
H. T. Diehl,
S. Dodelson,
C. Doux,
K. Eckert,
J. Elvin-Poole
, et al. (76 additional authors not shown)
Abstract:
We demonstrate and measure the impact of source galaxy clustering on higher-order summary statistics of weak gravitational lensing data. By comparing simulated data with galaxies that either trace or do not trace the underlying density field, we show this effect can exceed measurement uncertainties for common higher-order statistics for certain analysis choices. Source clustering effects are large…
▽ More
We demonstrate and measure the impact of source galaxy clustering on higher-order summary statistics of weak gravitational lensing data. By comparing simulated data with galaxies that either trace or do not trace the underlying density field, we show this effect can exceed measurement uncertainties for common higher-order statistics for certain analysis choices. Source clustering effects are larger at small scales and for statistics applied to combinations of low and high redshift samples, and diminish at high redshift. We evaluate the impact on different weak lensing observables, finding that third moments and wavelet phase harmonics are more affected than peak count statistics. Using Dark Energy Survey Year 3 data we construct null tests for the source-clustering-free case, finding a $p$-value of $p=4\times10^{-3}$ (2.6 $σ$) using third-order map moments and $p=3\times10^{-11}$ (6.5 $σ$) using wavelet phase harmonics. The impact of source clustering on cosmological inference can be either be included in the model or minimized through \textit{ad-hoc} procedures (e.g. scale cuts). We verify that the procedures adopted in existing DES Y3 cosmological analyses (using map moments and peaks) were sufficient to render this effect negligible. Failing to account for source clustering can significantly impact cosmological inference from higher-order gravitational lensing statistics, e.g. higher-order N-point functions, wavelet-moment observables (including phase harmonics and scattering transforms), and deep learning or field level summary statistics of weak lensing maps. We provide recipes both to minimise the impact of source clustering and to incorporate source clustering effects into forward-modelled mock data.
△ Less
Submitted 27 July, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Scattering and absorption by extra-dimensional black holes with GUP
Authors:
M. A. Anacleto,
J. A. V. Campos,
F. A. Brito,
E. Maciel,
E. Passos
Abstract:
In this paper, we consider the Schwarzschild-Tangherlini black hole to investigate the process of scalar wave scattering by the black hole in a spacetime of (d + 1) dimensions and also with the generalized uncertainty principle (GUP). In this scenario, we analytically determine the phase shift and explore the effect of extra dimensions by calculating the differential scattering and absorption cros…
▽ More
In this paper, we consider the Schwarzschild-Tangherlini black hole to investigate the process of scalar wave scattering by the black hole in a spacetime of (d + 1) dimensions and also with the generalized uncertainty principle (GUP). In this scenario, we analytically determine the phase shift and explore the effect of extra dimensions by calculating the differential scattering and absorption cross-section by applying the partial wave method at low and high-frequency limits. We show at high dimensions that the absorption is not zero as the mass parameter approaches zero.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
Decoherence-Free Entropic Gravity for Dirac Fermion
Authors:
Eric J. Sung,
Andre G. Campos,
Hartmut Abele,
Denys I. Bondar
Abstract:
The theory of entropic gravity conjectures that gravity emerges thermodynamically rather than being a fundamental force. One of the main criticisms of entropic gravity is that it would lead to quantum massive particles losing coherence in free fall, which is not observed experimentally. This criticism was refuted in [Phys. Rev. Res. 3, 033065 (2021)], where a nonrelativistic master equation modeli…
▽ More
The theory of entropic gravity conjectures that gravity emerges thermodynamically rather than being a fundamental force. One of the main criticisms of entropic gravity is that it would lead to quantum massive particles losing coherence in free fall, which is not observed experimentally. This criticism was refuted in [Phys. Rev. Res. 3, 033065 (2021)], where a nonrelativistic master equation modeling gravity as an open quantum system interaction demonstrated that in the strong coupling limit, coherence could be maintained and reproduce conventional free-fall dynamics. Moreover, the nonrelativistic master equation was shown to be fully compatible with the qBounce experiment for ultracold neutrons. Motivated by this, we extend these results to gravitationally accelerating Dirac fermions. We achieve this by using the Dirac equation in Rindler space and modeling entropic gravity as a thermal bath thus adopting the open quantum systems approach as well. We demonstrate that in the strong coupling limit, our entropic gravity model maintains quantum coherence for Dirac fermions. In addition, we demonstrate that spin is not affected by entropic gravity. We use the Foldy-Wouthysen transformation to demonstrate that it reduces to the nonrelativistic master equation, supporting the entropic gravity hypothesis for Dirac fermions. Also, we demonstrate how antigravity seemingly arises from the Dirac equation for free-falling antiparticles but use numerical simulations to show that this phenomenon originates from zitterbewegung thus not violating the equivalence principle.
△ Less
Submitted 15 November, 2023; v1 submitted 30 June, 2023;
originally announced July 2023.
-
DES Y3 + KiDS-1000: Consistent cosmology combining cosmic shear surveys
Authors:
Dark Energy Survey,
Kilo-Degree Survey Collaboration,
:,
T. M. C. Abbott,
M. Aguena,
A. Alarcon,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
M. Asgari,
S. Avila,
D. Bacon,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
E. Bertin,
M. Bilicki,
J. Blazek,
S. Bocquet,
D. Brooks,
P. Burger,
D. L. Burke,
H. Camacho,
A. Campos,
A. Carnero Rosell
, et al. (138 additional authors not shown)
Abstract:
We present a joint cosmic shear analysis of the Dark Energy Survey (DES Y3) and the Kilo-Degree Survey (KiDS-1000) in a collaborative effort between the two survey teams. We find consistent cosmological parameter constraints between DES Y3 and KiDS-1000 which, when combined in a joint-survey analysis, constrain the parameter $S_8 = σ_8 \sqrt{Ω_{\rm m}/0.3}$ with a mean value of…
▽ More
We present a joint cosmic shear analysis of the Dark Energy Survey (DES Y3) and the Kilo-Degree Survey (KiDS-1000) in a collaborative effort between the two survey teams. We find consistent cosmological parameter constraints between DES Y3 and KiDS-1000 which, when combined in a joint-survey analysis, constrain the parameter $S_8 = σ_8 \sqrt{Ω_{\rm m}/0.3}$ with a mean value of $0.790^{+0.018}_{-0.014}$. The mean marginal is lower than the maximum a posteriori estimate, $S_8=0.801$, owing to skewness in the marginal distribution and projection effects in the multi-dimensional parameter space. Our results are consistent with $S_8$ constraints from observations of the cosmic microwave background by Planck, with agreement at the $1.7σ$ level. We use a Hybrid analysis pipeline, defined from a mock survey study quantifying the impact of the different analysis choices originally adopted by each survey team. We review intrinsic alignment models, baryon feedback mitigation strategies, priors, samplers and models of the non-linear matter power spectrum.
△ Less
Submitted 19 October, 2023; v1 submitted 26 May, 2023;
originally announced May 2023.
-
IXA/Cogcomp at SemEval-2023 Task 2: Context-enriched Multilingual Named Entity Recognition using Knowledge Bases
Authors:
Iker García-Ferrero,
Jon Ander Campos,
Oscar Sainz,
Ander Salaberria,
Dan Roth
Abstract:
Named Entity Recognition (NER) is a core natural language processing task in which pre-trained language models have shown remarkable performance. However, standard benchmarks like CoNLL 2003 do not address many of the challenges that deployed NER systems face, such as having to classify emerging or complex entities in a fine-grained way. In this paper we present a novel NER cascade approach compri…
▽ More
Named Entity Recognition (NER) is a core natural language processing task in which pre-trained language models have shown remarkable performance. However, standard benchmarks like CoNLL 2003 do not address many of the challenges that deployed NER systems face, such as having to classify emerging or complex entities in a fine-grained way. In this paper we present a novel NER cascade approach comprising three steps: first, identifying candidate entities in the input sentence; second, linking the each candidate to an existing knowledge base; third, predicting the fine-grained category for each entity candidate. We empirically demonstrate the significance of external knowledge bases in accurately classifying fine-grained and emerging entities. Our system exhibits robust performance in the MultiCoNER2 shared task, even in the low-resource language setting where we leverage knowledge bases of high-resource languages.
△ Less
Submitted 27 April, 2023; v1 submitted 20 April, 2023;
originally announced April 2023.
-
Training Language Models with Language Feedback at Scale
Authors:
Jérémy Scheurer,
Jon Ander Campos,
Tomasz Korbak,
Jun Shern Chan,
Angelica Chen,
Kyunghyun Cho,
Ethan Perez
Abstract:
Pretrained language models often generate outputs that are not in line with human preferences, such as harmful text or factually incorrect summaries. Recent work approaches the above issues by learning from a simple form of human feedback: comparisons between pairs of model-generated outputs. However, comparison feedback only conveys limited information about human preferences. In this paper, we i…
▽ More
Pretrained language models often generate outputs that are not in line with human preferences, such as harmful text or factually incorrect summaries. Recent work approaches the above issues by learning from a simple form of human feedback: comparisons between pairs of model-generated outputs. However, comparison feedback only conveys limited information about human preferences. In this paper, we introduce Imitation learning from Language Feedback (ILF), a new approach that utilizes more informative language feedback. ILF consists of three steps that are applied iteratively: first, conditioning the language model on the input, an initial LM output, and feedback to generate refinements. Second, selecting the refinement incorporating the most feedback. Third, finetuning the language model to maximize the likelihood of the chosen refinement given the input. We show theoretically that ILF can be viewed as Bayesian Inference, similar to Reinforcement Learning from human feedback. We evaluate ILF's effectiveness on a carefully-controlled toy task and a realistic summarization task. Our experiments demonstrate that large language models accurately incorporate feedback and that finetuning with ILF scales well with the dataset size, even outperforming finetuning on human summaries. Learning from both language and comparison feedback outperforms learning from each alone, achieving human-level summarization performance.
△ Less
Submitted 22 February, 2024; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Improving Code Generation by Training with Natural Language Feedback
Authors:
Angelica Chen,
Jérémy Scheurer,
Tomasz Korbak,
Jon Ander Campos,
Jun Shern Chan,
Samuel R. Bowman,
Kyunghyun Cho,
Ethan Perez
Abstract:
The potential for pre-trained large language models (LLMs) to use natural language feedback at inference time has been an exciting recent development. We build upon this observation by formalizing an algorithm for learning from natural language feedback at training time instead, which we call Imitation learning from Language Feedback (ILF). ILF requires only a small amount of human-written feedbac…
▽ More
The potential for pre-trained large language models (LLMs) to use natural language feedback at inference time has been an exciting recent development. We build upon this observation by formalizing an algorithm for learning from natural language feedback at training time instead, which we call Imitation learning from Language Feedback (ILF). ILF requires only a small amount of human-written feedback during training and does not require the same feedback at test time, making it both user-friendly and sample-efficient. We further show that ILF can be seen as a form of minimizing the KL divergence to the ground truth distribution and demonstrate a proof-of-concept on a neural program synthesis task. We use ILF to improve a Codegen-Mono 6.1B model's pass@1 rate by 38% relative (and 10% absolute) on the Mostly Basic Python Problems (MBPP) benchmark, outperforming both fine-tuning on MBPP and fine-tuning on repaired programs written by humans. Overall, our results suggest that learning from human-written natural language feedback is both more effective and sample-efficient than training exclusively on demonstrations for improving an LLM's performance on code generation tasks.
△ Less
Submitted 22 February, 2024; v1 submitted 28 March, 2023;
originally announced March 2023.
-
Comment on "Rotating Spin and Giant Splitting: Unoccupied Surface Electronic Structure of Tl/Si(111)"
Authors:
Abraham F. Campos,
Kang Wang,
Antonio Tejeda
Abstract:
Rashba effect in 2D systems is extensively studied nowadays due to spintronics applications. The Letter studies the fundamentals of spin-orbit interaction in 2D systems. Experimental evidence is claimed for the rotation of the spin polarization vector in Tl/Si from an in-plane Rashba polarization at $\overlineΓ$ to the surface normal at $\overline{K}$($\overline{K}'$) valleys. These results are po…
▽ More
Rashba effect in 2D systems is extensively studied nowadays due to spintronics applications. The Letter studies the fundamentals of spin-orbit interaction in 2D systems. Experimental evidence is claimed for the rotation of the spin polarization vector in Tl/Si from an in-plane Rashba polarization at $\overlineΓ$ to the surface normal at $\overline{K}$($\overline{K}'$) valleys. These results are possible thanks to the single setup that could measure spin-resolved inverse photoemission (IPES) with in- and out-of- plane sensitivity. This Comment clarifies that (i) when considering the full data set in the Letter, the in-plane polarization does not vanish at the valleys, (ii) the Letter does not explain that the out-of-plane data are not real measurements, in the sense that they are derived by considering the fulfillment of a theoretical symmetry or from an unspecified data treatment.
△ Less
Submitted 1 July, 2023; v1 submitted 22 March, 2023;
originally announced March 2023.
-
Comparison Theorems for Stochastic Chemical Reaction Networks
Authors:
Felipe A. Campos,
Simone Bruno,
Yi Fu,
Domitilla Del Vecchio,
Ruth J. Williams
Abstract:
Continuous-time Markov chains are frequently used as stochastic models for chemical reaction networks, especially in the growing field of systems biology. A fundamental problem for these Stochastic Chemical Reaction Networks (SCRNs) is to understand the dependence of the stochastic behavior of these systems on the chemical reaction rate parameters. Towards solving this problem, in this paper we de…
▽ More
Continuous-time Markov chains are frequently used as stochastic models for chemical reaction networks, especially in the growing field of systems biology. A fundamental problem for these Stochastic Chemical Reaction Networks (SCRNs) is to understand the dependence of the stochastic behavior of these systems on the chemical reaction rate parameters. Towards solving this problem, in this paper we develop theoretical tools called comparison theorems that provide stochastic ordering results for SCRNs. These theorems give sufficient conditions for monotonic dependence on parameters in these network models, which allow us to obtain, under suitable conditions, information about transient and steady state behavior. These theorems exploit structural properties of SCRNs, beyond those of general continuous-time Markov chains. Furthermore, we derive two theorems to compare stationary distributions and mean first passage times for SCRNs with different parameter values, or with the same parameters and different initial conditions. These tools are developed for SCRNs taking values in a generic (finite or countably infinite) state space and can also be applied for non-mass-action kinetics models. When propensity functions are bounded, our method of proof gives an explicit method for coupling two comparable SCRNs, which can be used to simultaneously simulate their sample paths in a comparable manner. We illustrate our results with applications to models of enzymatic kinetics and epigenetic regulation by chromatin modifications.
△ Less
Submitted 6 March, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Post-2000 Nonlinear Optical Materials and Measurements: Data Tables and Best Practices
Authors:
Nathalie Vermeulen,
Daniel Espinosa,
Adam Ball,
John Ballato,
Philippe Boucaud,
Georges Boudebs,
Cecilia L. A. V. Campos,
Peter Dragic,
Anderson S. L. Gomes,
Mikko J. Huttunen,
Nathaniel Kinsey,
Rich Mildren,
Dragomir Neshev,
Lazaro Padilha,
Minhao Pu,
Ray Secondo,
Eiji Tokunaga,
Dmitry Turchinovich,
Jingshi Yan,
Kresten Yvind,
Ksenia Dolgaleva,
Eric W. Van Stryland
Abstract:
In its 60 years of existence, the field of nonlinear optics has gained momentum especially over the past two decades thanks to major breakthroughs in material science and technology. In this article, we present a new set of data tables listing nonlinear-optical properties for different material categories as reported in the literature since 2000. The papers included in the data tables are represen…
▽ More
In its 60 years of existence, the field of nonlinear optics has gained momentum especially over the past two decades thanks to major breakthroughs in material science and technology. In this article, we present a new set of data tables listing nonlinear-optical properties for different material categories as reported in the literature since 2000. The papers included in the data tables are representative experimental works on bulk materials, solvents, 0D-1D-2D materials, metamaterials, fiber waveguiding materials, on-chip waveguiding materials, hybrid waveguiding systems, and materials suitable for nonlinear optics at THz frequencies. In addition to the data tables, we also provide best practices for performing and reporting nonlinear-optical experiments. These best practices underpin the selection process that was used for including papers in the tables. While the tables indeed show strong advancements in the field over the past two decades, we encourage the nonlinear-optics community to implement the identified best practices in future works. This will allow a more adequate comparison, interpretation and use of the published parameters, and as such further stimulate the overall progress in nonlinear-optical science and applications.
△ Less
Submitted 21 May, 2023; v1 submitted 15 January, 2023;
originally announced January 2023.
-
Absorption, scattering and shadow by a noncommutative black hole with global monopole
Authors:
M. A. Anacleto,
F. A. Brito,
J. A. V. Campos,
E. Passos
Abstract:
In this paper, we investigate the process of massless scalar wave scattering due to a noncommutative black hole with a global monopole through the partial wave method. We computed the cross section of differential scattering and absorption at the low frequency limit. We also calculated, at the high frequency limit, the absorption and the shadow radius by the null geodesic method. We showed that no…
▽ More
In this paper, we investigate the process of massless scalar wave scattering due to a noncommutative black hole with a global monopole through the partial wave method. We computed the cross section of differential scattering and absorption at the low frequency limit. We also calculated, at the high frequency limit, the absorption and the shadow radius by the null geodesic method. We showed that noncommutativity causes a reduction in the differential scattering/absorption cross section and shadow radius, while the presence of the global monopole has the effect of increasing the value of such quantities. In the limit of the null mass parameter, we verify that the cross section of differential scattering, absorption and shadow ray approach to a non-zero value proportional to a minimum mass.
△ Less
Submitted 28 December, 2022;
originally announced December 2022.
-
The Dark Energy Survey Year 3 and eBOSS: constraining galaxy intrinsic alignments across luminosity and colour space
Authors:
S. Samuroff,
R. Mandelbaum,
J. Blazek,
A. Campos,
N. MacCrann,
G. Zacharegkas,
A. Amon,
J. Prat,
S. Singh,
J. Elvin-Poole,
A. J. Ross,
A. Alarcon,
E. Baxter,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
A. Carnero Rosell,
M. Carrasco Kind,
R. Cawthon,
C. Chang,
R. Chen,
A. Choi,
M. Crocce,
C. Davis,
J. DeRose
, et al. (82 additional authors not shown)
Abstract:
We present direct constraints on galaxy intrinsic alignments using the Dark Energy Survey Year 3 (DES Y3), the Extended Baryon Oscillation Spectroscopic Survey (eBOSS) and its precursor, the Baryon Oscillation Spectroscopic Survey (BOSS). Our measurements incorporate photometric red sequence (redMaGiC) galaxies from DES with median redshift $z\sim0.2-1.0$, luminous red galaxies (LRGs) from eBOSS a…
▽ More
We present direct constraints on galaxy intrinsic alignments using the Dark Energy Survey Year 3 (DES Y3), the Extended Baryon Oscillation Spectroscopic Survey (eBOSS) and its precursor, the Baryon Oscillation Spectroscopic Survey (BOSS). Our measurements incorporate photometric red sequence (redMaGiC) galaxies from DES with median redshift $z\sim0.2-1.0$, luminous red galaxies (LRGs) from eBOSS at $z\sim0.8$, and also a SDSS-III BOSS CMASS sample at $z\sim0.5$. We measure two point intrinsic alignment correlations, which we fit using a model that includes lensing, magnification and photometric redshift error. Fitting on scales $6<r_{\rm p} < 70$ Mpc$/h$, we make a detection of intrinsic alignments in each sample, at $5σ-22σ$ (assuming a simple one parameter model for IAs). Using these red samples, we measure the IA-luminosity relation. Our results are statistically consistent with previous results, but offer a significant improvement in constraining power, particularly at low luminosity. With this improved precision, we see detectable dependence on colour between broadly defined red samples. It is likely that a more sophisticated approach than a binary red/blue split, which jointly considers colour and luminosity dependence in the IA signal, will be needed in future. We also compare the various signal components at the best fitting point in parameter space for each sample, and find that magnification and lensing contribute $\sim2-18\%$ of the total signal. As precision continues to improve, it will certainly be necessary to account for these effects in future direct IA measurements. Finally, we make equivalent measurements on a sample of Emission Line Galaxies (ELGs) from eBOSS at $z\sim 0.8$. We report a null detection, constraining the IA amplitude (assuming the nonlinear alignment model) to be $A_1=0.07^{+0.32}_{-0.42}$ ($|A_1|<0.78$ at $95\%$ CL).
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
Non-local contribution from small scales in galaxy-galaxy lensing: Comparison of mitigation schemes
Authors:
J. Prat,
G. Zacharegkas,
Y. Park,
N. MacCrann,
E. R. Switzer,
S. Pandey,
C. Chang,
J. Blazek,
R. Miquel,
A. Alarcon,
O. Alves,
A. Amon,
F. Andrade-Oliveira,
K. Bechtol,
M. R. Becker,
G. M. Bernstein,
R. Chen,
A. Choi,
H. Camacho,
A. Campos,
A. Carnero Rosell,
M. Carrasco Kind,
R. Cawthon,
J. Cordero,
M. Crocce
, et al. (90 additional authors not shown)
Abstract:
Recent cosmological analyses with large-scale structure and weak lensing measurements, usually referred to as 3$\times$2pt, had to discard a lot of signal-to-noise from small scales due to our inability to accurately model non-linearities and baryonic effects. Galaxy-galaxy lensing, or the position-shear correlation between lens and source galaxies, is one of the three two-point correlation functi…
▽ More
Recent cosmological analyses with large-scale structure and weak lensing measurements, usually referred to as 3$\times$2pt, had to discard a lot of signal-to-noise from small scales due to our inability to accurately model non-linearities and baryonic effects. Galaxy-galaxy lensing, or the position-shear correlation between lens and source galaxies, is one of the three two-point correlation functions that are included in such analyses, usually estimated with the mean tangential shear. However, tangential shear measurements at a given angular scale $θ$ or physical scale $R$ carry information from all scales below that, forcing the scale cuts applied in real data to be significantly larger than the scale at which theoretical uncertainties become problematic. Recently there have been a few independent efforts that aim to mitigate the non-locality of the galaxy-galaxy lensing signal. Here we perform a comparison of the different methods, including the Y-transformation, the Point-Mass marginalization methodology and the Annular Differential Surface Density statistic. We do the comparison at the cosmological constraints level in a combined galaxy clustering and galaxy-galaxy lensing analysis. We find that all the estimators yield equivalent cosmological results assuming a simulated Rubin Observatory Legacy Survey of Space and Time (LSST) Year 1 like setup and also when applied to DES Y3 data. With the LSST Y1 setup, we find that the mitigation schemes yield $\sim$1.3 times more constraining $S_8$ results than applying larger scale cuts without using any mitigation scheme.
△ Less
Submitted 4 April, 2023; v1 submitted 7 December, 2022;
originally announced December 2022.
-
Wrinkling and crumpling in twisted few and multilayer CVD graphene: High density of edge modes influencing Raman spectra
Authors:
D. Nikolaievskyi,
M. Torregrosa,
A. Merlen,
S. Clair,
O. Chuzel,
J. -L. Parrain,
T. Neisus,
A. Campos,
M. Cabie,
C. Martin,
C. Pardanaud
Abstract:
Richness and complexity of Raman spectra related to graphene materials is established from years to decades, with, among others: the well-known G, D, 2D,... bands plus a plethora of weaker bands related to disorder behavior, doping, stress, crystal orientation or stacking information. Herein, we report on how to detect crumpling effects in Raman spectra, using a large variety of few and multilayer…
▽ More
Richness and complexity of Raman spectra related to graphene materials is established from years to decades, with, among others: the well-known G, D, 2D,... bands plus a plethora of weaker bands related to disorder behavior, doping, stress, crystal orientation or stacking information. Herein, we report on how to detect crumpling effects in Raman spectra, using a large variety of few and multilayer graphene. The main finding is that these crumples enhance the G band intensity like it does with twisted bi layer graphene. We updated the D over G band intensity ratio versus G band width plot, which is generally used to disentangle point and linear defects origin, by reporting surface defects created by crumples. Moreover, we report for the first time on the existence 23 resonant additional bands at 633 nm. We attribute them to edge modes formed by high density of crumples. We use Raman plots (2D bands versus G band positions and widths) to gain qualitative information about the way layers are stacked.
△ Less
Submitted 7 December, 2022;
originally announced December 2022.
-
The Dark Energy Survey Year 3 high redshift sample: Selection, characterization and analysis of galaxy clustering
Authors:
C. Sánchez,
A. Alarcon,
G. M. Bernstein,
J. Sanchez,
S. Pandey,
M. Raveri,
J. Prat,
N. Weaverdyck,
I. Sevilla-Noarbe,
C. Chang,
E. Baxter,
Y. Omori,
B. Jain,
O. Alves,
A. Amon,
K. Bechtol,
M. R. Becker,
J. Blazek,
A. Choi,
A. Campos,
A. Carnero Rosell,
M. Carrasco Kind,
M. Crocce,
D. Cross,
J. DeRose
, et al. (75 additional authors not shown)
Abstract:
The fiducial cosmological analyses of imaging galaxy surveys like the Dark Energy Survey (DES) typically probe the Universe at redshifts $z < 1$. This is mainly because of the limited depth of these surveys, and also because such analyses rely heavily on galaxy lensing, which is more efficient at low redshifts. In this work we present the selection and characterization of high-redshift galaxy samp…
▽ More
The fiducial cosmological analyses of imaging galaxy surveys like the Dark Energy Survey (DES) typically probe the Universe at redshifts $z < 1$. This is mainly because of the limited depth of these surveys, and also because such analyses rely heavily on galaxy lensing, which is more efficient at low redshifts. In this work we present the selection and characterization of high-redshift galaxy samples using DES Year 3 data, and the analysis of their galaxy clustering measurements. In particular, we use galaxies that are fainter than those used in the previous DES Year 3 analyses and a Bayesian redshift scheme to define three tomographic bins with mean redshifts around $z \sim 0.9$, $1.2$ and $1.5$, which significantly extend the redshift coverage of the fiducial DES Year 3 analysis. These samples contain a total of about 9 million galaxies, and their galaxy density is more than 2 times higher than those in the DES Year 3 fiducial case. We characterize the redshift uncertainties of the samples, including the usage of various spectroscopic and high-quality redshift samples, and we develop a machine-learning method to correct for correlations between galaxy density and survey observing conditions. The analysis of galaxy clustering measurements, with a total signal-to-noise $S/N \sim 70$ after scale cuts, yields robust cosmological constraints on a combination of the fraction of matter in the Universe $Ω_m$ and the Hubble parameter $h$, $Ω_m h = 0.195^{+0.023}_{-0.018}$, and 2-3% measurements of the amplitude of the galaxy clustering signals, probing galaxy bias and the amplitude of matter fluctuations, $b σ_8$. A companion paper $\textit{(in preparation)}$ will present the cross-correlations of these high-$z$ samples with CMB lensing from Planck and SPT, and the cosmological analysis of those measurements in combination with the galaxy clustering presented in this work.
△ Less
Submitted 1 December, 2022; v1 submitted 29 November, 2022;
originally announced November 2022.
-
Cable Network Management Infrastructure Evolution
Authors:
L Alberto Campos,
Jennifer Andreoli-Fang,
Vivek Ganti
Abstract:
An approach to enable advanced troubleshooting, granular analysis and service quality of experience assessment is presented. The use of topology information in the identification of each cable network element along with granular information of the element configuration and health is proposed. This technique covers multiple layers including the service layer. At the physical layers street addresses…
▽ More
An approach to enable advanced troubleshooting, granular analysis and service quality of experience assessment is presented. The use of topology information in the identification of each cable network element along with granular information of the element configuration and health is proposed. This technique covers multiple layers including the service layer. At the physical layers street addresses, taps and amplifiers are used to identify impairment location. All layers are leveraged to measure network and service reliability, service degradation and to quantify quality of experience. A cable infrastructure implementation is described as an example.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.