-
Field-level Emulation of Cosmic Structure Formation with Cosmology and Redshift Dependence
Authors:
Drew Jamieson,
Yin Li,
Francisco Villaescusa-Navarro,
Shirley Ho,
David N. Spergel
Abstract:
We present a field-level emulator for large-scale structure, capturing the cosmology dependence and the time evolution of cosmic structure formation. The emulator maps linear displacement fields to their corresponding nonlinear displacements from N-body simulations at specific redshifts. Designed as a neural network, the emulator incorporates style parameters that encode dependencies on…
▽ More
We present a field-level emulator for large-scale structure, capturing the cosmology dependence and the time evolution of cosmic structure formation. The emulator maps linear displacement fields to their corresponding nonlinear displacements from N-body simulations at specific redshifts. Designed as a neural network, the emulator incorporates style parameters that encode dependencies on $Ω_{\rm m}$ and the linear growth factor $D(z)$ at redshift $z$. We train our model on the six-dimensional N-body phase space, predicting particle velocities as the time derivative of the model's displacement outputs. This innovation results in significant improvements in training efficiency and model accuracy. Tested on diverse cosmologies and redshifts not seen during training, the emulator achieves percent-level accuracy on scales of $k\sim~1~{\rm Mpc}^{-1}~h$ at $z=0$, with improved performance at higher redshifts. We compare predicted structure formation histories with N-body simulations via merger trees, finding consistent merger event sequences and statistical properties.
△ Less
Submitted 14 August, 2024;
originally announced August 2024.
-
CREST: Effectively Compacting a Datastore For Retrieval-Based Speculative Decoding
Authors:
Sophia Ho,
Jinsol Park,
Patrick Wang
Abstract:
We present CREST (Compact Retrieval-Based Speculative Decoding), a redesign of REST that allows it to be effectively "compacted". REST is a drafting technique for speculative decoding based on retrieving exact n-gram matches of the most recent n tokens generated by the target LLM from a datastore. The key idea of CREST is to only store a subset of the smallest and most common n-grams in the datast…
▽ More
We present CREST (Compact Retrieval-Based Speculative Decoding), a redesign of REST that allows it to be effectively "compacted". REST is a drafting technique for speculative decoding based on retrieving exact n-gram matches of the most recent n tokens generated by the target LLM from a datastore. The key idea of CREST is to only store a subset of the smallest and most common n-grams in the datastore with the hope of achieving comparable performance with less storage space. We found that storing a subset of n-grams both reduces storage space and improves performance. CREST matches REST's accepted token length with 10.6-13.5x less storage space and achieves a 16.5-17.1% higher acceptance length than REST using the same storage space on the HumanEval and MT Bench benchmarks.
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
On Drinfeld modular curves for SL(2)
Authors:
Jesse Franklin,
Sheng-Yang Kevin Ho,
Mihran Papikian
Abstract:
We study the Drinfeld modular curves arising from the Hecke congruence subgroups of $\mathrm{SL}_2(\mathbb{F}_q[T])$. Using a combinatorial method of Gekeler and Nonnengardt, we obtain a genus formula for these curves. In cases when the genus is one, we compute the Weierstrass equation of the corresponding curve.
We study the Drinfeld modular curves arising from the Hecke congruence subgroups of $\mathrm{SL}_2(\mathbb{F}_q[T])$. Using a combinatorial method of Gekeler and Nonnengardt, we obtain a genus formula for these curves. In cases when the genus is one, we compute the Weierstrass equation of the corresponding curve.
△ Less
Submitted 31 July, 2024;
originally announced August 2024.
-
Conformal Wide-Angle Scanning Leaky-Wave Antenna for V-Band On-Body Applications
Authors:
Pratik Vadher,
Zvonimir Sipus,
Anja K. Skrivervik,
Qihang Zeng,
Ronan Sauleau,
John S. Ho,
Giulia Sacco,
Denys Nikolayev
Abstract:
Wearable on-body millimeter-wave (mmWave) radars can provide obstacle detection and guidance for visually impaired people. However, their everyday performance is hindered by the rigid form factor and limited scanning range. In this article, we propose a low-profile, fast-scanning leaky-wave antenna (LWA) operating in the unlicensed V-band (57-64 GHz) to be integrated for on-body applications such…
▽ More
Wearable on-body millimeter-wave (mmWave) radars can provide obstacle detection and guidance for visually impaired people. However, their everyday performance is hindered by the rigid form factor and limited scanning range. In this article, we propose a low-profile, fast-scanning leaky-wave antenna (LWA) operating in the unlicensed V-band (57-64 GHz) to be integrated for on-body applications such as lightweight portable frequency-modulated continuous wave (FMCW) radars. The proposed LWA consists of meandering microstrips that can conform to the human body curvatures while maintaining beam-forming and beam-scanning properties. Experimental results demonstrate that the planar LWA achieves a realized gain above 10 dB with a fan-beam steering range in the H-plane from -40° to 43° over the operating frequency band while the half power beam-width (HPBW) is within 20°. Since for the foreseen application the antenna is supposed to conform to the user's body, the performance is also analyzed for a bent condition. The beam steering range changes to -32° to 50° when placed on the knee (corresponding to 80 mm radius). Under bending conditions, the LWA exhibits a maximum degradation of 1.75 dB, while the HPBW increases to 25°. This shows that due to the small size of the antenna, the impact of bending is low and the beam-forming and beam-scanning property of the designed LWA remain intact. Furthermore, we enable 2-D spatial scanning by employing an array of twelve LWAs with phased excitation, extending the scanning range in the E-plane from -40° to 40°, while the HPBW remains below 20° across the operational frequency range.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Geometric Features Enhanced Human-Object Interaction Detection
Authors:
Manli Zhu,
Edmond S. L. Ho,
Shuang Chen,
Longzhi Yang,
Hubert P. H. Shum
Abstract:
Cameras are essential vision instruments to capture images for pattern detection and measurement. Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Transformer-based models have become the dominant approach for HOI detection due to their advanced network architectures and thus promising results. Howe…
▽ More
Cameras are essential vision instruments to capture images for pattern detection and measurement. Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Transformer-based models have become the dominant approach for HOI detection due to their advanced network architectures and thus promising results. However, most of them follow the one-stage design of vanilla Transformer, leaving rich geometric priors under-exploited and leading to compromised performance especially when occlusion occurs. Given that geometric features tend to outperform visual ones in occluded scenarios and offer information that complements visual cues, we propose a novel end-to-end Transformer-style HOI detection model, i.e., geometric features enhanced HOI detector (GeoHOI). One key part of the model is a new unified self-supervised keypoint learning method named UniPointNet that bridges the gap of consistent keypoint representation across diverse object categories, including humans. GeoHOI effectively upgrades a Transformer-based HOI detector benefiting from the keypoints similarities measuring the likelihood of human-object interactions as well as local keypoint patches to enhance interaction query representation, so as to boost HOI predictions. Extensive experiments show that the proposed method outperforms the state-of-the-art models on V-COCO and achieves competitive performance on HICO-DET. Case study results on the post-disaster rescue with vision-based instruments showcase the applicability of the proposed GeoHOI in real-world applications.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
An antiferromagnetic diode effect in even-layered MnBi2Te4
Authors:
Anyuan Gao,
Shao-Wen Chen,
Barun Ghosh,
Jian-Xiang Qiu,
Yu-Fei Liu,
Yugo Onishi,
Chaowei Hu,
Tiema Qian,
Damien Bérubé,
Thao Dinh,
Houchen Li,
Christian Tzschaschel,
Seunghyun Park,
Tianye Huang,
Shang-Wei Lien,
Zhe Sun,
Sheng-Chin Ho,
Bahadur Singh,
Kenji Watanabe,
Takashi Taniguchi,
David C. Bell,
Arun Bansil,
Hsin Lin,
Tay-Rong Chang,
Amir Yacoby
, et al. (4 additional authors not shown)
Abstract:
In a PN junction, the separation between positive and negative charges leads to diode transport. In the past few years, the intrinsic diode transport in noncentrosymmetric polar conductors has attracted great interest, because it suggests novel nonlinear applications and provides a symmetry-sensitive probe of Fermi surface. Recently, such studies have been extended to noncentrosymmetric supercondu…
▽ More
In a PN junction, the separation between positive and negative charges leads to diode transport. In the past few years, the intrinsic diode transport in noncentrosymmetric polar conductors has attracted great interest, because it suggests novel nonlinear applications and provides a symmetry-sensitive probe of Fermi surface. Recently, such studies have been extended to noncentrosymmetric superconductors, realizing the superconducting diode effect. Here, we show that, even in a centrosymmetric crystal without directional charge separation, the spins of an antiferromagnet (AFM) can generate a spatial directionality, leading to an AFM diode effect. We observe large second-harmonic transport in a nonlinear electronic device enabled by the compensated AFM state of even-layered MnBi2Te4. We also report a novel electrical sum-frequency generation (SFG), which has been rarely explored in contrast to the well-known optical SFG in wide-gap insulators. We demonstrate that the AFM enables an in-plane field-effect transistor and harvesting of wireless electromagnetic energy. The electrical SFG establishes a powerful method to study nonlinear electronics built by quantum materials. The AFM diode effect paves the way for potential device concepts including AFM logic circuits, self-powered AFM spintronics, and other applications that potentially bridge nonlinear electronics with AFM spintronics.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Finding dusty AGNs from the JWST CEERS survey with mid-infrared photometry
Authors:
Tom C. -C. Chien,
Chih-Teng Ling,
Tomotsugu Goto,
Cossas K. -W. Wu,
Seong Jin Kim,
Tetsuya Hashimoto,
Yu-Wei Lin,
Ece Kilerci,
Simon C. -C. Ho,
Po-Ya Wang,
Bjorn Jasper R. Raquel
Abstract:
The nature of the interaction between active galactic nuclei (AGNs) and their host galaxies remains an unsolved question. Therefore, conducting an AGN census is valuable to AGN research. Nevertheless, a significant fraction of AGNs are obscured by their environment, which blocks UV and optical emissions due to the dusty torus surrounding the central supermassive black hole (SMBH). To overcome this…
▽ More
The nature of the interaction between active galactic nuclei (AGNs) and their host galaxies remains an unsolved question. Therefore, conducting an AGN census is valuable to AGN research. Nevertheless, a significant fraction of AGNs are obscured by their environment, which blocks UV and optical emissions due to the dusty torus surrounding the central supermassive black hole (SMBH). To overcome this challenge, mid-infrared (IR) surveys have emerged as a valuable tool for identifying obscured AGNs, as the obscured light is re-emitted in this range. With its high sensitivity, the James Webb Space Telescope (JWST) uncovered more fainter objects than previous telescopes. By applying the SED fitting, this work investigates AGN candidates in JWST Cosmic Evolution Early Release Science (CEERS) fields. We identified 42 candidates, 30 of them are classified as composites ($0.2\leq f_{\rm AGN, IR}< 0.5$), and 12 of them are AGNs ($f_{\rm AGN, IR}\geq 0.5$). We report the AGN luminosity contributions and AGN number fractions as a function of redshift and total infrared luminosity, showing that previously reported increasing relations are not apparent in our sample due to the sample size. We also extend the previous results on ultra-luminous infrared galaxies (ULIRGs, $L_{\rm TIR}\geq 10^{12} L_{\odot}$) to less luminous AGNs, highlighting the power of JWST.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Contextual Counting: A Mechanistic Study of Transformers on a Quantitative Task
Authors:
Siavash Golkar,
Alberto Bietti,
Mariel Pettee,
Michael Eickenberg,
Miles Cranmer,
Keiya Hirashima,
Geraud Krawezik,
Nicholas Lourie,
Michael McCabe,
Rudy Morel,
Ruben Ohana,
Liam Holden Parker,
Bruno Régaldo-Saint Blancard,
Kyunghyun Cho,
Shirley Ho
Abstract:
Transformers have revolutionized machine learning across diverse domains, yet understanding their behavior remains crucial, particularly in high-stakes applications. This paper introduces the contextual counting task, a novel toy problem aimed at enhancing our understanding of Transformers in quantitative and scientific contexts. This task requires precise localization and computation within datas…
▽ More
Transformers have revolutionized machine learning across diverse domains, yet understanding their behavior remains crucial, particularly in high-stakes applications. This paper introduces the contextual counting task, a novel toy problem aimed at enhancing our understanding of Transformers in quantitative and scientific contexts. This task requires precise localization and computation within datasets, akin to object detection or region-based scientific analysis. We present theoretical and empirical analysis using both causal and non-causal Transformer architectures, investigating the influence of various positional encodings on performance and interpretability. In particular, we find that causal attention is much better suited for the task, and that no positional embeddings lead to the best accuracy, though rotary embeddings are competitive and easier to train. We also show that out of distribution performance is tightly linked to which tokens it uses as a bias term.
△ Less
Submitted 30 May, 2024;
originally announced June 2024.
-
Model Inversion Robustness: Can Transfer Learning Help?
Authors:
Sy-Tuyen Ho,
Koh Jun Hao,
Keshigeyan Chandrasegaran,
Ngoc-Bao Nguyen,
Ngai-Man Cheung
Abstract:
Model Inversion (MI) attacks aim to reconstruct private training data by abusing access to machine learning models. Contemporary MI attacks have achieved impressive attack performance, posing serious threats to privacy. Meanwhile, all existing MI defense methods rely on regularization that is in direct conflict with the training objective, resulting in noticeable degradation in model utility. In t…
▽ More
Model Inversion (MI) attacks aim to reconstruct private training data by abusing access to machine learning models. Contemporary MI attacks have achieved impressive attack performance, posing serious threats to privacy. Meanwhile, all existing MI defense methods rely on regularization that is in direct conflict with the training objective, resulting in noticeable degradation in model utility. In this work, we take a different perspective, and propose a novel and simple Transfer Learning-based Defense against Model Inversion (TL-DMI) to render MI-robust models. Particularly, by leveraging TL, we limit the number of layers encoding sensitive information from private training dataset, thereby degrading the performance of MI attack. We conduct an analysis using Fisher Information to justify our method. Our defense is remarkably simple to implement. Without bells and whistles, we show in extensive experiments that TL-DMI achieves state-of-the-art (SOTA) MI robustness. Our code, pre-trained models, demo and inverted data are available at: https://hosytuyen.github.io/projects/TL-DMI
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Reward Learning from Suboptimal Demonstrations with Applications in Surgical Electrocautery
Authors:
Zohre Karimi,
Shing-Hei Ho,
Bao Thach,
Alan Kuntz,
Daniel S. Brown
Abstract:
Automating robotic surgery via learning from demonstration (LfD) techniques is extremely challenging. This is because surgical tasks often involve sequential decision-making processes with complex interactions of physical objects and have low tolerance for mistakes. Prior works assume that all demonstrations are fully observable and optimal, which might not be practical in the real world. This pap…
▽ More
Automating robotic surgery via learning from demonstration (LfD) techniques is extremely challenging. This is because surgical tasks often involve sequential decision-making processes with complex interactions of physical objects and have low tolerance for mistakes. Prior works assume that all demonstrations are fully observable and optimal, which might not be practical in the real world. This paper introduces a sample-efficient method that learns a robust reward function from a limited amount of ranked suboptimal demonstrations consisting of partial-view point cloud observations. The method then learns a policy by optimizing the learned reward function using reinforcement learning (RL). We show that using a learned reward function to obtain a policy is more robust than pure imitation learning. We apply our approach on a physical surgical electrocautery task and demonstrate that our method can perform well even when the provided demonstrations are suboptimal and the observations are high-dimensional point clouds. Code and videos available here: https://sites.google.com/view/lfdinelectrocautery
△ Less
Submitted 15 April, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Two-Person Interaction Augmentation with Skeleton Priors
Authors:
Baiyi Li,
Edmond S. L. Ho,
Hubert P. H. Shum,
He Wang
Abstract:
Close and continuous interaction with rich contacts is a crucial aspect of human activities (e.g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc. However, acquiring such skeletal motion is challenging. While direct motion capture is expensive and slow, motion editing/generation is also non-trivial, as complex contact pattern…
▽ More
Close and continuous interaction with rich contacts is a crucial aspect of human activities (e.g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc. However, acquiring such skeletal motion is challenging. While direct motion capture is expensive and slow, motion editing/generation is also non-trivial, as complex contact patterns with topological and geometric constraints have to be retained. To this end, we propose a new deep learning method for two-body skeletal interaction motion augmentation, which can generate variations of contact-rich interactions with varying body sizes and proportions while retaining the key geometric/topological relations between two bodies. Our system can learn effectively from a relatively small amount of data and generalize to drastically different skeleton sizes. Through exhaustive evaluation and comparison, we show it can generate high-quality motions, has strong generalizability and outperforms traditional optimization-based methods and alternative deep learning solutions.
△ Less
Submitted 9 April, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
{\sc SimBIG}: Cosmological Constraints using Simulation-Based Inference of Galaxy Clustering with Marked Power Spectra
Authors:
Elena Massara,
ChangHoon Hahn,
Michael Eickenberg,
Shirley Ho,
Jiamin Hou,
Pablo Lemos,
Chirag Modi,
Azadeh Moradinezhad Dizgah,
Liam Parker,
Bruno Régaldo-Saint Blancard
Abstract:
We present the first $Λ$CDM cosmological analysis performed on a galaxy survey using marked power spectra. The marked power spectrum is the two-point function of a marked field, where galaxies are weighted by a function that depends on their local density. The presence of the mark leads these statistics to contain higher-order information of the original galaxy field, making them a good candidate…
▽ More
We present the first $Λ$CDM cosmological analysis performed on a galaxy survey using marked power spectra. The marked power spectrum is the two-point function of a marked field, where galaxies are weighted by a function that depends on their local density. The presence of the mark leads these statistics to contain higher-order information of the original galaxy field, making them a good candidate to exploit the non-Gaussian information of a galaxy catalog. In this work we make use of \simbig, a forward modeling framework for galaxy clustering analyses, and perform simulation-based inference using normalizing flows to infer the posterior distribution of the $Λ$CDM cosmological parameters. We consider different mark configurations (ways to weight the galaxy field) and deploy them in the \simbig~pipeline to analyze the corresponding marked power spectra measured from a subset of the BOSS galaxy sample. We analyze the redshift-space mark power spectra decomposed in $\ell = 0, 2, 4$ multipoles and include scales up to the non-linear regime. Among the various mark configurations considered, the ones that give the most stringent cosmological constraints produce posterior median and $68\%$ confidence limits on the growth of structure parameters equal to $Ω_m=0.273^{+0.040}_{-0.030}$ and $σ_8=0.777^{+0.077}_{-0.071}$. Compared to a perturbation theory analysis using the power spectrum of the same dataset, the \simbig~marked power spectra constraints on $σ_8$ are up to $1.2\times$ tighter, while no improvement is seen for the other cosmological parameters.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Sound Borrow-Checking for Rust via Symbolic Semantics (Long Version)
Authors:
Son Ho,
Aymeric Fromherz,
Jonathan Protzenko
Abstract:
The Rust programming language continues to rise in popularity, and as such, warrants the close attention of the programming languages community. In this work, we present a new foundational contribution towards the theoretical understanding of Rust's semantics. We prove that LLBC, a high-level, borrow-centric model previously proposed for Rust's semantics and execution, is sound with regards to a l…
▽ More
The Rust programming language continues to rise in popularity, and as such, warrants the close attention of the programming languages community. In this work, we present a new foundational contribution towards the theoretical understanding of Rust's semantics. We prove that LLBC, a high-level, borrow-centric model previously proposed for Rust's semantics and execution, is sound with regards to a low-level pointer-based language à la CompCert. Specifically, we prove the following: that LLBC is a correct view over a traditional model of execution; that LLBC's symbolic semantics are a correct abstraction of LLBC programs; and that LLBC's symbolic semantics act as a borrow-checker for LLBC, i.e. that symbolically-checked LLBC programs do not get stuck when executed on a heap-and-addresses model of execution.
To prove these results, we introduce a new proof style that considerably simplifies our proofs of simulation, which relies on a notion of hybrid states. Equipped with this reasoning framework, we show that a new addition to LLBC's symbolic semantics, namely a join operation, preserves the abstraction and borrow-checking properties. This in turn allows us to add support for loops to the Aeneas framework; we show, using a series of examples and case studies, that this unlocks new expressive power for Aeneas.
△ Less
Submitted 1 July, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
The Rational Torsion Subgroup of $J_0(\mathfrak{p}^r)$
Authors:
Sheng-Yang Kevin Ho
Abstract:
Let $\mathfrak{n}=\mathfrak{p}^r$ be a prime-power ideal of $\mathbb{F}_q[T]$ with $r\geq 2$. We study the rational torsion subgroup $\mathcal{T}(\mathfrak{p}^r)$ of the Drinfeld modular Jacobian $J_0(\mathfrak{p}^r)$. We prove that the prime-to-$q(q-1)$ part of $\mathcal{T}(\mathfrak{p}^r)$ is equal to that of the rational cuspidal divisor class group $\mathcal{C}(\mathfrak{p}^r)$ of the Drinfeld…
▽ More
Let $\mathfrak{n}=\mathfrak{p}^r$ be a prime-power ideal of $\mathbb{F}_q[T]$ with $r\geq 2$. We study the rational torsion subgroup $\mathcal{T}(\mathfrak{p}^r)$ of the Drinfeld modular Jacobian $J_0(\mathfrak{p}^r)$. We prove that the prime-to-$q(q-1)$ part of $\mathcal{T}(\mathfrak{p}^r)$ is equal to that of the rational cuspidal divisor class group $\mathcal{C}(\mathfrak{p}^r)$ of the Drinfeld modular curve $X_0(\mathfrak{p}^r)$. As we completely computed the structure of $\mathcal{C}(\mathfrak{p}^r)$, it also determines the structure of the prime-to-$q(q-1)$ part of $\mathcal{T}(\mathfrak{p}^r)$.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Octahedral and polar phase transitions in freestanding films of SrTiO3
Authors:
Ludmila Leroy,
Shih-Wen Huang,
Chun-Chien Chiu,
Sheng-Zhu Ho,
Janine Dössegger,
Cinthia Piamonteze,
Elsa Abreu,
Alessandro Bombardi,
Jan-Chi Yang,
Urs Staub
Abstract:
From extreme strain to bending, the possibilities in the manipulation of freestanding films of oxide perovskites bring a novel landscape to their properties and brings them one step closer to their application. It is therefore of great importance to fully understand the inherent properties of such films, in which dimensionality and surface effects can play a major role in defining the properties o…
▽ More
From extreme strain to bending, the possibilities in the manipulation of freestanding films of oxide perovskites bring a novel landscape to their properties and brings them one step closer to their application. It is therefore of great importance to fully understand the inherent properties of such films, in which dimensionality and surface effects can play a major role in defining the properties of the materials ground state. This paper reports the properties of freestanding (FS) films of the canonical oxide, SrTiO3 (STO) with thicknesses 20, 30, 40 and 80 nm. We show that the relaxed ultrathin STO FS films become polar at temperatures as high as 85 K, in contrast to the quantum paraelectric behavior of bulk. Our findings are based on the softening of the ferroelectric mode towards the ferroelectric transition temperature Tc and its consecutive hardening below Tc with further decreasing temperature, probed with THz time domain spectroscopy in transmission mode. We find almost no thickness dependence in Tc. Moreover, we characterize the antiferrodistortive (AFD) phase transition in STO FS by X-ray diffraction (XRD) probing superlattice reflections characteristic for the rotation of the TiO6 octahedra. Our results point to a higher phase transition temperature in comparison to bulk STO, as well as an unbalanced domain population favoring the rotation axis to be in plane. X-ray linear dichroism results further show a preferential Ti xz/yz orbital occupancy at the surface, but with a complete degeneracy in the t2g states in the inner part of the film indicating that the AFD distortion does not strongly affect the t2g splitting. These findings demonstrate that STO FS films have clearly different properties than bulk.
△ Less
Submitted 27 March, 2024;
originally announced March 2024.
-
Effects of galaxy environment on merger fraction
Authors:
W. J. Pearson,
D. J. D. Santos,
T. Goto,
T. -C. Huang,
S. J. Kim,
H. Matsuhara,
A. Pollo,
S. C. -C. Ho,
H. S. Hwang,
K. Małek,
T. Nakagawa,
M. Romano,
S. Serjeant,
L. Suelves,
H. Shim,
G. J. White
Abstract:
Aims. In this work, we intend to examine how environment influences the merger fraction, from the low density field environment to higher density groups and clusters. We also aim to study how the properties of a group or cluster, as well as the position of a galaxy in the group or cluster, influences the merger fraction.
Methods. We identified galaxy groups and clusters in the North Ecliptic Pol…
▽ More
Aims. In this work, we intend to examine how environment influences the merger fraction, from the low density field environment to higher density groups and clusters. We also aim to study how the properties of a group or cluster, as well as the position of a galaxy in the group or cluster, influences the merger fraction.
Methods. We identified galaxy groups and clusters in the North Ecliptic Pole using a friends-of-friends algorithm and the local density. Once identified, we determined the central galaxies, group radii, velocity dispersions, and group masses of these groups and clusters. Merging systems were identified with a neural network as well as visually. With these, we examined how the merger fraction changes as the local density changes for all galaxies as well as how the merger fraction changes as the properties of the groups or clusters change.
Results. We find that the merger fraction increases as local density increases and decreases as the velocity dispersion increases, as is often found in literature. A decrease in merger fraction as the group mass increases is also found. We also find groups with larger radii have higher merger fractions. The number of galaxies in a group does not influence the merger fraction.
Conclusions. The decrease in merger fraction as group mass increases is a result of the link between group mass and velocity dispersion. Hence, this decrease of merger fraction with increasing mass is a result of the decrease of merger fraction with velocity dispersion. The increasing relation between group radii and merger fraction may be a result of larger groups having smaller velocity dispersion at a larger distance from the centre or larger groups hosting smaller, infalling groups with more mergers. However, we do not find evidence of smaller groups having higher merger fractions.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
Cloud-by-cloud Multiphase Investigation of the Circumgalactic Medium of Low-redshift Galaxies
Authors:
Sameer,
Jane C. Charlton,
Bart P. Wakker,
Glenn G. Kacprzak,
Nikole M. Nielsen,
Christopher W. Churchill,
Philipp Richter,
Sowgat Muzahid,
Stephanie H. Ho,
Hasti Nateghi,
Benjamin Rosenwasser,
Anand Narayanan,
Rajib Ganguly
Abstract:
The pervasive presence of warm gas in galaxy halos suggests that the circumgalactic medium (CGM) is multiphase in its ionization structure and complex in its kinematics. Some recent state-of-the-art cosmological galaxy simulations predict an azimuthal dependence of CGM metallicities. We investigate the presence of such a trend by analyzing the distribution of gas properties in the CGM around 47…
▽ More
The pervasive presence of warm gas in galaxy halos suggests that the circumgalactic medium (CGM) is multiphase in its ionization structure and complex in its kinematics. Some recent state-of-the-art cosmological galaxy simulations predict an azimuthal dependence of CGM metallicities. We investigate the presence of such a trend by analyzing the distribution of gas properties in the CGM around 47 $z <$ 0.7 galaxies from the Multiphase Galaxy Halos Survey determined using a cloud-by-cloud, multiphase, ionization modelling approach. We identify three distinct populations of absorbers: cool clouds ($T \sim$ 10$^{4.1}$ K) in photoionization equilibrium, warm-hot collisionally ionized clouds ($T \sim$ 10$^{4.5-5}$ K) affected by time-dependent photoionization, and hotter clouds ($T \sim$ 10$^{5.4-6}$ K) with broad OVI and Lya absorption consistent with collisional ionization. We find that fragmentation can play a role in the origin of cool clouds, that warm-hot clouds are out of equilibrium due to rapid cooling, and that hotter clouds are representative of virialized halo gas in all but the lowest mass galaxies. The metallicities of clouds do not depend on the azimuthal angle or other galaxy properties for any of these populations. At face value, this disagrees with the simplistic model of the CGM with bipolar outflows and cold-mode planar accretion. However, the number of clouds per sightline is significantly larger close to the minor and major axes. This implies that the processes of outflows and accretion are contributing to these CGM cloud populations, and our sightlines are probing gas of mixed origins at all azimuthal angles in these low redshift galaxies.
△ Less
Submitted 5 April, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Handling Open Research Data within the Max Planck Society -- Looking Closer at the Year 2020
Authors:
Martin Boosen,
Michael Franke,
Yves Vincent Grossmann,
Sy Dat Ho,
Larissa Leiminger,
Jan Matthiesen
Abstract:
This paper analyses the practice of publishing research data within the Max Planck Society in the year 2020. The central finding of the study is that up to 40\% of the empirical text publications had research data available. The aggregation of the available data is predominantly analysed. There are differences between the sections of the Max Planck Society but they are not as great as one might ex…
▽ More
This paper analyses the practice of publishing research data within the Max Planck Society in the year 2020. The central finding of the study is that up to 40\% of the empirical text publications had research data available. The aggregation of the available data is predominantly analysed. There are differences between the sections of the Max Planck Society but they are not as great as one might expect. In the case of the journals, it is also apparent that a data policy can increase the availability of data related to textual publications. Finally, we found that the statement on data availability "upon (reasonable) request" does not work.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Exploring the faintest end of mid-infrared luminosity functions up to $z\simeq 5$ with the JWST CEERS survey
Authors:
Chih-Teng Ling,
Tomotsugu Goto,
Seong Jin Kim,
Cossas K. -W. Wu,
Tetsuya Hashimoto,
Tom C. -C. Chien,
Yu-Wei Lin,
Simon C. -C. Ho,
Ece Kilerci
Abstract:
Mid-infrared (MIR) light from galaxies is sensitive to dust-obscured star-formation activities because it traces the characteristic emission of dust heated by young, massive stars. By constructing the MIR luminosity functions (LFs), we are able to quantify the overall dusty star formation history and the evolution of galaxies over cosmic time. In this work, we report the first rest-frame MIR LFs a…
▽ More
Mid-infrared (MIR) light from galaxies is sensitive to dust-obscured star-formation activities because it traces the characteristic emission of dust heated by young, massive stars. By constructing the MIR luminosity functions (LFs), we are able to quantify the overall dusty star formation history and the evolution of galaxies over cosmic time. In this work, we report the first rest-frame MIR LFs at 7.7, 10, 12.8, 15, 18, and 21 $μ$m as well as the total IR LF from the James Webb Space Telescope (JWST) Cosmic Evolution Early Release Science (CEERS) survey. We identify 506 galaxies at $z=0-5.1$ in the CEERS survey that also have optical photometry from the Hubble Space Telescope. With the unprecedented sensitivity of the JWST, we probe the faintest end of the LFs at $z=0-1$ down to $L^* \sim 10^7 L_\odot$, $\sim 2$ orders of magnitude fainter than those from the previous generation of IR space telescopes. Our findings connect well with and continue the faint end of the MIR LFs from the deepest observations in past works. As a proxy of star formation history, we present the MIR-based luminosity density up to $z\simeq4.0$, marking the first probe of the early Universe by JWST MIRI.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Incremental Proof Development in Dafny with Module-Based Induction
Authors:
Son Ho,
Clément Pit-Claudel
Abstract:
Highly automated theorem provers like Dafny allow users to prove simple properties with little effort, making it easy to quickly sketch proofs. The drawback is that such provers leave users with little control about the proof search, meaning that the small changes inherent to the iterative process of writing a proof often lead to unpredictable variations in verification time, and eventually hard-t…
▽ More
Highly automated theorem provers like Dafny allow users to prove simple properties with little effort, making it easy to quickly sketch proofs. The drawback is that such provers leave users with little control about the proof search, meaning that the small changes inherent to the iterative process of writing a proof often lead to unpredictable variations in verification time, and eventually hard-to-diagnose proof failures. This sometimes turns the boon of high automation into a curse, as instead of breaking early and showing unsolved goals to the user like in Coq, proofs tend to gradually become unstable until their verification time explodes. At this point, the absence of a proof context to investigate often leaves the user to a painful debugging session. In this paper, we show how to use Dafny modules to encode Coq-like induction principles to dramatically improve the stability and maintainability of proofs about inductive data structures.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
${\rm S{\scriptsize IM}BIG}$: Cosmological Constraints from the Redshift-Space Galaxy Skew Spectra
Authors:
Jiamin Hou,
Azadeh Moradinezhad Dizgah,
ChangHoon Hahn,
Michael Eickenberg,
Shirley Ho,
Pablo Lemos,
Elena Massara,
Chirag Modi,
Liam Parker,
Bruno Régaldo-Saint Blancard
Abstract:
Extracting the non-Gaussian information of the cosmic large-scale structure (LSS) is vital in unlocking the full potential of the rich datasets from the upcoming stage-IV galaxy surveys. Galaxy skew spectra serve as efficient beyond-two-point statistics, encapsulating essential bispectrum information with computational efficiency akin to power spectrum analysis. This paper presents the first cosmo…
▽ More
Extracting the non-Gaussian information of the cosmic large-scale structure (LSS) is vital in unlocking the full potential of the rich datasets from the upcoming stage-IV galaxy surveys. Galaxy skew spectra serve as efficient beyond-two-point statistics, encapsulating essential bispectrum information with computational efficiency akin to power spectrum analysis. This paper presents the first cosmological constraints from analyzing the full set of redshift-space galaxy skew spectra of the data from the SDSS-III BOSS, accessing cosmological information down to nonlinear scales. Employing the ${\rm S{\scriptsize IM}BIG}$ forward modeling framework and simulation-based inference via normalizing flows, we analyze the CMASS-SGC sub-sample, which constitute approximately 10\% of the full BOSS data. Analyzing the scales up to $k_{\rm max}=0.5 \, {\rm Mpc}^{-1}h$, we find that the skew spectra improve the constraints on $Ω_{\rm m}, Ω_{\rm b}, h$, and $n_s$ by 34\%, 35\%, 18\%, 10\%, respectively, compared to constraints from previous ${\rm S{\scriptsize IM}BIG}$ power spectrum multipoles analysis, yielding $Ω_{\rm m}=0.288^{+0.024}_{-0.034}$, $Ω_{\rm b}= 0.043^{+0.005}_{-0.007}$, $h=0.759^{+0.104}_{-0.050}$, $n_{\rm s} = 0.918^{+0.041}_{-0.090}$ (at 68\% confidence limit). On the other hand, the constraints on $σ_8$ are weaker than from the power spectrum. Including the Big Bang Nucleosynthesis (BBN) prior on baryon density reduces the uncertainty on the Hubble parameter further, achieving $h=0.750^{+0.034}_{-0.032}$, which is a 38\% improvement over the constraint from the power spectrum with the same prior. Compared to the ${\rm S{\scriptsize IM}BIG}$ bispectrum (monopole) analysis, skew spectra offer comparable constraints on larger scales ($k_{\rm max}<0.3\, {\rm Mpc}^{-1}h$) for most parameters except for $σ_8$.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Shallower radius valley around low-mass hosts: Evidence for icy planets, collisions or high-energy radiation scatter
Authors:
Cynthia S. K. Ho,
James G. Rogers,
Vincent Van Eylen,
James E. Owen,
Hilke E. Schlichting
Abstract:
The radius valley, i.e., a dearth of planets with radii between 1.5 and 2 Earth radii, provides insights into planetary formation and evolution. Using homogenously revised planetary parameters from Kepler 1-minute short cadence light curves, we remodel transits of 72 small planets mostly orbiting low-mass stars, improving the precision and accuracy of planet parameters. By combining this sample wi…
▽ More
The radius valley, i.e., a dearth of planets with radii between 1.5 and 2 Earth radii, provides insights into planetary formation and evolution. Using homogenously revised planetary parameters from Kepler 1-minute short cadence light curves, we remodel transits of 72 small planets mostly orbiting low-mass stars, improving the precision and accuracy of planet parameters. By combining this sample with a similar sample of planets around higher-mass stars, we determine the depth of the radius valley as a function of stellar mass. We find that the radius valley is shallower for low-mass stars compared to their higher mass counterparts. Upon comparison, we find that theoretical models of photoevaporation under-predict the number of planets observed inside the radius valley for low-mass stars: with decreasing stellar mass, the predicted fraction of planets inside the valley remains approximately constant whereas the observed fraction increases. We argue that this provides evidence for the presence of icy planets around low-mass stars. Alternatively, planets orbiting low-mass stars undergo more frequent collisions and scatter in the stars' high-energy output may also cause planets to fill the valley. We predict that more precise mass measurements for planets orbiting low mass stars would be able to distinguish between these scenarios.
△ Less
Submitted 11 June, 2024; v1 submitted 22 January, 2024;
originally announced January 2024.
-
Recent $B^+ \!\to K^+ν\barν$ Excess and Muon $g-2$ Illuminating Light Dark Sector with Higgs Portal
Authors:
Shu-Yu Ho,
Jongkuk Kim,
Pyungwon Ko
Abstract:
The Belle II collaboration recently announced that they observed the $B^+ \!\to K^+ν\barν$ decay process for the first time. This dineutrino mode of $B^+ \!\to K^+ν\barν$ has been theoretically identified as a very clean channel. However, their result encounters a $2.7{}^{}σ$ deviation from the Standard Model (SM) calculation. On the other hand, last year, Fermilab released new data on muon $g-2$…
▽ More
The Belle II collaboration recently announced that they observed the $B^+ \!\to K^+ν\barν$ decay process for the first time. This dineutrino mode of $B^+ \!\to K^+ν\barν$ has been theoretically identified as a very clean channel. However, their result encounters a $2.7{}^{}σ$ deviation from the Standard Model (SM) calculation. On the other hand, last year, Fermilab released new data on muon $g-2$ away from the SM expectation with $5{}^{}σ$. In this letter, we study the simplest UV-complete $\text{U}(1)_{\textsf{L}_μ- \textsf{L}_τ}^{}$-charged complex scalar Dark Matter (DM) model. Thanks to the existence of light dark Higgs boson and light dark photon, we can explain the observed relic density of DM and resolve the results reported by both Belle II and Fermilab experiments simultaneously. As a byproduct, the Hubble tension is alleviated by taking $ΔN_\textsf{eff}^{} \simeq 0.3$ induced by the light dark photon.
△ Less
Submitted 26 January, 2024; v1 submitted 18 January, 2024;
originally announced January 2024.
-
Interferometric Single-Shot Parity Measurement in an InAs-Al Hybrid Device
Authors:
Morteza Aghaee,
Alejandro Alcaraz Ramirez,
Zulfi Alam,
Rizwan Ali,
Mariusz Andrzejczuk,
Andrey Antipov,
Mikhail Astafev,
Amin Barzegar,
Bela Bauer,
Jonathan Becker,
Umesh Kumar Bhaskar,
Alex Bocharov,
Srini Boddapati,
David Bohn,
Jouri Bommer,
Leo Bourdet,
Arnaud Bousquet,
Samuel Boutin,
Lucas Casparis,
Benjamin James Chapman,
Sohail Chatoor,
Anna Wulff Christensen,
Cassandra Chua,
Patrick Codd,
William Cole
, et al. (137 additional authors not shown)
Abstract:
The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostruct…
▽ More
The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostructures with a gate-defined nanowire. The interferometer is formed by tunnel-coupling the proximitized nanowire to quantum dots. The nanowire causes a state-dependent shift of these quantum dots' quantum capacitance of up to 1 fF. Our quantum capacitance measurements show flux h/2e-periodic bimodality with a signal-to-noise ratio of 1 in 3.7 $μ$s at optimal flux values. From the time traces of the quantum capacitance measurements, we extract a dwell time in the two associated states that is longer than 1 ms at in-plane magnetic fields of approximately 2 T. These results are consistent with a measurement of the fermion parity encoded in a pair of Majorana zero modes that are separated by approximately 3 $μ$m and subjected to a low rate of poisoning by non-equilibrium quasiparticles. The large capacitance shift and long poisoning time enable a parity measurement error probability of 1%.
△ Less
Submitted 2 April, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Engineering the strain and interlayer excitons of 2D materials via lithographically engraved hexagonal boron nitride
Authors:
Yu-Chiang Hsieh,
Zhen-You Lin,
Shin-Ji Fung,
Wen-Shin Lu,
Sheng-Chin Ho,
Siang-Ping Hong,
Sheng-Zhu Ho,
Chiu-Hua Huang,
Kenji Watanabe,
Takashi Taniguchi,
Yang-Hao Chan,
Yi-Chun Chen,
Chung-Lin Wu,
Tse-Ming Chen
Abstract:
Strain engineering has quickly emerged as a viable option to modify the electronic, optical and magnetic properties of 2D materials. However, it remains challenging to arbitrarily control the strain. Here we show that by creating atomically-flat surface nanostructures in hexagonal boron nitride, we achieve an arbitrary on-chip control of both the strain distribution and magnitude on high-quality m…
▽ More
Strain engineering has quickly emerged as a viable option to modify the electronic, optical and magnetic properties of 2D materials. However, it remains challenging to arbitrarily control the strain. Here we show that by creating atomically-flat surface nanostructures in hexagonal boron nitride, we achieve an arbitrary on-chip control of both the strain distribution and magnitude on high-quality molybdenum disulfide. The phonon and exciton emissions are shown to vary in accordance with our strain field designs, enabling us to write and draw any photoluminescence color image in a single chip. Moreover, our strain engineering offers a powerful means to significantly and controllably alter the strengths and energies of interlayer excitons at room temperature. This method can be easily extended to other material systems and offers a promise for functional excitonic devices.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Polycyclic aromatic hydrocarbon (PAH) luminous galaxies in JWST CEERS data
Authors:
Yu-Wei Lin,
Cossas K. -W. Wu,
Chih-Teng Ling,
Tomotsugu Goto,
Seong Jin Kim,
Ece Kilerci,
Tetsuya Hashimoto,
Po-Ya Wang,
Simon C. -C. Ho,
Tiger Yu-Yang Hsiao,
Bjorn Jasper R. Raquel,
Yuri Uno
Abstract:
It has been an unanswered question how many dusty galaxies have been undetected from the state-of-the-art observational surveys. JWST enables us to detect faint IR galaxies that have prominent polycyclic aromatic hydrocarbon (PAH) features in the mid-IR wavelengths. PAH is a valuable tracer of star formation and dust properties in the mid-infrared wavelength. The JWST Cosmic Evolution Early Releas…
▽ More
It has been an unanswered question how many dusty galaxies have been undetected from the state-of-the-art observational surveys. JWST enables us to detect faint IR galaxies that have prominent polycyclic aromatic hydrocarbon (PAH) features in the mid-IR wavelengths. PAH is a valuable tracer of star formation and dust properties in the mid-infrared wavelength. The JWST Cosmic Evolution Early Release Science (CEERS) fields provide us with wavelength coverage from 7.7 to 21 $μ$m using six photometric bands of the mid-infrared instrument (MIRI). We have identified galaxies dominated by mid-IR emission from PAHs, termed PAH galaxies. From our multi-band photometry catalogue, we selected ten PAH galaxies displaying high flux ratios of $\log(S_{15}/S_{10}) > 0.8$. The SED fitting analysis indicates that these galaxies are star-forming galaxies with total IR luminosities of $10^{10}$ $\sim$ $10^{11.5}$ $L_{\odot}$ at z $\sim 1$. The morphology of PAH galaxies does not show any clear signatures of major merging or interaction within the MIRI resolution. The majority of them are on the star-formation main sequence at $z \sim 1$. Our result demonstrates that JWST can detect PAH emissions from normal star-forming galaxies at $z \sim 1$, in addition to ultra-luminous infrared galaxies (ULIRGs) or luminous infrared galaxies (LIRGs).
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Detection of evolutionary shifts in variance under an Ornsten-Uhlenbeck model
Authors:
Wensha Zhang,
Lam Si Tung Ho,
Toby Kenney
Abstract:
1. Abrupt environmental changes can lead to evolutionary shifts in not only mean (optimal value), but also variance of descendants in trait evolution. There are some methods to detect shifts in optimal value but few studies consider shifts in variance. 2. We use a multi-optima and multi-variance OU process model to describe the trait evolution process with shifts in both optimal value and variance…
▽ More
1. Abrupt environmental changes can lead to evolutionary shifts in not only mean (optimal value), but also variance of descendants in trait evolution. There are some methods to detect shifts in optimal value but few studies consider shifts in variance. 2. We use a multi-optima and multi-variance OU process model to describe the trait evolution process with shifts in both optimal value and variance and provide analysis of how the covariance between species changes when shifts in variance occur along the path. 3. We propose a new method to detect the shifts in both variance and optimal values based on minimizing the loss function with L1 penalty. We implement our method in a new R package, ShiVa (Detection of evolutionary shifts in variance). 4. We conduct simulations to compare our method with the two methods considering only shifts in optimal values (l1ou; PhylogeneticEM). Our method shows strength in predictive ability and includes far fewer false positive shifts in optimal value compared to other methods when shifts in variance actually exist. When there are only shifts in optimal value, our method performs similarly to other methods. We applied our method to the cordylid data, ShiVa outperformed l1ou and phyloEM, exhibiting the highest log-likelihood and lowest BIC.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Pose-based Tremor Type and Level Analysis for Parkinson's Disease from Video
Authors:
Haozheng Zhang,
Edmond S. L. Ho,
Xiatian Zhang,
Silvia Del Din,
Hubert P. H. Shum
Abstract:
Purpose:Current methods for diagnosis of PD rely on clinical examination. The accuracy of diagnosis ranges between 73% and 84%, and is influenced by the experience of the clinical assessor. Hence, an automatic, effective and interpretable supporting system for PD symptom identification would support clinicians in making more robust PD diagnostic decisions. Methods: We propose to analyze Parkinson'…
▽ More
Purpose:Current methods for diagnosis of PD rely on clinical examination. The accuracy of diagnosis ranges between 73% and 84%, and is influenced by the experience of the clinical assessor. Hence, an automatic, effective and interpretable supporting system for PD symptom identification would support clinicians in making more robust PD diagnostic decisions. Methods: We propose to analyze Parkinson's tremor (PT) to support the analysis of PD, since PT is one of the most typical symptoms of PD with broad generalizability. To realize the idea, we present SPA-PTA, a deep learning-based PT classification and severity estimation system that takes consumer-grade videos of front-facing humans as input. The core of the system is a novel attention module with a lightweight pyramidal channel-squeezing-fusion architecture that effectively extracts relevant PT information and filters noise. It enhances modeling performance while improving system interpretability. Results:We validate our system via individual-based leave-one-out cross-validation on two tasks: the PT classification task and the tremor severity rating estimation task. Our system presents a 91.3% accuracy and 80.0% F1-score in classifying PT with non-PT class, while providing a 76.4% accuracy and 76.7% F1-score in more complex multiclass tremor rating classification task. Conclusion: Our system offers a cost-effective PT classification and tremor severity estimation results as warning signs of PD for undiagnosed patients with PT symptoms. In addition, it provides a potential solution for supporting PD diagnosis in regions with limited clinical resources.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Intra-Family Transformation of The Bi-Te Family via in-situ Chemical Interactions
Authors:
Zhihao He,
Tin Seng Manfred Ho,
Rolf Lortz,
Iam Keong Sou
Abstract:
The Bi-Te binary system, characterized by the homologous series of the (Bi2)m(Bi2Te3)n, has always attracted research interest for its layered structures and potential in advanced materials applications. Despite Bi2Te3 has been extensively studied, exploration of other compounds has been constrained by synthesis challenges. This study reports the molecular beam epitaxy (MBE) growth of FeTe on Bi2T…
▽ More
The Bi-Te binary system, characterized by the homologous series of the (Bi2)m(Bi2Te3)n, has always attracted research interest for its layered structures and potential in advanced materials applications. Despite Bi2Te3 has been extensively studied, exploration of other compounds has been constrained by synthesis challenges. This study reports the molecular beam epitaxy (MBE) growth of FeTe on Bi2Te3, demonstrating that varying growth conditions can turn the Bi2Te3 layer into different Bi-Te phases and form corresponding FeTe/Bi-Te heterostructures. Our combined analysis using reflection high-energy electron diffraction (RHEED), high-resolution X-ray diffraction (HRXRD), and high-resolution scanning transmission electron microscopy (HR-STEM), indicates that specific growth conditions used for the growth of the FeTe layer can facilitate the extraction of Te from Bi2Te3, leading to the formation of Bi4Te3 and Bi6Te3. Additionally, by lowering the FeTe growth temperature to 230 oC, Te extraction from the Bi2Te3 layer could be avoided, preserving the Bi2Te3 structure. Notably, all the three FeTe/Bi-Te structures exhibit superconductivity with the FeTe/Bi2Te3 heterostructure enjoying the highest superconductivity quality. These findings introduce a novel method for realizing Bi4Te3 and Bi6Te3 through Te extraction by growing FeTe on Bi2Te3, driven by the high reactivity between Fe and Te. This approach holds promise for synthesizing other members of the Bi-Te series, expanding the functional potential of these materials.
△ Less
Submitted 7 June, 2024; v1 submitted 16 December, 2023;
originally announced December 2023.
-
Cosmic star-formation history and black hole accretion history inferred from the JWST mid-infrared source counts
Authors:
Seong Jin Kim,
Tomotsugu Goto,
Chih-Teng Ling,
Cossas K. -W. Wu,
Tetsuya Hashimoto,
Ece Kilerci,
Simon C. -C. Ho,
Yuri Uno,
Po-Ya Wang,
Yu-Wei Lin
Abstract:
With the advent of the James Webb Space Telescope (JWST), extra-galactic source count studies were conducted down to sub-microJy in the mid-infrared (MIR), which is several tens of times fainter than what the previous-generation infrared (IR) telescopes achieved in the MIR. In this work, we aim to interpret the JWST source counts and constrain cosmic star-formation history (CSFH) and black hole ac…
▽ More
With the advent of the James Webb Space Telescope (JWST), extra-galactic source count studies were conducted down to sub-microJy in the mid-infrared (MIR), which is several tens of times fainter than what the previous-generation infrared (IR) telescopes achieved in the MIR. In this work, we aim to interpret the JWST source counts and constrain cosmic star-formation history (CSFH) and black hole accretion history (BHAH). We employ the backward evolution of local luminosity functions (LLFs) of galaxies to reproduce the observed source counts from sub-microJy to a few tens of mJy in the MIR bands of the JWST. The shapes of the LLFs at the MIR bands are determined using the model templates of the spectral energy distributions (SEDs) for five representative galaxy types (star-forming galaxies, starbursts, composite, AGN type 2 and 1). By simultaneously fitting our model to all the source counts in the six MIR bands, along with the previous results, we determine the best-fit evolutions of MIR LFs for each of the five galaxy types, and subsequently estimate the CSFH and BHAH. Thanks to the JWST, our estimates are based on several tens of times fainter MIR sources, the existence of which was merely an extrapolation in previous studies.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Simple Transferability Estimation for Regression Tasks
Authors:
Cuong N. Nguyen,
Phong Tran,
Lam Si Tung Ho,
Vu Dinh,
Anh T. Tran,
Tal Hassner,
Cuong V. Nguyen
Abstract:
We consider transferability estimation, the problem of estimating how well deep learning models transfer from a source to a target task. We focus on regression tasks, which received little previous attention, and propose two simple and computationally efficient approaches that estimate transferability based on the negative regularized mean squared error of a linear regression model. We prove novel…
▽ More
We consider transferability estimation, the problem of estimating how well deep learning models transfer from a source to a target task. We focus on regression tasks, which received little previous attention, and propose two simple and computationally efficient approaches that estimate transferability based on the negative regularized mean squared error of a linear regression model. We prove novel theoretical results connecting our approaches to the actual transferability of the optimal target models obtained from the transfer learning process. Despite their simplicity, our approaches significantly outperform existing state-of-the-art regression transferability estimators in both accuracy and efficiency. On two large-scale keypoint regression benchmarks, our approaches yield 12% to 36% better results on average while being at least 27% faster than previous state-of-the-art methods.
△ Less
Submitted 3 December, 2023; v1 submitted 1 December, 2023;
originally announced December 2023.
-
Leaving No Branches Behind: Predicting Baryonic Properties of Galaxies from Merger Trees
Authors:
Chen-Yu Chuang,
Christian Kragh Jespersen,
Yen-Ting Lin,
Shirley Ho,
Shy Genel
Abstract:
Galaxies play a key role in our endeavor to understand how structure formation proceeds in the Universe. For any precision study of cosmology or galaxy formation, there is a strong demand for huge sets of realistic mock galaxy catalogs, spanning cosmologically significant volumes. For such a daunting task, methods that can produce a direct mapping between dark matter halos from dark matter-only si…
▽ More
Galaxies play a key role in our endeavor to understand how structure formation proceeds in the Universe. For any precision study of cosmology or galaxy formation, there is a strong demand for huge sets of realistic mock galaxy catalogs, spanning cosmologically significant volumes. For such a daunting task, methods that can produce a direct mapping between dark matter halos from dark matter-only simulations and galaxies are strongly preferred, as producing mocks from full-fledged hydrodynamical simulations or semi-analytical models is too expensive. Here we present a Graph Neural Network-based model that is able to accurately predict key properties of galaxies such as stellar mass, $g-r$ color, star formation rate, gas mass, stellar metallicity, and gas metallicity, purely from dark matter properties extracted from halos along the full assembly history of the galaxies. Tests based on the TNG300 simulation of the IllustrisTNG project show that our model can recover the baryonic properties of galaxies to high accuracy, over a wide redshift range ($z = 0-5$), for all galaxies with stellar masses more massive than $10^9\,M_\odot$ and their progenitors, with strong improvements over the state-of-the-art methods. We further show that our method makes substantial strides toward providing an understanding of the implications of the IllustrisTNG galaxy formation model.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Surrogate Modeling for Computationally Expensive Simulations of Supernovae in High-Resolution Galaxy Simulations
Authors:
Keiya Hirashima,
Kana Moriwaki,
Michiko S. Fujii,
Yutaka Hirai,
Takayuki R. Saitoh,
Junichiro Makino,
Shirley Ho
Abstract:
Some stars are known to explode at the end of their lives, called supernovae (SNe). The substantial amount of matter and energy that SNe release provides significant feedback to star formation and gas dynamics in a galaxy. SNe release a substantial amount of matter and energy to the interstellar medium, resulting in significant feedback to star formation and gas dynamics in a galaxy. While such fe…
▽ More
Some stars are known to explode at the end of their lives, called supernovae (SNe). The substantial amount of matter and energy that SNe release provides significant feedback to star formation and gas dynamics in a galaxy. SNe release a substantial amount of matter and energy to the interstellar medium, resulting in significant feedback to star formation and gas dynamics in a galaxy. While such feedback has a crucial role in galaxy formation and evolution, in simulations of galaxy formation, it has only been implemented using simple {\it sub-grid models} instead of numerically solving the evolution of gas elements around SNe in detail due to a lack of resolution. We develop a method combining machine learning and Gibbs sampling to predict how a supernova (SN) affects the surrounding gas. The fidelity of our model in the thermal energy and momentum distribution outperforms the low-resolution SN simulations. Our method can replace the SN sub-grid models and help properly simulate un-resolved SN feedback in galaxy formation simulations. We find that employing our new approach reduces the necessary computational cost to $\sim$ 1 percent compared to directly resolving SN feedback.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Energy and Time Complexity for Sorting Algorithms in Java
Authors:
Kristina Carter,
Su Mei Gwen Ho,
Mathias Marquar Arhipenko Larsen,
Martin Sundman,
Maja H. Kirkeby
Abstract:
The article investigates the relationship between time complexity and energy consumption in sorting algorithms, focusing on commonly-used algorithms implemented in Java: Bubble Sort, Counting Sort, Merge Sort, and Quick Sort. The significance of understanding this relationship is driven by the increasing energy demands of Information and Communication Technology systems and the potential for softw…
▽ More
The article investigates the relationship between time complexity and energy consumption in sorting algorithms, focusing on commonly-used algorithms implemented in Java: Bubble Sort, Counting Sort, Merge Sort, and Quick Sort. The significance of understanding this relationship is driven by the increasing energy demands of Information and Communication Technology systems and the potential for software optimization to contribute to energy efficiency. If we find a strong correlation between time complexity and energy usage, it would enhance the ability of software developers to create energy-efficient applications.
This quantitative study researches the execution of four selected sorting algorithms with input varying over input sizes (25000 to 1 million) and input order types (best, worst, and random cases) on a single kernel in a Java-enabled system. The input size is adjusted according to the type's maximum execution time, resulting in 136 combinations, totalling 12960 measurements. Wall time and the CPU energy consumption is measured using Intel's RAPL. Statistical analysis are used to examine the correlations between time complexity, wall time, and energy consumption.
The study finds a strong correlation between time complexity and energy consumption for the sorting algorithms tested. More than 99% of the variance in energy consumption for Counting Sort, Merge Sort, and Quick Sort depend on their time complexities. More than 94% of the variance in energy consumption for Bubble Sort depends on its time complexity. The results affirm that time complexity can serve as a reliable predictor of energy consumption in sequential sorting algorithms. This discovery could guide software developers in choosing energy-efficient algorithms by considering time complexities.
△ Less
Submitted 8 May, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Learning to Learn for Few-shot Continual Active Learning
Authors:
Stella Ho,
Ming Liu,
Shang Gao,
Longxiang Gao
Abstract:
Continual learning strives to ensure stability in solving previously seen tasks while demonstrating plasticity in a novel domain. Recent advances in continual learning are mostly confined to a supervised learning setting, especially in NLP domain. In this work, we consider a few-shot continual active learning setting where labeled data are inadequate, and unlabeled data are abundant but with a lim…
▽ More
Continual learning strives to ensure stability in solving previously seen tasks while demonstrating plasticity in a novel domain. Recent advances in continual learning are mostly confined to a supervised learning setting, especially in NLP domain. In this work, we consider a few-shot continual active learning setting where labeled data are inadequate, and unlabeled data are abundant but with a limited annotation budget. We exploit meta-learning and propose a method, called Meta-Continual Active Learning. This method sequentially queries the most informative examples from a pool of unlabeled data for annotation to enhance task-specific performance and tackle continual learning problems through meta-objective. Specifically, we employ meta-learning and experience replay to address inter-task confusion and catastrophic forgetting. We further incorporate textual augmentations to avoid memory over-fitting caused by experience replay and sample queries, thereby ensuring generalization. We conduct extensive experiments on benchmark text classification datasets from diverse domains to validate the feasibility and effectiveness of meta-continual active learning. We also analyze the impact of different active learning strategies on various meta continual learning models. The experimental results demonstrate that introducing randomness into sample selection is the best default strategy for maintaining generalization in meta-continual learning framework.
△ Less
Submitted 30 May, 2024; v1 submitted 7 November, 2023;
originally announced November 2023.
-
Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles
Authors:
Luca Crosato,
Kai Tian,
Hubert P. H Shum,
Edmond S. L. Ho,
Yafei Wang,
Chongfeng Wei
Abstract:
Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the autonomous vehicle to be able to understand and predict the behaviour of human road users. In this literature review, the current s…
▽ More
Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the autonomous vehicle to be able to understand and predict the behaviour of human road users. In this literature review, the current state of IAAD research is surveyed in this work. Commencing with an examination of terminology, attention is drawn to challenges and existing models employed for modelling the behaviour of drivers and pedestrians. Next, a comprehensive review is conducted on various techniques proposed for interaction modelling, encompassing cognitive methods, machine learning approaches, and game-theoretic methods. The conclusion is reached through a discussion of potential advantages and risks associated with IAAD, along with the illumination of pivotal research inquiries necessitating future exploration.
△ Less
Submitted 30 October, 2023; v1 submitted 28 October, 2023;
originally announced October 2023.
-
SimBIG: Field-level Simulation-Based Inference of Galaxy Clustering
Authors:
Pablo Lemos,
Liam Parker,
ChangHoon Hahn,
Shirley Ho,
Michael Eickenberg,
Jiamin Hou,
Elena Massara,
Chirag Modi,
Azadeh Moradinezhad Dizgah,
Bruno Regaldo-Saint Blancard,
David Spergel
Abstract:
We present the first simulation-based inference (SBI) of cosmological parameters from field-level analysis of galaxy clustering. Standard galaxy clustering analyses rely on analyzing summary statistics, such as the power spectrum, $P_\ell$, with analytic models based on perturbation theory. Consequently, they do not fully exploit the non-linear and non-Gaussian features of the galaxy distribution.…
▽ More
We present the first simulation-based inference (SBI) of cosmological parameters from field-level analysis of galaxy clustering. Standard galaxy clustering analyses rely on analyzing summary statistics, such as the power spectrum, $P_\ell$, with analytic models based on perturbation theory. Consequently, they do not fully exploit the non-linear and non-Gaussian features of the galaxy distribution. To address these limitations, we use the {\sc SimBIG} forward modelling framework to perform SBI using normalizing flows. We apply SimBIG to a subset of the BOSS CMASS galaxy sample using a convolutional neural network with stochastic weight averaging to perform massive data compression of the galaxy field. We infer constraints on $Ω_m = 0.267^{+0.033}_{-0.029}$ and $σ_8=0.762^{+0.036}_{-0.035}$. While our constraints on $Ω_m$ are in-line with standard $P_\ell$ analyses, those on $σ_8$ are $2.65\times$ tighter. Our analysis also provides constraints on the Hubble constant $H_0=64.5 \pm 3.8 \ {\rm km / s / Mpc}$ from galaxy clustering alone. This higher constraining power comes from additional non-Gaussian cosmological information, inaccessible with $P_\ell$. We demonstrate the robustness of our analysis by showcasing our ability to infer unbiased cosmological constraints from a series of test simulations that are constructed using different forward models than the one used in our training dataset. This work not only presents competitive cosmological constraints but also introduces novel methods for leveraging additional cosmological information in upcoming galaxy surveys like DESI, PFS, and Euclid.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Galaxy Clustering Analysis with SimBIG and the Wavelet Scattering Transform
Authors:
Bruno Régaldo-Saint Blancard,
ChangHoon Hahn,
Shirley Ho,
Jiamin Hou,
Pablo Lemos,
Elena Massara,
Chirag Modi,
Azadeh Moradinezhad Dizgah,
Liam Parker,
Yuling Yao,
Michael Eickenberg
Abstract:
The non-Gaussisan spatial distribution of galaxies traces the large-scale structure of the Universe and therefore constitutes a prime observable to constrain cosmological parameters. We conduct Bayesian inference of the $Λ$CDM parameters $Ω_m$, $Ω_b$, $h$, $n_s$, and $σ_8$ from the BOSS CMASS galaxy sample by combining the wavelet scattering transform (WST) with a simulation-based inference approa…
▽ More
The non-Gaussisan spatial distribution of galaxies traces the large-scale structure of the Universe and therefore constitutes a prime observable to constrain cosmological parameters. We conduct Bayesian inference of the $Λ$CDM parameters $Ω_m$, $Ω_b$, $h$, $n_s$, and $σ_8$ from the BOSS CMASS galaxy sample by combining the wavelet scattering transform (WST) with a simulation-based inference approach enabled by the ${\rm S{\scriptsize IM}BIG}$ forward model. We design a set of reduced WST statistics that leverage symmetries of redshift-space data. Posterior distributions are estimated with a conditional normalizing flow trained on 20,000 simulated ${\rm S{\scriptsize IM}BIG}$ galaxy catalogs with survey realism. We assess the accuracy of the posterior estimates using simulation-based calibration and quantify generalization and robustness to the change of forward model using a suite of 2,000 test simulations. When probing scales down to $k_{\rm max}=0.5~h/\text{Mpc}$, we are able to derive accurate posterior estimates that are robust to the change of forward model for all parameters, except $σ_8$. We mitigate the robustness issues with $σ_8$ by removing the WST coefficients that probe scales smaller than $k \sim 0.3~h/\text{Mpc}$. Applied to the BOSS CMASS sample, our WST analysis yields seemingly improved constraints obtained from a standard PT-based power spectrum analysis with $k_{\rm max}=0.25~h/\text{Mpc}$ for all parameters except $h$. However, we still raise concerns on these results. The observational predictions significantly vary across different normalizing flow architectures, which we interpret as a form of model misspecification. This highlights a key challenge for forward modeling approaches when using summary statistics that are sensitive to detailed model-specific or observational imprints on galaxy clustering.
△ Less
Submitted 18 July, 2024; v1 submitted 23 October, 2023;
originally announced October 2023.
-
${\rm S{\scriptsize IM}BIG}$: The First Cosmological Constraints from Non-Gaussian and Non-Linear Galaxy Clustering
Authors:
ChangHoon Hahn,
Pablo Lemos,
Liam Parker,
Bruno Régaldo-Saint Blancard,
Michael Eickenberg,
Shirley Ho,
Jiamin Hou,
Elena Massara,
Chirag Modi,
Azadeh Moradinezhad Dizgah,
David Spergel
Abstract:
The 3D distribution of galaxies encodes detailed cosmological information on the expansion and growth history of the Universe. We present the first cosmological constraints that exploit non-Gaussian cosmological information on non-linear scales from galaxy clustering, inaccessible with current standard analyses. We analyze a subset of the BOSS galaxy survey using ${\rm S{\scriptsize IM}BIG}$, a ne…
▽ More
The 3D distribution of galaxies encodes detailed cosmological information on the expansion and growth history of the Universe. We present the first cosmological constraints that exploit non-Gaussian cosmological information on non-linear scales from galaxy clustering, inaccessible with current standard analyses. We analyze a subset of the BOSS galaxy survey using ${\rm S{\scriptsize IM}BIG}$, a new framework for cosmological inference that leverages high-fidelity simulations and deep generative models. We use two clustering statistics beyond the standard power spectrum: the bispectrum and a convolutional neural network based summary of the galaxy field. We infer constraints on $Λ$CDM parameters, $Ω_b$, $h$, $n_s$, $Ω_m$, and $σ_8$, that are 1.6, 1.5, 1.7, 1.2, and 2.3$\times$ tighter than power spectrum analyses. With this increased precision, we derive constraints on the Hubble constant, $H_0$, and $S_8 = σ_8 \sqrt{Ω_m/0.3}$ that are competitive with other cosmological probes, even with a sample that only spans 10% of the full BOSS volume. Our $H_0$ constraints, imposing the Big Bang Nucleosynthesis prior on the baryon density, are consistent with the early time constraints from the cosmic microwave background (CMB). Meanwhile, our $S_8$ constraints are consistent with weak lensing experiments and similarly lie below CMB constraints. Lastly, we present forecasts to show that future work extending ${\rm S{\scriptsize IM}BIG}$ to upcoming spectroscopic galaxy surveys (DESI, PFS, Euclid) will produce leading $H_0$ and $S_8$ constraints that bridge the gap between early and late time measurements and shed light on current cosmic tensions.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
${\rm S{\scriptsize IM}BIG}$: The First Cosmological Constraints from the Non-Linear Galaxy Bispectrum
Authors:
ChangHoon Hahn,
Michael Eickenberg,
Shirley Ho,
Jiamin Hou,
Pablo Lemos,
Elena Massara,
Chirag Modi,
Azadeh Moradinezhad Dizgah,
Liam Parker,
Bruno Régaldo-Saint Blancard
Abstract:
We present the first cosmological constraints from analyzing higher-order galaxy clustering on non-linear scales. We use ${\rm S{\scriptsize IM}BIG}$, a forward modeling framework for galaxy clustering analyses that employs simulation-based inference to perform highly efficient cosmological inference using normalizing flows. It leverages the predictive power of high-fidelity simulations and robust…
▽ More
We present the first cosmological constraints from analyzing higher-order galaxy clustering on non-linear scales. We use ${\rm S{\scriptsize IM}BIG}$, a forward modeling framework for galaxy clustering analyses that employs simulation-based inference to perform highly efficient cosmological inference using normalizing flows. It leverages the predictive power of high-fidelity simulations and robustly extracts cosmological information from regimes inaccessible with current standard analyses. In this work, we apply ${\rm S{\scriptsize IM}BIG}$ to a subset of the BOSS galaxy sample and analyze the redshift-space bispectrum monopole, $B_0(k_1, k_2, k_3)$, to $k_{\rm max}=0.5\,h/{\rm Mpc}$. We achieve 1$σ$ constraints of $Ω_m=0.293^{+0.027}_{-0.027}$ and $σ_8= 0.783^{+0.040}_{-0.038}$, which are more than 1.2 and 2.4$\times$ tighter than constraints from standard power spectrum analyses of the same dataset. We also derive 1.4, 1.4, 1.7$\times$ tighter constraints on $Ω_b$, $h$, $n_s$. This improvement comes from additional cosmological information in higher-order clustering on non-linear scales and, for $σ_8$, is equivalent to the gain expected from a standard analysis on a $\sim$4$\times$ larger galaxy sample. Even with our BOSS subsample, which only spans 10% of the full BOSS volume, we derive competitive constraints on the growth of structure: $S_8 = 0.774^{+0.056}_{-0.053}$. Our constraint is consistent with results from both cosmic microwave background and weak lensing. Combined with a $ω_b$ prior from Big Bang Nucleosynthesis, we also derive a constraint on $H_0=67.6^{+2.2}_{-1.8}\,{\rm km\,s^{-1}\,Mpc^{-1}}$ that is consistent with early universe constraints.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
A Generalization Bound of Deep Neural Networks for Dependent Data
Authors:
Quan Huu Do,
Binh T. Nguyen,
Lam Si Tung Ho
Abstract:
Existing generalization bounds for deep neural networks require data to be independent and identically distributed (iid). This assumption may not hold in real-life applications such as evolutionary biology, infectious disease epidemiology, and stock price prediction. This work establishes a generalization bound of feed-forward neural networks for non-stationary $φ$-mixing data.
Existing generalization bounds for deep neural networks require data to be independent and identically distributed (iid). This assumption may not hold in real-life applications such as evolutionary biology, infectious disease epidemiology, and stock price prediction. This work establishes a generalization bound of feed-forward neural networks for non-stationary $φ$-mixing data.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Light Thermal Self-Interacting Dark Matter in the Shadow of Non-Standard Cosmology
Authors:
Shu-Yu Ho,
Pyungwon Ko,
Dibyendu Nanda
Abstract:
In this paper, we construct a viable model for a GeV scale self-interacting dark matter (DM), where the DM was thermally produced in the early universe. Here, a new vector-like fermion with a dark charge under the $U(1)_{D}$ gauge symmetry serves as a secluded WIMP DM and it can dominantly annihilate into the light dark gauge boson and singlet scalar through the dark gauge interaction. Also, the s…
▽ More
In this paper, we construct a viable model for a GeV scale self-interacting dark matter (DM), where the DM was thermally produced in the early universe. Here, a new vector-like fermion with a dark charge under the $U(1)_{D}$ gauge symmetry serves as a secluded WIMP DM and it can dominantly annihilate into the light dark gauge boson and singlet scalar through the dark gauge interaction. Also, the self-interaction of DM is induced by the light dark gauge boson via the same gauge interaction. In addition to these particles, we further introduce two Weyl fermions and a doublet scalar, by which the dark gauge boson produced from s-wave DM annihilations can mostly decay into active neutrinos after the dark symmetry breaking such that the CMB bound on the DM with low masses can be eluded. In order to have a common parameter region to explain the observed relic abundance and self-interaction of DM, we also study this model in a non-standard cosmological evolution, where the cosmic expansion driven by a new field species is faster than the standard radiation-dominated universe during the frozen time of DM. Reversely, one can also use the self-interacting nature of light thermal DM to examine the non-standard cosmological history of the universe.
△ Less
Submitted 25 March, 2024; v1 submitted 9 October, 2023;
originally announced October 2023.
-
Two product formulas for counting successive vertex orderings
Authors:
Boon Suan Ho
Abstract:
A vertex ordering of a graph $G$ is a bijection $π\colon\{1,\dots,|V(G)|\}\to V(G)$. It is successive if the induced subgraph $G[v_{π(1)},\dots,v_{π(k)}]$ is connected for each $k$. Lixing Fang, Hao Huang, János Pach, Gábor Tardos, and Junchi Zuo [J. Comb. Theory A199 (2023), 105776] gave formulas for counting the number of successive vertex orderings for a class of graphs they called "fully regul…
▽ More
A vertex ordering of a graph $G$ is a bijection $π\colon\{1,\dots,|V(G)|\}\to V(G)$. It is successive if the induced subgraph $G[v_{π(1)},\dots,v_{π(k)}]$ is connected for each $k$. Lixing Fang, Hao Huang, János Pach, Gábor Tardos, and Junchi Zuo [J. Comb. Theory A199 (2023), 105776] gave formulas for counting the number of successive vertex orderings for a class of graphs they called "fully regular," and conjectured that these formulas could be written as certain products involving differences or ratios of binomial coefficients in two cases: When the graph is the line graph $L(K_n^{(3)})$ of the complete $3$-uniform hypergraph, or when it is the line graph $L(K_{m,n}^{(1,2)})$ of a complete "bipartite" $3$-uniform hypergraph. In this paper, we confirm both of these conjectures.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
AstroCLIP: A Cross-Modal Foundation Model for Galaxies
Authors:
Liam Parker,
Francois Lanusse,
Siavash Golkar,
Leopoldo Sarra,
Miles Cranmer,
Alberto Bietti,
Michael Eickenberg,
Geraud Krawezik,
Michael McCabe,
Ruben Ohana,
Mariel Pettee,
Bruno Regaldo-Saint Blancard,
Tiberiu Tesileanu,
Kyunghyun Cho,
Shirley Ho
Abstract:
We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used - without any model fine-tuning - for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation fro…
▽ More
We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used - without any model fine-tuning - for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation from both images and spectra, and (4) morphology classification. Our approach to implementing AstroCLIP consists of two parts. First, we embed galaxy images and spectra separately by pretraining separate transformer-based image and spectrum encoders in self-supervised settings. We then align the encoders using a contrastive loss. We apply our method to spectra from the Dark Energy Spectroscopic Instrument and images from its corresponding Legacy Imaging Survey. Overall, we find remarkable performance on all downstream tasks, even relative to supervised baselines. For example, for a task like photometric redshift prediction, we find similar performance to a specifically-trained ResNet18, and for additional tasks like physical property estimation (stellar mass, age, metallicity, and sSFR), we beat this supervised baseline by 19\% in terms of $R^2$. We also compare our results to a state-of-the-art self-supervised single-modal model for galaxy images, and find that our approach outperforms this benchmark by roughly a factor of two on photometric redshift estimation and physical property prediction in terms of $R^2$, while remaining roughly in-line in terms of morphology classification. Ultimately, our approach represents the first cross-modal self-supervised model for galaxies, and the first self-supervised transformer-based architectures for galaxy images and spectra.
△ Less
Submitted 14 June, 2024; v1 submitted 4 October, 2023;
originally announced October 2023.
-
Multiple Physics Pretraining for Physical Surrogate Models
Authors:
Michael McCabe,
Bruno Régaldo-Saint Blancard,
Liam Holden Parker,
Ruben Ohana,
Miles Cranmer,
Alberto Bietti,
Michael Eickenberg,
Siavash Golkar,
Geraud Krawezik,
Francois Lanusse,
Mariel Pettee,
Tiberiu Tesileanu,
Kyunghyun Cho,
Shirley Ho
Abstract:
We introduce multiple physics pretraining (MPP), an autoregressive task-agnostic pretraining approach for physical surrogate modeling. MPP involves training large surrogate models to predict the dynamics of multiple heterogeneous physical systems simultaneously by learning features that are broadly useful across diverse physical tasks. In order to learn effectively in this setting, we introduce a…
▽ More
We introduce multiple physics pretraining (MPP), an autoregressive task-agnostic pretraining approach for physical surrogate modeling. MPP involves training large surrogate models to predict the dynamics of multiple heterogeneous physical systems simultaneously by learning features that are broadly useful across diverse physical tasks. In order to learn effectively in this setting, we introduce a shared embedding and normalization strategy that projects the fields of multiple systems into a single shared embedding space. We validate the efficacy of our approach on both pretraining and downstream tasks over a broad fluid mechanics-oriented benchmark. We show that a single MPP-pretrained transformer is able to match or outperform task-specific baselines on all pretraining sub-tasks without the need for finetuning. For downstream tasks, we demonstrate that finetuning MPP-trained models results in more accurate predictions across multiple time-steps on new physics compared to training from scratch or finetuning pretrained video foundation models. We open-source our code and model weights trained at multiple scales for reproducibility and community experimentation.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
xVal: A Continuous Number Encoding for Large Language Models
Authors:
Siavash Golkar,
Mariel Pettee,
Michael Eickenberg,
Alberto Bietti,
Miles Cranmer,
Geraud Krawezik,
Francois Lanusse,
Michael McCabe,
Ruben Ohana,
Liam Parker,
Bruno Régaldo-Saint Blancard,
Tiberiu Tesileanu,
Kyunghyun Cho,
Shirley Ho
Abstract:
Large Language Models have not yet been broadly adapted for the analysis of scientific datasets due in part to the unique difficulties of tokenizing numbers. We propose xVal, a numerical encoding scheme that represents any real number using just a single token. xVal represents a given real number by scaling a dedicated embedding vector by the number value. Combined with a modified number-inference…
▽ More
Large Language Models have not yet been broadly adapted for the analysis of scientific datasets due in part to the unique difficulties of tokenizing numbers. We propose xVal, a numerical encoding scheme that represents any real number using just a single token. xVal represents a given real number by scaling a dedicated embedding vector by the number value. Combined with a modified number-inference approach, this strategy renders the model end-to-end continuous when considered as a map from the numbers of the input string to those of the output string. This leads to an inductive bias that is generally more suitable for applications in scientific domains. We empirically evaluate our proposal on a number of synthetic and real-world datasets. Compared with existing number encoding schemes, we find that xVal is more token-efficient and demonstrates improved generalization.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Reusability report: Prostate cancer stratification with diverse biologically-informed neural architectures
Authors:
Christian Pedersen,
Tiberiu Tesileanu,
Tinghui Wu,
Siavash Golkar,
Miles Cranmer,
Zijun Zhang,
Shirley Ho
Abstract:
In Elmarakeby et al., "Biologically informed deep neural network for prostate cancer discovery", a feedforward neural network with biologically informed, sparse connections (P-NET) was presented to model the state of prostate cancer. We verified the reproducibility of the study conducted by Elmarakeby et al., using both their original codebase, and our own re-implementation using more up-to-date l…
▽ More
In Elmarakeby et al., "Biologically informed deep neural network for prostate cancer discovery", a feedforward neural network with biologically informed, sparse connections (P-NET) was presented to model the state of prostate cancer. We verified the reproducibility of the study conducted by Elmarakeby et al., using both their original codebase, and our own re-implementation using more up-to-date libraries. We quantified the contribution of network sparsification by Reactome biological pathways, and confirmed its importance to P-NET's superior performance. Furthermore, we explored alternative neural architectures and approaches to incorporating biological information into the networks. We experimented with three types of graph neural networks on the same training data, and investigated the clinical prediction agreement between different models. Our analyses demonstrated that deep neural networks with distinct architectures make incorrect predictions for individual patient that are persistent across different initializations of a specific neural architecture. This suggests that different neural architectures are sensitive to different aspects of the data, an important yet under-explored challenge for clinical prediction tasks.
△ Less
Submitted 30 October, 2023; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Towards A Unified Utilitarian Ethics Framework for Healthcare Artificial Intelligence
Authors:
Forhan Bin Emdad,
Shuyuan Mary Ho,
Benhur Ravuri,
Shezin Hussain
Abstract:
Artificial Intelligence (AI) aims to elevate healthcare to a pinnacle by aiding clinical decision support. Overcoming the challenges related to the design of ethical AI will enable clinicians, physicians, healthcare professionals, and other stakeholders to use and trust AI in healthcare settings. This study attempts to identify the major ethical principles influencing the utility performance of AI…
▽ More
Artificial Intelligence (AI) aims to elevate healthcare to a pinnacle by aiding clinical decision support. Overcoming the challenges related to the design of ethical AI will enable clinicians, physicians, healthcare professionals, and other stakeholders to use and trust AI in healthcare settings. This study attempts to identify the major ethical principles influencing the utility performance of AI at different technological levels such as data access, algorithms, and systems through a thematic analysis. We observed that justice, privacy, bias, lack of regulations, risks, and interpretability are the most important principles to consider for ethical AI. This data-driven study has analyzed secondary survey data from the Pew Research Center (2020) of 36 AI experts to categorize the top ethical principles of AI design. To resolve the ethical issues identified by the meta-analysis and domain experts, we propose a new utilitarian ethics-based theoretical framework for designing ethical AI for the healthcare domain.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
DefGoalNet: Contextual Goal Learning from Demonstrations For Deformable Object Manipulation
Authors:
Bao Thach,
Tanner Watts,
Shing-Hei Ho,
Tucker Hermans,
Alan Kuntz
Abstract:
Shape servoing, a robotic task dedicated to controlling objects to desired goal shapes, is a promising approach to deformable object manipulation. An issue arises, however, with the reliance on the specification of a goal shape. This goal has been obtained either by a laborious domain knowledge engineering process or by manually manipulating the object into the desired shape and capturing the goal…
▽ More
Shape servoing, a robotic task dedicated to controlling objects to desired goal shapes, is a promising approach to deformable object manipulation. An issue arises, however, with the reliance on the specification of a goal shape. This goal has been obtained either by a laborious domain knowledge engineering process or by manually manipulating the object into the desired shape and capturing the goal shape at that specific moment, both of which are impractical in various robotic applications. In this paper, we solve this problem by developing a novel neural network DefGoalNet, which learns deformable object goal shapes directly from a small number of human demonstrations. We demonstrate our method's effectiveness on various robotic tasks, both in simulation and on a physical robot. Notably, in the surgical retraction task, even when trained with as few as 10 demonstrations, our method achieves a median success percentage of nearly 90%. These results mark a substantial advancement in enabling shape servoing methods to bring deformable object manipulation closer to practical, real-world applications.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Ptychographic nanoscale imaging of the magnetoelectric coupling in freestanding BiFeO$_3$
Authors:
Tim A. Butcher,
Nicholas W. Phillips,
Chun-Chien Chiu,
Chia-Chun Wei,
Sheng-Zhu Ho,
Yi-Chun Chen,
Erik Fröjdh,
Filippo Baruffaldi,
Maria Carulla,
Jiaguo Zhang,
Anna Bergamaschi,
Carlos A. F. Vaz,
Armin Kleibert,
Simone Finizio,
Jan-Chi Yang,
Shih-Wen Huang,
Jörg Raabe
Abstract:
Understanding the magnetic and ferroelectric ordering of magnetoelectric multiferroic materials at the nanoscale necessitates a versatile imaging method with high spatial resolution. Here, soft X-ray ptychography is employed to simultaneously image the ferroelectric and antiferromagnetic domains in an 80 nm thin freestanding film of the room-temperature multiferroic BiFeO$_3$ (BFO). The antiferrom…
▽ More
Understanding the magnetic and ferroelectric ordering of magnetoelectric multiferroic materials at the nanoscale necessitates a versatile imaging method with high spatial resolution. Here, soft X-ray ptychography is employed to simultaneously image the ferroelectric and antiferromagnetic domains in an 80 nm thin freestanding film of the room-temperature multiferroic BiFeO$_3$ (BFO). The antiferromagnetic spin cycloid of period 64 nm is resolved by reconstructing the corresponding resonant elastic X-ray scattering in real space and visualized together with mosaic-like ferroelectric domains in a linear dichroic contrast image at the Fe L$_3$ edge. The measurements reveal a near perfect coupling between the antiferromagnetic and ferroelectric ordering by which the propagation direction of the spin cycloid is locked orthogonally to the ferroelectric polarization. In addition, the study evinces both a preference for in-plane propagation of the spin cycloid and changes of the ferroelectric polarization by 71° between multiferroic domains in the epitaxial strain-free, freestanding BFO film. The results provide a direct visualization of the strong magnetoelectric coupling in BFO and of its fine multiferroic domain structure, emphasizing the potential of ptychographic imaging for the study of multiferroics and non-collinear magnetic materials with soft X-rays.
△ Less
Submitted 29 June, 2024; v1 submitted 25 August, 2023;
originally announced August 2023.