Search | arXiv e-print repository

arXiv:2406.19287 [pdf, other]

Isotropy of cosmic rays beyond $10^{20}$ eV favors their heavy mass composition

Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

Abstract: We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the resul… ▽ More We report an estimation of the injected mass composition of ultra-high energy cosmic rays (UHECRs) at energies higher than 10 EeV. The composition is inferred from an energy-dependent sky distribution of UHECR events observed by the Telescope Array surface detector by comparing it to the Large Scale Structure of the local Universe. In the case of negligible extra-galactic magnetic fields the results are consistent with a relatively heavy injected composition at E ~ 10 EeV that becomes lighter up to E ~ 100 EeV, while the composition at E > 100 EeV is very heavy. The latter is true even in the presence of highest experimentally allowed extra-galactic magnetic fields, while the composition at lower energies can be light if a strong EGMF is present. The effect of the uncertainty in the galactic magnetic field on these results is subdominant. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 8 pages, 3 figures, accepted for publication in PRL

arXiv:2406.19286 [pdf, other]

Mass composition of ultra-high energy cosmic rays from distribution of their arrival directions with the Telescope Array

Authors: Telescope Array Collaboration, R. U. Abbasi, Y. Abe, T. Abu-Zayyad, M. Allen, Y. Arai, R. Arimura, E. Barcikowski, J. W. Belz, D. R. Bergman, S. A. Blake, I. Buckland, B. G. Cheon, M. Chikawa, T. Fujii, K. Fujisue, K. Fujita, R. Fujiwara, M. Fukushima, G. Furlich, N. Globus, R. Gonzalez, W. Hanlon, N. Hayashida, H. He , et al. (118 additional authors not shown)

Abstract: We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale struc… ▽ More We use a new method to estimate the injected mass composition of ultrahigh cosmic rays (UHECRs) at energies higher than 10 EeV. The method is based on comparison of the energy-dependent distribution of cosmic ray arrival directions as measured by the Telescope Array experiment (TA) with that calculated in a given putative model of UHECR under the assumption that sources trace the large-scale structure (LSS) of the Universe. As we report in the companion letter, the TA data show large deflections with respect to the LSS which can be explained, assuming small extra-galactic magnetic fields (EGMF), by an intermediate composition changing to a heavy one (iron) in the highest energy bin. Here we show that these results are robust to uncertainties in UHECR injection spectra, the energy scale of the experiment and galactic magnetic fields (GMF). The assumption of weak EGMF, however, strongly affects this interpretation at all but the highest energies E > 100 EeV, where the remarkable isotropy of the data implies a heavy injected composition even in the case of strong EGMF. This result also holds if UHECR sources are as rare as $2 \times 10^{-5}$ Mpc$^{-3}$, that is the conservative lower limit for the source number density. △ Less

Submitted 27 June, 2024; originally announced June 2024.

Comments: 18 pages, 11 figures, accepted for publication in PRD

arXiv:2406.18820 [pdf, other]

Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training

Authors: Xinyu Lian, Sam Ade Jacobs, Lev Kurilenko, Masahiro Tanaka, Stas Bekman, Olatunji Ruwase, Minjia Zhang

Abstract: Existing checkpointing approaches seem ill-suited for distributed training even though hardware limitations make model parallelism, i.e., sharding model state across multiple accelerators, a requirement for model scaling. Consolidating distributed model state into a single checkpoint unacceptably slows down training, and is impractical at extreme scales. Distributed checkpoints, in contrast, are t… ▽ More Existing checkpointing approaches seem ill-suited for distributed training even though hardware limitations make model parallelism, i.e., sharding model state across multiple accelerators, a requirement for model scaling. Consolidating distributed model state into a single checkpoint unacceptably slows down training, and is impractical at extreme scales. Distributed checkpoints, in contrast, are tightly coupled to the model parallelism and hardware configurations of the training run, and thus unusable on different configurations. To address this problem, we propose Universal Checkpointing, a technique that enables efficient checkpoint creation while providing the flexibility of resuming on arbitrary parallelism strategy and hardware configurations. Universal Checkpointing unlocks unprecedented capabilities for large-scale training such as improved resilience to hardware failures through continued training on remaining healthy hardware, and reduced training time through opportunistic exploitation of elastic capacity. The key insight of Universal Checkpointing is the selection of the optimal representation in each phase of the checkpointing life cycle: distributed representation for saving, and consolidated representation for loading. This is achieved using two key mechanisms. First, the universal checkpoint format, which consists of a consolidated representation of each model parameter and metadata for mapping parameter fragments into training ranks of arbitrary model-parallelism configuration. Second, the universal checkpoint language, a simple but powerful specification language for converting distributed checkpoints into the universal checkpoint format. Our evaluation demonstrates the effectiveness and generality of Universal Checkpointing on state-of-the-art model architectures and a wide range of parallelism techniques. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2406.14366 [pdf, other]

GRB 211211A: The Case for Engine Powered over r-Process Powered Blue Kilonova

Authors: Hamid Hamidani, Masaomi Tanaka, Shigeo S. Kimura, Gavin P. Lamb, Kyohei Kawaguchi

Abstract: The recent Gamma-Ray Burst (GRB) GRB~211211A provides the earliest ($\sim 5$ h) data of a kilonova (KN) event, displaying bright ($\sim10^{42}$ erg s$^{-1}$) and blue early emission. Previously, this KN has been explained using simplistic multi-component fitting methods. Here, in order to understand the physical origin of the KN emission in GRB~211211A, we employ an analytic multi-zone model for r… ▽ More The recent Gamma-Ray Burst (GRB) GRB~211211A provides the earliest ($\sim 5$ h) data of a kilonova (KN) event, displaying bright ($\sim10^{42}$ erg s$^{-1}$) and blue early emission. Previously, this KN has been explained using simplistic multi-component fitting methods. Here, in order to understand the physical origin of the KN emission in GRB~211211A, we employ an analytic multi-zone model for r-process powered KN. We find that r-process powered KN models alone cannot explain the fast temporal evolution and the spectral energy distribution (SED) of the observed emission. Specifically, i) r-process models require high ejecta mass to match early luminosity, which overpredicts late-time emission, while ii) red KN models that reproduce late emission underpredict early luminosity. We propose an alternative scenario involving early contributions from the GRB central engine via a late low-power jet, consistent with plateau emission in short GRBs and GeV emission detected by Fermi-LAT at $\sim10^4$ s after GRB 211211A. Such late central engine activity, with an energy budget of $\sim \text{a few }\%$ of that of the prompt jet, combined with a single red-KN ejecta component, can naturally explain the light curve and SED of the observed emission; with the late-jet -- ejecta interaction reproducing the early blue emission and r-process heating reproducing the late red emission. This supports claims that late low-power engine activity after prompt emission may be common. We encourage very early follow-up observations of future nearby GRBs, and compact binary merger events, to reveal more about the central engine of GRBs and r-process events. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: 18 pages, 6 figures, and 2 tables. To be submitted to ApJ. Comments are welcome

arXiv:2406.14329 [pdf, other]

Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization

Authors: Tanapat Ratchatorn, Masayuki Tanaka

Abstract: Recent advancements in learning algorithms have demonstrated that the sharpness of the loss surface is an effective measure for improving the generalization gap. Building upon this concept, Sharpness-Aware Minimization (SAM) was proposed to enhance model generalization and achieved state-of-the-art performance. SAM consists of two main steps, the weight perturbation step and the weight updating st… ▽ More Recent advancements in learning algorithms have demonstrated that the sharpness of the loss surface is an effective measure for improving the generalization gap. Building upon this concept, Sharpness-Aware Minimization (SAM) was proposed to enhance model generalization and achieved state-of-the-art performance. SAM consists of two main steps, the weight perturbation step and the weight updating step. However, the perturbation in SAM is determined by only the gradient of the training loss, or cross-entropy loss. As the model approaches a stationary point, this gradient becomes small and oscillates, leading to inconsistent perturbation directions and also has a chance of diminishing the gradient. Our research introduces an innovative approach to further enhancing model generalization. We propose the Adaptive Adversarial Cross-Entropy (AACE) loss function to replace standard cross-entropy loss for SAM's perturbation. AACE loss and its gradient uniquely increase as the model nears convergence, ensuring consistent perturbation direction and addressing the gradient diminishing issue. Additionally, a novel perturbation-generating function utilizing AACE loss without normalization is proposed, enhancing the model's exploratory capabilities in near-optimum stages. Empirical testing confirms the effectiveness of AACE, with experiments demonstrating improved performance in image classification tasks using Wide ResNet and PyramidNet across various datasets. The reproduction code is available online △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: Accepted in ICIP2024. The project page can be accessed at http://www.vip.sc.e.titech.ac.jp/proj/AACE

arXiv:2406.13740 [pdf, other]

Kinetic Inductance, Quantum Geometry, and Superconductivity in Magic-Angle Twisted Bilayer Graphene

Authors: Miuko Tanaka, Joel Î-j. Wang, Thao H. Dinh, Daniel Rodan-Legrain, Sameia Zaman, Max Hays, Bharath Kannan, Aziza Almanakly, David K. Kim, Bethany M. Niedzielski, Kyle Serniak, Mollie E. Schwartz, Kenji Watanabe, Takashi Taniguchi, Jeffrey A. Grover, Terry P. Orlando, Simon Gustavsson, Pablo Jarillo-Herrero, William D. Oliver

Abstract: The physics of superconductivity in magic-angle twisted bilayer graphene (MATBG) is a topic of keen interest in moiré systems research, and it may provide insight into the pairing mechanism of other strongly correlated materials such as high-$T_{\mathrm{c}}$ superconductors. Here, we use DC-transport and microwave circuit quantum electrodynamics (cQED) to measure directly the superfluid stiffness… ▽ More The physics of superconductivity in magic-angle twisted bilayer graphene (MATBG) is a topic of keen interest in moiré systems research, and it may provide insight into the pairing mechanism of other strongly correlated materials such as high-$T_{\mathrm{c}}$ superconductors. Here, we use DC-transport and microwave circuit quantum electrodynamics (cQED) to measure directly the superfluid stiffness of superconducting MATBG via its kinetic inductance. We find the superfluid stiffness to be much larger than expected from conventional single-band Fermi liquid theory; rather, it aligns well with theory involving quantum geometric effects that are dominant at the magic angle. The temperature dependence of the superfluid stiffness exhibits a power-law behavior, which contraindicates an isotropic BCS model; instead, the extracted power-law exponents indicate an anisotropic superconducting gap, whether interpreted using the conventional anisotropic BCS model or a quantum geometric theory of flat-band superconductivity. Moreover, the quadratic dependence of the stiffness on both DC and microwave current is consistent with Ginzburg-Landau theory. Taken together, these findings strongly suggest a connection between quantum geometry, superfluid stiffness, and unconventional superconductivity in MATBG. Finally, the combined DC-microwave measurement platform used here is applicable to the investigation of other atomically thin superconductors. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.11923 [pdf, other]

The stellar halo of the Milky Way traced by blue horizontal-branch stars in the Subaru Hyper Suprime-Cam Survey

Authors: Tetsuya Fukushima, Masashi Chiba, Mikito Tanaka, Kohei Hayashi, Daisuke Homma, Sakurako Okamoto, Yutaka Komiyama, Masayuki Tanaka, Nobuo Arimoto, Tadafumi Matsuno

Abstract: We select blue-horizontal branch stars (BHBs) from the internal data release of the Hyper Suprime-Cam Subaru Strategic Program to reveal the global structure of the Milky Way (MW) stellar halo. The data are distributed over $\sim 1,100$~deg$^2$ area in the range of $18.5<g<24.5$~mag, so that candidate BHBs are detectable over a Galactocentric radius of $r \simeq 36-575$~kpc. In order to select mos… ▽ More We select blue-horizontal branch stars (BHBs) from the internal data release of the Hyper Suprime-Cam Subaru Strategic Program to reveal the global structure of the Milky Way (MW) stellar halo. The data are distributed over $\sim 1,100$~deg$^2$ area in the range of $18.5<g<24.5$~mag, so that candidate BHBs are detectable over a Galactocentric radius of $r \simeq 36-575$~kpc. In order to select most likely BHBs by removing blue straggler stars and other contaminants in a statistically significant manner, we develop and apply an extensive Bayesian method, as described in \citet{Fukushima2019}. Our sample can be fitted to either a single power-law profile with an index of $α=4.11^{+0.18}_{-0.18}$ or a broken power-law profile with an index of $α_{\rm in}=3.90^{+0.24}_{-0.30}$ at $r$ below a broken radius of $r_{\rm b}=184^{+118}_{-66}$ kpc and a very steep slope of $α_{\rm out}=9.1^{+6.8}_{-3.6}$ at $r>r_{\rm b}$; the statistical difference between these fitting profiles is small. Both profiles are found to show prolate shapes having axial ratios of $q=1.47^{+0.30}_{-0.33}$ and $1.56^{+0.34}_{-0.23}$, respectively. We also find a signature of the so-called "splashback radius" for the candidate BHBs, which can reach as large as $r \sim 575$~kpc, although it is still inconclusive owing to rather large distance errors in this faintest end of the sample. Our results suggest that the MW stellar halo consists of the two overlapping components: the {\it in situ} inner halo showing a relatively steep radial density profile and the {\it ex situ} outer halo with a shallower profile, being characteristic of a component formed from accretion of small stellar systems. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 15 pages, 8 figures, 5 tables, submitted to PASJ. arXiv admin note: substantial text overlap with arXiv:1904.04966

arXiv:2406.08612 [pdf, other]

Observation of Declination Dependence in the Cosmic Ray Energy Spectrum

Authors: The Telescope Array Collaboration, R. U. Abbasi, T. Abu-Zayyad, M. Allen, J. W. Belz, D. R. Bergman, I. Buckland, W. Campbell, B. G. Cheon, K. Endo, A. Fedynitch, T. Fujii, K. Fujisue, K. Fujita, M. Fukushima, G. Furlich, Z. Gerber, N. Globus, W. Hanlon, N. Hayashida, H. He, K. Hibino, R. Higuchi, D. Ikeda, T. Ishii , et al. (101 additional authors not shown)

Abstract: We report on an observation of the difference between northern and southern skies of the ultrahigh energy cosmic ray energy spectrum with a significance of ${\sim}8σ$. We use measurements from the two largest experiments$\unicode{x2014}$the Telescope Array observing the northern hemisphere and the Pierre Auger Observatory viewing the southern hemisphere. Since the comparison of two measurements fr… ▽ More We report on an observation of the difference between northern and southern skies of the ultrahigh energy cosmic ray energy spectrum with a significance of ${\sim}8σ$. We use measurements from the two largest experiments$\unicode{x2014}$the Telescope Array observing the northern hemisphere and the Pierre Auger Observatory viewing the southern hemisphere. Since the comparison of two measurements from different observatories introduces the issue of possible systematic differences between detectors and analyses, we validate the methodology of the comparison by examining the region of the sky where the apertures of the two observatories overlap. Although the spectra differ in this region, we find that there is only a $1.8σ$ difference between the spectrum measurements when anisotropic regions are removed and a fiducial cut in the aperture is applied. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 8 pages, 6 figures

arXiv:2406.03957 [pdf, other]

Exploring loop-induced first-order electroweak phase transition in the Higgs effective field theory

Authors: Ricardo R. Florentino, Shinya Kanemura, Masanori Tanaka

Abstract: The nearly aligned Higgs Effective Field Theory (naHEFT) is based on the general assumption: all deviations in the Higgs boson couplings are originated from quantum one-loop effects of new particles that are integrated out. If the new particles integrated out have the same non-decoupling property, physics of the electroweak symmetry breaking can be then described by several parameters in the naHEF… ▽ More The nearly aligned Higgs Effective Field Theory (naHEFT) is based on the general assumption: all deviations in the Higgs boson couplings are originated from quantum one-loop effects of new particles that are integrated out. If the new particles integrated out have the same non-decoupling property, physics of the electroweak symmetry breaking can be then described by several parameters in the naHEFT, so that there is a correlation among the Higgs boson couplings such as $h γγ$, $hWW$ and $hhh$ couplings. In this paper, we analyze the strongly first-order electroweak phase transition (EWPT) with the condition of sphaleron decoupling and the completion condition of the phase transition, and investigate the relation among the deviations in the Higgs boson couplings and the dynamics of the EWPTs. We also take into account the gravitational wave spectrum as well as the primordial black hole predicted at the EWPT. We show that if the new particles integrated out include charged scalar states future precision measurements of the $h γγ$ coupling can give a useful prediction on the $hhh$ coupling to realize the strongly first-order EWPT. We can explore the nature of EWPT and the new physics behind it by the combination of precision measurements of various Higgs boson couplings at future collider experiments, gravitational wave observations at future space-based interferometers and searches for primordial black holes. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 24 pages, 5 figures

Report number: OU-HET-1223

arXiv:2406.02849 [pdf, other]

Cluster candidates with massive quiescent galaxies at $z\sim2$

Authors: Tomokazu Kiyota, Makoto Ando, Masayuki Tanaka, Alexis Finoguenov, Sadman Shariar Ali, Jean Coupon, Guillaume Desprez, Stephen Gwyn, Marcin Sawicki, Rhythm Shimakawa

Abstract: Galaxy clusters are crucial to understanding role of the environment in galaxy evolution. However, due to their rarity, only a limited number of clusters have been identified at $z\gtrsim2$. In this paper, we report a discovery of seven cluster candidates with massive quiescent galaxies at $z\sim2$ in the $3.5\,\mathrm{deg}^{2}$ area of the XMM-LSS field, roughly doubling the known cluster sample… ▽ More Galaxy clusters are crucial to understanding role of the environment in galaxy evolution. However, due to their rarity, only a limited number of clusters have been identified at $z\gtrsim2$. In this paper, we report a discovery of seven cluster candidates with massive quiescent galaxies at $z\sim2$ in the $3.5\,\mathrm{deg}^{2}$ area of the XMM-LSS field, roughly doubling the known cluster sample at this frontier redshift if confirmed. We construct a photometric redshift catalog based on deep ($i\sim26$, $K_\mathrm{s}\sim24$) multi-wavelength photometry from $u^*$-band to $K$-band gathered from the Hyper Suprime-Cam Subaru Strategic Program and other collaborative/public surveys. We adopt a Gaussian kernel density estimate with two different spatial scales (10" and 60") to draw a density map of massive ($\log(M_{*}/M_{\odot})>10.5$) and quiescent ($\log(\mathrm{sSFR\, [\mathrm{yr^{-1}}]})<-10$) galaxies at $z\sim2$. Then, We identify seven prominent overdensities. These candidates show clear red sequences in color-magnitude diagrams ($z-H$ vs. $H$). Moreover, one of them shows an extended X-ray emission with $L_\mathrm{X}=(1.46\pm0.35)\times10^{44}$ erg s$^{-1}$, suggesting its virialized nature. There is no clear evidence of enhancement nor suppression of the star formation rate of the main sequence galaxies in the clusters. We find that cluster galaxies have a higher fraction of transition population with $-10.5<\log(\mathrm{sSFR\, [\mathrm{yr^{-1}}]})<-10$ ($12\%$) than the field ($2\%$), which implies the ongoing star formation quenching. The quiescent fraction in the cluster candidates also exceeds that in the field. We confirm that the excess of a quiescent fraction is larger for higher-mass galaxies. This is the first statistical evidence for the mass-dependent environmental quenching at work in clusters even at $z\sim2$. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 21 pages, 9 figures, submitted to ApJ

arXiv:2405.20989 [pdf, other]

Unravelling the asphericities in the explosion and multi-faceted circumstellar matter of SN 2023ixf

Authors: Avinash Singh, R. S. Teja, T. J. Moriya, K. Maeda, K. S. Kawabata, M. Tanaka, R. Imazawa, T. Nakaoka, A. Gangopadhyay, M. Yamanaka, V. Swain, D. K. Sahu, G. C. Anupama, B. Kumar, R. M. Anche, Y. Sano, A. Raj, V. K. Agnihotri, V. Bhalerao, D. Bisht, M. S. Bisht, K. Belwal, S. K. Chakrabarti, M. Fujii, T. Nagayama , et al. (11 additional authors not shown)

Abstract: We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) envelop… ▽ More We present a detailed investigation of photometric, spectroscopic, and polarimetric observations of the Type II SN 2023ixf. The early detection of highly-ionized flash features, rapid ascent in ultraviolet flux coupled with the blueward shift in near-ultraviolet colors and temperature provides compelling evidence for a delayed shock breakout from a confined dense circumstellar matter (CSM) enveloping the progenitor star. The temporal evolution of polarization in the SN 2023ixf phase revealed three distinct peaks in polarization evolution at 1.4 d, 6.4 d, and 79.2 d, indicating an asymmetric dense CSM, an aspherical shock front and clumpiness in the low-density extended CSM, and an aspherical inner ejecta/He-core. SN 2023ixf displayed two dominant axes, one along the CSM-outer ejecta and the other along the inner ejecta/He-core, showcasing the independent origin of asymmetry in the early and late evolution. The argument for an aspherical shock front is further strengthened by the presence of a high-velocity broad absorption feature in the blue wing of the Balmer features in addition to the P-Cygni absorption post 16 d. Hydrodynamical light curve modeling indicated a progenitor mass of 10 solar mass with a radius of 470 solar radius, explosion energy of 2e51 erg, and 0.06 solar mass of 56Ni. The modeling also indicated a two-zone CSM: a confined dense CSM extending up to 5e14 cm, with a mass-loss rate of 1e-2 solar mass per year, and an extended CSM spanning from 5e14 cm to 1e16 cm with a mass-loss rate of 1e-4 solar mass per year. The early nebular phase observations display an axisymmetric line profile of [OI] and red-ward attenuation of the emission of Halpha post 125 days, marking the onset of dust formation. △ Less

Submitted 31 May, 2024; originally announced May 2024.

Comments: 30 pages, 14 figures, 1 Table, Submitted to AAS Journals

arXiv:2405.17760 [pdf, ps, other]

Implications of neutrino species number and summed mass measurements in cosmological observations

Authors: N. Sasao, M. Yoshimura, M. Tanaka

Abstract: We confront measurable neutrino degrees of freedom $N_{\rm eff}$ and summed neutrino mass in the early universe to particle physics at the energy scale beyond the standard model (BSM), in particular including the issue of neutrino mass type distinction. The Majorana-type of massive neutrino is perfectly acceptable by Planck observations, while the Dirac-type neutrino may survive in a restricted cl… ▽ More We confront measurable neutrino degrees of freedom $N_{\rm eff}$ and summed neutrino mass in the early universe to particle physics at the energy scale beyond the standard model (BSM), in particular including the issue of neutrino mass type distinction. The Majorana-type of massive neutrino is perfectly acceptable by Planck observations, while the Dirac-type neutrino may survive in a restricted class of models that suppresses extra right-handed contribution to $ΔN_{\rm eff} = N_{\rm eff} - 3$ at a nearly indistinguishable level from the Majorana case. There is a chance that supersymmetry energy scale may be identified in supersymmetric extension of left-right symmetric model if improved $N_{\rm eff}$ measurements discover a finite value. Combined analysis of this quantity with the summed neutrino mass helps to determine the neutrino mass ordering pattern, if measurement accuracy of order, $60 \sim 80\,$meV, is achieved, as in CMB-S4. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.14146 [pdf, other]

Hyperspectral Image Dataset for Individual Penguin Identification

Authors: Youta Noboru, Yuko Ozasa, Masayuki Tanaka

Abstract: Remote individual animal identification is important for food safety, sport, and animal conservation. Numerous existing remote individual animal identification studies have focused on RGB images. In this paper, we tackle individual penguin identification using hyperspectral (HS) images. To the best of our knowledge, it is the first work to analyze spectral differences between penguin individuals u… ▽ More Remote individual animal identification is important for food safety, sport, and animal conservation. Numerous existing remote individual animal identification studies have focused on RGB images. In this paper, we tackle individual penguin identification using hyperspectral (HS) images. To the best of our knowledge, it is the first work to analyze spectral differences between penguin individuals using an HS camera. We have constructed a novel penguin HS image dataset, including 990 hyperspectral images of 27 penguins. We experimentally demonstrate that the spectral information of HS image pixels can be used for individual penguin identification. The experimental results show the effectiveness of using HS images for individual penguin identification. The dataset and source code are available here: https://033labcodes.github.io/igrass24_penguin/ △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Accepted by 2024 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2024)

arXiv:2405.12002 [pdf, other]

Optical Variability of Blazars in the Tomo-e Gozen Northern Sky Transient Survey

Authors: TianFang Zhang, Mamoru Doi, Mitsuru Kokubo, Shigeyuki Sako, Ryou Ohsawa, Nozomu Tominaga, Masaomi Tanaka, Yasushi Fukazawa, Hidenori Takahashi, Noriaki Arima, Naoto Kobayashi, Ko Arimatsu, Shin-ichiro Okumura, Sohei Kondo, Toshihiro Kasuga, Yuki Mori, Yuu Niino

Abstract: We studied the optical variability of 241 BL Lacs and 83 flat-spectrum radio quasars (FSRQ) from the 4LAC catalog using data from the Tomo-e Gozen Northern Sky Transient Survey, with $\sim$ 50 epochs per blazar on average. We excluded blazars whose optical variability may be underestimated due to the influence of their host galaxy, based on their optical luminosity ($L_O$). FSRQs with $γ$-ray phot… ▽ More We studied the optical variability of 241 BL Lacs and 83 flat-spectrum radio quasars (FSRQ) from the 4LAC catalog using data from the Tomo-e Gozen Northern Sky Transient Survey, with $\sim$ 50 epochs per blazar on average. We excluded blazars whose optical variability may be underestimated due to the influence of their host galaxy, based on their optical luminosity ($L_O$). FSRQs with $γ$-ray photon index greater than 2.6 exhibit very low optical variability, and their distribution of standard deviation of repeated photometry is significantly different from that of the other FSRQs (KS test P value equal to $5 \times 10^{-6}$ ). Among a sample of blazars at any particular cosmological epoch, those with lower $γ$-ray luminosity ($L_γ$) tend to have lower optical variability, and those FSRQs with $γ$-ray photon index greater than 2.6 tend to have low $L_γ$. We also measured the structure function of optical variability and found that the amplitude of the structure function for FSRQs is higher than previously measured and higher than that of BL Lacs at multiple time lags. Additionally, the amplitude of the structure function of FSRQs with high $γ$-ray photon index is significantly lower than that of FSRQs with low $γ$-ray photon index. The structure function of FSRQs of high $γ$-ray photon index shows a characteristic timescale of more than 10 days, which may be the variability timescale of the accretion disk. In summary, we infer that the optical component of FSRQs with high $γ$-ray photon index may be dominated by the accretion disk. △ Less

Submitted 20 May, 2024; originally announced May 2024.

Comments: 22 pages, 13 figures

arXiv:2405.11185 [pdf, other]

Majorization-minimization Bregman proximal gradient algorithms for nonnegative matrix factorization with the Kullback--Leibler divergence

Authors: Shota Takahashi, Mirai Tanaka, Shiro Ikeda

Abstract: Nonnegative matrix factorization (NMF) is a popular method in machine learning and signal processing to decompose a given nonnegative matrix into two nonnegative matrices. In this paper, to solve NMF, we propose new algorithms, called majorization-minimization Bregman proximal gradient algorithm (MMBPG) and MMBPG with extrapolation (MMBPGe). MMBPG and MMBPGe minimize an auxiliary function majorizi… ▽ More Nonnegative matrix factorization (NMF) is a popular method in machine learning and signal processing to decompose a given nonnegative matrix into two nonnegative matrices. In this paper, to solve NMF, we propose new algorithms, called majorization-minimization Bregman proximal gradient algorithm (MMBPG) and MMBPG with extrapolation (MMBPGe). MMBPG and MMBPGe minimize an auxiliary function majorizing the Kullback--Leibler (KL) divergence loss by the existing Bregman proximal gradient algorithms. While existing KL-based NMF methods update each variable alternately, proposed algorithms update all variables simultaneously. The proposed MMBPG and MMBPGe are equipped with a separable Bregman distance that satisfies the smooth adaptable property and that makes its subproblem solvable in closed forms. We also proved that even though these algorithms are designed to minimize an auxiliary function, MMBPG and MMBPGe monotonically decrease the objective function and a potential function, respectively. Using this fact, we show that a sequence generated by MMBPG(e) globally converges to a Karush--Kuhn--Tucker (KKT) point. In numerical experiments, we compared proposed algorithms with existing algorithms on synthetic data. △ Less

Submitted 18 May, 2024; originally announced May 2024.

Comments: 22 pages, 26 figures

MSC Class: 90C26; 49M37; 15A23

arXiv:2405.10078 [pdf, other]

Spurious reconstruction from brain activity

Authors: Ken Shirakawa, Yoshihiro Nagano, Misato Tanaka, Shuntaro C. Aoki, Kei Majima, Yusuke Muraki, Yukiyasu Kamitani

Abstract: Advances in brain decoding, particularly visual image reconstruction, have sparked discussions about the societal implications and ethical considerations of neurotechnology. As these methods aim to recover visual experiences from brain activity and achieve prediction beyond training samples (zero-shot prediction), it is crucial to assess their capabilities and limitations to inform public expectat… ▽ More Advances in brain decoding, particularly visual image reconstruction, have sparked discussions about the societal implications and ethical considerations of neurotechnology. As these methods aim to recover visual experiences from brain activity and achieve prediction beyond training samples (zero-shot prediction), it is crucial to assess their capabilities and limitations to inform public expectations and regulations. Our case study of recent text-guided reconstruction methods, which leverage a large-scale dataset (NSD) and text-to-image diffusion models, reveals limitations in their generalizability. We found decreased performance when applying these methods to a different dataset designed to prevent category overlaps between training and test sets. UMAP visualization of the text features with NSD images showed limited diversity of semantic and visual clusters, with overlap between training and test sets. Formal analysis and simulations demonstrated that clustered training samples can lead to "output dimension collapse," restricting predictable output feature dimensions. Diversifying the training set improved generalizability. However, text features alone are insufficient for mapping to the visual space. We argue that recent photo-like reconstructions may primarily be a blend of classification into trained categories and generation of inauthentic images through text-to-image diffusion (hallucination). Diverse datasets and compositional representations spanning the image space are essential for genuine zero-shot prediction. Interdisciplinary discussions grounded in understanding the current capabilities and limitations, as well as ethical considerations, of the technology are crucial for its responsible development. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.06257 [pdf, ps, other]

Measurement of cesium $8P_{J}\rightarrow 6P_{J'}$ electric quadrupole transition probabilities using fluorescence spectroscopy

Authors: Jing Wang, Yuki Miyamoto, Hideaki Hara, Minoru Tanaka, Motomichi Tashiro, Noboru Sasao

Abstract: Fluorescence spectra of the $8P_{J} \rightarrow 6P_{J'}$ ($J$ and $J'$ = 3/2, 1/2) electric quadrupole transition of cesium atoms have been observed with a heated cesium vapor cell. We determined the ratio of the transition probabilities of $8P_{J}\rightarrow6P_{J'}$ to $8P_{J}\rightarrow5D_{3/2}$ by comparing their respective photon emission rates. The results are in good agreement with our theor… ▽ More Fluorescence spectra of the $8P_{J} \rightarrow 6P_{J'}$ ($J$ and $J'$ = 3/2, 1/2) electric quadrupole transition of cesium atoms have been observed with a heated cesium vapor cell. We determined the ratio of the transition probabilities of $8P_{J}\rightarrow6P_{J'}$ to $8P_{J}\rightarrow5D_{3/2}$ by comparing their respective photon emission rates. The results are in good agreement with our theoretical calculations. These measurements provide crucial parameters for tests of coherent amplification method and improve knowledge of cesium properties which are essential to dark matter detection through atomic transitions. △ Less

Submitted 15 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

Comments: 9 pages, 6 figures

arXiv:2405.05463 [pdf, other]

Theoretical investigation of energy levels and transitions for Ce III with applications to kilonova spectra

Authors: G. Gaigalas, P. Rynkun, N. Domoto, M. Tanaka, D. Kato, L. Kitovienė

Abstract: Doubly ionized cerium (Ce$^{2+}$) is one of the most important ions to understand the kilonova spectra. In particular, near-infrared (NIR) transitions of Ce III between the ground (5p$^6$ 4f$^2$) and first excited (5p$^6$ 4f 5d) configurations are responsible for the absorption features around 14,500 A. However, there is no dedicated theoretical studies to provide accurate transition probabilities… ▽ More Doubly ionized cerium (Ce$^{2+}$) is one of the most important ions to understand the kilonova spectra. In particular, near-infrared (NIR) transitions of Ce III between the ground (5p$^6$ 4f$^2$) and first excited (5p$^6$ 4f 5d) configurations are responsible for the absorption features around 14,500 A. However, there is no dedicated theoretical studies to provide accurate transition probabilities for these transitions. We present energy levels of the ground and first excited configurations and transition data between them for Ce III. Calculations are performed using the GRASP2018 package, which is based on the multiconfiguration Dirac-Hartree-Fock and relativistic configuration interaction methods. Compared with the energy levels in the NIST database, our calculations reach the accuracy with the root-mean-square (rms) of 2732 cm$^{-1}$ or 1404 cm$^{-1}$ (excluding one highest level) for ground configuration, and rms of 618 cm$^{-1}$ for the first excited configuration. We extensively study the line strengths and find that the Babushkin gauge provide the more accurate values. By using the calculated gf values, we show that the NIR spectral features of kilonova can be explained by the Ce III lines. △ Less

Submitted 8 May, 2024; originally announced May 2024.

Comments: 8 pages, 7 figures, accepted for publication in MNRAS

arXiv:2405.04845 [pdf, other]

Weighted Particle-Based Optimization for Efficient Generalized Posterior Calibration

Authors: Masahiro Tanaka

Abstract: In the realm of statistical learning, the increasing volume of accessible data and increasing model complexity necessitate robust methodologies. This paper explores two branches of robust Bayesian methods in response to this trend. The first is generalized Bayesian inference, which introduces a learning rate parameter to enhance robustness against model misspecifications. The second is Gibbs poste… ▽ More In the realm of statistical learning, the increasing volume of accessible data and increasing model complexity necessitate robust methodologies. This paper explores two branches of robust Bayesian methods in response to this trend. The first is generalized Bayesian inference, which introduces a learning rate parameter to enhance robustness against model misspecifications. The second is Gibbs posterior inference, which formulates inferential problems using generic loss functions rather than probabilistic models. In such approaches, it is necessary to calibrate the spread of the posterior distribution by selecting a learning rate parameter. The study aims to enhance the generalized posterior calibration (GPC) algorithm proposed by [1]. Their algorithm chooses the learning rate to achieve the nominal frequentist coverage probability, but it is computationally intensive because it requires repeated posterior simulations for bootstrap samples. We propose a more efficient version of the GPC inspired by sequential Monte Carlo (SMC) samplers. A target distribution with a different learning rate is evaluated without posterior simulation as in the reweighting step in SMC sampling. Thus, the proposed algorithm can reach the desirable value within a few iterations. This improvement substantially reduces the computational cost of the GPC. Its efficacy is demonstrated through synthetic and real data applications. △ Less

Submitted 25 June, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

Comments: Forthcoming in Proceedings of the 7th International Conference on Data Science and Its Applications 2024

arXiv:2405.04771 [pdf, other]

Exploring Vision Transformers for 3D Human Motion-Language Models with Motion Patches

Authors: Qing Yu, Mikihiro Tanaka, Kent Fujiwara

Abstract: To build a cross-modal latent space between 3D human motion and language, acquiring large-scale and high-quality human motion data is crucial. However, unlike the abundance of image data, the scarcity of motion data has limited the performance of existing motion-language models. To counter this, we introduce "motion patches", a new representation of motion sequences, and propose using Vision Trans… ▽ More To build a cross-modal latent space between 3D human motion and language, acquiring large-scale and high-quality human motion data is crucial. However, unlike the abundance of image data, the scarcity of motion data has limited the performance of existing motion-language models. To counter this, we introduce "motion patches", a new representation of motion sequences, and propose using Vision Transformers (ViT) as motion encoders via transfer learning, aiming to extract useful knowledge from the image domain and apply it to the motion domain. These motion patches, created by dividing and sorting skeleton joints based on body parts in motion sequences, are robust to varying skeleton structures, and can be regarded as color image patches in ViT. We find that transfer learning with pre-trained weights of ViT obtained through training with 2D image data can boost the performance of motion analysis, presenting a promising direction for addressing the issue of limited motion data. Our extensive experiments show that the proposed motion patches, used jointly with ViT, achieve state-of-the-art performance in the benchmarks of text-to-motion retrieval, and other novel challenging tasks, such as cross-skeleton recognition, zero-shot motion classification, and human interaction recognition, which are currently impeded by the lack of data. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: Accepted to CVPR 2024, Project website: https://yu1ut.com/MotionPatches-HP/

arXiv:2404.16528 [pdf, other]

Generalized Posterior Calibration via Sequential Monte Carlo Sampler

Authors: Masahiro Tanaka

Abstract: As the amount and complexity of available data increases, the need for robust statistical learning becomes more pressing. To enhance resilience against model misspecification, the generalized posterior inference method adjusts the likelihood term by exponentiating it with a learning rate, thereby fine-tuning the dispersion of the posterior distribution. This study proposes a computationally effici… ▽ More As the amount and complexity of available data increases, the need for robust statistical learning becomes more pressing. To enhance resilience against model misspecification, the generalized posterior inference method adjusts the likelihood term by exponentiating it with a learning rate, thereby fine-tuning the dispersion of the posterior distribution. This study proposes a computationally efficient strategy for selecting an appropriate learning rate. The proposed approach builds upon the generalized posterior calibration (GPC) algorithm, which is designed to select a learning rate that ensures nominal frequentist coverage. This algorithm, which evaluates the coverage probability using bootstrap samples, has high computational costs because of the repeated posterior simulations needed for bootstrap samples. To address this limitation, the study proposes an algorithm that combines elements of the GPC algorithm with the sequential Monte Carlo (SMC) sampler. By leveraging the similarity between the learning rate in generalized posterior inference and the inverse temperature in SMC sampling, the proposed algorithm efficiently calibrates the posterior distribution with a reduced computational cost. For demonstration, the proposed algorithm was applied to several statistical learning models and shown to be significantly faster than the original GPC. △ Less

Submitted 2 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

Comments: Accepted for publication in Proceedings of the 2024 6th Asia Conference on Machine Learning and Computing

arXiv:2404.15963 [pdf, other]

Cosmic Himalayas: The Highest Quasar Density Peak Identified in a 10,000 deg$^2$ Sky with Spatial Discrepancies between Galaxies, Quasars, and IGM HI

Authors: Yongming Liang, Masami Ouchi, Dongsheng Sun, Nobunari Kashikawa, Zheng Cai, Sebastiano Cantalupo, Kentaro Nagamine, Hidenobu Yajima, Takanobu Kirihara, Haibin Zhang, Mingyu Li, Rhythm Shimakawa, Xiaohui Fan, Kei Ito, Masayuki Tanaka, Yuichi Harikane, J. Xavier Prochaska, Andrea Travascio, Weichen Wang, Martin Elvis, Giuseppina Fabbiano, Junya Arita, Masafusa Onoue, John D. Silverman, Dongdong Shi , et al. (5 additional authors not shown)

Abstract: We report the identification of a quasar overdensity in the BOSSJ0210 field, dubbed Cosmic Himalayas, consisting of 11 quasars at $z=2.16-2.20$, the densest overdensity of quasars ($17σ$) in the $\sim$10,000 deg$^2$ of the Sloan Digital Sky Survey. We present the spatial distributions of galaxies and quasars and an HI absorption map of the intergalactic medium (IGM). On the map of 465 galaxies sel… ▽ More We report the identification of a quasar overdensity in the BOSSJ0210 field, dubbed Cosmic Himalayas, consisting of 11 quasars at $z=2.16-2.20$, the densest overdensity of quasars ($17σ$) in the $\sim$10,000 deg$^2$ of the Sloan Digital Sky Survey. We present the spatial distributions of galaxies and quasars and an HI absorption map of the intergalactic medium (IGM). On the map of 465 galaxies selected from the MAMMOTH-Subaru survey, we find two galaxy density peaks that do not fall on the quasar overdensity but instead exist at the northwest and southeast sides, approximately 25 $h^{-1}$ comoving-Mpc apart from the quasar overdensity. With a spatial resolution of 15 $h^{-1}$ comoving Mpc in projection, we produce a three-dimensional HI tomography map by the IGM Ly$α$ forest in the spectra of 23 SDSS/eBOSS quasars behind the quasar overdensity. Surprisingly, the quasar overdensity coincides with neither an absorption peak nor a transmission peak of IGM HI but lies near the border separating opaque and transparent volumes, with the more luminous quasars located in an environment with lesser IGM HI. Hence remarkably, the overdensity region traced by the 11 quasars, albeit all in coherently active states, has no clear coincidence with peaks of galaxies or HI absorption densities. Current physical scenarios with mixtures of HI overdensities and quasar photoionization cannot fully interpret the emergence of Cosmic Himalayas, suggesting this peculiar structure is an excellent laboratory to unveil the interplay between galaxies, quasars, and the IGM. △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: 19 pages, 11 figures, submitted to ApJ, comments are welcome

arXiv:2404.15027 [pdf, other]

Three dimensional end-to-end simulation for kilonova emission from a black-hole neutron-star merger

Authors: Kyohei Kawaguchi, Nanae Domoto, Sho Fujibayashi, Kota Hayashi, Hamid Hamidani, Masaru Shibata, Masaomi Tanaka, Shinya Wanajo

Abstract: We study long-term evolution of the matter ejected in a black-hole neutron-star (BH-NS) merger employing the results of a long-term numerical-relativity simulation and nucleosynthesis calculation, in which both dynamical and post-merger ejecta formation are consistently followed. In particular, we employ the results for the merger of a $1.35\,M_\odot$ NS and a $5.4\,M_\odot$ BH with the dimensionl… ▽ More We study long-term evolution of the matter ejected in a black-hole neutron-star (BH-NS) merger employing the results of a long-term numerical-relativity simulation and nucleosynthesis calculation, in which both dynamical and post-merger ejecta formation are consistently followed. In particular, we employ the results for the merger of a $1.35\,M_\odot$ NS and a $5.4\,M_\odot$ BH with the dimensionless spin of 0.75. We confirm the finding in the previous studies that thermal pressure induced by radioactive heating in the ejecta significantly modifies the morphology of the ejecta. We then compute the kilonova (KN) light curves employing the ejecta profile obtained by the long-term evolution. We find that our present BH-NS model results in a KN light curve that is fainter yet more enduring than that observed in AT2017gfo. This is due to the fact that the emission is primarily powered by the lanthanide-rich dynamical ejecta, in which a long photon diffusion time scale is realized by the large mass and high opacity. While the peak brightness of the KN emission in both the optical and near-infrared bands is fainter than or comparable to those of binary NS models, the time-scale maintaining the peak brightness is much longer in the near-infrared band for the BH-NS KN model. Our result indicates that a BH-NS merger with massive ejecta can observationally be identified by the bright and long lasting ($>$two weeks) near-infrared emission. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 20 pages, 12 figures, submitted to MNRAS

arXiv:2404.14219 [pdf, other]

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset for training, a scaled-up version of the one used for phi-2, composed of heavily filtered publicly available web data and synthetic data. The model is also further aligned for robustness, safety, and chat format. We also provide some initial parameter-scaling results with a 7B and 14B models trained for 4.8T tokens, called phi-3-small and phi-3-medium, both significantly more capable than phi-3-mini (e.g., respectively 75% and 78% on MMLU, and 8.7 and 8.9 on MT-bench). Moreover, we also introduce phi-3-vision, a 4.2 billion parameter model based on phi-3-mini with strong reasoning capabilities for image and text prompts. △ Less

Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: 19 pages

arXiv:2404.03383 [pdf, other]

A unified Euler--Lagrange system for analyzing continuous-time accelerated gradient methods

Authors: Mitsuru Toyoda, Akatsuki Nishioka, Mirai Tanaka

Abstract: This paper presents an Euler--Lagrange system for a continuous-time model of the accelerated gradient methods in smooth convex optimization and proposes an associated Lyapunov-function-based convergence analysis framework. Recently, ordinary differential equations (ODEs) with dumping terms have been developed to intuitively interpret the accelerated gradient methods, and the design of unified mode… ▽ More This paper presents an Euler--Lagrange system for a continuous-time model of the accelerated gradient methods in smooth convex optimization and proposes an associated Lyapunov-function-based convergence analysis framework. Recently, ordinary differential equations (ODEs) with dumping terms have been developed to intuitively interpret the accelerated gradient methods, and the design of unified model describing the various individual ODE models have been examined. In existing reports, the Lagrangian, which results in the Euler-Lagrange equation, and the Lyapunov function for the convergence analysis have been separately proposed for each ODE. This paper proposes a unified Euler--Lagrange system and its Lyapunov function to cover the existing various models. In the convergence analysis using the Lyapunov function, a condition that parameters in the Lagrangian and Lyapunov function must satisfy is derived, and a parameter design for improving the convergence rate naturally results in the mysterious dumping coefficients. Especially, a symmetric Bregman divergence can lead to a relaxed condition of the parameters and a resulting improved convergence rate. As an application of this study, a slight modification in the Lyapunov function establishes the similar convergence proof for ODEs with smooth approximation in nondifferentiable objective function minimization. △ Less

Submitted 4 April, 2024; originally announced April 2024.

arXiv:2404.03112 [pdf, other]

On the Formation of the W-shaped O II Lines in Spectra of Type I Superluminous Supernovae

Authors: Sei Saito, Masaomi Tanaka, Paolo A. Mazzali, Stephan Hachinger, Kenta Hotokezaka

Abstract: H-poor superluminous supernovae (SLSNe-I) are characterized by O II lines around 4,000 - 4,500 A in pre-/near-maximum spectra, so-called W-shaped O II lines. As these lines are from relatively high excitation levels, they have been considered a sign of non-thermal processes, which may give a hint of power sources of SLSNe-I. However, the conditions for these lines to appear have not been understoo… ▽ More H-poor superluminous supernovae (SLSNe-I) are characterized by O II lines around 4,000 - 4,500 A in pre-/near-maximum spectra, so-called W-shaped O II lines. As these lines are from relatively high excitation levels, they have been considered a sign of non-thermal processes, which may give a hint of power sources of SLSNe-I. However, the conditions for these lines to appear have not been understood well. In this work, we systematically calculate synthetic spectra to reproduce observed spectra of eight SLSNe-I, parameterizing departure coefficients from the nebular approximation in the SN ejecta (expressed as b_neb). We find that most of the observed spectra can be reproduced well with b_neb ~< 10, which means that no strong departure is necessary for the formation of the W-shaped O II lines. We also show that the appearance of the W-shaped O II lines is sensitive to temperature; only spectra with temperatures T ~ 14,000 - 16,000 K can produce the W-shaped O II lines without large departures. Based on this, we constrain the non-thermal ionization rate near the photosphere. Our results suggest that spectral features of SLSNe-I can give independent constraints on the power source through the non-thermal ionization rates. △ Less

Submitted 3 April, 2024; originally announced April 2024.

Comments: 19 pages, 11 figures, accepted for publication in ApJ

arXiv:2404.00646 [pdf, other]

Primordial black holes from slow phase transitions: a model-building perspective

Authors: Shinya Kanemura, Masanori Tanaka, Ke-Pan Xie

Abstract: We investigate the formation of primordial black holes (PBHs) through delayed vacuum decay during slow cosmic first-order phase transitions. Two specific models, the polynomial potential and the real singlet extension of the Standard Model, are used as illustrative examples. Our findings reveal that models with zero-temperature scalar potential barriers are conducive to the realization of this mec… ▽ More We investigate the formation of primordial black holes (PBHs) through delayed vacuum decay during slow cosmic first-order phase transitions. Two specific models, the polynomial potential and the real singlet extension of the Standard Model, are used as illustrative examples. Our findings reveal that models with zero-temperature scalar potential barriers are conducive to the realization of this mechanism, as the phase transition duration is extended by the U-shaped Euclidean action. We find that the resulting PBH density is highly sensitive to the barrier height, with abundant PBH formation observed for sufficiently high barriers. Notably, the phase transition needs not to be ultra-supercooled (i.e. the parameter $α\gg1$), and the commonly used exponential nucleation approximation $Γ(t)\sim e^{βt}$ fails to capture the PBH formation dynamics in such models. △ Less

Submitted 21 May, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

Comments: 18 pages + appendix + references, 8 figures. To match the JHEP version

arXiv:2403.20110 [pdf, other]

Theory of the inverse Faraday effect in dissipative Rashba electron systems: Floquet engineering perspective

Authors: Miho Tanaka, Masahiro Sato

Abstract: We theoretically study the inverse Faraday effect (IFE), i.e., photo-induced magnetization, in two-dimensional Rashba spin-orbit coupled electron systems irradiated by a circularly polarized light. Quantum master (GKSL) equation enables us to accurately compute the laser driven dynamics, taking inevitable dissipation effects into account. To find the universal features of laser-driven magnetizatio… ▽ More We theoretically study the inverse Faraday effect (IFE), i.e., photo-induced magnetization, in two-dimensional Rashba spin-orbit coupled electron systems irradiated by a circularly polarized light. Quantum master (GKSL) equation enables us to accurately compute the laser driven dynamics, taking inevitable dissipation effects into account. To find the universal features of laser-driven magnetization and its dynamics, we investigate (i) the nonequilibrium steady state (NESS) driven by a continuous wave (CW) and (ii) ultrafast spin dynamics driven by short laser pulses. In the NESS (i), the laser-induced magnetization and its dependence of several parameters (laser frequency, laser field strength, temperature, dissipation strength, etc.) are shown to be in good agreement with the predictions from Floquet theory for dissipative systems in the high-frequency regime. In the case (ii), we focus on ferromagnetic metal states by introducing an effective magnetic field to the Rashba model as the mean field of electron-electron interaction. We find that a precession of the magnetic moment occurs due to the pulse-driven instantaneous magnetic field and the initial phase of the precession is controlled by changing the sign of light polarization. This is well consistent with the spin dynamics observed in experiments of laser-pulse-driven IFE. We discuss how the pulse-driven dynamics are captured by the Floquet theory. Our results provides a microscopic method to compute ultrafast dynamics in many electron systems irradiated by intense light. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: 19 pages (two column), 10 figures

arXiv:2403.16011 [pdf, other]

Uncovering the Ghostly Remains of an Extremely Diffuse Satellite in the Remote Halo of NGC 253

Authors: Sakurako Okamoto, Annette M. N. Ferguson, Nobuo Arimoto, Itsuki Ogami, Rokas Zemaitis, Masashi Chiba, Mike J. Irwin, In Sung Jang, Jin Koda, Yutaka Komiyama, Myung Gyoon Lee, Jeong Hwan Lee, Michael Rich, Masayuki Tanaka, Mikito Tanaka

Abstract: We present the discovery of NGC253-SNFC-dw1, a new satellite galaxy in the remote stellar halo of the Sculptor Group spiral, NGC 253. The system was revealed using deep resolved star photometry obtained as part of the Subaru Near-Field Cosmology Survey that uses the Hyper Suprime-Cam on the Subaru Telescope. Although rather luminous ($\rm{M_{V}} = -11.7 \pm 0.2$) and massive (… ▽ More We present the discovery of NGC253-SNFC-dw1, a new satellite galaxy in the remote stellar halo of the Sculptor Group spiral, NGC 253. The system was revealed using deep resolved star photometry obtained as part of the Subaru Near-Field Cosmology Survey that uses the Hyper Suprime-Cam on the Subaru Telescope. Although rather luminous ($\rm{M_{V}} = -11.7 \pm 0.2$) and massive ($M_* \sim 1.25\times 10^7~\rm{M}_{\odot}$), the system is one of the most diffuse satellites yet known, with a half-light radius of $\rm{R_{h}} = 3.37 \pm 0.36$ kpc and an average surface brightness of $\sim 30.1$ mag arcmin$^{-2}$ within the $\rm{R_{h}}$. The colour-magnitude diagram shows a dominant old ($\sim 10$ Gyr) and metal-poor ($\rm{[M/H]}=-1.5 \pm 0.1$ dex) stellar population, as well as several candidate thermally-pulsing asymptotic giant branch stars. The distribution of red giant branch stars is asymmetrical and displays two elongated tidal extensions pointing towards NGC 253, suggestive of a highly disrupted system being observed at apocenter. NGC253-SNFC-dw1 has a size comparable to that of the puzzling Local Group dwarfs Andromeda XIX and Antlia 2 but is two magnitudes brighter. While unambiguous evidence of tidal disruption in these systems has not yet been demonstrated, the morphology of NGC253-SNFC-dw1 clearly shows that this is a natural path to produce such diffuse and extended galaxies. The surprising discovery of this system in a previously well-searched region of the sky emphasizes the importance of surface brightness limiting depth in satellite searches. △ Less

Submitted 26 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

Comments: 10 pages, 4 figures, 1 table. Accepted for publication in ApJL

arXiv:2403.14234 [pdf, other]

Detection of a Spatially Extended Stellar Population in M33: A Shallow Stellar Halo?

Authors: Itsuki Ogami, Yutaka Komiyama, Masashi Chiba, Mikito Tanaka, Puragra Guhathakurta, Evan N. Kirby, Rosemary F. G. Wyse, Carrie Filion, Takanobu Kirihara, Miho N. Ishigaki, Kohei Hayashi

Abstract: We analyze the outer regions of M33, beyond 15 kpc in projected distance from its center using Subaru/HSC multi-color imaging. We identify Red Giant Branch (RGB) stars and Red Clump (RC) stars using the surface gravity sensitive $NB515$ filter for the RGB sample, and a multi-color selection for both samples. We construct the radial surface density profile of these RGB and RC stars, and find that M… ▽ More We analyze the outer regions of M33, beyond 15 kpc in projected distance from its center using Subaru/HSC multi-color imaging. We identify Red Giant Branch (RGB) stars and Red Clump (RC) stars using the surface gravity sensitive $NB515$ filter for the RGB sample, and a multi-color selection for both samples. We construct the radial surface density profile of these RGB and RC stars, and find that M33 has an extended stellar population with a shallow power-law index of $α> -3$, depending on the intensity of the contamination. This result represents a flatter profile than the stellar halo which has been detected by the previous study focusing on the central region, suggesting that M33 may have a double-structured halo component, i.e. inner/outer halos or a very extended disk. Also, the slope of this extended component is shallower than those typically found for halos in large galaxies, implying intermediate-mass galaxies may have different formation mechanisms (e.g., tidal interaction) from large spirals. We also analyze the radial color profile of RC/RGB stars, and detect a radial gradient, consistent with the presence of an old and/or metal-poor population in the outer region of M33, thereby supporting our proposal that the stellar halo extends beyond 15 kpc. Finally, we estimate that the surface brightness of this extended component is $μ_{\it V} = 35.72 \pm 0.08$ mag arcsec$^{-2}$. If our detected component is the stellar halo, this estimated value is consistent with the detection limit of previous observations. △ Less

Submitted 5 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

Comments: 18 pages, 15 figures, 2 tables, accepted for publication in ApJ

arXiv:2403.14127 [pdf]

Spin injection and detection in a Si-based ferromagnetic tunnel junction: A theoretical model based on the band diagram and experimental demonstration

Authors: Baisen Yu, Shoichi Sato, Masaaki Tanaka, Ryosho Nakane

Abstract: We have experimentally and theoretically investigated the spin injection/detection polarization in a Si-based ferromagnetic tunnel junction with an amorphous MgO layer, and demonstrated that the experimental features of the spin polarization in a wide bias range can be well explained using our theoretical model based on the band diagram of the junction and the direct tunneling mechanism. It is sho… ▽ More We have experimentally and theoretically investigated the spin injection/detection polarization in a Si-based ferromagnetic tunnel junction with an amorphous MgO layer, and demonstrated that the experimental features of the spin polarization in a wide bias range can be well explained using our theoretical model based on the band diagram of the junction and the direct tunneling mechanism. It is shown that the spin polarization originates from the band diagrams of the ferromagnetic Fe layer and n+-Si channel in the junction, while the spin selectivity of the MgO tunnel barrier is not necessary. Besides, we clarified the mechanism of the reduction in spin polarization when the bias is high and nonlinear properties are prominent, where the widely-used spin injection/detection model proposed by Valet and Fert is not applicable. The dominant mechanism of such reduction is found to be spin accumulation saturation (SAS) at the n+-Si interface in contact with the MgO layer as the bias is increased in the spin extraction geometry, which is inevitable in semiconductor-based ferromagnetic tunnel junctions. We performed numerical calculations on a two-terminal spin transport device with a n+-Si channel using the junction properties extracted from the experiments, and revealed that the magnetoresistance (MR) ratio is suppressed mainly by SAS in a higher bias range. Furthermore, we proposed methods for improving the MR ratio in two-terminal spin transport devices. Our experiments and theoretical model provide a deep understanding of the spin injection/detection phenomena in semiconductor-based spin transport devices, toward the realization of high performance under reasonably high bias conditions for practical use. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: Main manuscript:32 pages, 18 figures Supplemental material: 18 pages, 10 figures

arXiv:2403.12043 [pdf, other]

doi 10.1093/mnras/stae785

Abundance stratification in type Ia supernovae -- VII. The peculiar, C-rich iPTF16abc: highlighting diversity among luminous events

Authors: Charles J. Aouad, Paolo A. Mazzali, Chris Ashall, Masaomi Tanaka, Stephan Hachinger

Abstract: Observations of Type Ia supernovae (SNe\,Ia) reveal diversity, even within assumed subcategories. Here, the composition of the peculiar iPTF16abc (SN\,2016bln) is derived by modeling a time series of optical spectra. iPTF16abc's early spectra combine traits of SNe 1999aa and 1991T known for weak \SiII\ $λ$ 6355 and prominent \FeIII\ features. However, it differs with weak early \FeIII\ lines, and… ▽ More Observations of Type Ia supernovae (SNe\,Ia) reveal diversity, even within assumed subcategories. Here, the composition of the peculiar iPTF16abc (SN\,2016bln) is derived by modeling a time series of optical spectra. iPTF16abc's early spectra combine traits of SNe 1999aa and 1991T known for weak \SiII\ $λ$ 6355 and prominent \FeIII\ features. However, it differs with weak early \FeIII\ lines, and persistent \CII\ lines post-peak. It also exhibits a weak \CaII\ H\&K feature aligning it with SN\,1991T, an observation supported by their bolometric light curves. The early attenuation of \FeIII\ results from abundance effect. The weakening of the \SiII\ $λ$ 6355 line, stems from silicon depletion in the outer shells, a characteristic shared by both SNe 1999aa and 1991T, indicating a common explosion mechanism that terminates nuclear burning at around 12000 \kms\, unseen in normal events. Beneath a thin layer of intermediate mass elements (IMEs) with a total mass of 0.18 \Msun, extends a \Nifs\ rich shell totaling 0.76 \Msun\ and generating a bolometric luminosity as high as ${L_{\mathrm{peak}}}=1.60 \pm 0.1 \times$ $10^{43}$ ergs s$^{-1}$. Inner layers, typical of SNe\,Ia, hold neutron-rich elements, (\Feff\ and \Nife), totaling 0.20 M${\odot}$. Stable iron, exceeding solar abundance, and carbon, coexist in the outermost layers, challenging existing explosion models. The presence of carbon down to $v\approx$ 9000\,\kms, totalling $\sim$ 0.01 \Msun\, unprecedented in this class, links iPTF16abc to SN\,2003fg-like events. The retention of 91T-like traits in iPTF16abc underscores its importance in understanding the diversity of SNe\,Ia. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.11517 [pdf, other]

Inter-individual and inter-site neural code conversion and image reconstruction without shared stimuli

Authors: Haibao Wang, Jun Kai Ho, Fan L. Cheng, Shuntaro C. Aoki, Yusuke Muraki, Misato Tanaka, Yukiyasu Kamitani

Abstract: The human brain demonstrates substantial inter-individual variability in fine-grained functional topography, posing challenges in identifying common neural representations across individuals. Functional alignment has the potential to harmonize these individual differences. However, it typically requires an identical set of stimuli presented to different individuals, which is often unavailable. To… ▽ More The human brain demonstrates substantial inter-individual variability in fine-grained functional topography, posing challenges in identifying common neural representations across individuals. Functional alignment has the potential to harmonize these individual differences. However, it typically requires an identical set of stimuli presented to different individuals, which is often unavailable. To address this, we propose a content loss-based neural code converter, designed to convert brain activity from one subject to another representing the same content. The converter is optimized so that the source subject's converted brain activity is decoded into a latent image representation that closely resembles that of the stimulus given to the source subject. We show that converters optimized using hierarchical image representations achieve conversion accuracy comparable to those optimized by paired brain activity as in conventional methods. The brain activity converted from a different individual and even from a different site sharing no stimuli produced reconstructions that approached the quality of within-individual reconstructions. The converted brain activity had a generalizable representation that can be read out by different decoding schemes. The converter required much fewer training samples than that typically required for decoder training to produce recognizable reconstructions. These results demonstrate that our method can effectively combine image representations to convert brain activity across individuals without the need for shared stimuli, providing a promising tool for flexibly aligning data from complex cognitive tasks and a basis for brain-to-brain communication. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.03548 [pdf, other]

Mass and decay width of $T_{ccs}$ from symmetries

Authors: Mitsuru Tanaka, Yasuhiro Yamaguchi, Masayasu Harada

Abstract: We analyze the mass and width of the doubly heavy tetraquark $T_{ccs}$ composed of a heavy diquark and a light quark cloud with strangeness with assuming that a color anti-triplet heavy diquark is a dominant component of the doubly charmed tetraquarks $T_{cc}$ and $T_{ccs}$. We construct an effective Lagrangian for masses of heavy hadrons based on the superflavor symmetry between the doubly heavy… ▽ More We analyze the mass and width of the doubly heavy tetraquark $T_{ccs}$ composed of a heavy diquark and a light quark cloud with strangeness with assuming that a color anti-triplet heavy diquark is a dominant component of the doubly charmed tetraquarks $T_{cc}$ and $T_{ccs}$. We construct an effective Lagrangian for masses of heavy hadrons based on the superflavor symmetry between the doubly heavy tetraquarks and the singly heavy baryons with including the terms which simultaneously break the heavy-quark and light flavor symmetries, and predict the mass of $T_{ccs}$ as $M(T_{ccs}) = 4057\pm40$\,MeV. The comparison of this prediction with future experimental observation will give a clue to understand the color structure of the heavy diquark. We also predict the spin-averaged mass of $Ω_{cc}$ ($J^P = 1/2^+, 3/2^+)$ as $M(Ω_{cc}) = 3760\pm 18\,$MeV. We next calculate the decay width of $T_{ccs}$, based on solely the light flavor symmetry, as $Γ(T_{ccs}) = 1.2\pm 0.3$\,MeV. △ Less

Submitted 6 March, 2024; originally announced March 2024.

arXiv:2403.00363 [pdf, other]

SFQ counter-based precomputation for large-scale cryogenic VQE machines

Authors: Yosuke Ueno, Satoshi Imamura, Yuna Tomida, Teruo Tanimoto, Masamitsu Tanaka, Yutaka Tabuchi, Koji Inoue, Hiroshi Nakamura

Abstract: The variational quantum eigensolver (VQE) is a promising candidate that brings practical benefits from quantum computing. However, the required bandwidth in/out of a cryostat is a limiting factor to scale cryogenic quantum computers. We propose a tailored counter-based module with single flux quantum circuits in 4-K stage which precomputes a part of VQE calculation and reduces the amount of inter-… ▽ More The variational quantum eigensolver (VQE) is a promising candidate that brings practical benefits from quantum computing. However, the required bandwidth in/out of a cryostat is a limiting factor to scale cryogenic quantum computers. We propose a tailored counter-based module with single flux quantum circuits in 4-K stage which precomputes a part of VQE calculation and reduces the amount of inter-temperature communication. The evaluation shows that our system reduces the required bandwidth by 97%, and with this drastic reduction, total power consumption is reduced by 93% in the case where 277 VQE programs are executed in parallel on a 10000-qubit machine. △ Less

Submitted 1 March, 2024; originally announced March 2024.

Comments: 7 pages, 5 figures, 3 tables. Accepted by DAC'24 WIP poster session

arXiv:2402.07434 [pdf, ps, other]

doi 10.1145/3654823.3654895

Parameterizations for Gradient-based Markov Chain Monte Carlo on the Stiefel Manifold: A Comparative Study

Authors: Masahiro Tanaka

Abstract: Orthogonal matrices play an important role in probability and statistics, particularly in high-dimensional statistical models. Parameterizing these models using orthogonal matrices facilitates dimension reduction and parameter identification. However, establishing the theoretical validity of statistical inference in these models from a frequentist perspective is challenging, leading to a preferenc… ▽ More Orthogonal matrices play an important role in probability and statistics, particularly in high-dimensional statistical models. Parameterizing these models using orthogonal matrices facilitates dimension reduction and parameter identification. However, establishing the theoretical validity of statistical inference in these models from a frequentist perspective is challenging, leading to a preference for Bayesian approaches because of their ability to offer consistent uncertainty quantification. Markov chain Monte Carlo methods are commonly used for numerical approximation of posterior distributions, and sampling on the Stiefel manifold, which comprises orthogonal matrices, poses significant difficulties. While various strategies have been proposed for this purpose, gradient-based Markov chain Monte Carlo with parameterizations is the most efficient. However, a comprehensive comparison of these parameterizations is lacking in the existing literature. This study aims to address this gap by evaluating numerical efficiency of the four alternative parameterizations of orthogonal matrices under equivalent conditions. The evaluation was conducted for four problems. The results suggest that polar expansion parameterization is the most efficient, particularly for the high-dimensional and complex problems. However, all parameterizations exhibit limitations in significantly high-dimensional or difficult tasks, emphasizing the need for further advancements in sampling methods for orthogonal matrices. △ Less

Submitted 2 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: Proceedings of the 2024 3rd Asia Conference on Algorithms, Computing and Machine Learning, pp. 397-402

arXiv:2402.04429 [pdf, other]

Meritocracy and Its Discontents: Long-run Effects of Repeated School Admission Reforms

Authors: Chiaki Moriguchi, Yusuke Narita, Mari Tanaka

Abstract: What happens if selective colleges change their admission policies? We study this question by analyzing the world's first implementation of nationally centralized meritocratic admissions in the early twentieth century. We find a persistent meritocracy-equity tradeoff. Compared to the decentralized system, the centralized system admitted more high-achievers and produced more occupational elites (su… ▽ More What happens if selective colleges change their admission policies? We study this question by analyzing the world's first implementation of nationally centralized meritocratic admissions in the early twentieth century. We find a persistent meritocracy-equity tradeoff. Compared to the decentralized system, the centralized system admitted more high-achievers and produced more occupational elites (such as top income earners) decades later in the labor market. This gain came at a distributional cost, however. Meritocratic centralization also increased the number of urban-born elites relative to rural-born ones, undermining equal access to higher education and career advancement. △ Less

Submitted 23 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: Keywords: Elite Education, Market Design, Strategic Behavior, Regional Mobility, Universal Access, Persistent Effects

arXiv:2402.01179 [pdf]

Mechanism of ferromagnetism enhancement in a La$_{2/3}$ Sr$_{1/3}$ MnO$_3$ membrane released from epitaxial strain

Authors: Takahito Takeda, Takuma Arai, Kohei Yamagami, Le Duc Anh, Masaaki Tanaka, Masaki Kobayashi, Shinobu Ohya

Abstract: Recent studies have shown that the magnetic properties of the ferromagnetic perovskite oxide La$_{2/3}$ Sr$_{1/3}$ MnO$_3$ (LSMO) grown on an SrTiO3 (STO) substrate, such as its magnetic moment and Curie temperature, can be improved by releasing the film from the substrate. However, the microscopic origin of this enhancement is not yet well understood. In this study, we use synchrotron radiation m… ▽ More Recent studies have shown that the magnetic properties of the ferromagnetic perovskite oxide La$_{2/3}$ Sr$_{1/3}$ MnO$_3$ (LSMO) grown on an SrTiO3 (STO) substrate, such as its magnetic moment and Curie temperature, can be improved by releasing the film from the substrate. However, the microscopic origin of this enhancement is not yet well understood. In this study, we use synchrotron radiation measurements to investigate the mechanism of ferromagnetism enhancement in an LSMO membrane released from an STO substrate by dissolving a water-soluble Sr$_4$Al$_2$O$_7$ buffer layer. Using resonant photoemission spectroscopy on the as-grown LSMO film and LSMO membrane, we elucidate that the strain release from the STO substrate enhances the itineracy of the Mn-3d electrons via p-d hybridization, and this strengthens the double-exchange interaction. The reinforcement of the double-exchange interaction, in turn, improves the ferromagnetism of LSMO. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2402.00706 [pdf, ps, other]

Examples of solvable and nilpotent finite quantum groups

Authors: Gerard Glowacki, Masamune Hattori, Masato Tanaka

Abstract: We prove the solvability and nilpotency of Kac--Paljutkin's finite quantum group and Sekine quantum groups and we classify the solvable series of Kac--Paljutkin's finite quantum group via Cohen--Westreich's Burnside theorem. Some semisimple quasitriangular Hopf algebras of dimensions $2pq$ are also studied. In Appendix A, we give a direct computation of the universal $R$-matrices for Kac--Paljutki… ▽ More We prove the solvability and nilpotency of Kac--Paljutkin's finite quantum group and Sekine quantum groups and we classify the solvable series of Kac--Paljutkin's finite quantum group via Cohen--Westreich's Burnside theorem. Some semisimple quasitriangular Hopf algebras of dimensions $2pq$ are also studied. In Appendix A, we give a direct computation of the universal $R$-matrices for Kac--Paljutkin's $8$-dimensional finite quantum group. △ Less

Submitted 23 February, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: 47 pages

MSC Class: 16T05; 16T25; 16T10

arXiv:2401.14603 [pdf, other]

Luminosity Functions of the Host Galaxies of Supernova

Authors: Zhuoxi Liang, Nao Suzuki, Mamoru Doi, Masayuki Tanaka, Naoki Yasuda

Abstract: We present the luminosity functions and stellar mass functions of supernova (SN) host galaxies and test if they differ from the functions of normal field galaxies. We utilize homogeneous samples consisting of 273 SNe Ia ($z\leq0.3$) and 44 core-collapse (CC) SNe ($z \leq 0.1$) from the Sloan Digital Sky Survey (SDSS) II Supernova Survey and the high-signal-to-noise-ratio photometry of galaxies fro… ▽ More We present the luminosity functions and stellar mass functions of supernova (SN) host galaxies and test if they differ from the functions of normal field galaxies. We utilize homogeneous samples consisting of 273 SNe Ia ($z\leq0.3$) and 44 core-collapse (CC) SNe ($z \leq 0.1$) from the Sloan Digital Sky Survey (SDSS) II Supernova Survey and the high-signal-to-noise-ratio photometry of galaxies from the Hyper Suprime-Cam Subaru Strategic Program (HSC SSP). SN hosts are classified into star-forming and passive galaxy groups based on the spectral energy distribution (SED) fitting. We find that the SN host luminosity functions and stellar mass functions deviate from those of normal field galaxies. Star-forming galaxies dominate the low-mass end of the SN Ia host mass function, while passive galaxies dominate the high-mass end. CC SNe are predominantly hosted by star-forming galaxies. In addition, intermediate-mass hosts produce CC SNe with the highest efficiency, while the efficiency of producing SNe Ia monotonically increases as the hosts become more massive. Furthermore, We derive the pseudo mass normalized SN rates (pSNuM) based on the mass functions. We find that the star-forming component of pSNuM$_{Ia}$ is less sensitive to the changes in stellar mass, in comparison with the total rate. The behavior of pSNuM$_{CC}$ suggests that the CC rate is proportional to the star-forming rate. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.13868 [pdf, other]

Shell topology optimization based on level set method

Authors: Hiroki Kobayashi, Katsuya Nomura, Yuqing Zhou, Masato Tanaka, Atsushi Kawamoto, Tsuyoshi Nomura

Abstract: This paper proposes a level set-based method for optimizing shell structures with large design changes in shape and topology. Conventional shell optimization methods, whether parametric or nonparametric, often only allow limited design changes in shape. In the proposed method, the shell structure is defined as the isosurface of a level set function. The level set function is iteratively updated ba… ▽ More This paper proposes a level set-based method for optimizing shell structures with large design changes in shape and topology. Conventional shell optimization methods, whether parametric or nonparametric, often only allow limited design changes in shape. In the proposed method, the shell structure is defined as the isosurface of a level set function. The level set function is iteratively updated based on the shape sensitivity on the surface mesh. Therefore, the proposed method can represent an arbitrary manifold surface while dealing with topological changes, for example, from a spherical surface to a toroidal surface. We applied the proposed method to the mean compliance minimization problems of 3D shell structural designs for dome, bending plate and cantilever beam examples to demonstrate its efficacy of the proposed method. △ Less

Submitted 24 January, 2024; originally announced January 2024.

Comments: 13 pages, 13 figures

arXiv:2401.11920 [pdf, other]

The quality assurance test of the SliT ASIC for the J-PARC muon $g-2$/EDM experiment

Authors: Takashi Yamanaka, Yoichi Fujita, Eitaro Hamada, Tetsuichi Kishishita, Tsutomu Mibe, Yutaro Sato, Yoshiaki Seino, Masayoshi Shoji, Taikain Suehara, Manobu M. Tanaka, Junji Tojo, Keisuke Umebayashi, Tamaki Yoshioka

Abstract: The SliT ASIC is a readout chip for the silicon strip detector to be used at the J-PARC muon $g-2$/EDM experiment. The production version of SliT128D was designed and mass production was finished. A quality assurance test method for bare SliT128D chips was developed to provide a sufficient number of chips for the experiment. The quality assurance test of the SliT128D chips was performed and 5735 c… ▽ More The SliT ASIC is a readout chip for the silicon strip detector to be used at the J-PARC muon $g-2$/EDM experiment. The production version of SliT128D was designed and mass production was finished. A quality assurance test method for bare SliT128D chips was developed to provide a sufficient number of chips for the experiment. The quality assurance test of the SliT128D chips was performed and 5735 chips were inspected. No defect was observed in chips of 84.3%. Accepting a few channels with poor time walk performance out of 128 channels per chip, more than 90% yield can be achieved, which is sufficient to construct the whole detector. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: 5 pages, 8 figures

arXiv:2401.08671 [pdf, other]

DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference

Authors: Connor Holmes, Masahiro Tanaka, Michael Wyatt, Ammar Ahmad Awan, Jeff Rasley, Samyam Rajbhandari, Reza Yazdani Aminabadi, Heyang Qin, Arash Bakhtiari, Lev Kurilenko, Yuxiong He

Abstract: The deployment and scaling of large language models (LLMs) have become critical as they permeate various applications, demanding high-throughput and low-latency serving systems. Existing frameworks struggle to balance these requirements, especially for workloads with long prompts. This paper introduces DeepSpeed-FastGen, a system that employs Dynamic SplitFuse, a novel prompt and generation compos… ▽ More The deployment and scaling of large language models (LLMs) have become critical as they permeate various applications, demanding high-throughput and low-latency serving systems. Existing frameworks struggle to balance these requirements, especially for workloads with long prompts. This paper introduces DeepSpeed-FastGen, a system that employs Dynamic SplitFuse, a novel prompt and generation composition strategy, to deliver up to 2.3x higher effective throughput, 2x lower latency on average, and up to 3.7x lower (token-level) tail latency, compared to state-of-the-art systems like vLLM. We leverage a synergistic combination of DeepSpeed-MII and DeepSpeed-Inference to provide an efficient and easy-to-use serving system for LLMs. DeepSpeed-FastGen's advanced implementation supports a range of models and offers both non-persistent and persistent deployment options, catering to diverse user scenarios from interactive sessions to long-running applications. We present a detailed benchmarking methodology, analyze the performance through latency-throughput curves, and investigate scalability via load balancing. Our evaluations demonstrate substantial improvements in throughput and latency across various models and hardware configurations. We discuss our roadmap for future enhancements, including broader model support and new hardware backends. The DeepSpeed-FastGen code is readily available for community engagement and contribution. △ Less

Submitted 9 January, 2024; originally announced January 2024.

arXiv:2401.05837 [pdf, ps, other]

Intermediate-luminosity Type IIP SN 2021gmj: a low-energy explosion with signatures of circumstellar material

Authors: Yuta Murai, Masaomi Tanaka, Miho Kawabata, Kenta Taguchi, Rishabh Singh Teja, Tatsuya Nakaoka, Keiichi Maeda, Koji S. Kawabata, Takashi Nagao, Takashi J. Moriya, D. K. Sahu, G. C. Anupama, Nozomu Tominaga, Tomoki Morokuma, Ryo Imazawa, Satoko Inutsuka, Keisuke Isogai, Toshihiro Kasuga, Naoto Kobayashi, Sohei Kondo, Hiroyuki Maehara, Yuki Mori, Yuu Niino, Mao Ogawa, Ryou Ohsawa , et al. (6 additional authors not shown)

Abstract: We present photometric, spectroscopic and polarimetric observations of the intermediate-luminosity Type IIP supernova (SN) 2021gmj from 1 to 386 days after the explosion. The peak absolute V-band magnitude of SN 2021gmj is -15.5 mag, which is fainter than that of normal Type IIP SNe. The spectral evolution of SN 2021gmj resembles that of other sub-luminous supernovae: the optical spectra show narr… ▽ More We present photometric, spectroscopic and polarimetric observations of the intermediate-luminosity Type IIP supernova (SN) 2021gmj from 1 to 386 days after the explosion. The peak absolute V-band magnitude of SN 2021gmj is -15.5 mag, which is fainter than that of normal Type IIP SNe. The spectral evolution of SN 2021gmj resembles that of other sub-luminous supernovae: the optical spectra show narrow P-Cygni profiles, indicating a low expansion velocity. We estimate the progenitor mass to be about 12 Msun from the nebular spectrum and the 56Ni mass to be about 0.02 Msun from the bolometric light curve. We also derive the explosion energy to be about 3 x 10^{50} erg by comparing numerical light curve models with the observed light curves. Polarization in the plateau phase is not very large, suggesting nearly spherical outer envelope. The early photometric observations capture the rapid rise of the light curve, which is likely due to the interaction with a circumstellar material (CSM). The broad emission feature formed by highly-ionized lines on top of a blue continuum in the earliest spectrum gives further indication of the CSM at the vicinity of the progenitor. Our work suggests that a relatively low-mass progenitor of an intermediate-luminosity Type IIP SN can also experience an enhanced mass loss just before the explosion, as suggested for normal Type IIP SNe. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 18 pages, 16 figures, resubmitted to MNRAS after addressing referee comments

arXiv:2401.00668 [pdf, other]

The structure of the stellar halo of the Andromeda galaxy explored with the NB515 for Subaru/HSC. I.: New Insights on the stellar halo up to 120 kpc

Authors: Itsuki Ogami, Mikito Tanaka, Yutaka Komiyama, Masashi Chiba, Puragra Guhathakurta, Evan N. Kirby, Rosemary F. G. Wyse, Carrie Filion, Karoline M. Gilbert, Ivanna Escala, Masao Mori, Takanobu Kirihara, Masayuki Tanaka, Miho N. Ishigaki, Kohei Hayashi, Myun Gyoon Lee, Sanjib Sharma, Jason S. Kalirai, Robert H. Lupton

Abstract: We analyse the M31 halo and its substructure within a projected radius of 120 kpc using a combination of Subaru/HSC NB515 and CFHT/MegaCam g- & i-bands. We succeed in separating M31's halo stars from foreground contamination with $\sim$ 90 \% accuracy by using the surface gravity sensitive NB515 filter. Based on the selected M31 halo stars, we discover three new substructures, which associate with… ▽ More We analyse the M31 halo and its substructure within a projected radius of 120 kpc using a combination of Subaru/HSC NB515 and CFHT/MegaCam g- & i-bands. We succeed in separating M31's halo stars from foreground contamination with $\sim$ 90 \% accuracy by using the surface gravity sensitive NB515 filter. Based on the selected M31 halo stars, we discover three new substructures, which associate with the Giant Southern Stream (GSS) based on their photometric metallicity estimates. We also produce the distance and photometric metallicity estimates for the known substructures. While these quantities for the GSS are reproduced in our study, we find that the North-Western stream shows a steeper distance gradient than found in an earlier study, suggesting that it is likely to have formed in an orbit closer to the Milky Way. For two streams in the eastern halo (Stream C and D), we identify distance gradients that had not been resolved. Finally, we investigate the global halo photometric metallicity distribution and surface brightness profile using the NB515-selected halo stars. We find that the surface brightness of the metal-poor and metal-rich halo populations, and the all population can be fitted to a power-law profile with an index of $α= -1.65 \pm 0.02$, $-2.82\pm0.01$, and $-2.44\pm0.01$, respectively. In contrast to the relative smoothness of the halo profile, its photometric metallicity distribution appears to be spatially non-uniform with nonmonotonic trends with radius, suggesting that the halo population had insufficient time to dynamically homogenize the accreted populations. △ Less

Submitted 1 January, 2024; originally announced January 2024.

Comments: 24 pages, 26 figures, 5 tables, submitted to MNRAS

arXiv:2312.12907 [pdf, ps, other]

doi 10.1103/PhysRevD.109.092001

Solar neutrino measurements using the full data period of Super-Kamiokande-IV

Authors: Super-Kamiokande Collaboration, :, K. Abe, C. Bronner, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, S. Imaizumi, K. Iyogi, J. Kameda, Y. Kanemura, R. Kaneshima, Y. Kashiwagi, Y. Kataoka, Y. Kato, Y. Kishimoto, S. Miki, S. Mine, M. Miura, T. Mochizuki, S. Moriyama, Y. Nagao, M. Nakahata , et al. (305 additional authors not shown)

Abstract: An analysis of solar neutrino data from the fourth phase of Super-Kamiokande~(SK-IV) from October 2008 to May 2018 is performed and the results are presented. The observation time of the data set of SK-IV corresponds to $2970$~days and the total live time for all four phases is $5805$~days. For more precise solar neutrino measurements, several improvements are applied in this analysis: lowering th… ▽ More An analysis of solar neutrino data from the fourth phase of Super-Kamiokande~(SK-IV) from October 2008 to May 2018 is performed and the results are presented. The observation time of the data set of SK-IV corresponds to $2970$~days and the total live time for all four phases is $5805$~days. For more precise solar neutrino measurements, several improvements are applied in this analysis: lowering the data acquisition threshold in May 2015, further reduction of the spallation background using neutron clustering events, precise energy reconstruction considering the time variation of the PMT gain. The observed number of solar neutrino events in $3.49$--$19.49$ MeV electron kinetic energy region during SK-IV is $65,443^{+390}_{-388}\,(\mathrm{stat.})\pm 925\,(\mathrm{syst.})$ events. Corresponding $\mathrm{^{8}B}$ solar neutrino flux is $(2.314 \pm 0.014\, \rm{(stat.)} \pm 0.040 \, \rm{(syst.)}) \times 10^{6}~\mathrm{cm^{-2}\,s^{-1}}$, assuming a pure electron-neutrino flavor component without neutrino oscillations. The flux combined with all SK phases up to SK-IV is $(2.336 \pm 0.011\, \rm{(stat.)} \pm 0.043 \, \rm{(syst.)}) \times 10^{6}~\mathrm{cm^{-2}\,s^{-1}}$. Based on the neutrino oscillation analysis from all solar experiments, including the SK $5805$~days data set, the best-fit neutrino oscillation parameters are $\rm{sin^{2} θ_{12,\,solar}} = 0.306 \pm 0.013 $ and $Δm^{2}_{21,\,\mathrm{solar}} = (6.10^{+ 0.95}_{-0.81}) \times 10^{-5}~\rm{eV}^{2}$, with a deviation of about 1.5$σ$ from the $Δm^{2}_{21}$ parameter obtained by KamLAND. The best-fit neutrino oscillation parameters obtained from all solar experiments and KamLAND are $\sin^{2} θ_{12,\,\mathrm{global}} = 0.307 \pm 0.012 $ and $Δm^{2}_{21,\,\mathrm{global}} = (7.50^{+ 0.19}_{-0.18}) \times 10^{-5}~\rm{eV}^{2}$. △ Less

Submitted 20 February, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 47 pages, 61 figures

Journal ref: Phys. Rev. D 109, 092001 (2024)

arXiv:2312.08836 [pdf, ps, other]

A proper generating functional on a Podleś sphere

Authors: Masato Tanaka

Abstract: We construct a proper generating functional $L$ on a Podleś sphere and we show that $1$-cocycle arising from $L$ coincides with the one in our previous work. We also show that our 1-cocycle is purely non Gaussian and that the full `group' $C^{\ast}$-algebra of the quantum $SL(2,\mathbb{R})$ is liminal. We construct a proper generating functional $L$ on a Podleś sphere and we show that $1$-cocycle arising from $L$ coincides with the one in our previous work. We also show that our 1-cocycle is purely non Gaussian and that the full `group' $C^{\ast}$-algebra of the quantum $SL(2,\mathbb{R})$ is liminal. △ Less

Submitted 26 January, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

Comments: 18 pages. Any comments are welcome. My website: https://sites.google.com/view/masatotanaka-homepage/%E3%83%9B%E3%83%BC%E3%83%A0 My E-mails: tanakamasato.2121@gmail.com, masato.tanaka.c7@math.nagoya-u.ac.jp

MSC Class: 17B37; 20G42; 46L67

arXiv:2312.06286 [pdf, other]

Late Engine Activity in Neutron Star Mergers and Its Cocoon: An Alternative Scenario for the Blue Kilonova

Authors: Hamid Hamidani, Shigeo S. Kimura, Masaomi Tanaka, Kunihito Ioka

Abstract: Follow-up observations of short gamma-ray bursts (sGRBs) have continuously unveiled late extended/plateau emissions, attributed to jet launch due to late engine activity, the nature of which remains enigmatic. Observations of GW170817 confirmed that sGRBs are linked to neutron star (NS) mergers, and discovered a kilonova (KN) transient. Nevertheless, the origin of the early "blue" KN in GW170817 r… ▽ More Follow-up observations of short gamma-ray bursts (sGRBs) have continuously unveiled late extended/plateau emissions, attributed to jet launch due to late engine activity, the nature of which remains enigmatic. Observations of GW170817 confirmed that sGRBs are linked to neutron star (NS) mergers, and discovered a kilonova (KN) transient. Nevertheless, the origin of the early "blue" KN in GW170817 remains unclear. Here, we investigate the propagation of late jets in the merger ejecta. By analytically modeling jet dynamics, we determine the properties of the jet heated cocoon, and estimate its cooling emission. Our results reveal that late jets generate significantly brighter cocoons compared to prompt jets, primarily due to reduced energy loss by adiabatic cooling. Notably, with typical late jets, emission from the cocoon trapped inside the ejecta can reproduce the blue KN emission. We estimate that the forthcoming Einstein Probe mission will detect the early cocoon emission with a rate of $\sim 2.1_{-1.6}^{+3.2}$ yr$^{-1}$, and that optical/UV follow-ups in the LIGO-VIRGO-KAGRA O5 run will be able to detect $\sim 1.0_{-0.7}^{+1.5}$ cocoon emission events. As an electromagnetic counterpart, this emission provides an independent tool to probe NS mergers in the Universe, complementing insights from sGRBs and gravitational waves. △ Less

Submitted 25 January, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 31 pages, 7 figures, and 1 table. Accepted for publication in The Astrophysical Journal

arXiv:2312.04192 [pdf, other]

Convergence Rate Analysis of Continuous- and Discrete-Time Smoothing Gradient Algorithms

Authors: Mitsuru Toyoda, Akatsuki Nishioka, Mirai Tanaka

Abstract: This paper addresses the gradient flow -- the continuous-time representation of the gradient method -- with the smooth approximation of a non-differentiable objective function and presents convergence analysis framework. Similar to the gradient method, the gradient flow is inapplicable to the non-differentiable function minimization; therefore, this paper addresses the smoothing gradient method, w… ▽ More This paper addresses the gradient flow -- the continuous-time representation of the gradient method -- with the smooth approximation of a non-differentiable objective function and presents convergence analysis framework. Similar to the gradient method, the gradient flow is inapplicable to the non-differentiable function minimization; therefore, this paper addresses the smoothing gradient method, which exploits a decreasing smoothing parameter sequence in the smooth approximation. The convergence analysis is presented using conventional Lyapunov-function-based techniques, and a Lyapunov function applicable to both strongly convex and non-strongly convex objective functions is provided by taking into consideration the effect of the smooth approximation. Based on the equivalence of the stepsize in the smoothing gradient method and the discretization step in the forward Euler scheme for the numerical integration of the smoothing gradient flow, the sample values of the exact solution of the smoothing gradient flow are compared with the state variable of the smoothing gradient method, and the equivalence of the convergence rates is shown. △ Less

Submitted 7 December, 2023; originally announced December 2023.

arXiv:2312.02140 [pdf, other]

MMT/Binospec Spectroscopic Survey of Two $z\sim$ 0.8 Galaxy Clusters in the Eye of Horus Field

Authors: Jiyun Di, Eiichi Egami, Kenneth C. Wong, Chien-Hsiu Lee, Yuanhang Ning, Naomi Ota, Masayuki Tanaka

Abstract: The discovery of the Eye of Horus (EoH), a rare double source-plane lens system ($z_{\rm lens}=$ 0.795; $z_{\rm src}=$ 1.302 and 1.988), has also led to the identification of two high-redshift ($z_{\rm phot}\sim$ 0.8) galaxy clusters in the same field based on the subsequent analysis of the Subaru/Hyper Suprime-Cam (HSC) optical and XMM-Newton X-ray data. The two brightest cluster galaxies (BCGs),… ▽ More The discovery of the Eye of Horus (EoH), a rare double source-plane lens system ($z_{\rm lens}=$ 0.795; $z_{\rm src}=$ 1.302 and 1.988), has also led to the identification of two high-redshift ($z_{\rm phot}\sim$ 0.8) galaxy clusters in the same field based on the subsequent analysis of the Subaru/Hyper Suprime-Cam (HSC) optical and XMM-Newton X-ray data. The two brightest cluster galaxies (BCGs), one of which is the lensing galaxy of the EoH, are separated by only $\sim$100$"$ ($=$ 0.75 Mpc $<$ $r_{200}$) on the sky, raising the possibility that these two clusters may be physically associated. Here, we present a follow-up optical spectroscopic survey of this EoH field, obtaining 218 secure redshifts using MMT/Binospec. We have confirmed that there indeed exist two massive ($M_{\rm dyn}$ $>$ $10^{14}$ M$_\odot$) clusters of galaxies at $z$ $=$ 0.795 (the main cluster) and at $z=0.769$ (the NE cluster). However, these clusters have a velocity offset of $\sim$4300 km s$^{-1}$, suggesting that this two-cluster system is likely a line-of-sight projection rather than a physically-related association (e.g., a cluster merger). In terms of the properties of cluster-member galaxies, these two $z\sim0.8$ clusters appear well-developed, each harboring an old (age $=$ 3.6-6.0 Gyr) and massive ($M_\mathrm{*}$ $=$ 4.2-9.5 $\times$ $10^{11}$ M$_\odot$) BCG and exhibiting a well-established red sequence (RS). This study underscores the importance of conducting a spectroscopic follow-up for high-redshift cluster candidates because RS-based cluster selections are susceptible to such a projection effect in general. △ Less

Submitted 4 December, 2023; originally announced December 2023.

Comments: 13 pages (+56 pages in appendices), 7(+47) figures, 4(+1) tables; to be submitted to ApJ

Showing 1–50 of 1,076 results for author: Tanaka, M