-
Status of Xtend telescope onboard X-Ray Imaging and Spectroscopy Mission (XRISM)
Authors:
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takashi Okajima,
Hirofumi Noda,
Hiroyuki Uchida,
Hiromasa Suzuki,
Shogo Benjamin Kobayashi,
Tomokage Yoneyama,
Kouichi Hagino,
Kumiko Nobukawa,
Takaaki Tanaka,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Hironori Matsumoto,
Takeshi Tsuru,
Makoto Yamauchi,
Isamu Hatsukade,
Hirokazu Odaka,
Takayoshi Kohmura,
Kazutaka Yamaoka,
Manabu Ishida,
Yoshitomo Maeda,
Takayuki Hayashi
, et al. (38 additional authors not shown)
Abstract:
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is…
▽ More
Xtend is one of the two telescopes onboard the X-ray imaging and spectroscopy mission (XRISM), which was launched on September 7th, 2023. Xtend comprises the Soft X-ray Imager (SXI), an X-ray CCD camera, and the X-ray Mirror Assembly (XMA), a thin-foil-nested conically approximated Wolter-I optics. A large field of view of $38^{\prime}\times38^{\prime}$ over the energy range from 0.4 to 13 keV is realized by the combination of the SXI and XMA with a focal length of 5.6 m. The SXI employs four P-channel, back-illuminated type CCDs with a thick depletion layer of 200 $μ$m. The four CCD chips are arranged in a 2$\times$2 grid and cooled down to $-110$ $^{\circ}$C with a single-stage Stirling cooler. Before the launch of XRISM, we conducted a month-long spacecraft thermal vacuum test. The performance verification of the SXI was successfully carried out in a course of multiple thermal cycles of the spacecraft. About a month after the launch of XRISM, the SXI was carefully activated and the soundness of its functionality was checked by a step-by-step process. Commissioning observations followed the initial operation. We here present pre- and post-launch results verifying the Xtend performance. All the in-orbit performances are consistent with those measured on ground and satisfy the mission requirement. Extensive calibration studies are ongoing.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
Initial operations of the Soft X-ray Imager onboard XRISM
Authors:
Hiromasa Suzuki,
Tomokage Yoneyama,
Shogo B. Kobayashi,
Hirofumi Noda,
Hiroyuki Uchida,
Kumiko K. Nobukawa,
Kouichi Hagino,
Koji Mori,
Hiroshi Tomida,
Hiroshi Nakajima,
Takaaki Tanaka,
Hiroshi Murakami,
Hideki Uchiyama,
Masayoshi Nobukawa,
Yoshiaki Kanemaru,
Yoshinori Otsuka,
Haruhiko Yokosu,
Wakana Yonemaru,
Hanako Nakano,
Kazuhiro Ichikawa,
Reo Takemoto,
Tsukasa Matsushima,
Marina Yoshimoto,
Mio Aoyagi,
Kohei Shima
, et al. (30 additional authors not shown)
Abstract:
XRISM (X-Ray Imaging and Spectroscopy Mission) is an astronomical satellite with the capability of high-resolution spectroscopy with the X-ray microcalorimeter, Resolve, and wide field-of-view imaging with the CCD camera, Xtend. The Xtend consists of the mirror assembly (XMA: X-ray Mirror Assembly) and detector (SXI: Soft X-ray Imager). The components of SXI include CCDs, analog and digital electr…
▽ More
XRISM (X-Ray Imaging and Spectroscopy Mission) is an astronomical satellite with the capability of high-resolution spectroscopy with the X-ray microcalorimeter, Resolve, and wide field-of-view imaging with the CCD camera, Xtend. The Xtend consists of the mirror assembly (XMA: X-ray Mirror Assembly) and detector (SXI: Soft X-ray Imager). The components of SXI include CCDs, analog and digital electronics, and a mechanical cooler. After the successful launch on September 6th, 2023 (UT) and subsequent critical operations, the mission instruments were turned on and set up. The CCDs have been kept at the designed operating temperature of $-110^\circ$C ~after the electronics and cooling system were successfully set up. During the initial operation phase, which continued for more than a month after the critical operations, we verified the observation procedure, stability of the cooling system, all the observation options with different imaging areas and/or timing resolutions, and operations for protection against South Atlantic Anomaly. We optimized the operation procedure and observation parameters including the cooler settings, imaging areas for the specific modes with higher timing resolutions, and event selection algorithm. We summarize our policy and procedure of the initial operations for SXI. We also report on a couple of issues we faced during the initial operations and lessons learned from them.
△ Less
Submitted 28 June, 2024;
originally announced June 2024.
-
H.E.S.S. observations of the 2021 periastron passage of PSR B1259-63/LS 2883
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
R. Brose,
A. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff,
S. Casanova
, et al. (119 additional authors not shown)
Abstract:
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ day…
▽ More
PSR B1259-63 is a gamma-ray binary system that hosts a pulsar in an eccentric orbit, with a 3.4 year period, around an O9.5Ve star. At orbital phases close to periastron passages, the system radiates bright and variable non-thermal emission. We report on an extensive VHE observation campaign conducted with the High Energy Stereoscopic System, comprised of ~100 hours of data taken from $t_p-24$ days to $t_p+127$ days around the system's 2021 periastron passage. We also present the timing and spectral analyses of the source. The VHE light curve in 2021 is consistent with the stacked light curve of all previous observations. Within the light curve, we report a VHE maximum at times coincident with the third X-ray peak first detected in the 2021 X-ray light curve. In the light curve -- although sparsely sampled in this time period -- we see no VHE enhancement during the second disc crossing. In addition, we see no correspondence to the 2021 GeV flare in the VHE light curve. The VHE spectrum obtained from the analysis of the 2021 dataset is best described by a power law of spectral index $Γ= 2.65 \pm 0.04_{\text{stat}}$ $\pm 0.04_{\text{sys}}$, a value consistent with the previous H.E.S.S. observations of the source. We report spectral variability with a difference of $ΔΓ= 0.56 ~\pm~ 0.18_{\text{stat}}$ $~\pm~0.10_{\text{sys}}$ at 95% c.l., between sub-periods of the 2021 dataset. We also find a linear correlation between contemporaneous flux values of X-ray and TeV datasets, detected mainly after $t_p+25$ days, suggesting a change in the available energy for non-thermal radiation processes. We detect no significant correlation between GeV and TeV flux points, within the uncertainties of the measurements, from $\sim t_p-23$ days to $\sim t_p+126$ days. This suggests that the GeV and TeV emission originate from different electron populations.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Integrable $\mathbb{Z}_2^2$-graded Extensions of the Liouville and Sinh-Gordon Theories
Authors:
Naruhiko Aizawa,
Ren Ito,
Zhanna Kuznetsova,
Toshiya Tanaka,
Francesco Toppan
Abstract:
In this paper we present a general framework to construct integrable $\mathbb{Z}_2^2$-graded extensions of classical, two-dimensional Toda and conformal affine Toda theories. The scheme is applied to define the extended Liouville and Sinh-Gordon models; they are based on $\mathbb{Z}_2^2$-graded color Lie algebras and their fields satisfy a parabosonic statististics. The mathematical tools here int…
▽ More
In this paper we present a general framework to construct integrable $\mathbb{Z}_2^2$-graded extensions of classical, two-dimensional Toda and conformal affine Toda theories. The scheme is applied to define the extended Liouville and Sinh-Gordon models; they are based on $\mathbb{Z}_2^2$-graded color Lie algebras and their fields satisfy a parabosonic statististics. The mathematical tools here introduced are the $\mathbb{Z}_2^2$-graded covariant extensions of the Lax pair formalism and of the Polyakov's soldering procedure. The $\mathbb{Z}_2^2$-graded Sinh-Gordon model is derived from an affine $\mathbb{Z}_2^2$-graded color Lie algebra, mimicking a procedure originally introduced by Babelon-Bonora to derive the ordinary Sinh-Gordon model. The color Lie algebras under considerations are: the $6$-generator $\mathbb{Z}_2^2$-graded $sl_2$, the $\mathbb{Z}_2^2$-graded affine ${\widehat{sl_2}}$ algebra with two central extensions, the $\mathbb{Z}_2^2$-graded Virasoro algebra obtained from a Hamiltonian reduction.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
COSMOS-Web: The over-abundance and physical nature of "little red dots"--Implications for early galaxy and SMBH assembly
Authors:
Hollis B. Akins,
Caitlin M. Casey,
Erini Lambrides,
Natalie Allen,
Irham T. Andika,
Malte Brinch,
Jaclyn B. Champagne,
Olivia Cooper,
Xuheng Ding,
Nicole E. Drakos,
Andreas Faisst,
Steven L. Finkelstein,
Maximilien Franco,
Seiji Fujimoto,
Fabrizio Gentile,
Steven Gillman,
Ghassem Gozaliasl,
Santosh Harish,
Christopher C. Hayward,
Michaela Hirschmann,
Olivier Ilbert,
Jeyhan S. Kartaltepe,
Dale D. Kocevski,
Anton M. Koekemoer,
Vasily Kokorev
, et al. (16 additional authors not shown)
Abstract:
JWST has revealed a population of compact and extremely red galaxies at $z>4$, which likely host active galactic nuclei (AGN). We present a sample of 434 ``little red dots'' (LRDs), selected from the 0.54 deg$^2$ COSMOS-Web survey. We fit galaxy and AGN SED models to derive redshifts and physical properties; the sample spans $z\sim5$-$9$ after removing brown dwarf contaminants. We consider two ext…
▽ More
JWST has revealed a population of compact and extremely red galaxies at $z>4$, which likely host active galactic nuclei (AGN). We present a sample of 434 ``little red dots'' (LRDs), selected from the 0.54 deg$^2$ COSMOS-Web survey. We fit galaxy and AGN SED models to derive redshifts and physical properties; the sample spans $z\sim5$-$9$ after removing brown dwarf contaminants. We consider two extreme physical scenarios: either LRDs are all AGN, and their continuum emission is dominated by the accretion disk, or they are all compact star-forming galaxies, and their continuum is dominated by stars. If LRDs are AGN-dominated, our sample exhibits bolometric luminosities $\sim10^{45-47}$ erg\,s$^{-1}$, spanning the gap between JWST AGN in the literature and bright, rare quasars. We derive a bolometric luminosity function (LF) $\sim100$ times the (UV-selected) quasar LF, implying a non-evolving black hole accretion density of $\sim10^{-4}$ M$_\odot$ yr$^{-1}$ Mpc$^{-3}$ from $z\sim2$-$9$. By contrast, if LRDs are dominated by star formation, we derive stellar masses $\sim10^{8.5-10}\,M_\odot$. MIRI/F770W is key to deriving accurate stellar masses; without it, we derive a mass function inconsistent with $Λ$CDM. The median stellar mass profile is broadly consistent with the maximal stellar mass surface densities seen in the nearby universe, though the most massive $\sim50$\% of objects exceed this limit, requiring substantial AGN contribution to the continuum. Nevertheless, stacking all available X-ray, mid-IR, far-IR/sub-mm, and radio data yields non-detections. Whether dominated by dusty AGN, compact star-formation, or both, the high masses/luminosities and remarkable abundance of LRDs implies a dominant mode of early galaxy/SMBH growth.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Distributionally Robust Safe Sample Screening
Authors:
Hiroyuki Hanada,
Aoyama Tatsuya,
Akahane Satoshi,
Tomonari Tanaka,
Yoshito Okura,
Yu Inatsu,
Noriaki Hashimoto,
Shion Takeno,
Taro Murayama,
Hanju Lee,
Shinya Kojima,
Ichiro Takeuchi
Abstract:
In this study, we propose a machine learning method called Distributionally Robust Safe Sample Screening (DRSSS). DRSSS aims to identify unnecessary training samples, even when the distribution of the training samples changes in the future. To achieve this, we effectively combine the distributionally robust (DR) paradigm, which aims to enhance model robustness against variations in data distributi…
▽ More
In this study, we propose a machine learning method called Distributionally Robust Safe Sample Screening (DRSSS). DRSSS aims to identify unnecessary training samples, even when the distribution of the training samples changes in the future. To achieve this, we effectively combine the distributionally robust (DR) paradigm, which aims to enhance model robustness against variations in data distribution, with the safe sample screening (SSS), which identifies unnecessary training samples prior to model training. Since we need to consider an infinite number of scenarios regarding changes in the distribution, we applied SSS because it does not require model training after the change of the distribution. In this paper, we employed the covariate shift framework to represent the distribution of training samples and reformulated the DR covariate-shift problem as a weighted empirical risk minimization problem, where the weights are subject to uncertainty within a predetermined range. By extending the existing SSS technique to accommodate this weight uncertainty, the DRSSS method is capable of reliably identifying unnecessary samples under any future distribution within a specified range. We provide a theoretical guarantee for the DRSSS method and validate its performance through numerical experiments on both synthetic and real-world datasets.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Generalized Exponentiated Gradient Algorithms and Their Application to On-Line Portfolio Selection
Authors:
Andrzej Cichocki,
Sergio Cruces,
Auxiliadora Sarmiento,
Toshihisa Tanaka
Abstract:
This paper introduces a novel family of generalized exponentiated gradient (EG) updates derived from an Alpha-Beta divergence regularization function. Collectively referred to as EGAB, the proposed updates belong to the category of multiplicative gradient algorithms for positive data and demonstrate considerable flexibility by controlling iteration behavior and performance through three hyperparam…
▽ More
This paper introduces a novel family of generalized exponentiated gradient (EG) updates derived from an Alpha-Beta divergence regularization function. Collectively referred to as EGAB, the proposed updates belong to the category of multiplicative gradient algorithms for positive data and demonstrate considerable flexibility by controlling iteration behavior and performance through three hyperparameters: $α$, $β$, and the learning rate $η$. To enforce a unit $l_1$ norm constraint for nonnegative weight vectors within generalized EGAB algorithms, we develop two slightly distinct approaches. One method exploits scale-invariant loss functions, while the other relies on gradient projections onto the feasible domain. As an illustration of their applicability, we evaluate the proposed updates in addressing the online portfolio selection problem (OLPS) using gradient-based methods. Here, they not only offer a unified perspective on the search directions of various OLPS algorithms (including the standard exponentiated gradient and diverse mean-reversion strategies), but also facilitate smooth interpolation and extension of these updates due to the flexibility in hyperparameter selection. Simulation results confirm that the adaptability of these generalized gradient updates can effectively enhance the performance for some portfolios, particularly in scenarios involving transaction costs.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Cyclic image generation using chaotic dynamics
Authors:
Takaya Tanaka,
Yutaka Yamaguti
Abstract:
Successive image generation using cyclic transformations is demonstrated by extending the CycleGAN model to transform images among three different categories. Repeated application of the trained generators produces sequences of images that transition among the different categories. The generated image sequences occupy a more limited region of the image space compared with the original training dat…
▽ More
Successive image generation using cyclic transformations is demonstrated by extending the CycleGAN model to transform images among three different categories. Repeated application of the trained generators produces sequences of images that transition among the different categories. The generated image sequences occupy a more limited region of the image space compared with the original training dataset. Quantitative evaluation using precision and recall metrics indicates that the generated images have high quality but reduced diversity relative to the training dataset. Such successive generation processes are characterized as chaotic dynamics in terms of dynamical system theory. Positive Lyapunov exponents estimated from the generated trajectories confirm the presence of chaotic dynamics, with the Lyapunov dimension of the attractor found to be comparable to the intrinsic dimension of the training data manifold. The results suggest that chaotic dynamics in the image space defined by the deep generative model contribute to the diversity of the generated images, constituting a novel approach for multi-class image generation. This model can be interpreted as an extension of classical associative memory to perform hetero-association among image categories.
△ Less
Submitted 31 May, 2024;
originally announced May 2024.
-
Efficient first-principles approach to Gibbs free energy with thermal expansion
Authors:
Kota Hashimoto,
Tomonori Tanaka,
Yoshihiro Gohda
Abstract:
We propose a method to evaluate the Gibbs free energy from constant-volume first-principles calculations. The volume integral of the pressure is performed by determining the volume and the bulk modulus in equilibrium at finite temperatures, where the pressure and its volume derivative are evaluated utilizing first-principles calculations of the Grüneisen parameter without varying the volume. As an…
▽ More
We propose a method to evaluate the Gibbs free energy from constant-volume first-principles calculations. The volume integral of the pressure is performed by determining the volume and the bulk modulus in equilibrium at finite temperatures, where the pressure and its volume derivative are evaluated utilizing first-principles calculations of the Grüneisen parameter without varying the volume. As an example, the validity of our method is demonstrated for fcc-Al by comparing with the conventional quasiharmonic approximation that is much more computationally-demanding.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Remarks on Loss Function of Threshold Method for Ordinal Regression Problem
Authors:
Ryoya Yamasaki,
Toshiyuki Tanaka
Abstract:
Threshold methods are popular for ordinal regression problems, which are classification problems for data with a natural ordinal relation. They learn a one-dimensional transformation (1DT) of observations of the explanatory variable, and then assign label predictions to the observations by thresholding their 1DT values. In this paper, we study the influence of the underlying data distribution and…
▽ More
Threshold methods are popular for ordinal regression problems, which are classification problems for data with a natural ordinal relation. They learn a one-dimensional transformation (1DT) of observations of the explanatory variable, and then assign label predictions to the observations by thresholding their 1DT values. In this paper, we study the influence of the underlying data distribution and of the learning procedure of the 1DT on the classification performance of the threshold method via theoretical considerations and numerical experiments. Consequently, for example, we found that threshold methods based on typical learning procedures may perform poorly when the probability distribution of the target variable conditioned on an observation of the explanatory variable tends to be non-unimodal. Another instance of our findings is that learned 1DT values are concentrated at a few points under the learning procedure based on a piecewise-linear loss function, which can make difficult to classify data well.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods
Authors:
Ryoya Yamasaki,
Toshiyuki Tanaka
Abstract:
Ordinal regression (OR) is classification of ordinal data in which the underlying categorical target variable has a natural ordinal relation for the underlying explanatory variable. For $K$-class OR tasks, threshold methods learn a one-dimensional transformation (1DT) of the explanatory variable so that 1DT values for observations of the explanatory variable preserve the order of label values…
▽ More
Ordinal regression (OR) is classification of ordinal data in which the underlying categorical target variable has a natural ordinal relation for the underlying explanatory variable. For $K$-class OR tasks, threshold methods learn a one-dimensional transformation (1DT) of the explanatory variable so that 1DT values for observations of the explanatory variable preserve the order of label values $1,\ldots,K$ for corresponding observations of the target variable well, and then assign a label prediction to the learned 1DT through threshold labeling, namely, according to the rank of an interval to which the 1DT belongs among intervals on the real line separated by $(K-1)$ threshold parameters. In this study, we propose a parallelizable algorithm to find the optimal threshold labeling, which was developed in previous research, and derive sufficient conditions for that algorithm to successfully output the optimal threshold labeling. In a numerical experiment we performed, the computation time taken for the whole learning process of a threshold method with the optimal threshold labeling could be reduced to approximately 60\,\% by using the proposed algorithm with parallel processing compared to using an existing algorithm based on dynamic programming.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Evaluation of the X-ray SOI pixel detector with the on-chip ADC
Authors:
Hiroumi Matsuhashi,
Kouichi Hagino,
Aya Bamba,
Ayaki Takeda,
Masataka Yukumoto,
Koji Mori,
Yusuke Nishioka,
Takeshi Go Tsuru,
Mizuki Uenomachi,
Tomonori Ikeda,
Masamune Matsuda,
Takuto Narita,
Hiromasa Suzuki,
Takaaki Tanaka,
Ikuo Kurachi,
Takayoshi Kohmura,
Yusuke Uchida,
Yasuo Arai,
Shoji Kawahito
Abstract:
XRPIX is the monolithic X-ray SOI (silicon-on-insulator) pixel detector, which has a time resolution better than 10 $\rmμ$s as well as a high detection efficiency for X-rays above 10 keV. XRPIX is planned to be installed on future X-ray satellites. To mount on satellites, it is essential that the ADC (analog-to-digital converter) be implemented on the detector because such peripheral circuits must…
▽ More
XRPIX is the monolithic X-ray SOI (silicon-on-insulator) pixel detector, which has a time resolution better than 10 $\rmμ$s as well as a high detection efficiency for X-rays above 10 keV. XRPIX is planned to be installed on future X-ray satellites. To mount on satellites, it is essential that the ADC (analog-to-digital converter) be implemented on the detector because such peripheral circuits must be as compact as possible to achieve a large imaging area in the limited space in satellites. Thus, we developed a new XRPIX device with the on-chip ADC, and evaluated its performances. As the results, the integral non-linearity was evaluated to be 6 LSB (least significant bit), equivalent to 36 eV. The differential non-linearity was less than 0.7 LSB, and input noise from the on-chip ADC was 5~$\rm{e^{-}}$. Also, we evaluated end-to-end performance including the sensor part as well as the on-chip ADC. As the results, energy resolution at 5.9 keV was 294 $\rm{\pm}$ 4 eV in full-width at half maximum for the best pixel.
△ Less
Submitted 10 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
High-finesse nanofiber Fabry-Pérot resonator in a portable storage container
Authors:
S. Horikawa,
S. Yang,
T. Tanaka,
T. Aoki,
S. Kato
Abstract:
We present characterization and storage methods for a high-finesse nanofiber Fabry-Pérot resonator. Reflection spectroscopy from both ends of the resonator allows for evaluation of the mirror transmittances and optical loss inside the resonator. To maintain the quality of the nanofiber resonator after the fabrication, we have developed a portable storage container. By filling the container with dr…
▽ More
We present characterization and storage methods for a high-finesse nanofiber Fabry-Pérot resonator. Reflection spectroscopy from both ends of the resonator allows for evaluation of the mirror transmittances and optical loss inside the resonator. To maintain the quality of the nanofiber resonator after the fabrication, we have developed a portable storage container. By filling the container with dry, clean nitrogen gas, we can prevent contamination of the nanofiber during storage. This approach allows us to minimize the additional optical loss to less than 0.08% over a week. The portable container facilitates both the fabrication and subsequent experimentation with the resonator in different locations. This flexibility expands the range of applications, including quantum optics, communication, and sensing.
△ Less
Submitted 7 May, 2024; v1 submitted 18 March, 2024;
originally announced May 2024.
-
Distributionally Robust Safe Screening
Authors:
Hiroyuki Hanada,
Satoshi Akahane,
Tatsuya Aoyama,
Tomonari Tanaka,
Yoshito Okura,
Yu Inatsu,
Noriaki Hashimoto,
Taro Murayama,
Lee Hanju,
Shinya Kojima,
Ichiro Takeuchi
Abstract:
In this study, we propose a method Distributionally Robust Safe Screening (DRSS), for identifying unnecessary samples and features within a DR covariate shift setting. This method effectively combines DR learning, a paradigm aimed at enhancing model robustness against variations in data distribution, with safe screening (SS), a sparse optimization technique designed to identify irrelevant samples…
▽ More
In this study, we propose a method Distributionally Robust Safe Screening (DRSS), for identifying unnecessary samples and features within a DR covariate shift setting. This method effectively combines DR learning, a paradigm aimed at enhancing model robustness against variations in data distribution, with safe screening (SS), a sparse optimization technique designed to identify irrelevant samples and features prior to model training. The core concept of the DRSS method involves reformulating the DR covariate-shift problem as a weighted empirical risk minimization problem, where the weights are subject to uncertainty within a predetermined range. By extending the SS technique to accommodate this weight uncertainty, the DRSS method is capable of reliably identifying unnecessary samples and features under any future distribution within a specified range. We provide a theoretical guarantee of the DRSS method and validate its performance through numerical experiments on both synthetic and real-world datasets.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
Deci-Hz gravitational waves from the self-interacting axion cloud around the rotating stellar mass black hole
Authors:
Hidetoshi Omiya,
Takuya Takahashi,
Takahiro Tanaka,
Hirotaka Yoshino
Abstract:
Gravitational waves from condensates of ultra-light particles, such as axion, around rotating black holes are a promising probe to search for unknown physics. For this purpose, we need to characterize the signal to detect the gravitational waves, which requires tracking the evolution of the condensates, including various effects. The axion self-interaction causes the non-linear coupling between th…
▽ More
Gravitational waves from condensates of ultra-light particles, such as axion, around rotating black holes are a promising probe to search for unknown physics. For this purpose, we need to characterize the signal to detect the gravitational waves, which requires tracking the evolution of the condensates, including various effects. The axion self-interaction causes the non-linear coupling between the superradiant modes, resulting in complicated branching of evolution. Most studies so far have considered evolution under the non-relativistic approximation or the two-mode approximation. In this paper, we numerically investigate the evolution of the axion condensate without these approximations, taking higher multipole modes into account. We also investigate the possible signature in gravitational waves from the condensate. We show that the higher multipole modes are excited, leading to the gravitational wave signal by the transition of the axion between different levels. The most prominent signal of gravitational waves arises from the transition between modes with their angular quantum numbers different by two. The gravitational wave signal is emitted in the deci-Hz band for stellar mass black holes, which might be observable with the future gravitational wave detectors.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Search for synchrotron emission from secondary electrons of proton-proton interaction in Galactic PeVatron candidate HESS J1641$-$463
Authors:
Naomi Tsuji,
Takaaki Tanaka,
Samar Safi-Harb,
Felix Aharonian,
Sabrina Casanova,
Roland Kothes,
Emmanuel Moulin,
Hiroyuki Uchida,
Yasunobu Uchiyama
Abstract:
HESS J1641-463 is an unidentified gamma-ray source with a hard TeV gamma-ray spectrum, and thus it has been proposed to be a possible candidate for cosmic ray (CR) accelerators up to PeV energies (a PeVatron candidate). The source spatially coincides with the radio supernova remnant (SNR) G338.5+0.1, but has not yet been fully explored in the X-ray band. We analyzed newly taken NuSTAR data, pointi…
▽ More
HESS J1641-463 is an unidentified gamma-ray source with a hard TeV gamma-ray spectrum, and thus it has been proposed to be a possible candidate for cosmic ray (CR) accelerators up to PeV energies (a PeVatron candidate). The source spatially coincides with the radio supernova remnant (SNR) G338.5+0.1, but has not yet been fully explored in the X-ray band. We analyzed newly taken NuSTAR data, pointing at HESS J1641-463, with 82 ks effective exposure time. There is no apparent X-ray counterpart of HESS J1641-463, while nearby stellar cluster, Mercer 81, and stray-light X-rays are detected. Combined with the archival Chandra data, partially covering the source, we derived an upper limit of $\sim 6\times 10^{-13}$ erg cm$^{-2}$ s$^{-1}$ in 2-10 keV ($\sim 3\times 10^{-13}$ erg cm$^{-2}$ s$^{-1}$ in 10-20 keV). If the gamma-ray emission is originated from decay of $π^0$ mesons produced in interactions between CR protons and ambient materials, secondary electrons in the proton-proton interactions can potentially emit synchrotron photons in the X-ray band, which can be tested by our X-ray observations. Although the obtained X-ray upper limits cannot place a constraint on the primary proton spectrum, it will be possible with a future hard X-ray mission.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Strong lensing of gravitational waves with modified propagation
Authors:
Hiroki Takeda,
Takahiro Tanaka
Abstract:
We explore the impact of corrections to the propagation on the waveforms of gravitationally lensed gravitational waves under the geometrical optics approximation, focusing on both uniform cosmological modifications and local modifications localized around lensing objects. By adopting a model-independent phenomenological approach, we systematically investigate the effects of these modifications in…
▽ More
We explore the impact of corrections to the propagation on the waveforms of gravitationally lensed gravitational waves under the geometrical optics approximation, focusing on both uniform cosmological modifications and local modifications localized around lensing objects. By adopting a model-independent phenomenological approach, we systematically investigate the effects of these modifications in strong lensing scenarios, where detection of multiple images is expected. Our analysis reveals that cosmological modifications can yield corrections to the time delay that remain to be minor compared with the effects that accumulate over the whole propagation process, which are present also in the unlensed waveform. By contrast, local modifications around lensing objects can alter the image position and also the magnification factor, which is potentially polarization-selective and frequency-dependent. In some case we can have image disappearance as well as signal amplification. Furthermore, we demonstrate that such modifications can cause degradation of waveform match with the templates based on general relativity. This study highlights the importance of considering waveform modifications to search for the signature of modified propagation or the existence of extra polarization modes, and proposes potential observational targets.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
MAProtoNet: A Multi-scale Attentive Interpretable Prototypical Part Network for 3D Magnetic Resonance Imaging Brain Tumor Classification
Authors:
Binghua Li,
Jie Mao,
Zhe Sun,
Chao Li,
Qibin Zhao,
Toshihisa Tanaka
Abstract:
Automated diagnosis with artificial intelligence has emerged as a promising area in the realm of medical imaging, while the interpretability of the introduced deep neural networks still remains an urgent concern. Although contemporary works, such as XProtoNet and MProtoNet, has sought to design interpretable prediction models for the issue, the localization precision of their resulting attribution…
▽ More
Automated diagnosis with artificial intelligence has emerged as a promising area in the realm of medical imaging, while the interpretability of the introduced deep neural networks still remains an urgent concern. Although contemporary works, such as XProtoNet and MProtoNet, has sought to design interpretable prediction models for the issue, the localization precision of their resulting attribution maps can be further improved. To this end, we propose a Multi-scale Attentive Prototypical part Network, termed MAProtoNet, to provide more precise maps for attribution. Specifically, we introduce a concise multi-scale module to merge attentive features from quadruplet attention layers, and produces attribution maps. The proposed quadruplet attention layers can enhance the existing online class activation mapping loss via capturing interactions between the spatial and channel dimension, while the multi-scale module then fuses both fine-grained and coarse-grained information for precise maps generation. We also apply a novel multi-scale mapping loss for supervision on the proposed multi-scale module. Compared to existing interpretable prototypical part networks in medical imaging, MAProtoNet can achieve state-of-the-art performance in localization on brain tumor segmentation (BraTS) datasets, resulting in approximately 4% overall improvement on activation precision score (with a best score of 85.8%), without using additional annotated labels of segmentation. Our code will be released in https://github.com/TUAT-Novice/maprotonet.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Randomized Greedy Methods for Weak Submodular Sensor Selection with Robustness Considerations
Authors:
Ege C. Kaya,
Michael Hibbard,
Takashi Tanaka,
Ufuk Topcu,
Abolfazl Hashemi
Abstract:
We study a pair of budget- and performance-constrained weak submodular maximization problems. For computational efficiency, we explore the use of stochastic greedy algorithms which limit the search space via random sampling instead of the standard greedy procedure which explores the entire feasible search space. We propose a pair of stochastic greedy algorithms, namely, Modified Randomized Greedy…
▽ More
We study a pair of budget- and performance-constrained weak submodular maximization problems. For computational efficiency, we explore the use of stochastic greedy algorithms which limit the search space via random sampling instead of the standard greedy procedure which explores the entire feasible search space. We propose a pair of stochastic greedy algorithms, namely, Modified Randomized Greedy (MRG) and Dual Randomized Greedy (DRG) to approximately solve the budget- and performance-constrained problems, respectively. For both algorithms, we derive approximation guarantees that hold with high probability. We then examine the use of DRG in robust optimization problems wherein the objective is to maximize the worst-case of a number of weak submodular objectives and propose the Randomized Weak Submodular Saturation Algorithm (Random-WSSA). We further derive a high-probability guarantee for when Random-WSSA successfully constructs a robust solution. Finally, we showcase the effectiveness of these algorithms in a variety of relevant uses within the context of Earth-observing LEO constellations which estimate atmospheric weather conditions and provide Earth coverage.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
Unveiling extended gamma-ray emission around HESS J1813-178
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
J. Barnard,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaoui,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (126 additional authors not shown)
Abstract:
HESS J1813$-$178 is a very-high-energy $γ$-ray source spatially coincident with the young and energetic pulsar PSR J1813$-$1749 and thought to be associated with its pulsar wind nebula (PWN). Recently, evidence for extended high-energy emission in the vicinity of the pulsar has been revealed in the Fermi Large Area Telescope (LAT) data. This motivates revisiting the HESS J1813$-$178 region, taking…
▽ More
HESS J1813$-$178 is a very-high-energy $γ$-ray source spatially coincident with the young and energetic pulsar PSR J1813$-$1749 and thought to be associated with its pulsar wind nebula (PWN). Recently, evidence for extended high-energy emission in the vicinity of the pulsar has been revealed in the Fermi Large Area Telescope (LAT) data. This motivates revisiting the HESS J1813$-$178 region, taking advantage of improved analysis methods and an extended data set. Using data taken by the High Energy Stereoscopic System (H.E.S.S.) experiment and the Fermi-LAT, we aim to describe the $γ$-ray emission in the region with a consistent model, to provide insights into its origin. We performed a likelihood-based analysis on 32 hours of H.E.S.S. data and 12 years of Fermi-LAT data and fit a spectro-morphological model to the combined datasets. These results allowed us to develop a physical model for the origin of the observed $γ$-ray emission in the region. In addition to the compact very-high-energy $γ$-ray emission centered on the pulsar, we find a significant yet previously undetected component along the Galactic plane. With Fermi-LAT data, we confirm extended high-energy emission consistent with the position and elongation of the extended emission observed with H.E.S.S. These results establish a consistent description of the emission in the region from GeV energies to several tens of TeV. This study suggests that HESS J1813$-$178 is associated with a $γ$-ray PWN powered by PSR J1813$-$1749. A possible origin of the extended emission component is inverse Compton emission from electrons and positrons that have escaped the confines of the pulsar and form a halo around the PWN.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Robust Diffusion Models for Adversarial Purification
Authors:
Guang Lin,
Zerui Tao,
Jianhai Zhang,
Toshihisa Tanaka,
Qibin Zhao
Abstract:
Diffusion models (DMs) based adversarial purification (AP) has shown to be the most powerful alternative to adversarial training (AT). However, these methods neglect the fact that pre-trained diffusion models themselves are not robust to adversarial attacks as well. Additionally, the diffusion process can easily destroy semantic information and generate a high quality image but totally different f…
▽ More
Diffusion models (DMs) based adversarial purification (AP) has shown to be the most powerful alternative to adversarial training (AT). However, these methods neglect the fact that pre-trained diffusion models themselves are not robust to adversarial attacks as well. Additionally, the diffusion process can easily destroy semantic information and generate a high quality image but totally different from the original input image after the reverse process, leading to degraded standard accuracy. To overcome these issues, a natural idea is to harness adversarial training strategy to retrain or fine-tune the pre-trained diffusion model, which is computationally prohibitive. We propose a novel robust reverse process with adversarial guidance, which is independent of given pre-trained DMs and avoids retraining or fine-tuning the DMs. This robust guidance can not only ensure to generate purified examples retaining more semantic content but also mitigate the accuracy-robustness trade-off of DMs for the first time, which also provides DM-based AP an efficient adaptive ability to new attacks. Extensive experiments are conducted on CIFAR-10, CIFAR-100 and ImageNet to demonstrate that our method achieves the state-of-the-art results and exhibits generalization against different attacks.
△ Less
Submitted 24 May, 2024; v1 submitted 24 March, 2024;
originally announced March 2024.
-
Spectrum and extension of the inverse-Compton emission of the Crab Nebula from a combined Fermi-LAT and H.E.S.S. analysis
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
A. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (137 additional authors not shown)
Abstract:
The Crab Nebula is a unique laboratory for studying the acceleration of electrons and positrons through their non-thermal radiation. Observations of very-high-energy $γ$ rays from the Crab Nebula have provided important constraints for modelling its broadband emission. We present the first fully self-consistent analysis of the Crab Nebula's $γ$-ray emission between 1 GeV and $\sim$100 TeV, that is…
▽ More
The Crab Nebula is a unique laboratory for studying the acceleration of electrons and positrons through their non-thermal radiation. Observations of very-high-energy $γ$ rays from the Crab Nebula have provided important constraints for modelling its broadband emission. We present the first fully self-consistent analysis of the Crab Nebula's $γ$-ray emission between 1 GeV and $\sim$100 TeV, that is, over five orders of magnitude in energy. Using the open-source software package Gammapy, we combined 11.4 yr of data from the Fermi Large Area Telescope and 80 h of High Energy Stereoscopic System (H.E.S.S.) data at the event level and provide a measurement of the spatial extension of the nebula and its energy spectrum. We find evidence for a shrinking of the nebula with increasing $γ$-ray energy. Furthermore, we fitted several phenomenological models to the measured data, finding that none of them can fully describe the spatial extension and the spectral energy distribution at the same time. Especially the extension measured at TeV energies appears too large when compared to the X-ray emission. Our measurements probe the structure of the magnetic field between the pulsar wind termination shock and the dust torus, and we conclude that the magnetic field strength decreases with increasing distance from the pulsar. We complement our study with a careful assessment of systematic uncertainties.
△ Less
Submitted 21 March, 2024; v1 submitted 19 March, 2024;
originally announced March 2024.
-
On a family of relations of rooted tree maps
Authors:
Hideki Murahara,
Tatsushi Tanaka,
Noriko Wakabayashi
Abstract:
This paper is devoted to proving an infinite sequence of relations for rooted tree maps. On the way, we also give a basis for the space of rooted tree maps.
This paper is devoted to proving an infinite sequence of relations for rooted tree maps. On the way, we also give a basis for the space of rooted tree maps.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Ultralight vector dark matter search using data from the KAGRA O3GK run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
I. Abouelfettouh,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi
, et al. (1778 additional authors not shown)
Abstract:
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese…
▽ More
Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM.
△ Less
Submitted 5 March, 2024;
originally announced March 2024.
-
Universality of reservoir systems with recurrent neural networks
Authors:
Hiroki Yasumoto,
Toshiyuki Tanaka
Abstract:
Approximation capability of reservoir systems whose reservoir is a recurrent neural network (RNN) is discussed. In our problem setting, a reservoir system approximates a set of functions just by adjusting its linear readout while the reservoir is fixed. We will show what we call uniform strong universality of a family of RNN reservoir systems for a certain class of functions to be approximated. Th…
▽ More
Approximation capability of reservoir systems whose reservoir is a recurrent neural network (RNN) is discussed. In our problem setting, a reservoir system approximates a set of functions just by adjusting its linear readout while the reservoir is fixed. We will show what we call uniform strong universality of a family of RNN reservoir systems for a certain class of functions to be approximated. This means that, for any positive number, we can construct a sufficiently large RNN reservoir system whose approximation error for each function in the class of functions to be approximated is bounded from above by the positive number. Such RNN reservoir systems are constructed via parallel concatenation of RNN reservoirs.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Spatio-temporal reconstruction of substance dynamics using compressed sensing in multi-spectral magnetic resonance spectroscopic imaging
Authors:
Utako Yamamoto,
Hirohiko Imai,
Kei Sano,
Masayuki Ohzeki,
Tetsuya Matsuda,
Toshiyuki Tanaka
Abstract:
The objective of our study is to observe dynamics of multiple substances in vivo with high temporal resolution from multi-spectral magnetic resonance spectroscopic imaging (MRSI) data. The multi-spectral MRSI can effectively separate spectral peaks of multiple substances and is useful to measure spatial distributions of substances. However it is difficult to measure time-varying substance distribu…
▽ More
The objective of our study is to observe dynamics of multiple substances in vivo with high temporal resolution from multi-spectral magnetic resonance spectroscopic imaging (MRSI) data. The multi-spectral MRSI can effectively separate spectral peaks of multiple substances and is useful to measure spatial distributions of substances. However it is difficult to measure time-varying substance distributions directly by ordinary full sampling because the measurement requires a significantly long time. In this study, we propose a novel method to reconstruct the spatio-temporal distributions of substances from randomly undersampled multi-spectral MRSI data on the basis of compressed sensing (CS) and the partially separable function model with base spectra of substances. In our method, we have employed spatio-temporal sparsity and temporal smoothness of the substance distributions as prior knowledge to perform CS. The effectiveness of our method has been evaluated using phantom data sets of glass tubes filled with glucose or lactate solution in increasing amounts over time and animal data sets of a tumor-bearing mouse to observe the metabolic dynamics involved in the Warburg effect in vivo. The reconstructed results are consistent with the expected behaviors, showing that our method can reconstruct the spatio-temporal distribution of substances with a temporal resolution of four seconds which is extremely short time scale compared with that of full sampling. Since this method utilizes only prior knowledge naturally assumed for the spatio-temporal distributions of substances and is independent of the number of the spectral and spatial dimensions or the acquisition sequence of MRSI, it is expected to contribute to revealing the underlying substance dynamics in MRSI data already acquired or to be acquired in the future.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Tip of the iceberg: overmassive black holes at 4<z<7 found by JWST are not inconsistent with the local $\mathcal{M}_{\rm BH}$-$\mathcal{M}_\star$ relation
Authors:
Junyao Li,
John D. Silverman,
Yue Shen,
Marta Volonteri,
Knud Jahnke,
Ming-Yang Zhuang,
Matthew T. Scoggins,
Xuheng Ding,
Yuichi Harikane,
Masafusa Onoue,
Takumi S. Tanaka
Abstract:
JWST is revealing a new remarkable population of high-redshift ($z\gtrsim4$), low-luminosity Active Galactic Nuclei (AGNs) in deep surveys and detecting the host galaxy stellar light in the most luminous and massive quasars at $z\sim 6$ for the first time. Latest results claim supermassive black holes (SMBHs) in these systems to be significantly more massive than expected from the local BH mass -…
▽ More
JWST is revealing a new remarkable population of high-redshift ($z\gtrsim4$), low-luminosity Active Galactic Nuclei (AGNs) in deep surveys and detecting the host galaxy stellar light in the most luminous and massive quasars at $z\sim 6$ for the first time. Latest results claim supermassive black holes (SMBHs) in these systems to be significantly more massive than expected from the local BH mass - stellar mass ($\mathcal{M}_{\rm BH} - \mathcal{M}_\star$) relation and that this is not due to sample selection effects. Through detailed statistical modeling, we demonstrate that the coupled effects of selection biases (i.e., finite detection limit and requirements on detecting broad lines) and measurement uncertainties in $\mathcal{M}_{\rm BH}$ and $\mathcal{M}_\star$ can in fact largely account for the reported offset and flattening in the observed $\mathcal{M}_{\rm BH} - \mathcal{M}_\star$ relation toward the upper envelope of the local relation, even for those at $\mathcal{M}_{\rm BH} < 10^8\,M_{\odot}$. We further investigate the possible evolution of the $\mathcal{M}_{\rm BH} - \mathcal{M}_\star$ relation at $z\gtrsim 4$ with careful treatment of observational biases and consideration of the degeneracy between intrinsic evolution and dispersion in this relation. The bias-corrected intrinsic $\mathcal{M}_{\rm BH} - \mathcal{M}_\star$ relation in the low-mass regime suggests that there might be a large population of low-mass BHs (${\rm log}\,\mathcal{M}_{\rm BH} \lesssim 5$), possibly originating from lighter seeds, remaining undetected or unidentified even in the deepest JWST surveys. These results have important consequences for JWST studies of BH seeding and the coevolution between SMBHs and their host galaxies at the earliest cosmic times.
△ Less
Submitted 29 February, 2024;
originally announced March 2024.
-
Convergence Analysis of Blurring Mean Shift
Authors:
Ryoya Yamasaki,
Toshiyuki Tanaka
Abstract:
Blurring mean shift (BMS) algorithm, a variant of the mean shift algorithm, is a kernel-based iterative method for data clustering, where data points are clustered according to their convergent points via iterative blurring. In this paper, we analyze convergence properties of the BMS algorithm by leveraging its interpretation as an optimization procedure, which is known but has been underutilized…
▽ More
Blurring mean shift (BMS) algorithm, a variant of the mean shift algorithm, is a kernel-based iterative method for data clustering, where data points are clustered according to their convergent points via iterative blurring. In this paper, we analyze convergence properties of the BMS algorithm by leveraging its interpretation as an optimization procedure, which is known but has been underutilized in existing convergence studies. Whereas existing results on convergence properties applicable to multi-dimensional data only cover the case where all the blurred data point sequences converge to a single point, this study provides a convergence guarantee even when those sequences can converge to multiple points, yielding multiple clusters. This study also shows that the convergence of the BMS algorithm is fast by further leveraging geometrical characterization of the convergent points.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Curvature in the very-high energy gamma-ray spectrum of M87
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
F. Bradascio,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik C. Burger-Scheidlin,
T. Bylund,
S. Casanova,
R. Cecil,
J. Celic,
M. Cerruti
, et al. (110 additional authors not shown)
Abstract:
The radio galaxy M87 is a variable very-high energy (VHE) gamma-ray source, exhibiting three major flares reported in 2005, 2008, and 2010. Despite extensive studies, the origin of the VHE gamma-ray emission is yet to be understood. In this study, we investigate the VHE gamma-ray spectrum of M87 during states of high gamma-ray activity, utilizing 20.2$\,$ hours the H.E.S.S. observations. Our findi…
▽ More
The radio galaxy M87 is a variable very-high energy (VHE) gamma-ray source, exhibiting three major flares reported in 2005, 2008, and 2010. Despite extensive studies, the origin of the VHE gamma-ray emission is yet to be understood. In this study, we investigate the VHE gamma-ray spectrum of M87 during states of high gamma-ray activity, utilizing 20.2$\,$ hours the H.E.S.S. observations. Our findings indicate a preference for a curved spectrum, characterized by a log-parabola model with extra-galactic background light (EBL) model above 0.3$\,$TeV at the 4$σ$ level, compared to a power-law spectrum with EBL. We investigate the degeneracy between the absorption feature and the EBL normalization and derive upper limits on EBL models mainly sensitive in the wavelength range 12.4$\,$$μ$m - 40$\,$$μ$m.
△ Less
Submitted 25 April, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
Return-Aligned Decision Transformer
Authors:
Tsunehiko Tanaka,
Kenshi Abe,
Kaito Ariu,
Tetsuro Morimura,
Edgar Simo-Serra
Abstract:
Traditional approaches in offline reinforcement learning aim to learn the optimal policy that maximizes the cumulative reward, also known as return. However, as applications broaden, it becomes increasingly crucial to train agents that not only maximize the returns, but align the actual return with a specified target return, giving control over the agent's performance. Decision Transformer (DT) op…
▽ More
Traditional approaches in offline reinforcement learning aim to learn the optimal policy that maximizes the cumulative reward, also known as return. However, as applications broaden, it becomes increasingly crucial to train agents that not only maximize the returns, but align the actual return with a specified target return, giving control over the agent's performance. Decision Transformer (DT) optimizes a policy that generates actions conditioned on the target return through supervised learning and is equipped with a mechanism to control the agent using the target return. However, the action generation is hardly influenced by the target return because DT's self-attention allocates scarce attention scores to the return tokens. In this paper, we propose Return-Aligned Decision Transformer (RADT), designed to effectively align the actual return with the target return. RADT utilizes features extracted by paying attention solely to the return, enabling the action generation to consistently depend on the target return. Extensive experiments show that RADT reduces the discrepancies between the actual return and the target return of DT-based methods.
△ Less
Submitted 27 May, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Counterfactual Explanations of Black-box Machine Learning Models using Causal Discovery with Applications to Credit Rating
Authors:
Daisuke Takahashi,
Shohei Shimizu,
Takuma Tanaka
Abstract:
Explainable artificial intelligence (XAI) has helped elucidate the internal mechanisms of machine learning algorithms, bolstering their reliability by demonstrating the basis of their predictions. Several XAI models consider causal relationships to explain models by examining the input-output relationships of prediction models and the dependencies between features. The majority of these models hav…
▽ More
Explainable artificial intelligence (XAI) has helped elucidate the internal mechanisms of machine learning algorithms, bolstering their reliability by demonstrating the basis of their predictions. Several XAI models consider causal relationships to explain models by examining the input-output relationships of prediction models and the dependencies between features. The majority of these models have been based their explanations on counterfactual probabilities, assuming that the causal graph is known. However, this assumption complicates the application of such models to real data, given that the causal relationships between features are unknown in most cases. Thus, this study proposed a novel XAI framework that relaxed the constraint that the causal graph is known. This framework leveraged counterfactual probabilities and additional prior information on causal structure, facilitating the integration of a causal graph estimated through causal discovery methods and a black-box classification model. Furthermore, explanatory scores were estimated based on counterfactual probabilities. Numerical experiments conducted employing artificial data confirmed the possibility of estimating the explanatory score more accurately than in the absence of a causal graph. Finally, as an application to real data, we constructed a classification model of credit ratings assigned by Shiga Bank, Shiga prefecture, Japan. We demonstrated the effectiveness of the proposed method in cases where the causal graph is unknown.
△ Less
Submitted 26 April, 2024; v1 submitted 4 February, 2024;
originally announced February 2024.
-
Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization
Authors:
Guang Lin,
Chao Li,
Jianhai Zhang,
Toshihisa Tanaka,
Qibin Zhao
Abstract:
The deep neural networks are known to be vulnerable to well-designed adversarial attacks. The most successful defense technique based on adversarial training (AT) can achieve optimal robustness against particular attacks but cannot generalize well to unseen attacks. Another effective defense technique based on adversarial purification (AP) can enhance generalization but cannot achieve optimal robu…
▽ More
The deep neural networks are known to be vulnerable to well-designed adversarial attacks. The most successful defense technique based on adversarial training (AT) can achieve optimal robustness against particular attacks but cannot generalize well to unseen attacks. Another effective defense technique based on adversarial purification (AP) can enhance generalization but cannot achieve optimal robustness. Meanwhile, both methods share one common limitation on the degraded standard accuracy. To mitigate these issues, we propose a novel pipeline to acquire the robust purifier model, named Adversarial Training on Purification (AToP), which comprises two components: perturbation destruction by random transforms (RT) and purifier model fine-tuned (FT) by adversarial loss. RT is essential to avoid overlearning to known attacks, resulting in the robustness generalization to unseen attacks, and FT is essential for the improvement of robustness. To evaluate our method in an efficient and scalable way, we conduct extensive experiments on CIFAR-10, CIFAR-100, and ImageNette to demonstrate that our method achieves optimal robustness and exhibits generalization ability against unseen attacks.
△ Less
Submitted 15 March, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Acceleration and transport of relativistic electrons in the jets of the microquasar SS 433
Authors:
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
M. Bouyahiaou,
M. Breuhau,
R. Brose,
A. M. Brown,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin,
S. Caroff
, et al. (140 additional authors not shown)
Abstract:
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton…
▽ More
SS 433 is a microquasar, a stellar binary system with collimated relativistic jets. We observed SS 433 in gamma rays using the High Energy Stereoscopic System (H.E.S.S.), finding an energy-dependent shift in the apparent position of the gamma-ray emission of the parsec-scale jets. These observations trace the energetic electron population and indicate the gamma rays are produced by inverse-Compton scattering. Modelling of the energy-dependent gamma-ray morphology constrains the location of particle acceleration and requires an abrupt deceleration of the jet flow. We infer the presence of shocks on either side of the binary system at distances of 25 to 30 parsecs and conclude that self-collimation of the precessing jets forms the shocks, which then efficiently accelerate electrons.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
The $M_{\rm BH}-M_*$ relation up to $z\sim2$ through decomposition of COSMOS-Web NIRCam images
Authors:
Takumi S. Tanaka,
John D. Silverman,
Xuheng Ding,
Knud Jahnke,
Benny Trakhtenbrot,
Erini Lambrides,
Masafusa Onoue,
Irham Taufik Andika,
Angela Bongiorno,
Andreas L. Faisst,
Steven Gillman,
Christopher C. Hayward,
Michaela Hirschmann,
Anton Koekemoer,
Vasily Kokorev,
Zhaoxuan Liu,
Georgios E. Magdis,
Alvio Renzini,
Caitlin Casey,
Nicole E. Drakos,
Maximilien Franco,
Ghassem Gozaliasl,
Jeyhan Kartaltepe,
Daizhong Liu,
Henry Joy McCracken
, et al. (3 additional authors not shown)
Abstract:
Our knowledge of relations between supermassive black holes and their host galaxies at $z\gtrsim1$ is still limited, even though being actively sought out to $z\sim6$. Here, we use the high resolution and sensitivity of JWST to measure the host galaxy properties for 61 X-ray-selected type-I AGNs at $0.7<z<2.5$ with rest-frame optical/near-infrared imaging from COSMOS-Web and PRIMER. Black hole mas…
▽ More
Our knowledge of relations between supermassive black holes and their host galaxies at $z\gtrsim1$ is still limited, even though being actively sought out to $z\sim6$. Here, we use the high resolution and sensitivity of JWST to measure the host galaxy properties for 61 X-ray-selected type-I AGNs at $0.7<z<2.5$ with rest-frame optical/near-infrared imaging from COSMOS-Web and PRIMER. Black hole masses ($\log\left(M_{\rm BH}/M_\odot\right)\sim7.5-9.5$) are available from previous spectroscopic campaigns. We extract the host galaxy components from four NIRCam broadband images and the HST/ACS F814W image by applying a 2D image decomposition technique. We detect the host galaxy for $\sim90\%$ of the sample after subtracting the unresolved AGN emission. With host photometry free of AGN emission, we determine the stellar mass of the host galaxies to be $\log\left(M_*/M_\odot\right)\sim10-11.5$ through SED fitting and measure the evolution of the mass relation between SMBHs and their host galaxies. Considering selection biases and measurement uncertainties, we find that the $M_\mathrm{ BH}/M_*$ ratio evolves as $\left(1+z\right)^{0.37_{-0.60}^{+0.35}}$ thus remains essentially constant or exhibits mild evolution up to $z\sim2.5$. We also see an amount of scatter ($σ_μ=0.28\pm0.13$) is similar to the local relation and consistent with low-$z$ studies; this appears to not rule out non-causal cosmic assembly where mergers contribute to the statistical averaging towards the local relation. We highlight improvements to come with larger samples from JWST and, particularly, Euclid, which will exceed the statistical power of wide and deep surveys such as Subaru Hyper Suprime-Cam.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Three-Dimensional Velocity Diagnostics to Constrain the Type Ia Origin of Tycho's Supernova Remnant
Authors:
Hiroyuki Uchida,
Tomoaki Kasuga,
Keiichi Maeda,
Shiu-Hang Lee,
Takaaki Tanaka,
Aya Bamba
Abstract:
While various methods have been proposed to disentangle the progenitor system for Type Ia supernovae, their origin is still unclear. Circumstellar environment is a key to distinguishing between the double-degenerate (DD) and single-degenerate (SD) scenarios since a dense wind cavity is expected only in the case of the SD system. We perform spatially resolved X-ray spectroscopy of Tycho's supernova…
▽ More
While various methods have been proposed to disentangle the progenitor system for Type Ia supernovae, their origin is still unclear. Circumstellar environment is a key to distinguishing between the double-degenerate (DD) and single-degenerate (SD) scenarios since a dense wind cavity is expected only in the case of the SD system. We perform spatially resolved X-ray spectroscopy of Tycho's supernova remnant (SNR) with XMM-Newton and reveal the three-dimensional velocity structure of the expanding shock-heated ejecta measured from Doppler-broadened lines of intermediate-mass elements. Obtained velocity profiles are fairly consistent with those expected from a uniformly expanding ejecta model near the center, whereas we discover a rapid deceleration ($\sim4000$ km s$^{-1}$ to $\sim1000$ km s$^{-1}$) near the edge of the remnant in almost every direction. The result strongly supports the presence of a dense wall entirely surrounding the remnant, which is confirmed also by our hydrodynamical simulation. We thus conclude that Tycho's SNR is likely of the SD origin. Our new method will be useful for understanding progenitor systems of Type Ia SNRs in the era of high-angular/energy resolution X-ray astronomy with microcalorimeters.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Efficient Nonparametric Tensor Decomposition for Binary and Count Data
Authors:
Zerui Tao,
Toshihisa Tanaka,
Qibin Zhao
Abstract:
In numerous applications, binary reactions or event counts are observed and stored within high-order tensors. Tensor decompositions (TDs) serve as a powerful tool to handle such high-dimensional and sparse data. However, many traditional TDs are explicitly or implicitly designed based on the Gaussian distribution, which is unsuitable for discrete data. Moreover, most TDs rely on predefined multi-l…
▽ More
In numerous applications, binary reactions or event counts are observed and stored within high-order tensors. Tensor decompositions (TDs) serve as a powerful tool to handle such high-dimensional and sparse data. However, many traditional TDs are explicitly or implicitly designed based on the Gaussian distribution, which is unsuitable for discrete data. Moreover, most TDs rely on predefined multi-linear structures, such as CP and Tucker formats. Therefore, they may not be effective enough to handle complex real-world datasets. To address these issues, we propose ENTED, an \underline{E}fficient \underline{N}onparametric \underline{TE}nsor \underline{D}ecomposition for binary and count tensors. Specifically, we first employ a nonparametric Gaussian process (GP) to replace traditional multi-linear structures. Next, we utilize the \pg augmentation which provides a unified framework to establish conjugate models for binary and count distributions. Finally, to address the computational issue of GPs, we enhance the model by incorporating sparse orthogonal variational inference of inducing points, which offers a more effective covariance approximation within GPs and stochastic natural gradient updates for nonparametric models. We evaluate our model on several real-world tensor completion tasks, considering binary and count datasets. The results manifest both better performance and computational advantages of the proposed model.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Net-Zero Energy House-oriented Linear Programming for the Sizing Problem of Photovoltaic Panels and Batteries
Authors:
A. Daniel Carnerero,
Taichi Tanaka,
Mengmou Li,
Takeshi Hatanaka,
Yasuaki Wasa,
Kenji Hirata,
Yoshiaki Ushifusa,
Takanori Ida
Abstract:
The global drive towards carbon neutrality has led to a significant increase in the number of power plants based on renewable energy sources (RES). Concurrently, numerous households are adopting RES to generate their own energy, aiming to decrease both electricity costs and carbon footprints. To support these users, many papers have been devoted to developing optimal investment strategies for resi…
▽ More
The global drive towards carbon neutrality has led to a significant increase in the number of power plants based on renewable energy sources (RES). Concurrently, numerous households are adopting RES to generate their own energy, aiming to decrease both electricity costs and carbon footprints. To support these users, many papers have been devoted to developing optimal investment strategies for residential energy systems. However, there is still a significant gap as these studies often neglect important aspects like carbon neutrality. For this reason, in this paper, we explore the concept of net-zero energy houses (ZEHs) -- houses designed to have an annual net energy consumption around zero -- by presenting a constrained optimization problem to find the optimal number of photovoltaic panels and the optimal size of the battery system for home integration. Solving this constrained optimization problem is difficult due to its nonconvex constraints. Nevertheless, by applying a series of transformations, we reveal that it is possible to find an equivalent linear programming (LP) problem which is computationally tractable. The attainment of ZEH can be tackled by introducing a single constraint in the optimization problem. Additionally, we propose a sharing economy approach to the investment problem, offering a strategy that could potentially reduce investment costs and facilitate the attainment of ZEH more efficiently. Finally, we apply the proposed frameworks to a neighborhood in Japan as a case study, demonstrating the potential for long-term ZEH attainment. The results show that, under the right incentive, users can achieve ZEH, reduce their electricity costs and have a minimal impact on the main grid.
△ Less
Submitted 11 June, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
TeV flaring activity of the AGN PKS 0625-354 in November 2018
Authors:
H. E. S. S. Collaboration,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
A. Baktash,
V. Barbosa Martins,
J. Barnard,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
A. Brown,
F. Brun,
B. Bruno
, et al. (117 additional authors not shown)
Abstract:
Most $γ$-ray detected active galactic nuclei are blazars with one of their relativistic jets pointing towards the Earth. Only a few objects belong to the class of radio galaxies or misaligned blazars. Here, we investigate the nature of the object PKS 0625-354, its $γ$-ray flux and spectral variability and its broad-band spectral emission with observations from H.E.S.S., Fermi-LAT, Swift-XRT, and U…
▽ More
Most $γ$-ray detected active galactic nuclei are blazars with one of their relativistic jets pointing towards the Earth. Only a few objects belong to the class of radio galaxies or misaligned blazars. Here, we investigate the nature of the object PKS 0625-354, its $γ$-ray flux and spectral variability and its broad-band spectral emission with observations from H.E.S.S., Fermi-LAT, Swift-XRT, and UVOT taken in November 2018. The H.E.S.S. light curve above 200 GeV shows an outburst in the first night of observations followed by a declining flux with a halving time scale of 5.9h. The $γγ$-opacity constrains the upper limit of the angle between the jet and the line of sight to $\sim10^\circ$. The broad-band spectral energy distribution shows two humps and can be well fitted with a single-zone synchrotron self Compton emission model. We conclude that PKS 0625-354, as an object showing clear features of both blazars and radio galaxies, can be classified as an intermediate active galactic nuclei. Multi-wavelength studies of such intermediate objects exhibiting features of both blazars and radio galaxies are sparse but crucial for the understanding of the broad-band emission of $γ$-ray detected active galactic nuclei in general.
△ Less
Submitted 13 January, 2024;
originally announced January 2024.
-
EpilepsyLLM: Domain-Specific Large Language Model Fine-tuned with Epilepsy Medical Knowledge
Authors:
Xuyang Zhao,
Qibin Zhao,
Toshihisa Tanaka
Abstract:
With large training datasets and massive amounts of computing sources, large language models (LLMs) achieve remarkable performance in comprehensive and generative ability. Based on those powerful LLMs, the model fine-tuned with domain-specific datasets posseses more specialized knowledge and thus is more practical like medical LLMs. However, the existing fine-tuned medical LLMs are limited to gene…
▽ More
With large training datasets and massive amounts of computing sources, large language models (LLMs) achieve remarkable performance in comprehensive and generative ability. Based on those powerful LLMs, the model fine-tuned with domain-specific datasets posseses more specialized knowledge and thus is more practical like medical LLMs. However, the existing fine-tuned medical LLMs are limited to general medical knowledge with English language. For disease-specific problems, the model's response is inaccurate and sometimes even completely irrelevant, especially when using a language other than English. In this work, we focus on the particular disease of Epilepsy with Japanese language and introduce a customized LLM termed as EpilepsyLLM. Our model is trained from the pre-trained LLM by fine-tuning technique using datasets from the epilepsy domain. The datasets contain knowledge of basic information about disease, common treatment methods and drugs, and important notes in life and work. The experimental results demonstrate that EpilepsyLLM can provide more reliable and specialized medical knowledge responses.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Design study and spectroscopic performance of SOI pixel detector with a pinned depleted diode structure for X-ray astronomy
Authors:
Masataka Yukumoto,
Koji Mori,
Ayaki Takeda,
Yusuke Nishioka,
Syuto Yonemura,
Daisuke Izumi,
Uzuki Iwakiri,
Takeshi G. Tsuru,
Ikuo Kurachi,
Kouichi Hagino,
Yasuo Arai,
Takayoshi Kohmura,
Takaaki Tanaka,
Miraku Kimura,
Yuta Fuchita,
Taiga Yoshida,
Tomonori Ikeda
Abstract:
We have been developing silicon-on-insulator (SOI) pixel detectors with a pinned depleted diode (PDD) structure, named "XRPIX", for X-ray astronomy. The PDD structure is formed in a thick p-type substrate, to which high negative voltage is applied to make it fully depleted. A pinned p-well is introduced at the backside of the insulator layer to reduce a dark current generation at the Si-SiO$_{2}$…
▽ More
We have been developing silicon-on-insulator (SOI) pixel detectors with a pinned depleted diode (PDD) structure, named "XRPIX", for X-ray astronomy. The PDD structure is formed in a thick p-type substrate, to which high negative voltage is applied to make it fully depleted. A pinned p-well is introduced at the backside of the insulator layer to reduce a dark current generation at the Si-SiO$_{2}$ interface and to fix the back-gate voltage of the SOI transistors. An n-well is further introduced between the p-well and the substrate to make a potential barrier between them and suppress a leakage current. An optimization study on the n-well dopant concentration is necessary because a higher dopant concentration could result in a higher potential barrier but also in a larger sense-node capacitance leading to a lower spectroscopic performance, and vice versa. Based on a device simulation, we fabricated five candidate chips having different n-well dopant concentrations. We successfully found out the best n-well design, which suppressed a large leakage current and showed satisfactory X-ray spectroscopic performance. Too low and too high n-well dopant concentration chips showed a large leakage current and degraded X-ray spectroscopic performance, respectively. We also found that the dependency of X-ray spectroscopic performance on the n-well dopant concentration can be largely explained by the difference in sense-node capacitance.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
$\mathcal{N}=2$ Double graded supersymmetric quantum mechanics via dimensional reduction
Authors:
N. Aizawa,
Ren Ito,
Toshiya Tanaka
Abstract:
We present a novel $\mathcal{N} = 2 $ $\mathbb{Z}_2^2$-graded supersymmetric quantum mechanics ($\mathbb{Z}_2^2$-SQM) which has different features from those introduced so far. It is a two-dimensional (two-particle) system and is the first example of the quantum mechanical realization of an eight-dimensional irrep of the $\mathcal{N}=2$ $\mathbb{Z}_2^2$-supersymmetry algebra. The $\mathbb{Z}_2^2$-…
▽ More
We present a novel $\mathcal{N} = 2 $ $\mathbb{Z}_2^2$-graded supersymmetric quantum mechanics ($\mathbb{Z}_2^2$-SQM) which has different features from those introduced so far. It is a two-dimensional (two-particle) system and is the first example of the quantum mechanical realization of an eight-dimensional irrep of the $\mathcal{N}=2$ $\mathbb{Z}_2^2$-supersymmetry algebra. The $\mathbb{Z}_2^2$-SQM is obtained by quantizing the one-dimensional classical system derived by dimensional reduction from the two-dimensional $\mathbb{Z}_2^2$-supersymmetric Lagrangian of $\mathcal{N}=1$, which we constructed in our previous work. The ground states of the $\mathbb{Z}_2^2$-SQM are also investigated.
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
X-ray stacking reveals average SMBH accretion properties of star-forming galaxies and their cosmic evolution over 4 <~ z <~ 7
Authors:
Suin Matsui,
Kazuhiro Shimasaku,
Kei Ito,
Makoto Ando,
Takumi S. Tanaka
Abstract:
With an X-ray stacking analysis of ~ 12, 000 Lyman-break galaxies (LBGs) using the Chandra Legacy Survey image, we investigate average supermassive black hole (SMBH) accretion properties of star-forming galaxies (SFGs) at 4 <~ z <~ 7. Although no X-ray signal is detected in any stacked image, we obtain strong 3 sigma upper limits for the average black hole accretion rate (BHAR) as a function of st…
▽ More
With an X-ray stacking analysis of ~ 12, 000 Lyman-break galaxies (LBGs) using the Chandra Legacy Survey image, we investigate average supermassive black hole (SMBH) accretion properties of star-forming galaxies (SFGs) at 4 <~ z <~ 7. Although no X-ray signal is detected in any stacked image, we obtain strong 3 sigma upper limits for the average black hole accretion rate (BHAR) as a function of star formation rate (SFR). At z ~ 4 (5) where the stacked image is deeper, the 3 sigma BHAR upper limits per SFR are ~ 1.5 (1.0) dex lower than the local black hole-to-stellar mass ratio, indicating that the SMBHs of SFGs in the inactive (BHAR <~1M_sun yr^{-1}) phase are growing much more slowly than expected from simultaneous evolution. We obtain a similar result for BHAR per dark halo accretion rate. QSOs from the literature are found to have ~ 1 dex higher SFRs and >~ 2 dex higher BHARs than LBGs with the same dark halo mass. We also make a similar comparison for dusty starburst galaxies and quiescent galaxies from the literature. A duty-cycle corrected analysis shows that for a given dark halo, the SMBH mass increase in the QSO phase dominates over that in the much longer inactive phase. Finally, a comparison with the TNG300, TNG100, SIMBA100, and EAGLE100 simulations finds that they overshoot our BHAR upper limits by <~ 1.5 dex, possibly implying that simulated SMBHs are too massive.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
Physics-Informed Representation and Learning: Control and Risk Quantification
Authors:
Zhuoyuan Wang,
Reece Keller,
Xiyu Deng,
Kenta Hoshino,
Takashi Tanaka,
Yorie Nakahira
Abstract:
Optimal and safety-critical control are fundamental problems for stochastic systems, and are widely considered in real-world scenarios such as robotic manipulation and autonomous driving. In this paper, we consider the problem of efficiently finding optimal and safe control for high-dimensional systems. Specifically, we propose to use dimensionality reduction techniques from a comparison theorem f…
▽ More
Optimal and safety-critical control are fundamental problems for stochastic systems, and are widely considered in real-world scenarios such as robotic manipulation and autonomous driving. In this paper, we consider the problem of efficiently finding optimal and safe control for high-dimensional systems. Specifically, we propose to use dimensionality reduction techniques from a comparison theorem for stochastic differential equations together with a generalizable physics-informed neural network to estimate the optimal value function and the safety probability of the system. The proposed framework results in substantial sample efficiency improvement compared to existing methods. We further develop an autoencoder-like neural network to automatically identify the low-dimensional features of the system to enhance the ease of design for system integration. We also provide experiments and quantitative analysis to validate the efficacy of the proposed method. Source code is available at https://github.com/jacobwang925/path-integral-PINN.
△ Less
Submitted 8 May, 2024; v1 submitted 16 December, 2023;
originally announced December 2023.
-
Exploration of new chemical materials using black-box optimization with the D-wave quantum annealer
Authors:
Mikiya Doi,
Yoshihiro Nakao,
Takuro Tanaka,
Masami Sako,
Masayuki Ohzeki
Abstract:
In materials informatics, searching for chemical materials with desired properties is challenging due to the vastness of the chemical space. Moreover, the high cost of evaluating properties necessitates a search with a few clues. In practice, there is also a demand for proposing compositions that are easily synthesizable. In the real world, such as in the exploration of chemical materials, it is c…
▽ More
In materials informatics, searching for chemical materials with desired properties is challenging due to the vastness of the chemical space. Moreover, the high cost of evaluating properties necessitates a search with a few clues. In practice, there is also a demand for proposing compositions that are easily synthesizable. In the real world, such as in the exploration of chemical materials, it is common to encounter problems targeting black-box objective functions where formalizing the objective function in explicit form is challenging, and the evaluation cost is high. In recent research, a Bayesian optimization method has been proposed to formulate the quadratic unconstrained binary optimization (QUBO) problem as a surrogate model for black-box objective functions with discrete variables. Regarding this method, studies have been conducted using the D-Wave quantum annealer to optimize the acquisition function, which is based on the surrogate model and determines the next exploration point for the black-box objective function. In this paper, we address optimizing a black-box objective function containing discrete variables in the context of actual chemical material exploration. In this optimization problem, we demonstrate results obtaining parameters of the acquisition function by sampling from a probability distribution with variance can explore the solution space more extensively than in the case of no variance. As a result, we found combinations of substituents in compositions with the desired properties, which could only be discovered when we set an appropriate variance.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
JWST and ALMA discern the assembly of structural and obscured components in a high-redshift starburst galaxy
Authors:
Zhaoxuan Liu,
John D. Silverman,
Emanuele Daddi,
Annagrazia Puglisi,
Alvio Renzini,
Boris S. Kalita,
Jeyhan S. Kartaltepe,
Daichi Kashino,
Giulia Rodighiero,
Wiphu Rujopakarn,
Tomoko L. Suzuki,
Takumi S. Tanaka,
Francesco Valentino,
Irham Taufik Andika,
Caitlin M. Casey,
Andreas Faisst,
Maximilien Franco,
Ghassem Gozaliasl,
Steven Gillman,
Christopher C. Hayward,
Anton M. Koekemoer,
Vasily Kokorev,
Erini Lambrides,
Minju M. Lee,
Georgios E. Magdis
, et al. (5 additional authors not shown)
Abstract:
We present observations and analysis of the starburst, PACS-819, at z=1.45 ($M_*=10^{10.7}$ M$_{ \odot}$), using high-resolution ($0^{\prime \prime}.1$; 0.8 kpc) ALMA and multi-wavelength JWST images from the COSMOS-Web program. Dissimilar to HST/ACS images in the rest-frame UV, the redder NIRCam and MIRI images reveal a smooth central mass concentration and spiral-like features, atypical for such…
▽ More
We present observations and analysis of the starburst, PACS-819, at z=1.45 ($M_*=10^{10.7}$ M$_{ \odot}$), using high-resolution ($0^{\prime \prime}.1$; 0.8 kpc) ALMA and multi-wavelength JWST images from the COSMOS-Web program. Dissimilar to HST/ACS images in the rest-frame UV, the redder NIRCam and MIRI images reveal a smooth central mass concentration and spiral-like features, atypical for such an intense starburst. Through dynamical modeling of the CO J=5--4 emission with ALMA, PACS-819 is rotation-dominated thus has a disk-like nature. However, kinematic anomalies in CO and asymmetric features in the bluer JWST bands (e.g., F150W) support a more disturbed nature likely due to interactions. The JWST imaging further enables us to map the distribution of stellar mass and dust attenuation, thus clarifying the relationships between different structural components, not discernable in the previous HST images. The CO J = 5 -- 4 and FIR dust continuum emission are co-spatial with a heavily-obscured starbursting core (<1 kpc) which is partially surrounded by much less obscured star-forming structures including a prominent arc, possibly a tidally-distorted dwarf galaxy, and a clump, either a sign of an ongoing violent disk instability or a recently accreted low-mass satellite. With spatially-resolved maps, we find a high molecular gas fraction in the central area reaching $\sim3$ ($M_{\text{gas}}$/$M_*$) and short depletion times ($M_{\text{gas}}/SFR\sim$ 120 Myrs) across the entire system. These observations provide insights into the complex nature of starbursts in the distant universe and underscore the wealth of complementary information from high-resolution observations with both ALMA and JWST.
△ Less
Submitted 10 May, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Accelerating the convergence of free electron laser simulations by retrieving a spatially-coherent component of microbunching
Authors:
Takashi Tanaka
Abstract:
A simple method to reduce the numerical cost in free electron laser (FEL) simulations is presented, which is based on retrieving a spatially-coherent component of microbunching to suppress artifact effects that can potentially overestimate the FEL gain; this significantly reduces the number of macroparticles to reach the numerical convergence and enables the direct computation of amplified radiati…
▽ More
A simple method to reduce the numerical cost in free electron laser (FEL) simulations is presented, which is based on retrieving a spatially-coherent component of microbunching to suppress artifact effects that can potentially overestimate the FEL gain; this significantly reduces the number of macroparticles to reach the numerical convergence and enables the direct computation of amplified radiation without solving the wave equation. Examples of FEL simulations performed to demonstrate the proposed method show that the computation time to get a reliable result is reduced by 1-2 orders of magnitude depending on the simulation condition.
△ Less
Submitted 31 October, 2023;
originally announced October 2023.
-
Exploring the circumstellar environment of Tycho's supernova remnant--I. The hydrodynamic evolution of the shock
Authors:
Ryosuke Kobashi,
Shiu-Hang Lee,
Takaaki Tanaka,
Keiichi Maeda
Abstract:
Among Type Ia supernova remnants (SNRs), Tycho's SNR has been considered as a typical object from the viewpoints of its spectroscopic, morphological and environmental properties. A recent reanalysis of Chandra data shows that its forward shock is experiencing a substantial deceleration since around 2007, which suggests recent shock interactions with a dense medium as a consequence of the cavity-wa…
▽ More
Among Type Ia supernova remnants (SNRs), Tycho's SNR has been considered as a typical object from the viewpoints of its spectroscopic, morphological and environmental properties. A recent reanalysis of Chandra data shows that its forward shock is experiencing a substantial deceleration since around 2007, which suggests recent shock interactions with a dense medium as a consequence of the cavity-wall environment inside a molecular cloud. Such a non-uniform environment can be linked back to the nature and activities of its progenitor. In this study, we perform hydrodynamic simulations to characterize Tycho's cavity-wall environment using the latest multi-epoch proper motion measurements of the forward shock. A range of parameters for the environment is explored in the hydrodynamic models to fit with the observation data for each azimuthal region. Our results show that a wind-like cavity with $ρ(r)\propto r^{-2}$ reconciles with the latest data better than a uniform medium with a constant density. In addition, our best-fit model favors an anisotropic wind with an azimuthally varying wind parameter. The overall result indicates a mass-loss rate which is unusually high for the conventional single-degenerate explosion scenario.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
Chasing Gravitational Waves with the Cherenkov Telescope Array
Authors:
Jarred Gershon Green,
Alessandro Carosi,
Lara Nava,
Barbara Patricelli,
Fabian Schüssler,
Monica Seglar-Arroyo,
Cta Consortium,
:,
Kazuki Abe,
Shotaro Abe,
Atreya Acharyya,
Remi Adam,
Arnau Aguasca-Cabot,
Ivan Agudo,
Jorge Alfaro,
Nuria Alvarez-Crespo,
Rafael Alves Batista,
Jean-Philippe Amans,
Elena Amato,
Filippo Ambrosino,
Ekrem Oguzhan Angüner,
Lucio Angelo Antonelli,
Carla Aramo,
Cornelia Arcaro,
Luisa Arrabito
, et al. (545 additional authors not shown)
Abstract:
The detection of gravitational waves from a binary neutron star merger by Advanced LIGO and Advanced Virgo (GW170817), along with the discovery of the electromagnetic counterparts of this gravitational wave event, ushered in a new era of multimessenger astronomy, providing the first direct evidence that BNS mergers are progenitors of short gamma-ray bursts (GRBs). Such events may also produce very…
▽ More
The detection of gravitational waves from a binary neutron star merger by Advanced LIGO and Advanced Virgo (GW170817), along with the discovery of the electromagnetic counterparts of this gravitational wave event, ushered in a new era of multimessenger astronomy, providing the first direct evidence that BNS mergers are progenitors of short gamma-ray bursts (GRBs). Such events may also produce very-high-energy (VHE, > 100GeV) photons which have yet to be detected in coincidence with a gravitational wave signal. The Cherenkov Telescope Array (CTA) is a next-generation VHE observatory which aims to be indispensable in this search, with an unparalleled sensitivity and ability to slew anywhere on the sky within a few tens of seconds. New observing modes and follow-up strategies are being developed for CTA to rapidly cover localization areas of gravitational wave events that are typically larger than the CTA field of view. This work will evaluate and provide estimations on the expected number of of gravitational wave events that will be observable with CTA, considering both on- and off-axis emission. In addition, we will present and discuss the prospects of potential follow-up strategies with CTA.
△ Less
Submitted 5 February, 2024; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Discovery of a Radiation Component from the Vela Pulsar Reaching 20 Teraelectronvolts
Authors:
The H. E. S. S. Collaboration,
:,
F. Aharonian,
F. Ait Benkhali,
J. Aschersleben,
H. Ashkar,
M. Backes,
V. Barbosa Martins,
R. Batzofin,
Y. Becherini,
D. Berge,
K. Bernlöhr,
B. Bi,
M. Böttcher,
C. Boisson,
J. Bolmont,
M. de Bony de Lavergne,
J. Borowska,
F. Bradascio,
M. Breuhaus,
R. Brose,
F. Brun,
B. Bruno,
T. Bulik,
C. Burger-Scheidlin
, et al. (157 additional authors not shown)
Abstract:
Gamma-ray observations have established energetic isolated pulsars as outstanding particle accelerators and antimatter factories in the Galaxy. There is, however, no consensus regarding the acceleration mechanisms and the radiative processes at play, nor the locations where these take place. The spectra of all observed gamma-ray pulsars to date show strong cutoffs or a break above energies of a fe…
▽ More
Gamma-ray observations have established energetic isolated pulsars as outstanding particle accelerators and antimatter factories in the Galaxy. There is, however, no consensus regarding the acceleration mechanisms and the radiative processes at play, nor the locations where these take place. The spectra of all observed gamma-ray pulsars to date show strong cutoffs or a break above energies of a few gigaelectronvolt (GeV). Using the H.E.S.S. array of Cherenkov telescopes, we discovered a novel radiation component emerging beyond this generic GeV cutoff in the Vela pulsar's broadband spectrum. The extension of gamma-ray pulsation energies up to at least 20 teraelectronvolts (TeV) shows that Vela pulsar can accelerate particles to Lorentz factors higher than $4\times10^7$. This is an order of magnitude larger than in the case of the Crab pulsar, the only other pulsar detected in the TeV energy range. Our results challenge the state-of-the-art models for high-energy emission of pulsars while providing a new probe, i.e. the energetic multi-TeV component, for constraining the acceleration and emission processes in their extreme energy limit.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Coupled linear Schrödinger equations: Control and stabilization results
Authors:
K. Bhandari,
R. de A. Capistrano-Filho,
S. Majumdar,
T. Y. Tanaka
Abstract:
This article presents some controllability and stabilization results for a system of two coupled linear Schrödinger equations in the one-dimensional case where the state components are interacting through the Kirchhoff boundary conditions. Considering the system in a bounded domain, the null boundary controllability result is shown. The result is achieved thanks to a new Carleman estimate, which e…
▽ More
This article presents some controllability and stabilization results for a system of two coupled linear Schrödinger equations in the one-dimensional case where the state components are interacting through the Kirchhoff boundary conditions. Considering the system in a bounded domain, the null boundary controllability result is shown. The result is achieved thanks to a new Carleman estimate, which ensures a boundary observation. Additionally, this boundary observation together with some trace estimates, helps us to use the Gramian approach, with a suitable choice of feedback law, to prove that the system under consideration decays exponentially to zero at least as fast as the function $e^{-2ωt}$ for some $ω>0$.
△ Less
Submitted 20 March, 2024; v1 submitted 7 October, 2023;
originally announced October 2023.