Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

A novel machine learning method to detect double-ΛΛ\Lambdaroman_Λ hypernuclear events in nuclear emulsions

Yan He yan.he@riken.jp Vasyl Drozd Hiroyuki Ekawa Samuel Escrig Yiming Gao Ayumi Kasagi Enqiang Liu Abdul Muneem Manami Nakagawa Kazuma Nakazawa Christophe Rappold Nami Saito Takehiko R. Saito Shohei Sugimoto Masato Taki Yoshiki K. Tanaka He Wang Ayari Yanai Junya Yoshida Hongfei Zhang
Abstract

A novel method was developed to detect double-ΛΛ\Lambdaroman_Λ hypernuclear events in nuclear emulsions using machine learning techniques. The object detection model, the Mask R-CNN, was trained using images generated by Monte Carlo simulations, image processing, and image-style transformation based on generative adversarial networks. Despite being exclusively trained on HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events, the model achieved a detection efficiency of 93.9%percent\%% for HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He and 81.5%percent\%% for HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H events in the produced images. In addition, the model demonstrated its ability to detect the Nagara event, which is the only uniquely identified HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He event reported to date. It also exhibited a proper segmentation of the event topology. Furthermore, after analyzing 0.2%percent\%% of the entire emulsion data from the J-PARC E07 experiment utilizing the developed approach, six new candidates for double-ΛΛ\Lambdaroman_Λ hypernuclear events were detected, suggesting that more than 2000 double-strangeness hypernuclear events were recorded in the entire dataset. This method is sufficiently effective for mining more latent double-ΛΛ\Lambdaroman_Λ hypernuclear events recorded in nuclear emulsion sheets by reducing the time required for manual visual inspection by a factor of five hundred.

keywords:
Double-ΛΛ\Lambdaroman_Λ hypernucleus , ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ-ΞΞ\Xiroman_ΞN mixing , Nuclear emulsion , Machine learning , Mask R-CNN
\affiliation

[1]organization=School of Nuclear Science and Technology, Lanzhou University, addressline=222 South Tianshui Road, city=Lanzhou, Gansu Province, postcode=730000, country=China

\affiliation

[2]organization=High Energy Nuclear Physics Laboratory, Cluster for Pioneering Research, addressline=RIKEN, city=Wako, Saitama, postcode=351-0198, country=Japan

\affiliation

[3]organization=Energy and Sustainability Research Institute Groningen, University of Groningen, city=Groningen, country=Netherlands

\affiliation

[4]organization=Instituto de Estructura de la Materia, addressline=CSIC, city=Madrid, country=Spain

\affiliation

[5]organization=University of Chinese Academy of Sciences, city=Beijing, postcode=100049, country=China

\affiliation

[6]organization=Institute of Modern Physics, Chinese Academy of Sciences, addressline=509 Nanchang Road, city=Lanzhou, Gansu Province, postcode=730000, country=China

\affiliation

[7]organization=Graduate School of Artificial Intelligence and Science, Rikkyo University, addressline=3-34-1 Nishi Ikebukuro, Toshima-ku, city=Tokyo, postcode=171-8501, country=Japan

\affiliation

[8]organization=Faculty of Engineering Sciences, addressline=Ghulam Ishaq Khan Institute of Engineering Sciences and Technology, city=Topi, postcode=23640, country=KP, Pakistan

\affiliation

[9]organization=Graduate School of Engineering,Gifu University,, addressline=1-1 Yanagido, city=Gifu, postcode=501-1193, country=Japan

\affiliation

[10]organization=Faculty of Education, Gifu University, addressline=1-1 Yanagido, city=Gifu, postcode=501-1193, country=Japan

\affiliation

[11]organization=GSI Helmholtz Centre for Heavy Ion Research, addressline=Planckstrasse 1, D-64291, city=Darmstadt, country=Germany

\affiliation

[12]organization=Department of Physics, Saitama University, city=Saitama, postcode=338-8570, country=Japan

\affiliation

[13]organization=Department of physics, Tohoku University, address=Aramaki, Aoba-ku, city=Sendai, country=Japan

\affiliation

[14]organization=School of Physics, Xi’an Jiaotong University, city=Xi’an, shaanxi, country=China

1 Introduction

Studies on hypernuclei that contain one or more hyperons in their subatomic structure have extended our understanding of the nuclear force to the general baryon-baryon interaction under flavored SU(3) symmetry [1, 2]. Hyperons, which are baryons with strange quarks, introduce a strangeness degree of freedom (S𝑆Sitalic_S) into the nucleus. A comprehensive understanding of baryon-baryon interactions involving hyperons in dense nuclear matter is crucial to elucidate the internal structure of neutron stars [3]. Hypernuclear investigations are the only approach to probe baryon-baryon interactions involving hyperons in nuclear matter. However, experimental observations on hypernuclei remain quite limited. Approximately 40 single-strangeness hypernuclei (S=1𝑆1S=-1italic_S = - 1) have been observed. Particularly, experimental information on the double-strangeness (S=2𝑆2S=-2italic_S = - 2) sector is scarce. To date, only a few double-strangeness hypernuclei have been discovered [4, 5, 6, 7, 8, 9, 10, 11, 12]. Among these, only the Nagara event [8] was uniquely identified as a double-ΛΛ\Lambdaroman_Λ hypernucleus, HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He (ΛΛ+αΛΛ𝛼\Lambda\Lambda+\alpharoman_Λ roman_Λ + italic_α) in 2001 through a hybrid-emulsion experiment, whereas all other discovered double-ΛΛ\Lambdaroman_Λ hypernuclei have ambiguous interpretations.

Experimental studies of double-ΛΛ\Lambdaroman_Λ hypernuclei, where two ΛΛ\Lambdaroman_Λ hyperons are bound in a nucleus, are an effective approach to gain insight into the ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ interaction. The Nagara event is an epoch-making event in the study of double-ΛΛ\Lambdaroman_Λ hypernuclei, and provides a new and solid foundation for understanding the ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ interaction. Even today, it plays a decisive role in determining the strength of ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ interaction. Despite limited data, the binding energy of two ΛΛ\Lambdaroman_Λ hyperons in the discovered double-ΛΛ\Lambdaroman_Λ hypernuclei appears to exhibit a linear dependence on the mass number of the double-ΛΛ\Lambdaroman_Λ hypernuclei [13]. However, no conclusion can be drawn because of the lack of systematic studies on double-ΛΛ\Lambdaroman_Λ hypernuclei. Therefore, observations of various double-ΛΛ\Lambdaroman_Λ hypernuclei with high accuracy are strongly awaited. Moreover, the enhancement of the ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ bonding energy in double-ΛΛ\Lambdaroman_Λ hypernuclei owing to the three-body force represented by the ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ-ΞNΞ𝑁\Xi Nroman_Ξ italic_N mixing effect has been highlighted by several theoretical calculations [14, 15].

Nuclear emulsion experiments are one of the most efficient methods to identify double-ΛΛ\Lambdaroman_Λ hypernuclei by mass measurement because they make the decay chain of a double-ΛΛ\Lambdaroman_Λ hypernucleus visible in an emulsion with sub-μm𝜇m\rm{\mu}mitalic_μ roman_m spatial resolution [16]. Based on the accuracy of the emulsion at the micrometer scale, it is feasible to analyze the production and sequential decays of double-ΛΛ\Lambdaroman_Λ hypernuclear events recorded in emulsion sheets, enabling the identification of nuclides event-by-event.

Two events displaying a “three-vertex” topology of sequential decays in nuclear emulsion were first reported as double-ΛΛ\Lambdaroman_Λ hypernuclei by Danysz et al. [17, 18] and Prowse [19] in the 1960s. Both events were initiated by ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT hyperons captured at rest by one of the nuclei in the emulsion. However, the ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT hyperon in the first event was not identified, and no photograph of the second event was reported. Approximately 30 years later, another double-ΛΛ\Lambdaroman_Λ hypernuclei event showing a clear sequential decay topology was observed in the KEK-PS E176 experiment using the hybrid emulsion method after following approximately 80 ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT hyperons stopped in the emulsion [6, 4, 5, 20]. Although the nuclear species of this event were not uniquely identified, the existence of double-ΛΛ\Lambdaroman_Λ hypernuclei was first clarified using the hybrid-emulsion method.

Following the E176 experiment, the KEK-PS E373 experiment using the hybrid-emulsion method was designed to detect ten times more double-ΛΛ\Lambdaroman_Λ hypernuclear events than the E176 experiment. Finally, among nine events with sequential decay topology [7, 21, 13], the most known event, the Nagara event [8] was discovered after tracking approximately 103 ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT hyperons stopped in the emulsion. From the Nagara event, the ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ interaction was first confirmed to be weakly attractive. The observation of HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He in the ground state also imposes strict restrictions on the potential existence of H-dibaryon [22]. For HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He in which two protons and two neutrons occupy the 0s0𝑠0s0 italic_s shell, ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ-ΞNΞ𝑁\Xi Nroman_Ξ italic_N mixing is Pauli-suppressed. In contrast to HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He, HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H may have a significantly tighter ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ interaction strength because of ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ-ΞNΞ𝑁\Xi Nroman_Ξ italic_N mixing [14, 15]. However, HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H has not yet been discovered experimentally.

The J-PARC E07 experiment [23], conducted recently at the Japan Proton Accelerator Research Complex (J-PARC), is the latest and most updated hybrid-emulsion experiment, and is expected to detect approximately 100 double-ΛΛ\Lambdaroman_Λ hypernuclei events. It was proposed to provide an opportunity to gather more abundant nuclear information related to strangeness as a greater variety of double-ΛΛ\Lambdaroman_Λ hypernuclei species.

In the E07 experiment, ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT hyperons produced by the (K,K+)superscript𝐾superscript𝐾(K^{-},K^{+})( italic_K start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT , italic_K start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ) reaction were stopped and captured by the nuclei in the emulsion stacks. Using the hybrid-emulsion method, the position of ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT was tracked using other real-time detectors. However, the detection efficiency of the hybrid-emulsion method for all double-strangeness hypernuclear events recorded in E07 emulsion sheets was estimated to be approximately 10%percent\%% only [24, 25]. Owing to the limitations of spectrometer acceptance and tracking, approximately 70%percent\%% of (K,K+)superscript𝐾superscript𝐾(K^{-},K^{+})( italic_K start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT , italic_K start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT ) events were not tagged. Additionally, besides the triggered events, the ‘n’(K,K0)Ξsuperscript𝐾superscript𝐾0superscriptΞ(K^{-},K^{0})\Xi^{-}( italic_K start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT , italic_K start_POSTSUPERSCRIPT 0 end_POSTSUPERSCRIPT ) roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT reaction, which may occur at a higher rate cannot be detected with the hybrid-emulsion method [26]. Although 33 candidates for double-strangeness hypernuclear events have already been detected by following the triggered ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT hyperons, only three events, named Mino [9], Ibuki [10], and Irrawaddy [12] were identified. Therefore, it is necessary and worthwhile to develop a new detection method to achieve a significantly higher efficiency.

Approximately 1300 nuclear emulsion sheets were used in the J-PARC E07 experiment irradiated with Ksuperscript𝐾K^{-}italic_K start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT beams. To detect all the latent double-strangeness hypernuclear events that cannot be detected ising the hybrid-emulsion method, complete scanning of the entire nuclear emulsion sheets is necessary. Recently, an overall scanning method [27] that uses high-speed microscopes to capture images of an emulsion was developed. However, there are approximately 1.4 billion images per emulsion sheet for visual inspection, which would require over 500 years to analyze all the emulsion sheets [28]. Therefore, image recognition utilizing machine learning techniques for object detection is one of the most effective approaches for reducing the analysis time. Image recognition methods using machine learning techniques have already been applied to search for alpha-decay events of natural isotopes [29] and hypertriton events [30] in the emulsion sheets of the J-PARC E07 experiment successfully. In the present study, we first applied machine learning techniques to detect double-ΛΛ\Lambdaroman_Λ hypernuclear events.

Section 2 describes the development procedures for both the generation of simulated double-ΛΛ\Lambdaroman_Λ hypernuclear events and the training of the object detection model. Section 3 describes the performance of the proposed method. Section 4 presents the results of the detection of double-ΛΛ\Lambdaroman_Λ hypernuclear events in E07 emulsion data.

Refer to caption
Fig. 1: The images of double-ΛΛ\Lambdaroman_Λ hypernuclear event generated with Geant4 simulation and image processing. Panel (a) shows the trajectories and decay mode of the event. In panel (b) the trajectory information is converted to three-dimension images while RGB channels of the image represent different focus planes, the green color is related to the optimal focus plane and red and blue represent the shallower and deeper plane, respectively.

2 Method

In the present work, we employed a state-of-the-art machine-learning-based object detection model, the mask region-based convolutional neural network (Mask R-CNN) [31]. For double-ΛΛ\Lambdaroman_Λ hypernuclear events, there is insufficient data to train the model, as only one event has been uniquely identified to date. Therefore, we employed Geant4 Monte Carlo simulations [32] to generate double-ΛΛ\Lambdaroman_Λ hypernuclear events. After event generation, training data containing double-ΛΛ\Lambdaroman_Λ hypernuclear events were produced by image processing and image style transformation, pix2pix [33] using generative adversarial networks (GANs) [34]. The Mask-R CNN model was trained using the produced training data. After training, we evaluated the model, which showed sufficient efficiency in detecting double-ΛΛ\Lambdaroman_Λ hypernuclear events in the produced images, as discussed later in this paper. Additionally, the model accurately detected the Nagara event and successfully segmented its topology.

2.1 Data preparation

As double-ΛΛ\Lambdaroman_Λ hypernuclear data were insufficient to train the Mask R-CNN model, images containing double-ΛΛ\Lambdaroman_Λ hypernuclear events and background events were generated for training and evaluating the model by utilizing Geant4 Monte Carlo simulations, image processing, and image-style transformation. In the Geant4 Monte Carlo simulations, the composition of the nuclear emulsion was replicated by referring to the emulsion layer of the J-PARC E07 experiment [23]. For double-ΛΛ\Lambdaroman_Λ hypernuclear events generated in Geant4 Monte Carlo simulations, we first considered the case of HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He, and applied the decay sequence presented in Eq. (1).

Ξ+12Csuperscript12superscriptΞCabsent\displaystyle\Xi^{-}+^{12}\rm{C}\rightarrowroman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT + start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_C → HeΛΛ6+α+tsuperscriptsubscriptHeΛΛ6𝛼t\displaystyle\prescript{6\ }{\Lambda\Lambda}{\rm{He}}+\alpha+tstart_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He + italic_α + roman_t (1)
HeΛ5+p+πabsentsuperscriptsubscriptHeΛ5psuperscript𝜋\displaystyle\quad\!\hookrightarrow\prescript{5\ }{\Lambda}{\rm{He}}+p+\pi^{-}↪ start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He + roman_p + italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT
α+p+πabsent𝛼𝑝superscript𝜋\displaystyle\qquad\quad\;\hookrightarrow\alpha+p+\pi^{-}↪ italic_α + italic_p + italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT

One of the HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events is shown in Fig. 1 (a). As shown in the figure, HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He is produced by ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT capture of C12superscriptC12\rm{}^{12}Cstart_FLOATSUPERSCRIPT 12 end_FLOATSUPERSCRIPT roman_C in the nuclear emulsion at vertex A. We assumed that ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT is bound in the 3D atomic orbit of C12superscriptC12\rm{}^{12}Cstart_FLOATSUPERSCRIPT 12 end_FLOATSUPERSCRIPT roman_C with a binding energy of 0.13 MeV [35]. From the capture point vertex A in Fig. 1 (a), decays of HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He and HeΛ5superscriptsubscriptHeΛ5\prescript{5}{\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He occurred at vertices B and C, respectively. The decay modes of HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He and HeΛ5superscriptsubscriptHeΛ5\prescript{5}{\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He in Eq. (1) were chosen as mesonic decay with πsuperscript𝜋\pi^{-}italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT emission, because non-mesonic decay may induce more ambiguous interpretations for identifying events. During the generation of an event, the lifetimes of hypernuclei in the production and decay of Eq. (1) were assumed to be approximately 200 psps\rm{ps}roman_ps because the proposed method is not sensitive to the lifetime. In addition, the mass of HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He was defined assuming ΛΛΛΛ\Lambda\Lambdaroman_Λ roman_Λ interaction strength is zero.

When the charged particles of double-ΛΛ\Lambdaroman_Λ hypernuclear events undergo nuclear emulsion, the number of grains generated along the trajectory of the nuclear emulsion is correlated with their energy loss, velocity, and zenith angle [36]. In Fig. 1 (a), the thickness of the track that is related to the grain density was calculated and reproduced corresponding to the velocity and angle at each step for various tracks [30]. As the tracks in the nuclear emulsion were recorded with three dimensional information, the trajectories shown in panel (a) of Fig. 1 were converted to a colored image as shown in panel (b). RGB colors were employed to represent different focal planes, where green indicates tracks in the optimal focus plane, and red and blue signify tracks in the shallower and deeper planes, respectively.

Refer to caption
Fig. 2: Panel (a) shows the colored image with RGB channels, including background and a double-ΛΛ\Lambdaroman_Λ hypernuclear event. Panel (b) depicts the surrogate image resembling a real emulsion image converted by the pix2pix model from the colored image, while Panel (c) shows the associated mask image, wherein only the double-ΛΛ\Lambdaroman_Λ hypernuclear event is marked as an object in the training data.

In the nuclear emulsion of the E07 experiment, Ksuperscript𝐾K^{-}italic_K start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT beam interaction events were the main background noise that produced tracks similar to the events of interest. To achieve an accurate classification and detection performance, negative samples [37], Ksuperscript𝐾K^{-}italic_K start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT beam interaction events, were generated as background events. To simulate the interaction of a Ksuperscript𝐾K^{-}italic_K start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT beam at 1.8 GeV/c with the nuclides in the nuclear emulsion of the E07 experiment, the JAM package [38], based on data from the hadron scattering experiment was used. The tracks of particles from the beam interaction were visualized with the same Geant4 framework, image processing, and image-style transformation method employed in the generation of the double-ΛΛ\Lambdaroman_Λ hypernuclear image data. Fig. 2 (a) shows examples of a double-ΛΛ\Lambdaroman_Λ hypernuclear event (marked by a solid white rectangle) and a beam interaction event (marked by a dashed circle). Additional background tracks were extracted from the actual microscopic images of E07 emulsion data using an image filter and binarization. Three types of depth information for the background tracks were encoded using RGB channels, which was consistent with the method employed for the simulated images.

After the creation of the RGB image shown in Fig. 2 (a), image style transfer using GANs [34] was applied to generate emulsion images that closely mimicked real emulsion data. Based on the capabilities of GANs, the pix2pix model [33] was employed to convert the RGB image, shown in Fig. 2 (a) into an image similar to a real emulsion image as shown in Fig. 2 (b). The parameters for training the pix2pix model in this study are aligned with those specified in [30]. The image produced by the trained pix2pix model in Fig. 2 (b), combined with the corresponding mask images in Fig. 2 (c), served as training data for the object detection model described in the following section.

Table 1: Hyperparameters for Mask R-CNN model training
      Parameters       Value
      Backbone       ResNet50 [39]
      Batch size       8
      Initial learning rate       0.02
      Learning rate gamma       0.1
      Learning rate step       80, 90, 100, 110, 120
      momentum       0.9
      Total epochs       200

2.2 Model training

The proposed method employs the Mask R-CNN object detection model [31], which is a widely adopted architecture for detection and segmentation tasks [40] owing to its simplicity and flexibility in network design and hyperparameter tuning. The model can not only detect objects of interest, but can also precisely delineate their boundaries at the pixel level, assigning confidence scores between zero and one. A score closer to one is considered to be better.

The training data for the Mask R-CNN model comprised input images paired with the corresponding mask images that labeled double-ΛΛ\Lambdaroman_Λ hypernuclear events as objects of interest. The images in Figs. 2 (b) and 2 (c) are examples of the input image and corresponding mask image, respectively. The mask image, outlining the shape and position of the object event can be generated using a Geant4 simulation. A double-ΛΛ\Lambdaroman_Λ hypernuclear event typically displays a “three-vertex” characterized by its sequential decay. To ensure that the produced images maintained a clear “three-vertex” topology for double-ΛΛ\Lambdaroman_Λ hypernuclear events, a cut condition of 2 µm was applied to the projected length of hypernuclear trajectories parallel to the image plane during data preparation. In addition to the cut condition for the range of hypernuclear tracks, double-ΛΛ\Lambdaroman_Λ hypernuclear events with projected angles greater than 45 between HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He and HeΛ5superscriptsubscriptHeΛ5\prescript{5}{\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He were also selected. For the angles between the daughter particles and hypernuclei at the three vertices, a cut condition of 30 was applied. In particular, the angles between the particles and HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He from vertex A were constrained to be greater than 30. Similarly, for the particles emitted from vertex B, the angles between the particles and both HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He and HeΛ5superscriptsubscriptHeΛ5\prescript{5}{\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He were also required to be greater than 30. After applying these cut conditions to both the range and angles of the particles of double-ΛΛ\Lambdaroman_Λ hypernuclear events, a total of 18354 images were generated for training the model, with 80%percent\%% allocated to the training set and the remaining 20%percent\%% for validation.

In this study, the Mask R-CNN model was implemented using the PyTorch framework (https://github.com/multimodallearning/pytorch-mask-rcnn). The hyperparameters applied for the model training are summarized in Table 1. The smoothed validation loss [29] was utilized with the following recurrence formula to define the best epoch:

S0=V0subscript𝑆0subscript𝑉0\displaystyle S_{0}=V_{0}italic_S start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_V start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (2)
Si=wSi1+(1.0w)Visubscript𝑆𝑖𝑤subscript𝑆𝑖11.0𝑤subscript𝑉𝑖\displaystyle S_{i}=wS_{i-1}+(1.0-w)V_{i}italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_w italic_S start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT + ( 1.0 - italic_w ) italic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT

where, Sisubscript𝑆𝑖S_{i}italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and Visubscript𝑉𝑖V_{i}italic_V start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are the i𝑖iitalic_ith smoothed and original values of the validation loss, respectively. Parameter w𝑤witalic_w is a weight set as 0.9, indicating the degree of smoothing. Fig. 3 shows the training and validation losses during model training, represented by blue and orange lines, respectively. Epoch 119, characterized by the lowest smoothed loss, was determined to be the optimal epoch for subsequent inference.

Refer to caption
Fig. 3: Plots of loss for training and validation data as functions of the number of epochs. The orange and blue lines show the loss for training and validation datasets. The epoch with the smallest validation loss value in the curve defined by Eq. (2) was determined to be the best epoch, and the model at epoch = 119 was employed for inference.

3 Model performance

After training the model, we first evaluated its performance by analyzing the produced images. To verify whether the model can detect double-ΛΛ\Lambdaroman_Λ hypernuclear events, four datasets containing 500 images each were employed for evaluation. These images included both double-ΛΛ\Lambdaroman_Λ hypernuclear and background events. These events were generated using the same procedure as that used to produce the training and validation datasets. The first group of images featured HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events with the decay mode described by Eq. (1), which was identical to that of the training dataset. Although the model was trained exclusively on HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events with the decay mode shown in Eq. (1), double-ΛΛ\Lambdaroman_Λ hypernuclear events with other decay modes described below were also used for evaluation. The decay mode of HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events in the second group is as follows:

Ξ+12CHeΛΛ6+α+tHeΛ5+p+πt+p+n\begin{split}\Xi^{-}+^{12}\rm{C}\rightarrow&\prescript{6\ }{\Lambda\Lambda}{% \rm{He}}+\alpha+t\\ &\quad\!\hookrightarrow\prescript{5\,}{\Lambda}{\rm{He}}+p+\pi^{-}\\ &\qquad\quad\;\hookrightarrow t+p+n\end{split}start_ROW start_CELL roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT + start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_C → end_CELL start_CELL start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He + italic_α + roman_t end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ↪ start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He + roman_p + italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL ↪ italic_t + italic_p + italic_n end_CELL end_ROW (3)

In addition to HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events, HeΛΛ5superscriptsubscriptHeΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events with the following two decay modes for the remaining two groups of images were used for model evaluation.

Ξ+12Csuperscript12superscriptΞCabsent\displaystyle\quad\Xi^{-}+^{12}\rm{C}\rightarrowroman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT + start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_C → HΛΛ5+α+αsuperscriptsubscriptHΛΛ5𝛼𝛼\displaystyle\prescript{5\ }{\Lambda\Lambda}{\rm{H}}+\alpha+\alphastart_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H + italic_α + italic_α (4)
HeΛ5+πabsentsuperscriptsubscriptHeΛ5superscript𝜋\displaystyle\quad\!\hookrightarrow\prescript{5\,}{\Lambda}{\rm{He}}+\pi^{-}↪ start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He + italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT
α+p+πabsent𝛼𝑝superscript𝜋\displaystyle\qquad\quad\;\hookrightarrow\alpha+p+\pi^{-}↪ italic_α + italic_p + italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT
Ξ+12Csuperscript12superscriptΞCabsent\displaystyle\qquad\Xi^{-}+^{12}\rm{C}\rightarrowroman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT + start_POSTSUPERSCRIPT 12 end_POSTSUPERSCRIPT roman_C → HΛΛ5+α+αsuperscriptsubscriptHΛΛ5𝛼𝛼\displaystyle\prescript{5\ }{\Lambda\Lambda}{\rm{H}}+\alpha+\alphastart_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H + italic_α + italic_α (5)
HeΛ4+n+πabsentsuperscriptsubscriptHeΛ4nsuperscript𝜋\displaystyle\quad\!\hookrightarrow\prescript{4\,}{\Lambda}{\rm{He}}+n+\pi^{-}↪ start_FLOATSUPERSCRIPT 4 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He + roman_n + italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT
He3+p+πabsentsuperscriptHe3psuperscript𝜋\displaystyle\qquad\quad\;\hookrightarrow\rm{{}^{3}He}+p+\pi^{-}↪ start_FLOATSUPERSCRIPT 3 end_FLOATSUPERSCRIPT roman_He + roman_p + italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT

When the four groups of images were input into the model, double-ΛΛ\Lambdaroman_Λ hypernuclear events were successfully detected, as indicated by the orange box in the mask image in Fig. 4. However, the model also produced some misdetections, primarily due to beam interactions crossing other tracks, such as the object highlighted by the red box in the mask image in Fig. 4.

Refer to caption
Fig. 4: The example of the object detected by the model for produced images. While the model detected double-ΛΛ\Lambdaroman_Λ hypernuclear events, misdetection occurred during evaluation with test dataset. The misdetections are primarily caused by beam interactions that create tracks crossing with other tracks as illustrated in this figure. Panel (a) is the produced emulsion images and panel (b) shows the mask image highlighting the objects detected by the developed model. The objects in the masks image are detected double-ΛΛ\Lambdaroman_Λ hypernuclear event and misdetection indicated by the orange and red boxes, respectively.
Refer to caption
Fig. 5: Score distributions of the detected double-ΛΛ\Lambdaroman_Λ hypernuclear events and misdetections for the four test datasets. Each dataset contains 500 images, and each image includes double-ΛΛ\Lambdaroman_Λ hypernuclear event with a specific decay mode. Double-ΛΛ\Lambdaroman_Λ hypernuclear events in panel (a) and (b) are HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events with decay mode Eq. (1) and Eq. (3), and double-ΛΛ\Lambdaroman_Λ hypernuclear events in (c) and (d) are HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H with decay mode Eq. (4) and Eq. (5). Orange lines represent the score distributions of detected double-ΛΛ\Lambdaroman_Λ hypernuclear events and blue lines show the score distribution of misdetections. With a score threshold of 0.8 as shown with black dash lines, the detection efficiency and purity of the model for test datasets were calculated with Eq. (6) and Eq. (7), respectively. The results of calculation are listed in Table. 2.
Refer to caption
Fig. 6: Detection reslut of Nagara event [8] by the developed model. Panel (a) shows the emulsion image captured under microscope with a 20×\times× objective lens. Panel (b) shows the mask image detected by the model. White pixels are detected HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He event. The mask image clearly visualizes both the production and decay vertices of the event with a score of 0.974, and all emitted particle tracks are segmented precisely.

Fig. 5 shows the score distributions of the test results for the four datasets. Each dataset consisted of 500 images, including double-ΛΛ\Lambdaroman_Λ hypernuclear events with a specific decay mode: Eq. (1) in (a), Eq. (3) in (b), Eq. (4) in (c), and Eq. (5) in (d). The blue line represents the distribution of misdetections, whereas the orange line shows the score distribution of the detected double-ΛΛ\Lambdaroman_Λ hypernuclear events. Using a score threshold of 0.8, the detection efficiency and purity of the model for the test datasets were calculated as

efficiency=Ndetected-doubleNtotal-doubleefficiencysubscriptNdetected-doublesubscriptNtotal-double\rm efficiency=\frac{N_{detected\text{-}double}}{N_{total\text{-}double}}roman_efficiency = divide start_ARG roman_N start_POSTSUBSCRIPT roman_detected - roman_double end_POSTSUBSCRIPT end_ARG start_ARG roman_N start_POSTSUBSCRIPT roman_total - roman_double end_POSTSUBSCRIPT end_ARG (6)
purity=Ndetected-doubleNdetected-double+NmisdetectionpuritysubscriptNdetected-doublesubscriptNdetected-doublesubscriptNmisdetection\rm purity=\frac{N_{detected\text{-}double}}{N_{detected\text{-}double}+N_{% misdetection}}roman_purity = divide start_ARG roman_N start_POSTSUBSCRIPT roman_detected - roman_double end_POSTSUBSCRIPT end_ARG start_ARG roman_N start_POSTSUBSCRIPT roman_detected - roman_double end_POSTSUBSCRIPT + roman_N start_POSTSUBSCRIPT roman_misdetection end_POSTSUBSCRIPT end_ARG (7)

Ndetected-doublesubscriptNdetected-double\rm{N_{detected\text{-}double}}roman_N start_POSTSUBSCRIPT roman_detected - roman_double end_POSTSUBSCRIPT represents the number of detected double-ΛΛ\Lambdaroman_Λ hypernuclear events, Ntotal-doublesubscriptNtotal-double\rm{N_{total\text{-}double}}roman_N start_POSTSUBSCRIPT roman_total - roman_double end_POSTSUBSCRIPT is the total number of double-ΛΛ\Lambdaroman_Λ hypernuclear events in the test dataset, and NmisdetectionsubscriptNmisdetection\rm{N_{misdetection}}roman_N start_POSTSUBSCRIPT roman_misdetection end_POSTSUBSCRIPT is the number of misdetections. The efficiencies and purities of the four test datasets are presented in Table 2.

For the decay modes described in Eq. (1) and Eq. (3), HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He is produced by ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT capture at rest by C12superscriptC12\rm{}^{12}Cstart_FLOATSUPERSCRIPT 12 end_FLOATSUPERSCRIPT roman_C and decays with πsuperscript𝜋\pi^{-}italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT emission. The primary distinction between the two decay modes lies in the subsequent decay of a single-ΛΛ\Lambdaroman_Λ hypernucleus.

  • 1.

    in Eq. (1), HeΛ5superscriptsubscriptHeΛ5\prescript{5}{\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He decays with πsuperscript𝜋\pi^{-}italic_π start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT emission.

  • 2.

    in Eq. (3), non-mesonic decay occurs for HeΛ5superscriptsubscriptHeΛ5\prescript{5}{\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT roman_He, as observed in the Nagara event [8].

For HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events with the decay modes described by Eq. (1) and Eq. (3), the model achieved detection efficiencies of 94.8 %percent\%% and 93.0 %percent\%%, respectively, along with purities of 98.5 %percent\%% and 98.1 %percent\%%. On average, the model achieved a detection efficiency of 93.9%percent\%% and purity of 98.3%percent\%% for HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He.

In addition, HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H events with decay modes described by Eqs. (4) and (5) were used to further evaluate the model performance. HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H was produced by ΞsuperscriptΞ\Xi^{-}roman_Ξ start_POSTSUPERSCRIPT - end_POSTSUPERSCRIPT capture at rest by C12superscriptC12\rm{}^{12}Cstart_FLOATSUPERSCRIPT 12 end_FLOATSUPERSCRIPT roman_C, followed by sequential mesonic decay. The distinction between Eq. (4) and Eq. (5) lies within the neutron emission during the HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H decay.

  • 1.

    in Eq. (4), HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H decays without neutron emission.

  • 2.

    in Eq. (5), HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H decays with neutron emission.

Although trained only on HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events, even for HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H, the model exhibited a high detection efficiency of 81.4%percent\%% and purity of 98.6%percent\%% for the decay mode Eq. (4), and 81.6%percent\%% efficiency and 98.1%percent\%% purity for the decay mode Eq. (5). On average, the model achieved a detection efficiency of 81.5%percent\%% and a purity of 98.4%percent\%% for HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H.

After evaluating the model using the generated images, we tested it using the Nagara event [8]. Fig. 6 presents the detection results. Panel (a) shows a microscopic image of the Nagara event in the nuclear emulsion captured with a 20×\times× objective lens, and panel (b) displays the mask image predicted by the model, highlighting the detected HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He event. The model successfully detected the Nagara event with a score of 0.974. The corresponding mask image clearly visualizes both the production and decay vertices, accurately segmenting all the emitted particle tracks.

Table 2: Detection efficiency and purity of the model for HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He events with decay mode Eq. (1) and Eq. (3), and HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H events with decay mode Eq. (4) and Eq. (5) with a score threshold 0.8. Ndetected-doublesubscriptNdetected-double\rm{N_{detected\text{-}double}}roman_N start_POSTSUBSCRIPT roman_detected - roman_double end_POSTSUBSCRIPT is the number of the double-ΛΛ\Lambdaroman_Λ hypernuclear events detected by the model, and NmisdetectionsubscriptNmisdetection\rm{N_{misdetection}}roman_N start_POSTSUBSCRIPT roman_misdetection end_POSTSUBSCRIPT is the number of the misdetections. Efficiency and purity are calculated by the Eq. 6 and Eq. 7.
Double-ΛΛ\Lambdaroman_Λ hypernucleaus Decay mode Ndetected-doublesubscriptNdetected-double\rm{N_{detected\text{-}double}}roman_N start_POSTSUBSCRIPT roman_detected - roman_double end_POSTSUBSCRIPT NmisdetectionsubscriptNmisdetection\rm{N_{misdetection}}roman_N start_POSTSUBSCRIPT roman_misdetection end_POSTSUBSCRIPT Efficiency Purity
HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He Eq. (1) 474 7 94.8%percent\%% 98.5%percent\%%
HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He Eq. (3) 464 10 93.0%percent\%% 98.1%percent\%%
HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H Eq. (4) 407 6 81.4%percent\%% 98.6%percent\%%
HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H Eq. (5) 413 8 81.6%percent\%% 98.1%percent\%%

4 Results and discussions

Following the evaluation of the produced images, the model performance was evaluated on actual emulsion data. The evaluation employed 2.4 million microscopic images acquired from a volume of approximately 25cm2×0.025cm25superscriptcm20.025cm\rm{25\ cm^{2}\times 0.025\ cm}25 roman_cm start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT × 0.025 roman_cm on the emulsion sheet in the J-PARC E07 experiment. From these images, the model detected 8336 images that exhibited characteristics similar to the “three-vertex” topology.

The emulsion images used as inputs for the model were captured by optical scanning. The emulsion sheet was scanned using a microscope with a 20×\times× objective lens, moving in the horizontal and vertical directions to capture images from different regions. To acquire images from different focal depths, the stage was moved perpendicular to the emulsion sheet in approximately 3 μm𝜇m\rm{\mu m}italic_μ roman_m steps. This scanning process can counted an event multiple times if it appears across multiple focal planes. To address this issue, duplicate events were removed based on the object positions predicted by the model. If the distance between the positions of the objects detected in adjacent focal planes was less than 30 μm𝜇𝑚\mu mitalic_μ italic_m, the latter object was considered as a duplicate and was removed. Additionally, the model detected the dust captured in the emulsion, which were removed based on the number of black pixels in the mask images detected by the model [30]. From the initial 8336 detected images, 3343 objects were duplicates and 1091 containing dust were excluded. Utimately, 4177 objects remained in 3902 images for further visual inspection.

Refer to caption
Fig. 7: Examples and score distributions of the detected objects by the proposed model. Each pair of images in left part displays the emulsion images (left) and corresponding mask images (right). Panel (a) demonstrates a positive detection of the “three-vertex” event, and panel (b), (c), (d) are examples of the alpha decay, cross and beam interaction events, respectively. The right panel presents histograms of the score distributions for the four detected objects categories in (e), (f), (g) and (h), corresponding to the categories shown in (a), (b), (c), and (d).

Fig. 7 shows examples (left) and score distributions (right) of the four object categories detected using the developed model. Each pair of example images displays the emulsion images captured under a 20×\times× objective lens and mask images from the model detection. Panel (a) shows an example of positive detection of the “three-vertex” event. In total, there were 56 positive objects of “three-vertex” events from 4177 detected objects. Beyond the “three-vertex” events, the model detected additional events with the following categories:

  • 1.

    152 alpha decay events as shown in Fig. 7 (b);

  • 2.

    993 objects with at least two vertices caused by cross tracks in Fig. 7 (c);

  • 3.

    1355 beam interaction events in Fig. 7 (d).

The right panel of Fig. 7 displays the score distribution of these four categories of objects on a logarithmic scale. The remaining objects detected were dust and duplicates.

The developed model reduced the number of background images from 2.4 million to 4177, which were retained for visual inspection, representing a reduction factor of 1.7×1031.7superscript1031.7\times 10^{-3}1.7 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT. This time consumption is 500 times shorter than the 500 years required for manual visual inspection of the entire nuclear emulsion, as discussed in Section 1. Consequently, it is feasible to visually inspect all images of the entire J-PARC E07 nuclear emulsion within one year.

After the evaluation, we applied the model to approximately 0.2%percent\%% of the entire E07 emulsion dataset captured from a volume of 4800cm2× 0.025cm4800superscriptcm20.025cm\rm{4800\ cm^{2}\ \times\ 0.025\ cm}4800 roman_cm start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT × 0.025 roman_cm of the emulsion sheets. In total, 12962 “three-vertex” objects were detected, and from these objects, six double-ΛΛ\Lambdaroman_Λ hypernuclear candidates were observed.

Fig. 8 (a-f) show images of the six candidates. The left panels of the images in each group show the detection results of the developed model. The rightmost images depict the event topology under a microscope with a 90×\times× objective lens. The blue arrows in the rightmost images in panels (a), (b), (c), and (d) indicate the incoming particles of the events. The incoming particles were captured at the first vertex A, and sequential decay occurred at vertices B and C. In panels (e) and (f), vertex A represents the beam interaction, followed by cascade decay at vertices B and C. These six candidates of double-strangeness hypernuclear events showed clear “three-vertex” topology and cascade decays occurred. The number of detected candidates suggests that more than two thousand double-strangeness hypernuclei were recorded in the entire dataset. Further kinematic analyses are required to definitively identify these events. Detailed analyses of these events are currently underway.

Refer to caption
Fig. 8: The images of six candidates detected by the developed model. Panel (a-f) presents six groups of images representing six candidates. The left-hand images in each group, used as model input, were captured under a 20×\times× objective lens. The middle images display the results of detection by the developed model, emphasizing event topology. The rightmost images present the event topology captured with a 90×\times× objective lens. The blue arrows in the right images of panels (a), (b), (c) and (d) show the incoming particle of the events. And the incoming particles were captured at the first vertex A. After capture, the sequential decay occurred at vertex B and C, respectively. In panel (e) and (f), vertex A represents a beam interaction followed by cascade decays at vertex B and C.

5 Summary

In this study, we developed a novel method utilizing Geant4 Monte Carlo simulations, image processing, and image-style transformation with GANs to detect double-ΛΛ\Lambdaroman_Λ hypernuclear events in nuclear emulsion sheets of the J-PARC E07 experiment. The proposed method can detect double-ΛΛ\Lambdaroman_Λ hypernuclear events in both produced and actual emulsion images. For the produced images, the method achieved detection efficiencies of 93.9%percent\%% and 81.5%percent\%% for HeΛΛ6superscriptsubscriptHeΛΛ6\prescript{6\ }{\Lambda\Lambda}{\rm{He}}start_FLOATSUPERSCRIPT 6 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_He and HΛΛ5superscriptsubscriptHΛΛ5\prescript{5\ }{\Lambda\Lambda}{\rm{H}}start_FLOATSUPERSCRIPT 5 end_FLOATSUPERSCRIPT start_POSTSUBSCRIPT roman_Λ roman_Λ end_POSTSUBSCRIPT roman_H, respectively, with corresponding purities of 98.3%percent\%% and 98.4%percent\%%. In addition to the produced images, the proposed method successfully detected the Nagara event with a confidence score of 0.974. When applied to E07 emulsion images, the method drastically reduced the background images by a factor of 0.0017 and successfully detected six candidates of double-ΛΛ\Lambdaroman_Λ hypernuclear events over 0.2%percent\%% of the full nuclear emulsion dataset of the E07 experiment. The number of detected candidates suggests that more than 2000 double-strangeness hypernuclear events were recorded in the entire dataset. The proposed method shows considerable promise for application across the entire E07 nuclear emulsion dataset, potentially enhancing visual inspection efficiency by approximately 500 times.

CRediT authorship contribution statement

Yan He: Conceptualization, Methodology, Software, Validation, Analysis, Investigation, Data curation, Writing – original draft, Writing – review & editing, Visualization. Vasyl Drozd: Methodology, Analysis, Writing – review & editing. Hiroyuki Ekawa: Methodology, Software, Analysis, Investigation, Writing – review & editing. Samuel Escrig: Methodology, Writing – review & editing. Yiming Gao: Methodology, Writing – review & editing. Ayumi Kasagi: Conceptualization, Methodology, Software, Validation, Analysis, Investigation, Data curation, Writing – review & editing, Visualization. Enqiang Liu: Methodology, Software, Analysis, Investigation, Writing – review & editing. Abdul Muneem: Methodology, Investigation, Writing – review & editing. Manami Nakagawa: Conceptualization, Methodology, Software, Validation, Analysis, Investigation, Data curation, Writing – review & editing. Kazuma Nakazawa: Conceptualization, Methodology, Analysis, Investigation, Writing – review & editing, Resources, Funding acquisition. Christophe Rappold: Methodology, Analysis, Investigation, Writing – review & editing. Nami Saito: Conceptualization, Methodology, Software, Investigation, Writing – review & editing. Takehiko R. Saito: Conceptualization, Methodology, Writing – original draft, Writing – review & editing, Project administration, Supervision, Resources, Funding acquisition. Shohei Sugimoto: Methodology, Writing – review & editing. Masato Taki: Conceptualization, Methodology, Software, Investigation, Writing – review & editing. Yoshiki K. Tanaka: Methodology, Investigation, Writing – review & editing. He Wang: Methodology, Investigation, Writing – review & editing. Ayari Yanai: Methodology, Writing – review & editing. Junya Yoshida: Conceptualization, Methodology, Software, Validation, Analysis, Investigation, Data curation, Writing – review & editing. Hongfei Zhang: Methodology, Writing – review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Data availability

Data will be made available on request.

Acknowledgments

This work was supported by JSPS KAKENHI Grant Numbers JP16H02180, JP20H00155, JP18H05403, and JP19H05147 (Grant-in-Aid for Scientific Research on Innovative Areas 6005). A.K was supported by JSPS KAKENHI Grant Numbers JP23K19051 (Grant-in-Aid for Research Activity Start-up). The authors thank the J-PARC E07 collaboration for providing the emulsion sheets. The authors thank Risa Kobayashi, Michi Ando, Chiho Harisaki and Hanako Kubota of RIKEN and Yoko Tsuchii of Gifu University for their technical support in mining events in the J-PARC E07 nuclear emulsions. The authors thank Yukiko Kurakata of RIKEN including the administrative works.

References