Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

[a]Abha R. Khakurdikar

Search for UHE neutrinos in the background of cosmic rays

   Washington R. Carvalho Jr    Jörg R. Hörandel
Abstract

The main challenge in detecting ultra-high energy (UHE) neutrinos is discriminating a neutrino-induced shower in the background of showers initiated by ultra-high energy nuclei. The resulting shower development from neutrinos exhibits different characteristics from hadron-induced showers because neutrinos penetrate the atmosphere more deeply than hadrons.

This study focuses on simulations of highly inclined neutrino-induced showers above 75superscript7575^{\circ}75 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT zenith angles, exploring an extensive energy range from 1EeV1EeV1\text{EeV}1 EeV to 120EeV120EeV120\text{EeV}120 EeV. These simulated showers have different ranges of interaction depths corresponding to each zenith angle, presenting diverse detection challenges.

Our methodology utilises timing data from radio antennas for the shower front calculation for extensive air showers induced by neutrinos and nuclei. Furthermore, we incorporate signals obtained from Water Cherenkov detectors and the spatial distribution of stations registering signals in both Water Cherenkov detectors and radio antennas. We aim to classify neutrino-induced showers and background events stemming from nuclei by harnessing a decision tree classifier employing the Gini impurity method. Our framework yields excellent accuracy for separating the neutrinos from the background.

The findings of this study offer significant advancements in the domain of UHE neutrino detection, shedding light on astrophysical phenomena associated with these elusive particles amidst the complex background of UHE nuclei.

1 Introduction

The primary challenge in detecting ultra--high energy neutrinos is distinguishing a shower induced by neutrinos from the background of showers caused by UHE cosmic rays, which can be protons, heavy nuclei, and, to a much lesser extent, even photons. The simulations for downgoing neutrinos and hadrons--induced showers are produced at energies between 1018superscript101810^{18}10 start_POSTSUPERSCRIPT 18 end_POSTSUPERSCRIPT and 1020superscript102010^{20}10 start_POSTSUPERSCRIPT 20 end_POSTSUPERSCRIPT eV and at zenith angles between 758575superscript8575-85^{\circ}75 - 85 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT. Neutrinos interact deep into the atmosphere. Hence, neutrino showers are generally much closer to the ground; hence, the shower wavefront is more curved than hadron--induced showers. We develop a method to reconstruct the shower fronts and fit the showerfront structure. Our methodology utilises timing data from radio antennas for the shower front calculation for extensive air showers induced by neutrinos and nuclei. The method is explained in Sec.2. Furthermore, we incorporate signals obtained from Water Cherenkov detectors and the spatial distribution of stations registering signals in both Water Cherenkov detectors and radio antennas. We aim to classify neutrino--induced showers and background events stemming from nuclei by harnessing a random forest classifier employing the Gini impurity method, explained in Sec.4. Our framework yields excellent accuracy for separating the neutrinos from the background. The findings of this study offer significant advancements in the domain of UHE neutrino detection, shedding light on astrophysical phenomena associated with these elusive particles amidst the complex background of UHE nuclei.

2 Showerfront reconstruction for neutrinos

Due to their weak interaction with matter, neutrinos can penetrate the Earth’s atmosphere deep before producing extensive air showers. These neutrino-induced showers differ from those caused by hadronic cosmic rays as they occur closer to the ground, resulting in a more curved shower wavefront than the flatter wavefronts of hadron-induced showers. We reconstruct the shower front structure using the timing information from antenna signals. Geometrically, the arrival of air shower particles at lateral distances from the shower axis is delayed compared to a planar front. The time delay, calculated as dtna=(rprojsin(θMC))/c𝑑𝑡𝑛𝑎subscript𝑟projsubscript𝜃MC𝑐dtna=-(r_{\text{proj}}\sin(\theta_{\text{MC}}))/citalic_d italic_t italic_n italic_a = - ( italic_r start_POSTSUBSCRIPT proj end_POSTSUBSCRIPT roman_sin ( italic_θ start_POSTSUBSCRIPT MC end_POSTSUBSCRIPT ) ) / italic_c, where rprojsubscript𝑟projr_{\text{proj}}italic_r start_POSTSUBSCRIPT proj end_POSTSUBSCRIPT is the projected distance to the shower axis, helps derive the corrected time signal for shower front reconstruction.

Refer to caption
(a)
Refer to caption
(b)
Figure 1: (a) Signal time measured with respect to the distance of the antenna for proton and neutrino events. (b) Corrected time signal with respect to the distance of the antenna.

Fig.1(a) shows the initial signal time tosubscript𝑡𝑜t_{o}italic_t start_POSTSUBSCRIPT italic_o end_POSTSUBSCRIPT as a function of distance r𝑟ritalic_r to the shower axis for proton and neutrino-induced showers, revealing no clear distinction between events. After correcting the time delay, Fig.1(b) presents the corrected time as a function of r𝑟ritalic_r, showing a more curved structure for the neutrino event due to its deeper interaction depth of 1000 g/cm21000superscript g/cm21000\text{ g/cm}^{2}1000 g/cm start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Both the plots have an antenna spacing of 500 m500 m500\text{ m}500 m. This greater curvature results from neutrino-induced showers starting deeper in the atmosphere than proton events, which develop higher up, leading to less curved wavefronts.

We also produced hadron and neutrino simulations with a 1.5km1.5km1.5\text{km}1.5 km spacing. The antenna response was unfolded in the simulations to reconstruct the electric field generated by the air shower. Partially cleaned noise and time jitter (σt=5 nssubscript𝜎𝑡5 ns\sigma_{t}=5\text{ ns}italic_σ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = 5 ns) were added to the simulations. We calculated the time delay dtna𝑑𝑡𝑛𝑎dtnaitalic_d italic_t italic_n italic_a and corrected the time signal. The neutrino shower, developing deep into the atmosphere, exhibits a curved structure compared to the proton shower formed higher up. The fitting process involves modelling the relationship between the spatial coordinates (xi,yi)subscript𝑥𝑖subscript𝑦𝑖(x_{i},y_{i})( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) and the corrected time ti=todtnasubscript𝑡𝑖subscript𝑡𝑜𝑑𝑡𝑛𝑎t_{i}=t_{o}-dtnaitalic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_t start_POSTSUBSCRIPT italic_o end_POSTSUBSCRIPT - italic_d italic_t italic_n italic_a using an elliptical paraboloid equation: ax2+by2+cxy+dx+ey+f𝑎superscript𝑥2𝑏superscript𝑦2𝑐𝑥𝑦𝑑𝑥𝑒𝑦𝑓ax^{2}+by^{2}+cxy+dx+ey+fitalic_a italic_x start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_b italic_y start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + italic_c italic_x italic_y + italic_d italic_x + italic_e italic_y + italic_f. The goal is to determine the parameters (a,b,c,d,e,f)𝑎𝑏𝑐𝑑𝑒𝑓(a,b,c,d,e,f)( italic_a , italic_b , italic_c , italic_d , italic_e , italic_f ) that minimize the difference between the observed and predicted corrected time values. Fig.2 shows the fitted structure, where the neutrino shower (purple) with an interaction depth of 1500 g/cm21500superscript g/cm21500\text{ g/cm}^{2}1500 g/cm start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT has a more curved front than the proton shower, despite both having the same primary energy and arrival direction. We obtained fit parameters for hadron (proton, helium, nitrogen, and iron) and neutrino simulations over the energy range of 1120 EeV1120 EeV1-120\text{ EeV}1 - 120 EeV and zenith angles of 758575superscript8575-85^{\circ}75 - 85 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT.

Refer to caption
Figure 2: Corrected time signal with respect to the distance of the antenna position. The data points are fitted using the elliptical paraboloid equation.

3 Features for classification

The shower front reconstruction and fitting method are described in Sec.2. We obtain fit parameters for the hadron and neutrino events for the energy range of 1120 EeV1120 EeV1-120\text{ EeV}1 - 120 EeV and zenith angle range of 758575superscript8575-85^{\circ}75 - 85 start_POSTSUPERSCRIPT ∘ end_POSTSUPERSCRIPT. Fig.3 shows the fit parameters a,b,c𝑎𝑏𝑐a,b,citalic_a , italic_b , italic_c, from top left to right and d,e,f𝑑𝑒𝑓d,e,fitalic_d , italic_e , italic_f, from bottom left to right for proton and neutrino events over the energy range. We can see the separation in some parameters for neutrino and proton events.

Refer to caption
Figure 3: Fit parameters for the proton (orange) and neutrino (purple) events over the logarithmic energy range of 1018.41020.1 eVsuperscript1018.4superscript1020.1 eV10^{18.4}-10^{20.1}\text{ eV}10 start_POSTSUPERSCRIPT 18.4 end_POSTSUPERSCRIPT - 10 start_POSTSUPERSCRIPT 20.1 end_POSTSUPERSCRIPT eV.

We also use the observable Sbsubscript𝑆𝑏S_{b}italic_S start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, the total muon signal in an event. It is defined as,

S=iSi×(RiRref)b𝑆subscript𝑖subscript𝑆𝑖superscriptsubscript𝑅𝑖subscript𝑅ref𝑏S=\sum_{i}{S_{i}}\times\left(\frac{R_{i}}{R_{\text{ref}}}\right)^{b}italic_S = ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT × ( divide start_ARG italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_R start_POSTSUBSCRIPT ref end_POSTSUBSCRIPT end_ARG ) start_POSTSUPERSCRIPT italic_b end_POSTSUPERSCRIPT (1)

where the sum runs over the triggered stations, Sisubscript𝑆𝑖S_{i}italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the recorded signal in the ilimit-from𝑖i-italic_i -th station at a distance Risubscript𝑅𝑖R_{i}italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT from the reconstructed axis and Rrefsubscript𝑅refR_{\text{ref}}italic_R start_POSTSUBSCRIPT ref end_POSTSUBSCRIPT is a reference distance equal to 3500 m3500 m3500\text{ m}3500 m for this analysis for the horizontal footprint of the inclined showers. The exponent b𝑏bitalic_b is chosen equal to 4444 for maximizing the separation power between photons and hadrons. Fig.4(a) illustrates the distribution of Sbsubscript𝑆𝑏S_{b}italic_S start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT over the total energy range. The inclined hadron showers have significant muonic components at the detector level compared to neutrinos. Hence, the muon signal is higher for hadron showers. The signal stations measured by muon signal and radio emission are used for the analysis. The events go through shower front reconstruction, and the events with the minimum number of 6666 signal stations with SNR 7.227.227.227.22 are used for the classification. Fig.4(b) shows the number of stations with muon signal and radio emission over the energy range 1018.41020.1superscript1018.4superscript1020.110^{18.4}-10^{20.1}10 start_POSTSUPERSCRIPT 18.4 end_POSTSUPERSCRIPT - 10 start_POSTSUPERSCRIPT 20.1 end_POSTSUPERSCRIPT eV.

Refer to caption
(a) Total muon signal in an event over the energy range. Each data point represents one reconstructed event.
Refer to caption
(b) Number of stations with muon signal and radio signal over the energy range of 1018.41020.1 eVsuperscript1018.4superscript1020.1 eV10^{18.4}-10^{20.1}\text{ eV}10 start_POSTSUPERSCRIPT 18.4 end_POSTSUPERSCRIPT - 10 start_POSTSUPERSCRIPT 20.1 end_POSTSUPERSCRIPT eV.
Figure 4: Comparison of muon signal and number of stations over different energy ranges.

4 Machine Learning for the Classification of neutrinos and background

Refer to caption
(a) An example of a tree for tree depth 6 with Gini impurity--based data split. The data is divided into two classes, ’Neutrino’ and ’Hadron’, based on the feature threshold value and the Gini impurity.
Refer to caption
(b) Model Accuracy as a function of maximum tree depth.
Figure 5: Comparison of tree structure and model accuracy as a function of tree depth in a binary classification task.

The dataset contains the primary energy, zenith angle, fit parameters, muon Signal, Number of stations and primary particle (neutrino or background (p, fe, he, n, photons)). After processing the data, the next step is to select the features and the target label for model training. The fit parameters( ’a’, ’b’, ’c’, ’d’, ’e’, ’f’), Total muon Signal Sbsubscript𝑆𝑏S_{b}italic_S start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT, Number of stations nstatsubscript𝑛statn_{\text{stat}}italic_n start_POSTSUBSCRIPT stat end_POSTSUBSCRIPT are chosen as the input features for the model. These features were chosen because they were relevant in distinguishing neutrinos and hadrons. The data is split into training and testing sets. Splitting the data into training and testing sets allows the model to be trained on one portion of the data (80%) and tested on another portion (20%).

Hyperparameters are the parameters of a machine--learning model that need to be set before training begins. For Random Forest models, these hyperparameters define how the individual decision trees are built and how the ensemble of trees operates. We define a grid of hyperparameters for the Random Forest model, including options for the number of trees (n_estimators)n_estimators(\text{n\_estimators})( n_estimators ), maximum depth (max_depth)max_depth(\text{max\_depth})( max_depth ), and minimum samples required to split a node (min_samples_split)min_samples_split(\text{min\_samples\_split})( min_samples_split ) and to be at a leaf node (min_samples_leaf)min_samples_leaf(\text{min\_samples\_leaf})( min_samples_leaf ). This cross--validation technique is used to find the best combination of hyperparameters by testing different combinations on the training data and selecting the one with the highest accuracy. The n_estimators hyperparameter specifies the number of decision trees in the ensemble. More trees generally improve performance because the model averages over more trees, reducing variance and improving stability. As more trees typically lead to better accuracy, increasing the number of trees makes the model more computationally expensive and time--consuming to train and predict. The numbers used in the initial grid search were 50,1005010050,10050 , 100 and 200.200200.200 .

Decision trees evaluate split quality using metrics like Gini impurity and Entropy. Gini Impurity [1] measures node impurity, aiming to minimize it for effective splits. The tree algorithm selects features and thresholds to reduce impurity, splitting nodes based on whether feature values meet certain criteria. A split with a high Gini gain is highly effective, making it desirable for tree construction. Entropy [2], derived from information theory, measures uncertainty and quantifies class label disorder. It calculates information gain to select the feature that reduces uncertainty the most. Although Gini impurity is computationally simpler, entropy can sometimes yield better splits due to its theoretical foundation.

The depth of a tree indicates its complexity and is defined as the maximum number of nodes from the root to the farthest leaf. As shown in Fig.5(b), accuracy often improves with increasing depth, but deeper trees are prone to overfitting. Hyperparameters like max_depth𝑚𝑎𝑥_𝑑𝑒𝑝𝑡max\_depthitalic_m italic_a italic_x _ italic_d italic_e italic_p italic_t italic_h, min_samples_split𝑚𝑖𝑛_𝑠𝑎𝑚𝑝𝑙𝑒𝑠_𝑠𝑝𝑙𝑖𝑡min\_samples\_splititalic_m italic_i italic_n _ italic_s italic_a italic_m italic_p italic_l italic_e italic_s _ italic_s italic_p italic_l italic_i italic_t, and min_samples_leaf𝑚𝑖𝑛_𝑠𝑎𝑚𝑝𝑙𝑒𝑠_𝑙𝑒𝑎𝑓min\_samples\_leafitalic_m italic_i italic_n _ italic_s italic_a italic_m italic_p italic_l italic_e italic_s _ italic_l italic_e italic_a italic_f control tree growth, affecting model generalization. Pruning, governed by the cost complexity criterion Rα(T)=R(T)+α|T|subscript𝑅𝛼𝑇𝑅𝑇𝛼𝑇R_{\alpha}(T)=R(T)+\alpha\cdot|T|italic_R start_POSTSUBSCRIPT italic_α end_POSTSUBSCRIPT ( italic_T ) = italic_R ( italic_T ) + italic_α ⋅ | italic_T |, helps prevent overfitting by removing branches that offer minimal impurity reduction. After training, the model reveals the importance of each feature, which indicates how much each feature contributes to the classification. For the Gini method, feature importance is calculated by assessing how much each feature reduces impurity across all decision nodes. The resulting decision tree, built from these features and thresholds, provides decision rules that predict the primary_particle class labels for new instances.

4.1 Model Evaluation

Refer to caption
Figure 6: Model accuracy for different energy bins. Hadron events are classified with good accuracy across all energy ranges. Ultra-high energy neutrino events are also classified accurately. Neutrino events at low energy have low classification accuracy due to low statistics and the indistinguishable shower geometry from lower energy hadrons.

Model evaluation is essential in machine learning to assess how well a trained model performs on unseen data, ensuring its ability to generalize beyond the training set. The Random Forest model we developed was evaluated using multiple metrics, with accuracy being a simple yet widely used metric that measures overall correctness. The best model from the grid search was used to make predictions, achieving an accuracy of 0.96380.96380.96380.9638 for elliptical parabola fits in the ground plane. Different fitting equations were also tested, with parabola and hyperbola fits yielding lower accuracies of 0.8610.8610.8610.861 and 0.8320.8320.8320.832, respectively, as the shower plane geometry removes asymmetry structure, reducing classification accuracy. The accuracy was calculated as Accuracy=Number of Correct PredictionsTotal Number of PredictionsAccuracyNumber of Correct PredictionsTotal Number of Predictions\text{Accuracy}=\frac{\text{Number of Correct Predictions}}{\text{Total Number% of Predictions}}Accuracy = divide start_ARG Number of Correct Predictions end_ARG start_ARG Total Number of Predictions end_ARG, and was optimized through an ensemble of 100100100100 trees and hyperparameter tuning. We further evaluated the model by splitting data into energy bins and calculating accuracy for each class (Neutrino and Hadron) separately. Lower accuracy for low-energy neutrinos was attributed to low statistics and the similar geometry of low-energy hadrons and high-interaction depth neutrino showers.

Refer to caption
(a) Confusion matrix for a binary classification for Neutrino and Hadron, showing the number of correct and incorrect predictions for each class.
Refer to caption
(b) ROC Curve for the Random Forest Classifier performance for the binary classification of Neutrino and Hadron events. The AUC is 0.990.990.990.99, indicating the classifier’s good performance. The dotted line indicates the random classifier with an AUC of 0.50.50.50.5.
Figure 7: Comparison of classifier performance using confusion matrix and ROC curve for binary classification of Neutrino and Hadron events.

Fig.6 shows model accuracy across energy bins, highlighting good classification accuracy for ultra-high energy neutrino events but challenges at lower energies. A confusion matrix (Fig.7(a)) provides detailed insights into classification performance, showing an overall accuracy of 99.73%percent99.7399.73\%99.73 % for neutrino and proton classification. Additionally, ROC curves (Fig.7(b)) demonstrate the model’s performance, with an AUC of 0.990.990.990.99, indicating strong classification capability.

Refer to caption
(a)
Model Log Loss
Random Forest Classifier 0.0344
Decision Tree (Gini Impurity) 0.3186
Logistic Regression 0.4885
Random Classifier 1.0128
Majority Class Classifier 12.8247
(b)
Figure 8: (a) Log Loss calculations for different classifiers. The lower the Log Loss, the better the model’s performance. (b) Log Loss values for different classifiers.

Log Loss (Fig.8(a) and Table 8(b)) was also calculated, with the Random Forest achieving the lowest value of 0.034360.034360.034360.03436. In a further test using a set of 2222 neutrino and 1056105610561056 hadron events, the model achieved 98.02%percent98.0298.02\%98.02 % accuracy, with a log loss of 0.11750.11750.11750.1175. Fig.9(a) and Fig.9(b) illustrate these events’ confusion matrix and probability distribution, showing that the model effectively classified the rare neutrino events with high confidence.

Refer to caption
(a)
Refer to caption
(b)
Figure 9: (a) Actual and predicted hadron and neutrino events represented using Confusion matrix for the test set of 2222 neutrino and 1056105610561056 hadron events. (b) Probability distribution of neutrino and hadron events. The class with the highest average probability is selected as the final prediction for that sample. 2222 neutrino events have an average probability of 00, and the average probability for hadron events is 0.940.940.940.94.

References

  • [1] Breiman, Leo, Friedman, Jerome, Olshen, Richard A., & Stone, Charles J. (1984). Classification and Regression Trees.
  • [2] Shannon, Claude E. (1948). A Mathematical Theory of Communication. Bell System Technical Journal, 27(3), 379–423.
  • [3] Hastie, Trevor, Tibshirani, Robert, and Friedman, Jerome (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction.
  • [4] Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning.