[a]Abha R. Khakurdikar
Search for UHE neutrinos in the background of cosmic rays
Abstract
The main challenge in detecting ultra-high energy (UHE) neutrinos is discriminating a neutrino-induced shower in the background of showers initiated by ultra-high energy nuclei. The resulting shower development from neutrinos exhibits different characteristics from hadron-induced showers because neutrinos penetrate the atmosphere more deeply than hadrons.
This study focuses on simulations of highly inclined neutrino-induced showers above zenith angles, exploring an extensive energy range from to . These simulated showers have different ranges of interaction depths corresponding to each zenith angle, presenting diverse detection challenges.
Our methodology utilises timing data from radio antennas for the shower front calculation for extensive air showers induced by neutrinos and nuclei. Furthermore, we incorporate signals obtained from Water Cherenkov detectors and the spatial distribution of stations registering signals in both Water Cherenkov detectors and radio antennas. We aim to classify neutrino-induced showers and background events stemming from nuclei by harnessing a decision tree classifier employing the Gini impurity method. Our framework yields excellent accuracy for separating the neutrinos from the background.
The findings of this study offer significant advancements in the domain of UHE neutrino detection, shedding light on astrophysical phenomena associated with these elusive particles amidst the complex background of UHE nuclei.
1 Introduction
The primary challenge in detecting ultrahigh energy neutrinos is distinguishing a shower induced by neutrinos from the background of showers caused by UHE cosmic rays, which can be protons, heavy nuclei, and, to a much lesser extent, even photons. The simulations for downgoing neutrinos and hadronsinduced showers are produced at energies between and eV and at zenith angles between . Neutrinos interact deep into the atmosphere. Hence, neutrino showers are generally much closer to the ground; hence, the shower wavefront is more curved than hadroninduced showers. We develop a method to reconstruct the shower fronts and fit the showerfront structure. Our methodology utilises timing data from radio antennas for the shower front calculation for extensive air showers induced by neutrinos and nuclei. The method is explained in Sec.2. Furthermore, we incorporate signals obtained from Water Cherenkov detectors and the spatial distribution of stations registering signals in both Water Cherenkov detectors and radio antennas. We aim to classify neutrinoinduced showers and background events stemming from nuclei by harnessing a random forest classifier employing the Gini impurity method, explained in Sec.4. Our framework yields excellent accuracy for separating the neutrinos from the background. The findings of this study offer significant advancements in the domain of UHE neutrino detection, shedding light on astrophysical phenomena associated with these elusive particles amidst the complex background of UHE nuclei.
2 Showerfront reconstruction for neutrinos
Due to their weak interaction with matter, neutrinos can penetrate the Earth’s atmosphere deep before producing extensive air showers. These neutrino-induced showers differ from those caused by hadronic cosmic rays as they occur closer to the ground, resulting in a more curved shower wavefront than the flatter wavefronts of hadron-induced showers. We reconstruct the shower front structure using the timing information from antenna signals. Geometrically, the arrival of air shower particles at lateral distances from the shower axis is delayed compared to a planar front. The time delay, calculated as , where is the projected distance to the shower axis, helps derive the corrected time signal for shower front reconstruction.
Fig.1(a) shows the initial signal time as a function of distance to the shower axis for proton and neutrino-induced showers, revealing no clear distinction between events. After correcting the time delay, Fig.1(b) presents the corrected time as a function of , showing a more curved structure for the neutrino event due to its deeper interaction depth of . Both the plots have an antenna spacing of . This greater curvature results from neutrino-induced showers starting deeper in the atmosphere than proton events, which develop higher up, leading to less curved wavefronts.
We also produced hadron and neutrino simulations with a spacing. The antenna response was unfolded in the simulations to reconstruct the electric field generated by the air shower. Partially cleaned noise and time jitter () were added to the simulations. We calculated the time delay and corrected the time signal. The neutrino shower, developing deep into the atmosphere, exhibits a curved structure compared to the proton shower formed higher up. The fitting process involves modelling the relationship between the spatial coordinates and the corrected time using an elliptical paraboloid equation: . The goal is to determine the parameters that minimize the difference between the observed and predicted corrected time values. Fig.2 shows the fitted structure, where the neutrino shower (purple) with an interaction depth of has a more curved front than the proton shower, despite both having the same primary energy and arrival direction. We obtained fit parameters for hadron (proton, helium, nitrogen, and iron) and neutrino simulations over the energy range of and zenith angles of .
3 Features for classification
The shower front reconstruction and fitting method are described in Sec.2. We obtain fit parameters for the hadron and neutrino events for the energy range of and zenith angle range of . Fig.3 shows the fit parameters , from top left to right and , from bottom left to right for proton and neutrino events over the energy range. We can see the separation in some parameters for neutrino and proton events.
We also use the observable , the total muon signal in an event. It is defined as,
(1) |
where the sum runs over the triggered stations, is the recorded signal in the th station at a distance from the reconstructed axis and is a reference distance equal to for this analysis for the horizontal footprint of the inclined showers. The exponent is chosen equal to for maximizing the separation power between photons and hadrons. Fig.4(a) illustrates the distribution of over the total energy range. The inclined hadron showers have significant muonic components at the detector level compared to neutrinos. Hence, the muon signal is higher for hadron showers. The signal stations measured by muon signal and radio emission are used for the analysis. The events go through shower front reconstruction, and the events with the minimum number of signal stations with SNR are used for the classification. Fig.4(b) shows the number of stations with muon signal and radio emission over the energy range eV.
4 Machine Learning for the Classification of neutrinos and background
The dataset contains the primary energy, zenith angle, fit parameters, muon Signal, Number of stations and primary particle (neutrino or background (p, fe, he, n, photons)). After processing the data, the next step is to select the features and the target label for model training. The fit parameters( ’a’, ’b’, ’c’, ’d’, ’e’, ’f’), Total muon Signal , Number of stations are chosen as the input features for the model. These features were chosen because they were relevant in distinguishing neutrinos and hadrons. The data is split into training and testing sets. Splitting the data into training and testing sets allows the model to be trained on one portion of the data (80%) and tested on another portion (20%).
Hyperparameters are the parameters of a machinelearning model that need to be set before training begins. For Random Forest models, these hyperparameters define how the individual decision trees are built and how the ensemble of trees operates. We define a grid of hyperparameters for the Random Forest model, including options for the number of trees , maximum depth , and minimum samples required to split a node and to be at a leaf node . This crossvalidation technique is used to find the best combination of hyperparameters by testing different combinations on the training data and selecting the one with the highest accuracy. The n_estimators hyperparameter specifies the number of decision trees in the ensemble. More trees generally improve performance because the model averages over more trees, reducing variance and improving stability. As more trees typically lead to better accuracy, increasing the number of trees makes the model more computationally expensive and timeconsuming to train and predict. The numbers used in the initial grid search were and
Decision trees evaluate split quality using metrics like Gini impurity and Entropy. Gini Impurity [1] measures node impurity, aiming to minimize it for effective splits. The tree algorithm selects features and thresholds to reduce impurity, splitting nodes based on whether feature values meet certain criteria. A split with a high Gini gain is highly effective, making it desirable for tree construction. Entropy [2], derived from information theory, measures uncertainty and quantifies class label disorder. It calculates information gain to select the feature that reduces uncertainty the most. Although Gini impurity is computationally simpler, entropy can sometimes yield better splits due to its theoretical foundation.
The depth of a tree indicates its complexity and is defined as the maximum number of nodes from the root to the farthest leaf. As shown in Fig.5(b), accuracy often improves with increasing depth, but deeper trees are prone to overfitting. Hyperparameters like , , and control tree growth, affecting model generalization. Pruning, governed by the cost complexity criterion , helps prevent overfitting by removing branches that offer minimal impurity reduction. After training, the model reveals the importance of each feature, which indicates how much each feature contributes to the classification. For the Gini method, feature importance is calculated by assessing how much each feature reduces impurity across all decision nodes. The resulting decision tree, built from these features and thresholds, provides decision rules that predict the primary_particle class labels for new instances.
4.1 Model Evaluation
Model evaluation is essential in machine learning to assess how well a trained model performs on unseen data, ensuring its ability to generalize beyond the training set. The Random Forest model we developed was evaluated using multiple metrics, with accuracy being a simple yet widely used metric that measures overall correctness. The best model from the grid search was used to make predictions, achieving an accuracy of for elliptical parabola fits in the ground plane. Different fitting equations were also tested, with parabola and hyperbola fits yielding lower accuracies of and , respectively, as the shower plane geometry removes asymmetry structure, reducing classification accuracy. The accuracy was calculated as , and was optimized through an ensemble of trees and hyperparameter tuning. We further evaluated the model by splitting data into energy bins and calculating accuracy for each class (Neutrino and Hadron) separately. Lower accuracy for low-energy neutrinos was attributed to low statistics and the similar geometry of low-energy hadrons and high-interaction depth neutrino showers.
Fig.6 shows model accuracy across energy bins, highlighting good classification accuracy for ultra-high energy neutrino events but challenges at lower energies. A confusion matrix (Fig.7(a)) provides detailed insights into classification performance, showing an overall accuracy of for neutrino and proton classification. Additionally, ROC curves (Fig.7(b)) demonstrate the model’s performance, with an AUC of , indicating strong classification capability.
Model | Log Loss |
---|---|
Random Forest Classifier | 0.0344 |
Decision Tree (Gini Impurity) | 0.3186 |
Logistic Regression | 0.4885 |
Random Classifier | 1.0128 |
Majority Class Classifier | 12.8247 |
Log Loss (Fig.8(a) and Table 8(b)) was also calculated, with the Random Forest achieving the lowest value of . In a further test using a set of neutrino and hadron events, the model achieved accuracy, with a log loss of . Fig.9(a) and Fig.9(b) illustrate these events’ confusion matrix and probability distribution, showing that the model effectively classified the rare neutrino events with high confidence.
References
- [1] Breiman, Leo, Friedman, Jerome, Olshen, Richard A., & Stone, Charles J. (1984). Classification and Regression Trees.
- [2] Shannon, Claude E. (1948). A Mathematical Theory of Communication. Bell System Technical Journal, 27(3), 379–423.
- [3] Hastie, Trevor, Tibshirani, Robert, and Friedman, Jerome (2009). The Elements of Statistical Learning: Data Mining, Inference, and Prediction.
- [4] Bishop, Christopher M. (2006). Pattern Recognition and Machine Learning.