-
Markov Chain Variance Estimation: A Stochastic Approximation Approach
Authors:
Shubhada Agrawal,
Prashanth L. A.,
Siva Theja Maguluri
Abstract:
We consider the problem of estimating the asymptotic variance of a function defined on a Markov chain, an important step for statistical inference of the stationary mean. We design a novel recursive estimator that requires $O(1)$ computation at each step, does not require storing any historical samples or any prior knowledge of run-length, and has optimal $O(\frac{1}{n})$ rate of convergence for t…
▽ More
We consider the problem of estimating the asymptotic variance of a function defined on a Markov chain, an important step for statistical inference of the stationary mean. We design a novel recursive estimator that requires $O(1)$ computation at each step, does not require storing any historical samples or any prior knowledge of run-length, and has optimal $O(\frac{1}{n})$ rate of convergence for the mean-squared error (MSE) with provable finite sample guarantees. Here, $n$ refers to the total number of samples generated. Our estimator is based on linear stochastic approximation of an equivalent formulation of the asymptotic variance in terms of the solution of the Poisson equation.
We generalize our estimator in several directions, including estimating the covariance matrix for vector-valued functions, estimating the stationary variance of a Markov chain, and approximately estimating the asymptotic variance in settings where the state space of the underlying Markov chain is large. We also show applications of our estimator in average reward reinforcement learning (RL), where we work with asymptotic variance as a risk measure to model safety-critical applications. We design a temporal-difference type algorithm tailored for policy evaluation in this context. We consider both the tabular and linear function approximation settings. Our work paves the way for developing actor-critic style algorithms for variance-constrained RL.
△ Less
Submitted 22 September, 2024; v1 submitted 9 September, 2024;
originally announced September 2024.
-
Graph Classification with GNNs: Optimisation, Representation and Inductive Bias
Authors:
P. Krishna Kumar a,
Harish G. Ramaswamy
Abstract:
Theoretical studies on the representation power of GNNs have been centered around understanding the equivalence of GNNs, using WL-Tests for detecting graph isomorphism. In this paper, we argue that such equivalence ignores the accompanying optimization issues and does not provide a holistic view of the GNN learning process. We illustrate these gaps between representation and optimization with exam…
▽ More
Theoretical studies on the representation power of GNNs have been centered around understanding the equivalence of GNNs, using WL-Tests for detecting graph isomorphism. In this paper, we argue that such equivalence ignores the accompanying optimization issues and does not provide a holistic view of the GNN learning process. We illustrate these gaps between representation and optimization with examples and experiments. We also explore the existence of an implicit inductive bias (e.g. fully connected networks prefer to learn low frequency functions in their input space) in GNNs, in the context of graph classification tasks. We further prove theoretically that the message-passing layers in the graph, have a tendency to search for either discriminative subgraphs, or a collection of discriminative nodes dispersed across the graph, depending on the different global pooling layers used. We empirically verify this bias through experiments over real-world and synthetic datasets. Finally, we show how our work can help in incorporating domain knowledge via attention based architectures, and can evince their capability to discriminate coherent subgraphs.
△ Less
Submitted 23 August, 2024; v1 submitted 17 August, 2024;
originally announced August 2024.
-
Kinetics of vapor-liquid transition of active matter system under quasi one-dimensional confinement
Authors:
Parameshwaran A,
Bhaskar Sen Gupta
Abstract:
We study the kinetics of vapor-liquid phase separation in a quasi one-dimensional confined active matter system using molecular dynamics simulations. Activity is invoked via the Vicsek rule, while passive interaction follows the Lennard-Jones potential. With the system density near the vapor branch, the evolution morphology features disconnected liquid clusters. In the passive limit, coarsening be…
▽ More
We study the kinetics of vapor-liquid phase separation in a quasi one-dimensional confined active matter system using molecular dynamics simulations. Activity is invoked via the Vicsek rule, while passive interaction follows the Lennard-Jones potential. With the system density near the vapor branch, the evolution morphology features disconnected liquid clusters. In the passive limit, coarsening begins with nucleation, followed by an evaporation-condensation growth mechanism, leading to a metastable state without complete phase separation. We aim to understand the impact of Vicsek-like self-propulsion on the structure and growth of these clusters. Our key finding is that Vicsek activity results in a distinct growth mechanism, notably rapid cluster growth and the breakdown of the metastable state through ballistic aggregation. Relevant growth laws are analyzed and explained using appropriate theoretical models.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
EnterpriseEM: Fine-tuned Embeddings for Enterprise Semantic Search
Authors:
Kamalkumar Rathinasamy,
Jayarama Nettar,
Amit Kumar,
Vishal Manchanda,
Arun Vijayakumar,
Ayush Kataria,
Venkateshprasanna Manjunath,
Chidambaram GS,
Jaskirat Singh Sodhi,
Shoeb Shaikh,
Wasim Akhtar Khan,
Prashant Singh,
Tanishq Dattatray Ige,
Vipin Tiwari,
Rajab Ali Mondal,
Harshini K,
S Reka,
Chetana Amancharla,
Faiz ur Rahman,
Harikrishnan P A,
Indraneel Saha,
Bhavya Tiwary,
Navin Shankar Patel,
Pradeep T S,
Balaji A J
, et al. (2 additional authors not shown)
Abstract:
Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components.…
▽ More
Enterprises grapple with the significant challenge of managing proprietary unstructured data, hindering efficient information retrieval. This has led to the emergence of AI-driven information retrieval solutions, designed to adeptly extract relevant insights to address employee inquiries. These solutions often leverage pre-trained embedding models and generative models as foundational components. While pre-trained embeddings may exhibit proximity or disparity based on their original training objectives, they might not fully align with the unique characteristics of enterprise-specific data, leading to suboptimal alignment with the retrieval goals of enterprise environments. In this paper, we propose a comprehensive methodology for contextualizing pre-trained embedding models to enterprise environments, covering the entire process from data preparation to model fine-tuning and evaluation. By adapting the embeddings to better suit the retrieval tasks prevalent in enterprises, we aim to enhance the performance of information retrieval solutions. We discuss the process of fine-tuning, its effect on retrieval accuracy, and the potential benefits for enterprise information management. Our findings demonstrate the efficacy of fine-tuned embedding models in improving the precision and relevance of search results in enterprise settings.
△ Less
Submitted 27 September, 2024; v1 submitted 18 May, 2024;
originally announced June 2024.
-
Kinetic temperature of massive star-forming molecular clumps measured with formaldehyde V. The massive filament DR21
Authors:
X. Zhao,
X. D. Tang,
C. Henkel,
Y. Gong,
Y. Lin,
D. L. Li,
Y. X. He,
Y. P. Ao,
X. Lu,
T. Liu,
Y. Sun,
K. Wang,
X. P. Chen,
J. Esimbek,
J. J. Zhou,
J. W. Wu,
J. J. Qiu,
X. W. Zheng,
J. S. Li,
C. S. Luo,
Q. Zhao
Abstract:
The kinetic temperature structure of the massive filament DR21 has been mapped using the IRAM 30 m telescope. This mapping employed the para-H$_2$CO triplet ($J_{\rm K_aK_c}$ = 3$_{03}$--2$_{02}$, 3$_{22}$--2$_{21}$, and 3$_{21}$--2$_{20}$) on a scale of $\sim$0.1 pc. By modeling the averaged line ratios of para-H$_{2}$CO with RADEX under non-LTE assumptions, the kinetic temperature of the dense g…
▽ More
The kinetic temperature structure of the massive filament DR21 has been mapped using the IRAM 30 m telescope. This mapping employed the para-H$_2$CO triplet ($J_{\rm K_aK_c}$ = 3$_{03}$--2$_{02}$, 3$_{22}$--2$_{21}$, and 3$_{21}$--2$_{20}$) on a scale of $\sim$0.1 pc. By modeling the averaged line ratios of para-H$_{2}$CO with RADEX under non-LTE assumptions, the kinetic temperature of the dense gas was derived at a density of $n$(H$_{2}$) = 10$^{5}$ cm$^{-3}$. The para-H$_2$CO lines reveal significantly higher temperatures than NH$_3$ (1,1)/(2,2) and FIR wavelengths. The dense clumps appear to correlate with the notable kinetic temperature. Among the four dense cores (N44, N46, N48, and N54), temperature gradients are observed on a scale of $\sim$0.1-0.3 pc. This suggests that the warm dense gas is influenced by internal star formation activity. With the exception of N54, the temperature profiles of these cores were fitted with power-law indices ranging from $-$0.3 to $-$0.5. This indicates that the warm dense gas is heated by radiation emitted from internally embedded protostar(s) and/or clusters. While there is no direct evidence supporting the idea that the dense gas is heated by shocks resulting from a past explosive event in the DR21 region, our measurements toward the DR21W1 region provide compelling evidence that the dense gas is indeed heated by shocks originating from the western DR21 flow. Higher temperatures appear to be associated with turbulence. The physical parameters of the dense gas in the DR21 filament exhibit a remarkable similarity to the results obtained in OMC-1 and N113. This may imply that the physical mechanisms governing the dynamics and thermodynamics of dense gas traced by H$_{2}$CO in diverse star formation regions may be dominated by common underlying principles despite variations in specific environmental conditions. (abbreviated)
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
The light quantum mechanism of PCR efficiency oscillation with gold nanoparticle concentration
Authors:
Huan-Huan Fang,
Yong-Cong Chen,
Ze-Fei Liu,
Xiao-Mei Zhu,
Ping Ao
Abstract:
The widespread application of nanomaterials in polymerase chain reaction (PCR) technology has opened new avenues for improving detection methods in the biomedical field. Recent experiments (Chem. Eur. J. 2023, e202203513) have revealed oscillatory behavior between PCR efficiency and the concentration of gold nanoparticles in the pM range, potentially linked to the long-range Coulomb interactions a…
▽ More
The widespread application of nanomaterials in polymerase chain reaction (PCR) technology has opened new avenues for improving detection methods in the biomedical field. Recent experiments (Chem. Eur. J. 2023, e202203513) have revealed oscillatory behavior between PCR efficiency and the concentration of gold nanoparticles in the pM range, potentially linked to the long-range Coulomb interactions among charged colloidal particles and the quantum size effect of nanoparticle electronic states. Through Monte Carlo simulation, we discovered that the radial distribution function of gold nanoparticles in solution gradually exhibits peak characteristics with increasing charge, triggering coherent photon behavior in Rayleigh scattering within the solution, thereby influencing the efficiency of reusing released photons in the PCR chain reaction. The study demonstrates that the oscillation period aligns with the wavelength of downstream reaction photons, while their energy matches the width of energy levels near the Fermi level of gold nanoparticles. The latter can absorb and store electron states internally, promoting upstream PCR reactions through subsequent re-release, and compensating for energy deficiencies through the Boltzmann distribution of electrons. This work is poised to advance the application of PCR-specific precise detection methods in the field of quantum biotechnology.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Observations of the Crab Nebula with MACE (Major Atmospheric Cherenkov Experiment)
Authors:
Borwankar C.,
Sharma M.,
Hariharan J.,
Venugopal K.,
Godambe S.,
Mankuzhyil N.,
Chandra P.,
Khurana M.,
Pathania A.,
Chouhan N.,
Dhar V. K.,
Thubstan R.,
Norlha S.,
Keshavananda,
Sarkar D.,
Dar Z. A.,
Kotwal S. V.,
Godiyal S.,
Kushwaha C. P.,
Singh K. K.,
Das M. P.,
Tolamatti A.,
Ghosal B.,
Chanchalani K.,
Pandey P.
, et al. (10 additional authors not shown)
Abstract:
The Major Atmospheric Cherenkov Experiment (MACE) is a large size (21m) Imaging Atmospheric Cherenkov Telescope (IACT) installed at an altitude of 4270m above sea level at Hanle, Ladakh in northern India. Here we report the detection of Very High Energy (VHE) gamma-ray emission from Crab Nebula above 80 GeV. We analysed ~15 hours of data collected at low zenith angle between November 2022 and Febr…
▽ More
The Major Atmospheric Cherenkov Experiment (MACE) is a large size (21m) Imaging Atmospheric Cherenkov Telescope (IACT) installed at an altitude of 4270m above sea level at Hanle, Ladakh in northern India. Here we report the detection of Very High Energy (VHE) gamma-ray emission from Crab Nebula above 80 GeV. We analysed ~15 hours of data collected at low zenith angle between November 2022 and February 2023. The energy spectrum is well described by a log-parabola function with a flux of ~(3.46 +/- 0.26stat) x 10-10 TeV-1 cm-2 s-1, at 400 GeV with spectral index of 2.09 +/- 0.06stat and a curvature parameter of 0.08 +/- 0.07stat. The gamma-rays are detected in an energy range spanning from 80 GeV to ~5 TeV. The energy resolution improves from ~34% at an analysis energy threshold of 80 GeV to ~21% above 1 TeV. The daily light curve and the spectral energy distribution obtained for the Crab Nebula is in agreement with previous measurements, considering statistical and systematic uncertainties.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Entangled biphoton generation in myelin sheath
Authors:
Zefei Liu,
Yong-Cong Chen,
Ping Ao
Abstract:
Consciousness within the brain hinges on the synchronized activities of millions of neurons, but the mechanism responsible for orchestrating such synchronization remains elusive. In this study, we employ cavity quantum electrodynamics (cQED) to explore entangled biphoton generation through cascade emission in the vibration spectrum of C-H bonds within the lipid molecules' tails. The results indica…
▽ More
Consciousness within the brain hinges on the synchronized activities of millions of neurons, but the mechanism responsible for orchestrating such synchronization remains elusive. In this study, we employ cavity quantum electrodynamics (cQED) to explore entangled biphoton generation through cascade emission in the vibration spectrum of C-H bonds within the lipid molecules' tails. The results indicate that the cylindrical cavity formed by a myelin sheath can facilitate spontaneous photon emission from the vibrational modes and generate a significant number of entangled photon pairs. The abundance of C-H bond vibration units in neurons can therefore serve as a source of quantum entanglement resources for the nervous system. The finding may offer insight into the brain's ability to leverage these resources for quantum information transfer, thereby elucidating a potential source for the synchronized activity of neurons.
△ Less
Submitted 21 August, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
Radiation resistance of fine-grained ceramics Y2.5Nd0.5Al5O12 under Xe-ions irradiation
Authors:
Alekseeva L. S.,
Nokhrin A. V.,
Yunin P. A.,
Nazarov A. A.,
Orlova A. I.,
Skuratov V. A.,
Issatov A. T.,
Kovylin R. S.,
Murashov A. A.,
Boldin M. S.,
Voronin A. V.,
Chuvil'deev V. N.,
Zotov D. A.
Abstract:
Oxide Y2.5Nd0.5Al5O12 (YAG:Nd) with garnet structure was synthesized in the powder and ceramics forms. Fine-grained YAG:Nd ceramics with a relative density of ~99% were obtained by the Spark Plasma Sintering method (SPS). The radiation resistance of ceramics was studied under irradiation with swift Xe-ions (E = 146 MeV). A gradient defect structure is formed in irradiated ceramics, varying from la…
▽ More
Oxide Y2.5Nd0.5Al5O12 (YAG:Nd) with garnet structure was synthesized in the powder and ceramics forms. Fine-grained YAG:Nd ceramics with a relative density of ~99% were obtained by the Spark Plasma Sintering method (SPS). The radiation resistance of ceramics was studied under irradiation with swift Xe-ions (E = 146 MeV). A gradient defect structure is formed in irradiated ceramics, varying from layer to layer. The strained YAG phase formed as a result of Xe ions irradiation is localized in a near-surface layer with a thickness of ~5 μm. Full amorphization of the samples was observed under irradiation with a fluence of 1x10^13 cm-2. The calculated critical fluence was 6.5x10^12 cm-2, which corresponded to 0.03 dpa. The microhardness of the surface layers of irradiated ceramics is less than the central layers, and, in general, decreases with increasing ion fluence.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Solar Flare Prediction and Feature Selection using Light Gradient Boosting Machine Algorithm
Authors:
Vysakh P. A.,
Prateek Mayank
Abstract:
Solar flares are among the most severe space weather phenomena, and they have the capacity to generate radiation storms and radio disruptions on Earth. The accurate prediction of solar flare events remains a significant challenge, requiring continuous monitoring and identification of specific features that can aid in forecasting this phenomenon, particularly for different classes of solar flares.…
▽ More
Solar flares are among the most severe space weather phenomena, and they have the capacity to generate radiation storms and radio disruptions on Earth. The accurate prediction of solar flare events remains a significant challenge, requiring continuous monitoring and identification of specific features that can aid in forecasting this phenomenon, particularly for different classes of solar flares. In this study, we aim to forecast C and M class solar flares utilising a machine-learning algorithm, namely the Light Gradient Boosting Machine. We have utilised a dataset spanning 9 years, obtained from the Space-weather Helioseismic and Magnetic Imager Active Region Patches (SHARP), with a temporal resolution of 1 hour. A total of 37 flare features were considered in our analysis, comprising of 25 active region parameters and 12 flare history features. To address the issue of class imbalance in solar flare data, we employed the Synthetic Minority Oversampling Technique (SMOTE). We used two labeling approaches in our study: a fixed 24-hour window label and a varying window that considers the changing nature of solar activity. Then, the developed machine learning algorithm was trained and tested using forecast verification metrics, with an emphasis on evaluating the true skill statistic (TSS). Furthermore, we implemented a feature selection algorithm to determine the most significant features from the pool of 37 features that could distinguish between flaring and non-flaring active regions. We found that utilising a limited set of useful features resulted in improved prediction performance. For the 24-hour prediction window, we achieved a TSS of 0.63 (0.69) and accuracy of 0.90 (0.97) for $\geq$C ($\geq$M) class solar flares.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Optimization of utility-based shortfall risk: A non-asymptotic viewpoint
Authors:
Sumedh Gupte,
Prashanth L. A.,
Sanjay P. Bhat
Abstract:
We consider the problems of estimation and optimization of utility-based shortfall risk (UBSR), which is a popular risk measure in finance. In the context of UBSR estimation, we derive a non-asymptotic bound on the mean-squared error of the classical sample average approximation (SAA) of UBSR. Next, in the context of UBSR optimization, we derive an expression for the UBSR gradient under a smooth p…
▽ More
We consider the problems of estimation and optimization of utility-based shortfall risk (UBSR), which is a popular risk measure in finance. In the context of UBSR estimation, we derive a non-asymptotic bound on the mean-squared error of the classical sample average approximation (SAA) of UBSR. Next, in the context of UBSR optimization, we derive an expression for the UBSR gradient under a smooth parameterization. This expression is a ratio of expectations, both of which involve the UBSR. We use SAA for the numerator as well as denominator in the UBSR gradient expression to arrive at a biased gradient estimator. We derive non-asymptotic bounds on the estimation error, which show that our gradient estimator is asymptotically unbiased. We incorporate the aforementioned gradient estimator into a stochastic gradient (SG) algorithm for UBSR optimization. Finally, we derive non-asymptotic bounds that quantify the rate of convergence of our SG algorithm for UBSR optimization.
△ Less
Submitted 30 March, 2024; v1 submitted 28 October, 2023;
originally announced October 2023.
-
Malware Classification using Deep Neural Networks: Performance Evaluation and Applications in Edge Devices
Authors:
Akhil M R,
Adithya Krishna V Sharma,
Harivardhan Swamy,
Pavan A,
Ashray Shetty,
Anirudh B Sathyanarayana
Abstract:
With the increasing extent of malware attacks in the present day along with the difficulty in detecting modern malware, it is necessary to evaluate the effectiveness and performance of Deep Neural Networks (DNNs) for malware classification. Multiple DNN architectures can be designed and trained to detect and classify malware binaries. Results demonstrate the potential of DNNs in accurately classif…
▽ More
With the increasing extent of malware attacks in the present day along with the difficulty in detecting modern malware, it is necessary to evaluate the effectiveness and performance of Deep Neural Networks (DNNs) for malware classification. Multiple DNN architectures can be designed and trained to detect and classify malware binaries. Results demonstrate the potential of DNNs in accurately classifying malware with high accuracy rates observed across different malware types. Additionally, the feasibility of deploying these DNN models on edge devices to enable real-time classification, particularly in resource-constrained scenarios proves to be integral to large IoT systems. By optimizing model architectures and leveraging edge computing capabilities, the proposed methodologies achieve efficient performance even with limited resources. This study contributes to advancing malware detection techniques and emphasizes the significance of integrating cybersecurity measures for the early detection of malware and further preventing the adverse effects caused by such attacks. Optimal considerations regarding the distribution of security tasks to edge devices are addressed to ensure that the integrity and availability of large scale IoT systems are not compromised due to malware attacks, advocating for a more resilient and secure digital ecosystem.
△ Less
Submitted 21 August, 2023;
originally announced October 2023.
-
Sulfur isotope ratios in the Large Magellanic Cloud
Authors:
Y. Gong,
C. Henkel,
K. M. Menten,
C. -H. R. Chen,
Z. Y. Zhang,
Y. T. Yan,
A. Weiss,
N. Langer,
J. Z. Wang,
R. Q. Mao,
X. D. Tang,
W. Yang,
Y. P. Ao,
M. Wang
Abstract:
Sulfur isotope ratios have emerged as a promising tool for tracing stellar nucleosynthesis, quantifying stellar populations, and investigating the chemical evolution of galaxies. While extensively studied in the Milky Way, in extragalactic environments they remain largely unexplored. We focus on investigating the sulfur isotope ratios in the Large Magellanic Cloud (LMC) to gain insights into sulfu…
▽ More
Sulfur isotope ratios have emerged as a promising tool for tracing stellar nucleosynthesis, quantifying stellar populations, and investigating the chemical evolution of galaxies. While extensively studied in the Milky Way, in extragalactic environments they remain largely unexplored. We focus on investigating the sulfur isotope ratios in the Large Magellanic Cloud (LMC) to gain insights into sulfur enrichment in this nearby system and to establish benchmarks for such ratios in metal-poor galaxies. We conducted pointed observations of CS and its isotopologues toward N113, one of the most prominent star-formation regions in the LMC, utilizing the Atacama Pathfinder EXperiment 12~m telescope. We present the first robust detection of C$^{33}$S in the LMC by successfully identifying two C$^{33}$S transitions on a large scale of $\sim$5 pc. Our measurements result in an accurate determination of the $^{34}$S/$^{33}$S isotope ratio, which is 2.0$\pm$0.2. Our comparative analysis indicates that the $^{32}$S/$^{33}$S and $^{34}$S/$^{33}$S isotope ratios are about a factor of 2 lower in the LMC than in the Milky Way. Our findings suggest that the low $^{34}$S/$^{33}$S isotope ratio in the LMC can be attributed to a combination of the age effect, low metallicity, and star formation history.
△ Less
Submitted 18 October, 2023; v1 submitted 26 September, 2023;
originally announced September 2023.
-
Sparse electrophysiological source imaging predicts aging-related gait speed slowing
Authors:
Vega-Hernandez,
Mayrim,
Galan-Garcia,
Lidice,
Perez-Hidalgo-Gato,
Jhoanna,
Ontivero-Ortega,
Marlis,
Garcia-Agustin,
Daysi,
Garcia-Reyes,
Ronaldo,
Bosch-Bayard,
Jorge,
Marinazzo,
Daniele,
Martinez-Montes,
Eduardo,
Valdes-Sosa,
Pedro A
Abstract:
Objective: We seek stable Electrophysiological Source Imaging (ESI) biomarkers associated with Gait Speed (GS) as a measure of functional decline. Towards this end we determine the predictive value of ESI activation and connectivity patterns of resting-state EEG Theta rhythm on physical performance decline measured by a slowing GS in aging individuals. Methods: As potential biomarkers related to G…
▽ More
Objective: We seek stable Electrophysiological Source Imaging (ESI) biomarkers associated with Gait Speed (GS) as a measure of functional decline. Towards this end we determine the predictive value of ESI activation and connectivity patterns of resting-state EEG Theta rhythm on physical performance decline measured by a slowing GS in aging individuals. Methods: As potential biomarkers related to GS changes, we estimate ESI using flexible sparse/smooth/non-negative models (NN-SLASSO), from which activation ESI (aESI) and connectivity ESI (cESI) features are selected using the Stable Sparse Classifier method. Results and Conclusions: Novel sparse aESI models outperformed traditional methods such as the LORETA family. The models combining aESI and cESI features improved the predictability of GS changes. Selected biomarkers from activation/connectivity patterns were localized to orbitofrontal and temporal cortical regions. Significance: The proposed methodology contributes to understanding the activation and connectivity of ESI complex patterns related to GS, providing potential biomarker features for GS slowing. Given the known relationship between GS decline and cognitive impairment, this preliminary work suggests it might be applied to other, more complex measures of healthy and pathological aging. Importantly, it might allow an ESI-based evaluation of rehabilitation programs.
△ Less
Submitted 22 August, 2023; v1 submitted 20 July, 2023;
originally announced July 2023.
-
Hall anomaly by vacancies vs fragments of vortex lattice: Quantitative analyses of new evidences
Authors:
Ruonan Guo,
Yong-Cong Chen,
Da Jiang,
Ping Ao
Abstract:
Despite numerous recent studies on the Hall anomaly following the discovery of cuprate superconductivity, the origin of this phenomenon remains contentious. We demonstrate that a previously proposed mechanism, in which vacancy-on-fragment of the flux-line crystal, provides an alternative explanation for the observations of $\rm{Bi_{2}Sr_{2}CaCu_{2}O_{x}}$ thin films made by Nitzav and Kanigel [Phy…
▽ More
Despite numerous recent studies on the Hall anomaly following the discovery of cuprate superconductivity, the origin of this phenomenon remains contentious. We demonstrate that a previously proposed mechanism, in which vacancy-on-fragment of the flux-line crystal, provides an alternative explanation for the observations of $\rm{Bi_{2}Sr_{2}CaCu_{2}O_{x}}$ thin films made by Nitzav and Kanigel [Phys. Rev. B. 107, 094516 (2023)], without the need for adjustable parameters. Specifically, we show that the power-law behavior of $ρ_{xy}$ over $ρ_{xx}$, with and without sign reversal, is consistent with the picture of vacancies versus fragments. Interestingly, we find that the effective length of vortex lines is consistently 1.5 unit cells (UC) across different experiments, independent of film thickness.
△ Less
Submitted 27 November, 2023; v1 submitted 16 June, 2023;
originally announced June 2023.
-
A Cubic-regularized Policy Newton Algorithm for Reinforcement Learning
Authors:
Mizhaan Prajit Maniyar,
Akash Mondal,
Prashanth L. A.,
Shalabh Bhatnagar
Abstract:
We consider the problem of control in the setting of reinforcement learning (RL), where model information is not available. Policy gradient algorithms are a popular solution approach for this problem and are usually shown to converge to a stationary point of the value function. In this paper, we propose two policy Newton algorithms that incorporate cubic regularization. Both algorithms employ the…
▽ More
We consider the problem of control in the setting of reinforcement learning (RL), where model information is not available. Policy gradient algorithms are a popular solution approach for this problem and are usually shown to converge to a stationary point of the value function. In this paper, we propose two policy Newton algorithms that incorporate cubic regularization. Both algorithms employ the likelihood ratio method to form estimates of the gradient and Hessian of the value function using sample trajectories. The first algorithm requires an exact solution of the cubic regularized problem in each iteration, while the second algorithm employs an efficient gradient descent-based approximation to the cubic regularized problem. We establish convergence of our proposed algorithms to a second-order stationary point (SOSP) of the value function, which results in the avoidance of traps in the form of saddle points. In particular, the sample complexity of our algorithms to find an $ε$-SOSP is $O(ε^{-3.5})$, which is an improvement over the state-of-the-art sample complexity of $O(ε^{-4.5})$.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
The decay $τ\to 3πν_τ$ and axial-vector meson $a_1$ in the NJL model
Authors:
Volkov M. K.,
Nurlan K.,
Pivovarov A. A
Abstract:
The branching fractions of $τ\to π^+ π^-π^- ν_τ$ and $τ\to π^- 2π^0ν_τ$ are calculated within the chiral NJL model. Features of the axial-vector $a_1$ meson which plays an important role in describing the $τ$ decays are discussed. Permissible values for the mass and width of the $a_1$ meson are considered in accordance with the latest experiments.
The branching fractions of $τ\to π^+ π^-π^- ν_τ$ and $τ\to π^- 2π^0ν_τ$ are calculated within the chiral NJL model. Features of the axial-vector $a_1$ meson which plays an important role in describing the $τ$ decays are discussed. Permissible values for the mass and width of the $a_1$ meson are considered in accordance with the latest experiments.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Description of the decay $τ\to K ππν_τ$ in the NJL type chiral quark model
Authors:
Volkov M. K.,
Pivovarov A. A.,
Nurlan K
Abstract:
The branching fractions of the decays $τ^- \to K^- π^+ π^- ν_τ$, $τ^- \to K^- π^0 π^0 ν_τ$ and $τ^- \to \bar{K}^0 π^- π^0ν_τ$ are calculated in the $U(3)\times U(3)$ Nambu--Jona-Lasinio type chiral model. Four intermediate channels are considered: the contact channel and the channels with the intermediate axial vector, vector and pseudoscalar mesons. The additional resonance states $K^* π$ and…
▽ More
The branching fractions of the decays $τ^- \to K^- π^+ π^- ν_τ$, $τ^- \to K^- π^0 π^0 ν_τ$ and $τ^- \to \bar{K}^0 π^- π^0ν_τ$ are calculated in the $U(3)\times U(3)$ Nambu--Jona-Lasinio type chiral model. Four intermediate channels are considered: the contact channel and the channels with the intermediate axial vector, vector and pseudoscalar mesons. The additional resonance states $K^* π$ and $K ρ$ are taken into account in all channels. It is shown that the main contributions to the widths of these decays are given by the axial vector channel. In the axial vector channel, the intermediate resonances $K_1(1270)$ and $K_1(1400)$ are taken into account. The obtained results are in satisfactory agreement with the known experimental data.
△ Less
Submitted 5 March, 2023;
originally announced March 2023.
-
Impact of the Design Parameters on the Microwave Displacement Sensor Performance
Authors:
Premsai Regalla,
Praveenkumar A V
Abstract:
The investigations have been conducted on the involved design parameters to analyze the behavior of the microwave displacement sensor output characteristics. To implement this, a dielectric resonator loaded to a reflection mode operated microstrip line circuit is proposed. For the proposed reflection mode sensor, the sensor features like resonant frequency, impedance matching position, sensitivity…
▽ More
The investigations have been conducted on the involved design parameters to analyze the behavior of the microwave displacement sensor output characteristics. To implement this, a dielectric resonator loaded to a reflection mode operated microstrip line circuit is proposed. For the proposed reflection mode sensor, the sensor features like resonant frequency, impedance matching position, sensitivity, and dynamic range are sensitive to the displacement of DR to microstrip line. The impact of the substrate shape and size, and resonators geometrical properties are numerically analyzed and experimentally validated by using VNA. The sensor analysis shows good matching between both HFSS simulations and VNA measurements.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Multi Mode (Reflection and Transmission) Operated Dielectric Resonator based Displacement Sensor
Authors:
Premsai Regalla,
Praveen Kumar A. V
Abstract:
The authors propose a multi mode operated dielectric resonator based displacement sensor. A cylindrical dielectric resonator is coupled to two 50 ohm microstrip lines to enable the multi mode operating feature. The parameters like S11 in reflection mode and S21 in transmission mode are sensitive to DR displacements w.r.to transmission lines. The HFSS is carried out for numerical analysis and the f…
▽ More
The authors propose a multi mode operated dielectric resonator based displacement sensor. A cylindrical dielectric resonator is coupled to two 50 ohm microstrip lines to enable the multi mode operating feature. The parameters like S11 in reflection mode and S21 in transmission mode are sensitive to DR displacements w.r.to transmission lines. The HFSS is carried out for numerical analysis and the fabricated sensor prototype is realized with VNA measurement. Both the HFSS and VNA responses are in good agreement with each other for this multi mode operation. This reveals the selection of mode freedom to the choice of operation.
△ Less
Submitted 23 February, 2023;
originally announced February 2023.
-
Progression of Digital-Receiver Architecture: From MWA to SKA1-Low,and beyond
Authors:
Girish B. S.,
Harshavardhan Reddy S.,
Shiv Sethi,
Srivani K. S.,
Abhishek R.,
Ajithkumar B.,
Sahana Bhattramakki,
Kaushal Buch,
Sandeep Chaudhuri,
Yashwant Gupta,
Kamini P. A.,
Sanjay Kudale,
Madhavi S.,
Mekhala Muley,
Prabu T.,
Raghunathan A.,
Shelton G. J
Abstract:
Backed by advances in digital electronics, signal processing, computation, and storage technologies, aperture arrays, which had strongly influenced the design of telescopes in the early years of radio astronomy, have made a comeback. Amid all these developments, an international effort to design and build the world's largest radio telescope, the Square Kilometre Array (SKA), is ongoing. With its v…
▽ More
Backed by advances in digital electronics, signal processing, computation, and storage technologies, aperture arrays, which had strongly influenced the design of telescopes in the early years of radio astronomy, have made a comeback. Amid all these developments, an international effort to design and build the world's largest radio telescope, the Square Kilometre Array (SKA), is ongoing. With its vast collecting area of 1 sq-km, the SKA is envisaged to provide unsurpassed sensitivity and leverage technological advances to implement a complex receiver to provide a large field of view through multiple beams on the sky. Many pathfinders and precursor aperture array telescopes for the SKA, operating in the frequency range of 10-300 MHz, have been constructed and operationalized to obtain valuable feedback on scientific, instrumental, and functional aspects. This review article looks explicitly into the progression of digital-receiver architecture from the Murchison Widefield Array (precursor) to the SKA1-Low. It highlights the technological advances in analog-to-digital converters (ADCs),field-programmable gate arrays (FPGAs), and central processing unit-graphics processing unit (CPU-GPU) hybrid platforms around which complex digital signal processing systems implement efficient channelizers, beamformers, and correlators. The article concludes with a preview of the design of a new generation signal processing platform based on radio frequency system-on-chip (RFSoC).
△ Less
Submitted 17 January, 2023;
originally announced January 2023.
-
The 100-month Swift catalogue of supergiant fast X-ray transients II. SFXT diagnostics from outburst properties
Authors:
Romano P.,
Evans P. A.,
Bozzo E.,
Mangano V.,
Vercellone S.,
Guidorzi C.,
Ducci L.,
Kennea J. A.,
Barthelmy S. D.,
Palmer D. M.,
Krimm H. A.,
Cenko B.
Abstract:
Supergiant Fast X-ray Transients (SFXT) are High Mass X-ray Binaries displaying X-ray outbursts reaching peak luminosities of 10$^{38}$ erg/s and spend most of their life in more quiescent states with luminosities as low as 10$^{32}$-10$^{33}$ erg/s. The main goal of our comprehensive and uniform analysis of the SFXT Swift triggers is to provide tools to predict whether a transient which has no kn…
▽ More
Supergiant Fast X-ray Transients (SFXT) are High Mass X-ray Binaries displaying X-ray outbursts reaching peak luminosities of 10$^{38}$ erg/s and spend most of their life in more quiescent states with luminosities as low as 10$^{32}$-10$^{33}$ erg/s. The main goal of our comprehensive and uniform analysis of the SFXT Swift triggers is to provide tools to predict whether a transient which has no known X-ray counterpart may be an SFXT candidate. These tools can be exploited for the development of future missions exploring the variable X-ray sky through large FoV instruments. We examined all available data on outbursts of SFXTs that triggered the Swift/BAT collected between 2005-08-30 and 2014-12-31, in particular those for which broad-band data, including the Swift/XRT ones, are also available. We processed all BAT and XRT data uniformly with the Swift Burst Analyser to produce spectral evolution dependent flux light curves for each outburst. The BAT data allowed us to infer useful diagnostics to set SFXT triggers apart from the general GRB population, showing that SFXTs give rise uniquely to image triggers and are simultaneously very long, faint, and `soft' hard-X-ray transients. The BAT data alone can discriminate very well the SFXTs from other fast transients such as anomalous X-ray pulsars and soft gamma repeaters. However, to distinguish SFXTs from, for instance, accreting millisecond X-ray pulsars and jetted tidal disruption events, the XRT data collected around the time of the BAT triggers are decisive. The XRT observations of 35/52 SFXT BAT triggers show that in the soft X-ray energy band, SFXTs display a decay in flux from the peak of the outburst of at least 3 orders of magnitude within a day and rarely undergo large re-brightening episodes, favouring in most cases a rapid decay down to the quiescent level within 3-5 days (at most). [Abridged]
△ Less
Submitted 9 December, 2022;
originally announced December 2022.
-
The MUSE second-generation VLT instrument
Authors:
Bacon R.,
Accardo M.,
Adjali L.,
Anwand H.,
Bauer S.,
Biswas I.,
Blaizot J.,
Boudon D.,
Brau-Nogue S.,
Brinchmann J.,
Caillier P.,
Capoani L.,
Carollo C. M.,
Contini T.,
Couderc P.,
Daguise E.,
Deiries S.,
Delabre B.,
Dreizler S.,
Dubois J. P.,
Dupieux M.,
Dupuy C.,
Emsellem E.,
Fechner T.,
Fleischmann A.
, et al. (43 additional authors not shown)
Abstract:
The Multi Unit Spectroscopic Explorer (MUSE) is a second-generation VLT panoramic integral-field spectrograph currently in manufacturing, assembly and integration phase. MUSE has a field of 1x1 arcmin2 sampled at 0.2x0.2 arcsec2 and is assisted by the VLT ground layer adaptive optics ESO facility using four laser guide stars. The instrument is a large assembly of 24 identical high performance inte…
▽ More
The Multi Unit Spectroscopic Explorer (MUSE) is a second-generation VLT panoramic integral-field spectrograph currently in manufacturing, assembly and integration phase. MUSE has a field of 1x1 arcmin2 sampled at 0.2x0.2 arcsec2 and is assisted by the VLT ground layer adaptive optics ESO facility using four laser guide stars. The instrument is a large assembly of 24 identical high performance integral field units, each one composed of an advanced image slicer, a spectrograph and a 4kx4k detector. In this paper we review the progress of the manufacturing and report the performance achieved with the first integral field unit.
△ Less
Submitted 30 November, 2022;
originally announced November 2022.
-
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
Authors:
Gandharv Patil,
Prashanth L. A.,
Dheeraj Nagaraj,
Doina Precup
Abstract:
We study the finite-time behaviour of the popular temporal difference (TD) learning algorithm when combined with tail-averaging. We derive finite time bounds on the parameter error of the tail-averaged TD iterate under a step-size choice that does not require information about the eigenvalues of the matrix underlying the projected TD fixed point. Our analysis shows that tail-averaged TD converges…
▽ More
We study the finite-time behaviour of the popular temporal difference (TD) learning algorithm when combined with tail-averaging. We derive finite time bounds on the parameter error of the tail-averaged TD iterate under a step-size choice that does not require information about the eigenvalues of the matrix underlying the projected TD fixed point. Our analysis shows that tail-averaged TD converges at the optimal $O\left(1/t\right)$ rate, both in expectation and with high probability. In addition, our bounds exhibit a sharper rate of decay for the initial error (bias), which is an improvement over averaging all iterates. We also propose and analyse a variant of TD that incorporates regularisation. From analysis, we conclude that the regularised version of TD is useful for problems with ill-conditioned features.
△ Less
Submitted 19 September, 2024; v1 submitted 12 October, 2022;
originally announced October 2022.
-
Chromosome Segmentation Analysis Using Image Processing Techniques and Autoencoders
Authors:
Amritha S Pallavoor,
Prajwal A,
Sundareshan TS,
Sreekanth K Pallavoor
Abstract:
Chromosome analysis and identification from metaphase images is a critical part of cytogenetics based medical diagnosis. It is mainly used for identifying constitutional, prenatal and acquired abnormalities in the diagnosis of genetic diseases and disorders. The process of identification of chromosomes from metaphase is a tedious one and requires trained personnel and several hours to perform. Cha…
▽ More
Chromosome analysis and identification from metaphase images is a critical part of cytogenetics based medical diagnosis. It is mainly used for identifying constitutional, prenatal and acquired abnormalities in the diagnosis of genetic diseases and disorders. The process of identification of chromosomes from metaphase is a tedious one and requires trained personnel and several hours to perform. Challenge exists especially in handling touching, overlapping and clustered chromosomes in metaphase images, which if not segmented properly would result in wrong classification. We propose a method to automate the process of detection and segmentation of chromosomes from a given metaphase image, and in using them to classify through a Deep CNN architecture to know the chromosome type. We have used two methods to handle the separation of overlapping chromosomes found in metaphases - one method involving watershed algorithm followed by autoencoders and the other a method purely based on watershed algorithm. These methods involve a combination of automation and very minimal manual effort to perform the segmentation, which produces the output. The manual effort ensures that human intuition is taken into consideration, especially in handling touching, overlapping and cluster chromosomes. Upon segmentation, individual chromosome images are then classified into their respective classes with 95.75\% accuracy using a Deep CNN model. Further, we impart a distribution strategy to classify these chromosomes from the given output (which typically could consist of 46 individual images in a normal scenario for human beings) into its individual classes with an accuracy of 98\%. Our study helps conclude that pure manual effort involved in chromosome segmentation can be automated to a very good level through image processing techniques to produce reliable and satisfying results.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
The ASTRI Mini-Array of Cherenkov Telescopes at the Observatorio del Teide
Authors:
Scuderi S.,
Giuliani A.,
Pareschi G.,
Tosti G.,
Catalano O.,
Amato E.,
Antonelli L. A.,
Becerra Gonzáles J.,
Bellassai G.,
Bigongiari,
C.,
Biondo B.,
Böttcher M.,
Bonanno G.,
Bonnoli G.,
Bruno P.,
Bulgarelli A.,
Canestrari R.,
Capalbi M.,
Caraveo P.,
Cardillo M.,
Conforti V.,
Contino G.,
Corpora M.,
Costa A.
, et al. (73 additional authors not shown)
Abstract:
The ASTRI Mini-Array (MA) is an INAF project to build and operate a facility to study astronomical sources emitting at very high-energy in the TeV spectral band. The ASTRI MA consists of a group of nine innovative Imaging Atmospheric Cherenkov telescopes. The telescopes will be installed at the Teide Astronomical Observatory of the Instituto de Astrofisica de Canarias (IAC) in Tenerife (Canary Isl…
▽ More
The ASTRI Mini-Array (MA) is an INAF project to build and operate a facility to study astronomical sources emitting at very high-energy in the TeV spectral band. The ASTRI MA consists of a group of nine innovative Imaging Atmospheric Cherenkov telescopes. The telescopes will be installed at the Teide Astronomical Observatory of the Instituto de Astrofisica de Canarias (IAC) in Tenerife (Canary Islands, Spain) on the basis of a host agreement with INAF. Thanks to its expected overall performance, better than those of current Cherenkov telescopes' arrays for energies above \sim 5 TeV and up to 100 TeV and beyond, the ASTRI MA will represent an important instrument to perform deep observations of the Galactic and extra-Galactic sky at these energies.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.
-
A Gradient Smoothed Functional Algorithm with Truncated Cauchy Random Perturbations for Stochastic Optimization
Authors:
Akash Mondal,
Prashanth L. A.,
Shalabh Bhatnagar
Abstract:
In this paper, we present a stochastic gradient algorithm for minimizing a smooth objective function that is an expectation over noisy cost samples, and only the latter are observed for any given parameter. Our algorithm employs a gradient estimation scheme with random perturbations, which are formed using the truncated Cauchy distribution from the delta sphere. We analyze the bias and variance of…
▽ More
In this paper, we present a stochastic gradient algorithm for minimizing a smooth objective function that is an expectation over noisy cost samples, and only the latter are observed for any given parameter. Our algorithm employs a gradient estimation scheme with random perturbations, which are formed using the truncated Cauchy distribution from the delta sphere. We analyze the bias and variance of the proposed gradient estimator. Our algorithm is found to be particularly useful in the case when the objective function is non-convex, and the parameter dimension is high. From an asymptotic convergence analysis, we establish that our algorithm converges almost surely to the set of stationary points of the objective function and obtains the asymptotic convergence rate. We also show that our algorithm avoids unstable equilibria, implying convergence to local minima. Further, we perform a non-asymptotic convergence analysis of our algorithm. In particular, we establish here a non-asymptotic bound for finding an epsilon-stationary point of the non-convex objective function. Finally, we demonstrate numerically through simulations that the performance of our algorithm outperforms GSF, SPSA, and RDSA by a significant margin over a few non-convex settings and further validate its performance over convex (noisy) objectives.
△ Less
Submitted 30 June, 2023; v1 submitted 30 July, 2022;
originally announced August 2022.
-
A review of Deep learning Techniques for COVID-19 identification on Chest CT images
Authors:
Briskline Kiruba S,
Petchiammal A,
D. Murugan
Abstract:
The current COVID-19 pandemic is a serious threat to humanity that directly affects the lungs. Automatic identification of COVID-19 is a challenge for health care officials. The standard gold method for diagnosing COVID-19 is Reverse Transcription Polymerase Chain Reaction (RT-PCR) to collect swabs from affected people. Some limitations encountered while collecting swabs are related to accuracy an…
▽ More
The current COVID-19 pandemic is a serious threat to humanity that directly affects the lungs. Automatic identification of COVID-19 is a challenge for health care officials. The standard gold method for diagnosing COVID-19 is Reverse Transcription Polymerase Chain Reaction (RT-PCR) to collect swabs from affected people. Some limitations encountered while collecting swabs are related to accuracy and longtime duration. Chest CT (Computed Tomography) is another test method that helps healthcare providers quickly identify the infected lung areas. It was used as a supporting tool for identifying COVID-19 in an earlier stage. With the help of deep learning, the CT imaging characteristics of COVID-19. Researchers have proven it to be highly effective for COVID-19 CT image classification. In this study, we review the recent deep learning techniques that can use to detect the COVID-19 disease. Relevant studies were collected by various databases such as Web of Science, Google Scholar, and PubMed. Finally, we compare the results of different deep learning models, and CT image analysis is discussed.
△ Less
Submitted 6 August, 2022; v1 submitted 29 July, 2022;
originally announced August 2022.
-
Paddy Leaf diseases identification on Infrared Images based on Convolutional Neural Networks
Authors:
Petchiammal A,
Briskline Kiruba S,
D. Murugan
Abstract:
Agriculture is the mainstay of human society because it is an essential need for every organism. Paddy cultivation is very significant so far as humans are concerned, largely in the Asian continent, and it is one of the staple foods. However, plant diseases in agriculture lead to depletion in productivity. Plant diseases are generally caused by pests, insects, and pathogens that decrease productiv…
▽ More
Agriculture is the mainstay of human society because it is an essential need for every organism. Paddy cultivation is very significant so far as humans are concerned, largely in the Asian continent, and it is one of the staple foods. However, plant diseases in agriculture lead to depletion in productivity. Plant diseases are generally caused by pests, insects, and pathogens that decrease productivity to a large scale if not controlled within a particular time. Eventually, one cannot see an increase in paddy yield. Accurate and timely identification of plant diseases can help farmers mitigate losses due to pests and diseases. Recently, deep learning techniques have been used to identify paddy diseases and overcome these problems. This paper implements a convolutional neural network (CNN) based on a model and tests a public dataset consisting of 636 infrared image samples with five paddy disease classes and one healthy class. The proposed model proficiently identified and classified paddy diseases of five different types and achieved an accuracy of 88.28%
△ Less
Submitted 6 August, 2022; v1 submitted 29 July, 2022;
originally announced August 2022.
-
Stochastic Gradient Descent and Anomaly of Variance-flatness Relation in Artificial Neural Networks
Authors:
Xia Xiong,
Yong-Cong Chen,
Chunxiao Shi,
Ping Ao
Abstract:
Stochastic gradient descent (SGD), a widely used algorithm in deep-learning neural networks has attracted continuing studies for the theoretical principles behind its success. A recent work reports an anomaly (inverse) relation between the variance of neural weights and the landscape flatness of the loss function driven under SGD [Feng & Tu, PNAS 118, 0027 (2021)]. To investigate this seemingly vi…
▽ More
Stochastic gradient descent (SGD), a widely used algorithm in deep-learning neural networks has attracted continuing studies for the theoretical principles behind its success. A recent work reports an anomaly (inverse) relation between the variance of neural weights and the landscape flatness of the loss function driven under SGD [Feng & Tu, PNAS 118, 0027 (2021)]. To investigate this seemingly violation of statistical physics principle, the properties of SGD near fixed points are analysed via a dynamic decomposition method. Our approach recovers the true "energy" function under which the universal Boltzmann distribution holds. It differs from the cost function in general and resolves the paradox raised by the the anomaly. The study bridges the gap between the classical statistical mechanics and the emerging discipline of artificial intelligence, with potential for better algorithms to the latter.
△ Less
Submitted 12 June, 2023; v1 submitted 11 July, 2022;
originally announced July 2022.
-
COVID-19 Disease Identification on Chest-CT images using CNN and VGG16
Authors:
Briskline Kiruba S,
Petchiammal A,
D. Murugan
Abstract:
A newly identified coronavirus disease called COVID-19 mainly affects the human respiratory system. COVID-19 is an infectious disease caused by a virus originating in Wuhan, China, in December 2019. Early diagnosis is the primary challenge of health care providers. In the earlier stage, medical organizations were dazzled because there were no proper health aids or medicine to detect a COVID-19. A…
▽ More
A newly identified coronavirus disease called COVID-19 mainly affects the human respiratory system. COVID-19 is an infectious disease caused by a virus originating in Wuhan, China, in December 2019. Early diagnosis is the primary challenge of health care providers. In the earlier stage, medical organizations were dazzled because there were no proper health aids or medicine to detect a COVID-19. A new diagnostic tool RT-PCR (Reverse Transcription Polymerase Chain Reaction), was introduced. It collects swab specimens from the patient's nose or throat, where the COVID-19 virus gathers. This method has some limitations related to accuracy and testing time. Medical experts suggest an alternative approach called CT (Computed Tomography) that can quickly diagnose the infected lung areas and identify the COVID-19 in an earlier stage. Using chest CT images, computer researchers developed several deep learning models identifying the COVID-19 disease. This study presents a Convolutional Neural Network (CNN) and VGG16-based model for automated COVID-19 identification on chest CT images. The experimental results using a public dataset of 14320 CT images showed a classification accuracy of 96.34% and 96.99% for CNN and VGG16, respectively.
△ Less
Submitted 9 July, 2022;
originally announced July 2022.
-
Topology, Vorticity and Limit Cycle in a Stabilized Kuramoto-Sivashinsky Equation
Authors:
Yong-Cong Chen,
Chunxiao Shi,
J. M. Kosterlitz,
Xiaomei Zhu,
Ping Ao
Abstract:
A noisy stabilized Kuramoto-Sivashinsky equation is analyzed by stochastic decomposition. For values of control parameter for which periodic stationary patterns exist, the dynamics can be decomposed into diffusive and transverse parts which act on a stochastic potential. The relative positions of stationary states in the stochastic global potential landscape can be obtained from the topology spann…
▽ More
A noisy stabilized Kuramoto-Sivashinsky equation is analyzed by stochastic decomposition. For values of control parameter for which periodic stationary patterns exist, the dynamics can be decomposed into diffusive and transverse parts which act on a stochastic potential. The relative positions of stationary states in the stochastic global potential landscape can be obtained from the topology spanned by the low-lying eigenmodes which inter-connect them. Numerical simulations confirm the predicted landscape. The transverse component also predicts a universal class of vortex like circulations around fixed points. These drive nonlinear drifting and limit cycle motion of the underlying periodic structure in certain regions of parameter space. Our findings might be relevant in studies of other nonlinear systems such as deep learning neural networks.
△ Less
Submitted 6 July, 2022;
originally announced July 2022.
-
A Quantitative Analysis of Dynamic Mechanisms Regulating HIV Latency and Activation
Authors:
Ruiqi Xiong,
Yang Su,
Ping Ao
Abstract:
Objective: The reservoir of human immunodeficiency virus (HIV) latently infected cells is the major obstacle for eradication of acquired immunodeficiency syndrome (AIDS). Due to the noisy environment and multiple influencing factors in the organism, current dynamical models cannot reach a common understanding of the molecular mechanism of HIV latency. In this work, through a new dynamical structur…
▽ More
Objective: The reservoir of human immunodeficiency virus (HIV) latently infected cells is the major obstacle for eradication of acquired immunodeficiency syndrome (AIDS). Due to the noisy environment and multiple influencing factors in the organism, current dynamical models cannot reach a common understanding of the molecular mechanism of HIV latency. In this work, through a new dynamical structure decomposition, the deterministic part of the equation can be separated from the stochastic noise. Thus, the fixed-point analysis of ordinary differential equation is enough to obtain the different steady states of the system. Methods: We established a dynamical model of HIV transcription process by using continuous stochastic differential equations, which simplifies the dimensions of equations needed to describe the system and increases the explorable space of the model. Different states between latency and activation of virus and their relations can be intuitively represented by potential functions and probability distribution functions. Results: Based on our model, the influence of different dynamical parameters on stability is quantitatively analyzed, the parameter ranges of the system in bistable and monostable states are obtained respectively. The theoretical basis of this work is verified by comparing the effects of different factors on dynamic bifurcation with the results of biological experiments. Conclusion: This paper goes beyond previous discrete stochastic methods, and can quantitatively analyze the dynamic mechanism of HIV transcriptional regulation through ordinary differential equations, which is beneficial to the promotion to deal with the high-dimensional situation, and further study the occurrence and development of AIDS in vivo, so as to guide the design of experiments and search for clinical treatment.
△ Less
Submitted 22 June, 2022; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Hall anomaly by vacancies in pinned lattice of vortices: A quantitative analysis on the thin-film data of BSCCO
Authors:
Ruonan Guo,
Yong-Cong Chen,
Ping Ao
Abstract:
Hall anomaly, as appears in the mixed-state Hall resistivity of type-II superconductors, has had numerous theories but yet a consensus on its origin. In this work, we conducted a quantitative analysis of the magnetotransport measurements on BSCCO thin films by Zhao et al. [Phys. Rev. Lett. 122, 247001 (2019)] and validate a previously proposed vacancy mechanism [cf. J. Phys. Condens. Matter. 10, L…
▽ More
Hall anomaly, as appears in the mixed-state Hall resistivity of type-II superconductors, has had numerous theories but yet a consensus on its origin. In this work, we conducted a quantitative analysis of the magnetotransport measurements on BSCCO thin films by Zhao et al. [Phys. Rev. Lett. 122, 247001 (2019)] and validate a previously proposed vacancy mechanism [cf. J. Phys. Condens. Matter. 10, L677 (1998)] with many-body vortex correlations for the phenomenon. The model attributes the Hall anomaly to the motion of vacancies in pinned fragments of vortex lattice. Its validity is first examined by an exploration on the vortex states near the Kosterlitz-Thouless transition on the vortex crystal. Comparisons are then carried out between the measured activation energies with the calculated creation energy of the vortex-anti-vortex pair and the vacancy energy on the flux-line lattice, with no adjustable parameter. Our analysis elucidates the theoretical basis and prerequisites of the vacancy model. In particular, the vacancy activation energies are an order of magnitude smaller than that of a sole vortex line. The proposed mechanism may provide a macro-theoretical framework for other studies.
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
Paddy Doctor: A Visual Image Dataset for Automated Paddy Disease Classification and Benchmarking
Authors:
Petchiammal A,
Briskline Kiruba S,
D. Murugan,
Pandarasamy A
Abstract:
One of the critical biotic stress factors paddy farmers face is diseases caused by bacteria, fungi, and other organisms. These diseases affect plants' health severely and lead to significant crop loss. Most of these diseases can be identified by regularly observing the leaves and stems under expert supervision. In a country with vast agricultural regions and limited crop protection experts, manual…
▽ More
One of the critical biotic stress factors paddy farmers face is diseases caused by bacteria, fungi, and other organisms. These diseases affect plants' health severely and lead to significant crop loss. Most of these diseases can be identified by regularly observing the leaves and stems under expert supervision. In a country with vast agricultural regions and limited crop protection experts, manual identification of paddy diseases is challenging. Thus, to add a solution to this problem, it is necessary to automate the disease identification process and provide easily accessible decision support tools to enable effective crop protection measures. However, the lack of availability of public datasets with detailed disease information limits the practical implementation of accurate disease detection systems. This paper presents \emph{Paddy Doctor}, a visual image dataset for identifying paddy diseases. Our dataset contains 16,225 annotated paddy leaf images across 13 classes (12 diseases and normal leaf). We benchmarked the \emph{Paddy Doctor} dataset using a Convolutional Neural Network (CNN) and four transfer learning based models (VGG16, MobileNet, Xception, and ResNet34). The experimental results showed that ResNet34 achieved the highest F1-score of 97.50%. We release our dataset and reproducible code in the open source for community use.
△ Less
Submitted 25 November, 2022; v1 submitted 23 May, 2022;
originally announced May 2022.
-
Three new brown dwarfs and a massive hot Jupiter revealed by TESS around early-type stars
Authors:
Psaridi A.,
Bouchy F.,
Lendl M.,
Grieves N.,
Stassun K. G.,
Carmichael T.,
Gill S.,
Peña Rojas P. A.,
Gan T.,
Shporer A.,
Bieryla A.,
Christiansen J. L,
Crossfield I. J. M,
Galland F. Hooton M. J. Jenkins J. M,
Jenkins J. S,
Latham D. W,
Lund M. B,
Rodriguez J. E,
Ting E. B,
Udry S. Ulmer-Moll S. Wittenmyer R. A,
Yanzhe Zhang Y.,
Zhou G.,
Addison B.,
Cointepas M.,
Collins K. A.
, et al. (18 additional authors not shown)
Abstract:
The detection and characterization of exoplanets and brown dwarfs (BDs) around massive AF-type stars is essential to investigate and constrain the impact of stellar mass on planet properties. However, such targets are still poorly explored in radial velocity (RV) surveys because they only feature a small number of stellar lines and those are usually broadened and blended by stellar rotation as wel…
▽ More
The detection and characterization of exoplanets and brown dwarfs (BDs) around massive AF-type stars is essential to investigate and constrain the impact of stellar mass on planet properties. However, such targets are still poorly explored in radial velocity (RV) surveys because they only feature a small number of stellar lines and those are usually broadened and blended by stellar rotation as well as stellar jitter. As a result, the available information about the formation and evolution of planets and BDs around hot stars is limited. We aim to increase the sample and precisely measure the masses and eccentricities of giant planets and BDs transiting AF-type stars detected by the Transiting Exoplanet Survey Satellite (TESS). We followed bright (V < 12 mag) stars with $T_{\mathrm{eff}}$ > 6200 K that host giant companions (R > 7 $\mathrm{R_{\rm \oplus}}$) using ground-based photometric observations as well as high precision RV measurements from the CORALIE, CHIRON, TRES, FEROS, and MINERVA-Australis spectrographs. In the context, we present the discovery of three BD companions, TOI-629b, TOI-1982b, and TOI-2543b, and one massive planet, TOI-1107b. From the joint analysis we find the BDs have masses between 66 and 68 $\mathrm{M_{\rm Jup}}$, periods between 7.54 and 17.17 days, and radii between 0.95 and 1.11 $\mathrm{R_{\rm Jup}}$. The hot Jupiter TOI-1107b has an orbital period of 4.08 days, a radius of 1.30 $\mathrm{R_{\rm Jup}}$, and a mass of 3.35 $\mathrm{M_{\rm Jup}}$. As a by-product of this program, we identified four low-mass eclipsing components (TOI-288b, TOI-446b, TOI-478b, and TOI-764b). Both TOI-1107b and TOI-1982b present an anomalously inflated radius with respect to the age of these systems. TOI-629 is among the hottest stars with a known transiting brown dwarf. TOI-629b and TOI-1982b are among the most eccentric brown dwarfs.
△ Less
Submitted 22 May, 2022;
originally announced May 2022.
-
A Survey of Risk-Aware Multi-Armed Bandits
Authors:
Vincent Y. F. Tan,
Prashanth L. A.,
Krishna Jagannathan
Abstract:
In several applications such as clinical trials and financial portfolio optimization, the expected value (or the average reward) does not satisfactorily capture the merits of a drug or a portfolio. In such applications, risk plays a crucial role, and a risk-aware performance measure is preferable, so as to capture losses in the case of adverse events. This survey aims to consolidate and summarise…
▽ More
In several applications such as clinical trials and financial portfolio optimization, the expected value (or the average reward) does not satisfactorily capture the merits of a drug or a portfolio. In such applications, risk plays a crucial role, and a risk-aware performance measure is preferable, so as to capture losses in the case of adverse events. This survey aims to consolidate and summarise the existing research on risk measures, specifically in the context of multi-armed bandits. We review various risk measures of interest, and comment on their properties. Next, we review existing concentration inequalities for various risk measures. Then, we proceed to defining risk-aware bandit problems, We consider algorithms for the regret minimization setting, where the exploration-exploitation trade-off manifests, as well as the best-arm identification setting, which is a pure exploration problem -- both in the context of risk-sensitive measures. We conclude by commenting on persisting challenges and fertile areas for future research.
△ Less
Submitted 11 May, 2022;
originally announced May 2022.
-
Investigation of the effect of the grain sizes on the dynamic strength of the fine-grained alumina ceramics obtained by Spark Plasma Sintering
Authors:
Melekhin N. V.,
Boldin M. S.,
Bragov A. M.,
Filippov A. R.,
Popov A. A.,
Shotin S. V.,
Nokhrin A. V.,
Chuvil'deev V. N.,
Murashov A. A.,
Tabachkova N. Yu.
Abstract:
The results of dynamic strength tests of the alumina ceramics with various grain sizes are presented. The ceramics were obtained by Spark Plasma Sintering (SPS) of industrial submicron and fine Al2O3 powders. The heating up was performed with the rate of 10 oC/min; the grain sizes in the ceramics was controlled by varying the SPS temperature and the heating rate as well as by varying the initial s…
▽ More
The results of dynamic strength tests of the alumina ceramics with various grain sizes are presented. The ceramics were obtained by Spark Plasma Sintering (SPS) of industrial submicron and fine Al2O3 powders. The heating up was performed with the rate of 10 oC/min; the grain sizes in the ceramics was controlled by varying the SPS temperature and the heating rate as well as by varying the initial sizes of the Al2O3 particles in the powders. The ceramics had a high density (over 98%) and a uniform fine-grained microstructure (the mean grain sizes varied from 0.8 to 13.4 mkm). The dynamic compressing tests were carried out by modified Kolsky method with using split Hopkinson pressure bar. The tests were performed at room temperature using a 20-mm PG-20 gas gun with the strain rate of ~10^3 s-1. The dependence of the dynamic ultimate strength of alumina on the grain size was found for the first time to have a non-monotonous character (with a maximum). The maximum value of the dynamic ultimate compression strength (SY = 1060 MPa) was provided at the mean grain size of ~2.9-3 mkm. The reduction of SY for alumina in the range of submicron grain sizes was shown to originate from the reduction of the relative density of the ceramics sintered at lower SPS temperatures.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
A policy gradient approach for optimization of smooth risk measures
Authors:
Nithia Vijayan,
Prashanth L. A
Abstract:
We propose policy gradient algorithms for solving a risk-sensitive reinforcement learning (RL) problem in on-policy as well as off-policy settings. We consider episodic Markov decision processes, and model the risk using the broad class of smooth risk measures of the cumulative discounted reward. We propose two template policy gradient algorithms that optimize a smooth risk measure in on-policy an…
▽ More
We propose policy gradient algorithms for solving a risk-sensitive reinforcement learning (RL) problem in on-policy as well as off-policy settings. We consider episodic Markov decision processes, and model the risk using the broad class of smooth risk measures of the cumulative discounted reward. We propose two template policy gradient algorithms that optimize a smooth risk measure in on-policy and off-policy RL settings, respectively. We derive non-asymptotic bounds that quantify the rate of convergence of our proposed algorithms to a stationary point of the smooth risk measure. As special cases, we establish that our algorithms apply to optimization of mean-variance and distortion risk measures, respectively.
△ Less
Submitted 23 June, 2024; v1 submitted 22 February, 2022;
originally announced February 2022.
-
Performance Enhancement of C-V2X Mode 4 Utilizing Multiple Candidate Single-subframe Resources
Authors:
G. P. Wijesiri N. B. A,
T. Samarasinghe,
J. Haapola
Abstract:
Prioritization of data streams in cellular vehicle-to-everything (C-V2X) may lead to unfavorable packet delays in low priority streams. This paper studies the allocation of multiple candidate single-subframe resources (CSRs) per vehicle as a solution. It proposes a methodology to determine the number of CSRs for each vehicle based on the number of total vehicles, and to assign the multiple data st…
▽ More
Prioritization of data streams in cellular vehicle-to-everything (C-V2X) may lead to unfavorable packet delays in low priority streams. This paper studies the allocation of multiple candidate single-subframe resources (CSRs) per vehicle as a solution. It proposes a methodology to determine the number of CSRs for each vehicle based on the number of total vehicles, and to assign the multiple data streams among them for simultaneous transmission. The numerical results highlight the achievable delay gains of the proposed approach, and its negligible impact on packet collisions.
△ Less
Submitted 22 February, 2022;
originally announced February 2022.
-
Optimizing Neural Network for Computer Vision task in Edge Device
Authors:
Ranjith M S,
S Parameshwara,
Pavan Yadav A,
Shriganesh Hegde
Abstract:
The field of computer vision has grown very rapidly in the past few years due to networks like convolution neural networks and their variants. The memory required to store the model and computational expense are very high for such a network limiting it to deploy on the edge device. Many times, applications rely on the cloud but that makes it hard for working in real-time due to round-trip delays.…
▽ More
The field of computer vision has grown very rapidly in the past few years due to networks like convolution neural networks and their variants. The memory required to store the model and computational expense are very high for such a network limiting it to deploy on the edge device. Many times, applications rely on the cloud but that makes it hard for working in real-time due to round-trip delays. We overcome these problems by deploying the neural network on the edge device itself. The computational expense for edge devices is reduced by reducing the floating-point precision of the parameters in the model. After this the memory required for the model decreases and the speed of the computation increases where the performance of the model is least affected. This makes an edge device to predict from the neural network all by itself.
△ Less
Submitted 2 October, 2021;
originally announced October 2021.
-
Exploring Water Governing System Fit Through a Statistical Mechanics Approach
Authors:
Peyman Arjomandi A.,
Seyedalireza Seyedi,
Ehsan Tavakoli Nabavi,
Saeid Alikhani
Abstract:
Water governing systems are twisted with complex interplays among levels and scales which embody their structures. Typically, the mismatch between human-generated and natural systems produces externalities and inefficiencies reflectable in spatial scales. The largely known problem of fit in water governance is investigated to detect the issues of fit between administrative/institutional scales and…
▽ More
Water governing systems are twisted with complex interplays among levels and scales which embody their structures. Typically, the mismatch between human-generated and natural systems produces externalities and inefficiencies reflectable in spatial scales. The largely known problem of fit in water governance is investigated to detect the issues of fit between administrative/institutional scales and the hydrological one in a lake basin. To implement the idea, constraining the level of analysis interlinked to the concentrated levels of administration in spatial scales, the fit of the governing system was analyzed by means of statistical mechanics. Modeling the structure of water demand/supply governing system in a given region through the Curie-Weiss Mean Field approximation, the system cost in relation to its structure and fit was appraised and compared with two other conceptual structures in the Urmia Lake Basin in Iran. The methodology articulated an analysis framework for exploring the effectiveness of the formulated water demand/supply governing system and its fit to the relevant hydrological system. The findings of this study may help developing strategies to encourage adaptations, rescaling/reforms for effective watershed management.
△ Less
Submitted 13 August, 2021;
originally announced September 2021.
-
Kinetic temperature of massive star-forming molecular clumps measured with formaldehyde IV. The ALMA view of N113 and N159W in the LMC
Authors:
X. D. Tang,
C. Henkel,
K. M. Menten,
Y. Gong,
C. -H. R. Chen,
D. L. Li,
M. -Y. Lee,
J. G. Mangum,
Y. P. Ao,
S. Mühle,
S. Aalto,
S. García-Burillo,
S. Martín,
S. Viti,
S. Muller,
F. Costagliola,
H. Asiri,
S. A. Levshakov,
M. Spaans,
J. Ott,
C. M. V. Impellizzeri,
Y. Fukui,
Y. X. He,
J. Esimbek,
J. J. Zhou
, et al. (3 additional authors not shown)
Abstract:
We mapped the kinetic temperature structure of two massive star-forming regions, N113 and N159W, in the Large Magellanic Cloud (LMC). We have used $\sim$1\hbox{$\,.\!\!^{\prime\prime}$}6\,($\sim$0.4\,pc) resolution measurements of the para-H$_2$CO\,$J_{\rm K_ aK_c}$\,=\,3$_{03}$--2$_{02}$, 3$_{22}$--2$_{21}$, and 3$_{21}$--2$_{20}$ transitions near 218.5\,GHz to constrain RADEX non-LTE models of t…
▽ More
We mapped the kinetic temperature structure of two massive star-forming regions, N113 and N159W, in the Large Magellanic Cloud (LMC). We have used $\sim$1\hbox{$\,.\!\!^{\prime\prime}$}6\,($\sim$0.4\,pc) resolution measurements of the para-H$_2$CO\,$J_{\rm K_ aK_c}$\,=\,3$_{03}$--2$_{02}$, 3$_{22}$--2$_{21}$, and 3$_{21}$--2$_{20}$ transitions near 218.5\,GHz to constrain RADEX non-LTE models of the physical conditions. The gas kinetic temperatures derived from the para-H$_2$CO line ratios 3$_{22}$--2$_{21}$/3$_{03}$--2$_{02}$ and 3$_{21}$--2$_{20}$/3$_{03}$--2$_{02}$ range from 28 to 105\,K in N113 and 29 to 68\,K in N159W. Distributions of the dense gas traced by para-H$_2$CO agree with those of the 1.3\,mm dust and \emph{Spitzer}\,8.0\,$μ$m emission, but do not significantly correlate with the H$α$ emission. The high kinetic temperatures ($T_{\rm kin}$\,$\gtrsim$\,50\,K) of the dense gas traced by para-H$_2$CO appear to be correlated with the embedded infrared sources inside the clouds and/or YSOs in the N113 and N159W regions. The lower temperatures ($T_{\rm kin}$\,$<$\,50\,K) are measured at the outskirts of the H$_2$CO-bearing distributions of both N113 and N159W. It seems that the kinetic temperatures of the dense gas traced by para-H$_2$CO are weakly affected by the external sources of the H$α$ emission. The non-thermal velocity dispersions of para-H$_2$CO are well correlated with the gas kinetic temperatures in the N113 region, implying that the higher kinetic temperature traced by para-H$_2$CO is related to turbulence on a $\sim$0.4\,pc scale. The dense gas heating appears to be dominated by internal star formation activity, radiation, and/or turbulence. It seems that the mechanism heating the dense gas of the star-forming regions in the LMC is consistent with that in Galactic massive star-forming regions located in the Galactic plane.
△ Less
Submitted 24 August, 2021;
originally announced August 2021.
-
Policy Gradient Methods for Distortion Risk Measures
Authors:
Nithia Vijayan,
Prashanth L. A
Abstract:
We propose policy gradient algorithms which learn risk-sensitive policies in a reinforcement learning (RL) framework. Our proposed algorithms maximize the distortion risk measure (DRM) of the cumulative reward in an episodic Markov decision process in on-policy and off-policy RL settings, respectively. We derive a variant of the policy gradient theorem that caters to the DRM objective, and integra…
▽ More
We propose policy gradient algorithms which learn risk-sensitive policies in a reinforcement learning (RL) framework. Our proposed algorithms maximize the distortion risk measure (DRM) of the cumulative reward in an episodic Markov decision process in on-policy and off-policy RL settings, respectively. We derive a variant of the policy gradient theorem that caters to the DRM objective, and integrate it with a likelihood ratio-based gradient estimation scheme. We derive non-asymptotic bounds that establish the convergence of our proposed algorithms to an approximate stationary point of the DRM objective.
△ Less
Submitted 4 February, 2024; v1 submitted 9 July, 2021;
originally announced July 2021.
-
Space Photometry with BRITE-Constellation
Authors:
Weiss W. W,
Zwintz K.,
Kuschnig R.,
Handler G.,
Moffat A. F. J.,
Baade D.,
Bowman D. M.,
Granzer T.,
Kallinger T.,
Koudelka O. F.,
Lovekin C. C.,
Neiner C.,
Pablo H.,
Pigulski A.,
Popowicz A.,
Ramiaramanantsoa T.,
Rucinski S. M.,
Strassmeier K. G.,
Wade G. A
Abstract:
BRITE-Constellation is devoted to high-precision optical photometric monitoring of bright stars, distributed all over the Milky Way, in red and/or blue passbands. Photometry from space avoids the turbulent and absorbing terrestrial atmosphere and allows for very long and continuous observing runs with high time resolution and thus provides the data necessary for understanding various processes ins…
▽ More
BRITE-Constellation is devoted to high-precision optical photometric monitoring of bright stars, distributed all over the Milky Way, in red and/or blue passbands. Photometry from space avoids the turbulent and absorbing terrestrial atmosphere and allows for very long and continuous observing runs with high time resolution and thus provides the data necessary for understanding various processes inside stars (e.g., asteroseismology) and in their immediate environment. While the first astronomical observations from space focused on the spectral regions not accessible from the ground it soon became obvious around 1970 that avoiding the turbulent terrestrial atmosphere significantly improved the accuracy of photometry and satellites explicitly dedicated to high-quality photometry were launched. A perfect example is BRITE-Constellation, which is the result of a very successful cooperation between Austria, Canada and Poland. Research highlights for targets distributed nearly over the entire HRD are presented, but focus primarily on massive and hot stars.
△ Less
Submitted 24 June, 2021;
originally announced June 2021.
-
Perfect optical coherence lattices
Authors:
Liang Chunhao,
Liu Xin,
Xu Zhiheng,
Wang Fei,
Ponomarenko Sergey A.,
Cai Yangjian,
Pujuan Ma
Abstract:
We advance and experimentally implement a protocol to generate perfect optical coherence lattices (OCL) that are not modulated by an envelope field. Structuring the amplitude and phase of an input partially coherent beam in a Fourier plane of an imaging system lies at the heart of our protocol. In the proposed approach, the OCL node profile depends solely on the degree of coherence (DOC) of the in…
▽ More
We advance and experimentally implement a protocol to generate perfect optical coherence lattices (OCL) that are not modulated by an envelope field. Structuring the amplitude and phase of an input partially coherent beam in a Fourier plane of an imaging system lies at the heart of our protocol. In the proposed approach, the OCL node profile depends solely on the degree of coherence (DOC) of the input beam such that, in principle, any lattice structure can be attained via proper manipulations in the Fourier plane. Moreover, any genuine partially coherent source can serve as an input to our lattice generating imaging system. Our results are anticipated to find applications to optical field engineering and multi-target probing among others.
△ Less
Submitted 12 June, 2021;
originally announced June 2021.
-
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint
Authors:
Nithia Vijayan,
Prashanth L. A
Abstract:
We propose two policy gradient algorithms for solving the problem of control in an off-policy reinforcement learning (RL) context. Both algorithms incorporate a smoothed functional (SF) based gradient estimation scheme. The first algorithm is a straightforward combination of importance sampling-based off-policy evaluation with SF-based gradient estimation. The second algorithm, inspired by the sto…
▽ More
We propose two policy gradient algorithms for solving the problem of control in an off-policy reinforcement learning (RL) context. Both algorithms incorporate a smoothed functional (SF) based gradient estimation scheme. The first algorithm is a straightforward combination of importance sampling-based off-policy evaluation with SF-based gradient estimation. The second algorithm, inspired by the stochastic variance-reduced gradient (SVRG) algorithm, incorporates variance reduction in the update iteration. For both algorithms, we derive non-asymptotic bounds that establish convergence to an approximate stationary point. From these results, we infer that the first algorithm converges at a rate that is comparable to the well-known REINFORCE algorithm in an off-policy RL context, while the second algorithm exhibits an improved rate of convergence.
△ Less
Submitted 23 June, 2024; v1 submitted 6 January, 2021;
originally announced January 2021.
-
NIT COVID-19 at WNUT-2020 Task 2: Deep Learning Model RoBERTa for Identify Informative COVID-19 English Tweets
Authors:
Jagadeesh M S,
Alphonse P J A
Abstract:
This paper presents the model submitted by the NIT_COVID-19 team for identified informative COVID-19 English tweets at WNUT-2020 Task2. This shared task addresses the problem of automatically identifying whether an English tweet related to informative (novel coronavirus) or not. These informative tweets provide information about recovered, confirmed, suspected, and death cases as well as the locat…
▽ More
This paper presents the model submitted by the NIT_COVID-19 team for identified informative COVID-19 English tweets at WNUT-2020 Task2. This shared task addresses the problem of automatically identifying whether an English tweet related to informative (novel coronavirus) or not. These informative tweets provide information about recovered, confirmed, suspected, and death cases as well as the location or travel history of the cases. The proposed approach includes pre-processing techniques and pre-trained RoBERTa with suitable hyperparameters for English coronavirus tweet classification. The performance achieved by the proposed model for shared task WNUT 2020 Task2 is 89.14% in the F1-score metric.
△ Less
Submitted 11 November, 2020;
originally announced November 2020.
-
LCA-Net: Light Convolutional Autoencoder for Image Dehazing
Authors:
Pavan A,
Adithya Bennur,
Mohit Gaggar,
Shylaja S S
Abstract:
Image dehazing is a crucial image pre-processing task aimed at removing the incoherent noise generated by haze to improve the visual appeal of the image. The existing models use sophisticated networks and custom loss functions which are computationally inefficient and requires heavy hardware to run. Time is of the essence in image pre-processing since real time outputs can be obtained instantly. T…
▽ More
Image dehazing is a crucial image pre-processing task aimed at removing the incoherent noise generated by haze to improve the visual appeal of the image. The existing models use sophisticated networks and custom loss functions which are computationally inefficient and requires heavy hardware to run. Time is of the essence in image pre-processing since real time outputs can be obtained instantly. To overcome these problems, our proposed generic model uses a very light convolutional encoder-decoder network which does not depend on any atmospheric models. The network complexity-image quality trade off is handled well in this neural network and the performance of this network is not limited by low-spec systems. This network achieves optimum dehazing performance at a much faster rate, on several standard datasets, comparable to the state-of-the-art methods in terms of image quality.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
2kenize: Tying Subword Sequences for Chinese Script Conversion
Authors:
Pranav A,
Isabelle Augenstein
Abstract:
Simplified Chinese to Traditional Chinese character conversion is a common preprocessing step in Chinese NLP. Despite this, current approaches have poor performance because they do not take into account that a simplified Chinese character can correspond to multiple traditional characters. Here, we propose a model that can disambiguate between mappings and convert between the two scripts. The model…
▽ More
Simplified Chinese to Traditional Chinese character conversion is a common preprocessing step in Chinese NLP. Despite this, current approaches have poor performance because they do not take into account that a simplified Chinese character can correspond to multiple traditional characters. Here, we propose a model that can disambiguate between mappings and convert between the two scripts. The model is based on subword segmentation, two language models, as well as a method for mapping between subword sequences. We further construct benchmark datasets for topic classification and script conversion. Our proposed method outperforms previous Chinese Character conversion approaches by 6 points in accuracy. These results are further confirmed in a downstream application, where 2kenize is used to convert pretraining dataset for topic classification. An error analysis reveals that our method's particular strengths are in dealing with code-mixing and named entities.
△ Less
Submitted 7 May, 2020;
originally announced May 2020.