Search | arXiv e-print repository

Approximate Bayesian inference for high-resolution spatial disaggregation using alternative data sources

Authors: Anis Pakrashi, Arnab Hazra, Sooraj M Raveendran, Krishnachandran Balakrishnan

Abstract: This paper addresses the challenge of obtaining precise demographic information at a fine-grained spatial level, a necessity for planning localized public services such as water distribution networks, or understanding local human impacts on the ecosystem. While population sizes are commonly available for large administrative areas, such as wards in India, practical applications often demand knowle… ▽ More This paper addresses the challenge of obtaining precise demographic information at a fine-grained spatial level, a necessity for planning localized public services such as water distribution networks, or understanding local human impacts on the ecosystem. While population sizes are commonly available for large administrative areas, such as wards in India, practical applications often demand knowledge of population density at smaller spatial scales. We explore the integration of alternative data sources, specifically satellite-derived products, including land cover, land use, street density, building heights, vegetation coverage, and drainage density. Using a case study focused on Bangalore City, India, with a ward-level population dataset for 198 wards and satellite-derived sources covering 786,702 pixels at a resolution of 30mX30m, we propose a semiparametric Bayesian spatial regression model for obtaining pixel-level population estimates. Given the high dimensionality of the problem, exact Bayesian inference is deemed impractical; we discuss an approximate Bayesian inference scheme based on the recently proposed max-and-smooth approach, a combination of Laplace approximation and Markov chain Monte Carlo. A simulation study validates the reasonable performance of our inferential approach. Mapping pixel-level estimates to the ward level demonstrates the effectiveness of our method in capturing the spatial distribution of population sizes. While our case study focuses on a demographic application, the methodology developed here readily applies to count-type spatial datasets from various scientific disciplines, where high-resolution alternative data sources are available. △ Less

Submitted 15 July, 2024; originally announced July 2024.

Comments: 30 pages, 7 figures, 3 tables

arXiv:2404.12669 [pdf]

Comparison of Two-Moment and Three-Moment Bulk Microphysics Schemes in Thunderstorm Simulations over Indian Subcontinent

Authors: Chandrima Mallick, Ushnanshu Dutta, Moumita Bhowmik, Greeshma M. Mohan, Anupam Hazra, S. D. Pawar, Jen-Ping Chen

Abstract: We have performed three-dimensional thunderstorm real simulations using the two-moment and three-moment bulk microphysics schemes in the Weather Research and Forecasting (WRF) model. We have analyzed three cases to understand the potential differences between the double-moment (Morrison-2M) and National Taiwan University triple-moment (NTU-3M) microphysics parameterizations in capturing the charac… ▽ More We have performed three-dimensional thunderstorm real simulations using the two-moment and three-moment bulk microphysics schemes in the Weather Research and Forecasting (WRF) model. We have analyzed three cases to understand the potential differences between the double-moment (Morrison-2M) and National Taiwan University triple-moment (NTU-3M) microphysics parameterizations in capturing the characteristics of lightning events over the Indian subcontinent. Despite general resemblances in these schemes, the simulations reveal distinct differences in storm structure, cloud hydrometeors formation, and precipitation. The lightning flash counts from the in situ lightning detection network (LDN) are also used to compare the simulation of storms. The Lightning Potential Index (LPI) is computed for Morrison-2M and NTU-3M microphysics schemes and compared it with the Lightning Detection Network (LDN) observation. In most cases, the Morrison-2M shows more LPI than the NTU-3M scheme. Both the schemes also differ in simulating rainfall and other thermodynamical, dynamical, and microphysical parameters in the model. Here, we have attempted to identify the basic differences between these two schemes, which may be responsible for the discrepancies in the simulations. In particular, the Morrison-2M produced much higher surface precipitation rates. The effects on the size distributions cloud hydrometeors between two microphysical schemes are important to simulate the biases in the precipitation and lightning flash counts. The inclusions of ice crystal shapes are responsible for many of the key differences between the two microphysics simulations. Different approaches in treating cloud ice, snow, and graupel may have an impact on the simulation of lightning and precipitation. Results show that the simulation of lightning events is sensitive to microphysical parameterization schemes in NWP models. △ Less

Submitted 19 April, 2024; originally announced April 2024.

arXiv:2404.00734 [pdf, other]

Weak decays of $\pmb{B_c}$ involving vector mesons in self-consistent covariant light-front approach

Authors: Thejus Mary S., Avijit Hazra, Neelesh Sharma, Rohit Dhir

Abstract: We present a comprehensive analysis of weak transition form factors, semileptonic decays, and nonleptonic decays of $B_c$ meson involving pseudoscalar ($P$) and vector ($V$) meson for bottom-conserving and bottom-changing decay modes. We employ self-consistent covariant light-front quark model (CLFQM), termed as Type-II correspondence, to calculate the $B_c$ to $P(V)$ transition form factors. The… ▽ More We present a comprehensive analysis of weak transition form factors, semileptonic decays, and nonleptonic decays of $B_c$ meson involving pseudoscalar ($P$) and vector ($V$) meson for bottom-conserving and bottom-changing decay modes. We employ self-consistent covariant light-front quark model (CLFQM), termed as Type-II correspondence, to calculate the $B_c$ to $P(V)$ transition form factors. The Type-II correspondence in the CLF approach gives self-consistent results associated with the $B^{(i)}_j$ functions, which vanish numerically after the replacement $M^{\prime(\prime\prime)} \to M_0^{\prime(\prime\prime)}$ in traditional Type-I correspondence, and the covariance of the matrix elements is also restored. We investigate these effects on bottom conserving $B_c \to P(V)$ form factors that have not yet been studied in CLFQM Type-II correspondence. In addition, we quantify the implications of self-consistency propagating to weak decays involving both bottom-conserving and bottom-changing $B_c$ transition form factors. We use two different parameterizations, the usual three-parameter function of $q^2$ and the model-independent $z$-series expansion, to establish a clear understanding of $q^2$ dependence. Using the numerical values of the form factors, we predict the branching ratios and other physical observables, such as, forward-backward asymmetries, polarization fractions, etc., of the semileptonic $B_c$ decays. We extend our analysis to predict the branching ratios of two-body nonleptonic weak decays using the factorization hypothesis in self-consistent CLFQM. We also compare our results with those of other theoretical studies. △ Less

Submitted 31 March, 2024; originally announced April 2024.

Comments: 57 pages, 11 figures

arXiv:2403.15670 [pdf, other]

Computationally Scalable Bayesian SPDE Modeling for Censored Spatial Responses

Authors: Indranil Sahoo, Suman Majumder, Arnab Hazra, Ana G. Rappold, Dipankar Bandyopadhyay

Abstract: Observations of groundwater pollutants, such as arsenic or Perfluorooctane sulfonate (PFOS), are riddled with left censoring. These measurements have impact on the health and lifestyle of the populace. Left censoring of these spatially correlated observations are usually addressed by applying Gaussian processes (GPs), which have theoretical advantages. However, this comes with a challenging comput… ▽ More Observations of groundwater pollutants, such as arsenic or Perfluorooctane sulfonate (PFOS), are riddled with left censoring. These measurements have impact on the health and lifestyle of the populace. Left censoring of these spatially correlated observations are usually addressed by applying Gaussian processes (GPs), which have theoretical advantages. However, this comes with a challenging computational complexity of $\mathcal{O}(n^3)$, which is impractical for large datasets. Additionally, a sizable proportion of the data being left-censored creates further bottlenecks, since the likelihood computation now involves an intractable high-dimensional integral of the multivariate Gaussian density. In this article, we tackle these two problems simultaneously by approximating the GP with a Gaussian Markov random field (GMRF) approach that exploits an explicit link between a GP with Matérn correlation function and a GMRF using stochastic partial differential equations (SPDEs). We introduce a GMRF-based measurement error into the model, which alleviates the likelihood computation for the censored data, drastically improving the speed of the model while maintaining admirable accuracy. Our approach demonstrates robustness and substantial computational scalability, compared to state-of-the-art methods for censored spatial responses across various simulation settings. Finally, the fit of this fully Bayesian model to the concentration of PFOS in groundwater available at 24,959 sites across California, where 46.62\% responses are censored, produces prediction surface and uncertainty quantification in real time, thereby substantiating the applicability and scalability of the proposed method. Code for implementation is made available via GitHub. △ Less

Submitted 22 March, 2024; originally announced March 2024.

arXiv:2401.09243 [pdf, other]

DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning

Authors: Sabariswaran Mani, Sreyas Venkataraman, Abhranil Chandra, Adyan Rizvi, Yash Sirvi, Soumojit Bhattacharya, Aritra Hazra

Abstract: Robot learning tasks are extremely compute-intensive and hardware-specific. Thus the avenues of tackling these challenges, using a diverse dataset of offline demonstrations that can be used to train robot manipulation agents, is very appealing. The Train-Offline-Test-Online (TOTO) Benchmark provides a well-curated open-source dataset for offline training comprised mostly of expert data and also be… ▽ More Robot learning tasks are extremely compute-intensive and hardware-specific. Thus the avenues of tackling these challenges, using a diverse dataset of offline demonstrations that can be used to train robot manipulation agents, is very appealing. The Train-Offline-Test-Online (TOTO) Benchmark provides a well-curated open-source dataset for offline training comprised mostly of expert data and also benchmark scores of the common offline-RL and behaviour cloning agents. In this paper, we introduce DiffClone, an offline algorithm of enhanced behaviour cloning agent with diffusion-based policy learning, and measured the efficacy of our method on real online physical robots at test time. This is also our official submission to the Train-Offline-Test-Online (TOTO) Benchmark Challenge organized at NeurIPS 2023. We experimented with both pre-trained visual representation and agent policies. In our experiments, we find that MOCO finetuned ResNet50 performs the best in comparison to other finetuned representations. Goal state conditioning and mapping to transitions resulted in a minute increase in the success rate and mean-reward. As for the agent policy, we developed DiffClone, a behaviour cloning agent improved using conditional diffusion. △ Less

Submitted 23 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

Comments: NeurIPS 2023 Train Offline Test Online Workshop and Competition (Best Paper Oral Presentation / Winning Competition Submission)

arXiv:2401.06346 [pdf]

Towards better representation of Indian summer monsoon rainfall in CMIP6 models: Evaluation of MISO and MJO simulation

Authors: Ushnanshu Dutta, Moumita Bhowmik, Anupam Hazra, Chein-Jung Shiu, Jen-Ping Chen

Abstract: The seasonal prediction of the Indian summer monsoon (ISM) and Monsoon Intraseasonal Oscillations (MISO), as well as the Madden Julian Oscillations (MJO) that strongly modulate MISO, is important to the country for water and crop management. We have analyzed the precipitation, convection, and total cloud fraction (TCF) in the sixth Coupled Model Intercomparison Projects (CMIP6). This study highlig… ▽ More The seasonal prediction of the Indian summer monsoon (ISM) and Monsoon Intraseasonal Oscillations (MISO), as well as the Madden Julian Oscillations (MJO) that strongly modulate MISO, is important to the country for water and crop management. We have analyzed the precipitation, convection, and total cloud fraction (TCF) in the sixth Coupled Model Intercomparison Projects (CMIP6). This study highlights the significant differences in simulating MISO and MJO between the two groups of selected CMIP6 models and physical reasons behind them. The mean and intraseasonal features of MISO and MJO varied significantly in CMIP6 models, which are linked with a better depiction of convection and total cloud fraction. The probability distributions of rainfall and OLR in CMIP6 models indicate significant variations in simulating ISM precipitating clouds. TaiESM1 and IITM-ESM demonstrate improvements in capturing the MISO features. TaiESM1 depicts better eastward propagation of MJO during both summer and winter. The biases in OLR and TCF are also less in the IITM-ESM and TaiESM1 models than in CanESM5 and FGOALS-g3. The results demonstrate the importance of cloud and convection in CMIP6 models to depict realistic MISO and MJO and provide a road map for improving ISM climate prediction and projections. Keywords: ISM rainfall, CMIP6 models, MISO, MJO △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2312.13517 [pdf, other]

An utopic adventure in the modelling of conditional univariate and multivariate extremes

Authors: Léo R. Belzile, Arnab Hazra, Rishikesh Yadav

Abstract: The EVA 2023 data competition consisted of four challenges, ranging from interval estimation for very high quantiles of univariate extremes conditional on covariates, point estimation of unconditional return levels under a custom loss function, to estimation of the probabilities of tail events for low and high-dimensional multivariate data. We tackle these tasks by revisiting the current and exist… ▽ More The EVA 2023 data competition consisted of four challenges, ranging from interval estimation for very high quantiles of univariate extremes conditional on covariates, point estimation of unconditional return levels under a custom loss function, to estimation of the probabilities of tail events for low and high-dimensional multivariate data. We tackle these tasks by revisiting the current and existing literature on conditional univariate and multivariate extremes. We propose new cross-validation methods for covariate-dependent models, validation metrics for exchangeable multivariate models, formulae for the joint probability of exceedance for multivariate generalized Pareto vectors and a composition sampling algorithm for generating multivariate tail events for the latter. We highlight overarching themes ranging from model validation at extremely high quantile levels to building custom estimation strategies that leverage model assumptions. △ Less

Submitted 20 December, 2023; originally announced December 2023.

Comments: 42 pages, 9 figures, 10 tables

arXiv:2312.11181 [pdf, ps, other]

Anomalous relaxation and hyperuniform fluctuations in center-of-mass conserving systems with broken time-reversal symmetry

Authors: Anirban Mukherjee, Dhiraj Tapader, Animesh Hazra, Punyabrata Pradhan

Abstract: We study a paradigmatic model of absorbing-phase transition - the Oslo model - on a one-dimensional ring of $L$ sites with a fixed global density $\barρ$; notably, microscopic dynamics conserve both mass and \textit{center of mass (CoM), but lacks time-reversal symmetry}. Despite having highly constrained dynamics due to CoM conservation, the system exhibits diffusive relaxation away from critical… ▽ More We study a paradigmatic model of absorbing-phase transition - the Oslo model - on a one-dimensional ring of $L$ sites with a fixed global density $\barρ$; notably, microscopic dynamics conserve both mass and \textit{center of mass (CoM), but lacks time-reversal symmetry}. Despite having highly constrained dynamics due to CoM conservation, the system exhibits diffusive relaxation away from criticality and superdiffusive relaxation near criticality. Furthermore, the CoM conservation severely restricts particle movement, rendering the mobility to vanish exactly. Indeed the temporal growth of current fluctuation is qualitatively different from that observed in diffusive systems with a single conservation law. Away from criticality, steady-state fluctuation $\langle \mathcal{Q}_i^2(T,Δ) \rangle$ of current $\mathcal{Q}_i$ across $i$th bond up to time $T$ \textit{saturates} as $\langle \mathcal{Q}_i^2 \rangle \simeq Σ_Q^2(Δ) - {\rm const.} T^{-1/2}$; near criticality, it grows subdiffusively as $\langle \mathcal{Q}_i^2 \rangle \sim T^α$, with $0 < α< 1/2$, and eventually \textit{saturates} to $Σ_Q^2(Δ)$. The asymptotic current fluctuation $Σ_Q^2(Δ)$ is a \textit{nonmonotonic} function of $Δ$: It diverges as $Σ_Q^2(Δ) \sim Δ^2$ for $Δ\gg ρ_c$ and $Σ_Q^2(Δ) \sim Δ^{-δ}$, with $δ> 0$, for $Δ\to 0^+$. By using a mass-conservation principle, we exactly determine the exponents $δ= 2(1-1/ν_\perp)/ν_\perp$ and $α= δ/z ν_\perp$ via the correlation-length and dynamic exponents, $ν_\perp$ and $z$, respectively. Finally, we show that, in the steady state, the self-diffusion coefficient $\mathcal{D}_s(\barρ)$ of tagged particles is connected to activity by $\mathcal{D}_s(\barρ) = a(\barρ) / \barρ$. △ Less

Submitted 14 February, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

Comments: 27 pages, 14 figures, some small changes in the text and figures

arXiv:2311.17471 [pdf, other]

Distributed AI in Zero-touch Provisioning for Edge Networks: Challenges and Research Directions

Authors: Abhishek Hazra, Andrea Morichetta, Ilir Murturi, Lauri Lovén, Chinmaya Kumar Dehury, Victor Casamayor Pujol, Praveen Kumar Donta, Schahram Dustdar

Abstract: Zero-touch network is anticipated to inaugurate the generation of intelligent and highly flexible resource provisioning strategies where multiple service providers collaboratively offer computation and storage resources. This transformation presents substantial challenges to network administration and service providers regarding sustainability and scalability. This article combines Distributed Art… ▽ More Zero-touch network is anticipated to inaugurate the generation of intelligent and highly flexible resource provisioning strategies where multiple service providers collaboratively offer computation and storage resources. This transformation presents substantial challenges to network administration and service providers regarding sustainability and scalability. This article combines Distributed Artificial Intelligence (DAI) with Zero-touch Provisioning (ZTP) for edge networks. This combination helps to manage network devices seamlessly and intelligently by minimizing human intervention. In addition, several advantages are also highlighted that come with incorporating Distributed AI into ZTP in the context of edge networks. Further, we draw potential research directions to foster novel studies in this field and overcome the current limitations. △ Less

Submitted 29 November, 2023; originally announced November 2023.

arXiv:2309.14705 [pdf, ps, other]

Dynamic fluctuations of current and mass in nonequilibrium mass transport processes

Authors: Animesh Hazra, Anirban Mukherjee, Punyabrata Pradhan

Abstract: We study steady-state dynamic fluctuations of current and mass, as well as the corresponding power spectra, in conserved-mass transport processes on a ring of $L$ sites; these processes violate detailed balance, have nontrivial spatial structures, and their steady states are not described by the Boltzmann-Gibbs distribution. We exactly calculate, for all times $T$, the fluctuations… ▽ More We study steady-state dynamic fluctuations of current and mass, as well as the corresponding power spectra, in conserved-mass transport processes on a ring of $L$ sites; these processes violate detailed balance, have nontrivial spatial structures, and their steady states are not described by the Boltzmann-Gibbs distribution. We exactly calculate, for all times $T$, the fluctuations $\langle \mathcal{Q}_i^2(T) \rangle$ and $\langle \mathcal{Q}_{sub}^2(l, T) \rangle$ of the cumulative currents upto time $T$ across $i$th bond and across a subsystem of size $l$ (summed over bonds in the subsystem), respectively; we also calculate the (two-point) dynamic correlation function for subsystem mass. In particular, we show that, for large $L \gg 1$, the bond-current fluctuation grows linearly for $T \sim {\cal O}(1)$, subdiffusively for $T \ll L^2$ and then again linearly for $T \gg L^2$. The scaled subsystem current fluctuation $\lim_{l \rightarrow \infty, T \rightarrow \infty} \langle \mathcal{Q}^2_{sub}(l, T) \rangle/2lT$ converges to the density-dependent particle mobility $χ$ when the large subsystem size limit is taken first, followed by the large time limit. Remarkably, the scaled current fluctuation $D \langle \mathcal{Q}_i^2(T)\rangle/2 χL \equiv {\cal W}(y)$ as a function of scaled time $y=DT/L^2$ is expressed in terms of a universal scaling function ${\cal W}(y)$, where $D$ is the bulk-diffusion coefficient. Similarly, the power spectra for current and mass time series are characterized by the respective universal scaling functions, which are calculated exactly. We provide a microscopic derivation of equilibrium-like Green-Kubo and Einstein relations, that connect the steady-state current fluctuations to the response to an external force and to mass fluctuation, respectively. △ Less

Submitted 28 February, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

Comments: 24 pages, 13 figures

arXiv:2309.03655 [pdf, other]

$B_c$ to $A$ Transition Form Factors and Semileptonic Decays in Self-consistent Covariant Light-front Approach

Authors: Avijit Hazra, Thejus Mary S., Neelesh Sharma, Rohit Dhir

Abstract: We present a comprehensive analysis of the semileptonic weak decays of $B_c$ meson decaying to axial-vector ($A$) mesons for bottom-conserving and bottom-changing decay modes. We employ self-consistent covariant light-front quark model (CLFQM) that uses type-II correspondence to eliminate inconsistencies in the traditional type-I CLFQM. As a fresh attempt, we test the self-consistency in CLFQM thr… ▽ More We present a comprehensive analysis of the semileptonic weak decays of $B_c$ meson decaying to axial-vector ($A$) mesons for bottom-conserving and bottom-changing decay modes. We employ self-consistent covariant light-front quark model (CLFQM) that uses type-II correspondence to eliminate inconsistencies in the traditional type-I CLFQM. As a fresh attempt, we test the self-consistency in CLFQM through type-II correspondence for $B_c \to A$ meson transition form factors. We establish that in type-II correspondence the form factors for longitudinal and transverse polarization states are numerically equal and are free from zero-mode contributions, which confirms the self-consistency of type-II correspondence for $B_c \to A$ transition form factors. Furthermore, we ascertain that the problems of inconsistency and violation of covariance of CLFQM within the type-I correspondence are resolved in type-II correspondence for $B_c \to A$ transitions. We thoroughly investigate the effects of self-consistency between type-I and type-II schemes using a comparative analysis. We also study the $q^2$ dependence of the form factors in weak hadronic currents for the whole accessible kinematic range $0 \leqslant q^2 \leqslant q^2_{max}$ for both bottom-conserving as well as bottom-changing transitions. In addition, we extend our analysis to predict the branching ratios of the semileptonic weak decays of $B_c$ meson involving axial-vector meson in the final state to quantify the effects of self-consistency in these decays that were not studied before. We evaluate the lepton mass effect on these branching ratios and various other important physical observables, such as forward-backward asymmetries, lepton-side convexity parameter, asymmetry parameter, and longitudinal polarization asymmetries and fractions. Finally, we obtain the lepton flavor universality ratios for various decays. △ Less

Submitted 12 May, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: 68 pages, 16 figures, 8 tables. Revised manuscript

arXiv:2309.03165 [pdf, other]

A Semiparametric Generalized Exponential Regression Model with a Principled Distance-based Prior for Analyzing Trends in Rainfall

Authors: Arijit Dey, Arnab Hazra

Abstract: The Western Ghats mountain range holds critical importance in regulating monsoon rainfall across Southern India, with a profound impact on regional agriculture. Here, we analyze daily wet-day rainfall data for the monsoon months between 1901-2022 for the Northern, Middle, and Southern Western Ghats regions. Motivated by an exploratory data analysis, we introduce a semiparametric Bayesian generaliz… ▽ More The Western Ghats mountain range holds critical importance in regulating monsoon rainfall across Southern India, with a profound impact on regional agriculture. Here, we analyze daily wet-day rainfall data for the monsoon months between 1901-2022 for the Northern, Middle, and Southern Western Ghats regions. Motivated by an exploratory data analysis, we introduce a semiparametric Bayesian generalized exponential (GE) regression model; despite the underlying GE distribution assumption being well-known in the literature, including in the context of rainfall analysis, no research explored it in a regression setting, as of our knowledge. Our proposed approach involves modeling the GE rate parameter within a generalized additive model framework. An important feature is the integration of a principled distance-based prior for the GE shape parameter; this allows the model to shrink to an exponential regression model that retains the advantages of the exponential family. We draw inferences using the Markov chain Monte Carlo algorithm. Extensive simulations demonstrate that the proposed model outperforms simpler alternatives. Applying the model to analyze the rainfall data over 122 years provides insights into model parameters, temporal patterns, and the impact of climate change. We observe a significant decreasing trend in wet-day rainfall for the Southern Western Ghats region. △ Less

Submitted 6 September, 2023; originally announced September 2023.

Comments: 24 pages, 8 figures

MSC Class: 62P12; 62F15; 62G08; 62J12

arXiv:2308.13895 [pdf, other]

Estimating Changepoints in Extremal Dependence, Applied to Aviation Stock Prices During COVID-19 Pandemic

Authors: Arnab Hazra, Shiladitya Bose

Abstract: The dependence in the tails of the joint distribution of two random variables is generally assessed using $χ$-measure, the limiting conditional probability of one variable being extremely high given the other variable is also extremely high. This work is motivated by the structural changes in $χ$-measure between the daily rate of return (RoR) of the two Indian airlines, IndiGo and SpiceJet, during… ▽ More The dependence in the tails of the joint distribution of two random variables is generally assessed using $χ$-measure, the limiting conditional probability of one variable being extremely high given the other variable is also extremely high. This work is motivated by the structural changes in $χ$-measure between the daily rate of return (RoR) of the two Indian airlines, IndiGo and SpiceJet, during the COVID-19 pandemic. We model the daily maximum and minimum RoR vectors (potentially transformed) using the bivariate Hüsler-Reiss (BHR) distribution. To estimate the changepoint in the $χ$-measure of the BHR distribution, we explore two changepoint detection procedures based on the Likelihood Ratio Test (LRT) and Modified Information Criterion (MIC). We obtain critical values and power curves of the LRT and MIC test statistics for low through high values of $χ$-measure. We also explore the consistency of the estimators of the changepoint based on LRT and MIC numerically. In our data application, for RoR maxima and minima, the most prominent changepoints detected by LRT and MIC are close to the announcement of the first phases of lockdown and unlock, respectively, which are realistic; thus, our study would be beneficial for portfolio optimization in the case of future pandemic situations. △ Less

Submitted 14 June, 2024; v1 submitted 26 August, 2023; originally announced August 2023.

Comments: 30 pages, 6 figures, 1 table

MSC Class: 62P05; 60G70

arXiv:2308.05812 [pdf, other]

Exploring the Efficacy of Statistical and Deep Learning Methods for Large Spatial Datasets: A Case Study

Authors: Arnab Hazra, Pratik Nag, Rishikesh Yadav, Ying Sun

Abstract: Increasingly large and complex spatial datasets pose massive inferential challenges due to high computational and storage costs. Our study is motivated by the KAUST Competition on Large Spatial Datasets 2023, which tasked participants with estimating spatial covariance-related parameters and predicting values at testing sites, along with uncertainty estimates. We compared various statistical and d… ▽ More Increasingly large and complex spatial datasets pose massive inferential challenges due to high computational and storage costs. Our study is motivated by the KAUST Competition on Large Spatial Datasets 2023, which tasked participants with estimating spatial covariance-related parameters and predicting values at testing sites, along with uncertainty estimates. We compared various statistical and deep learning approaches through cross-validation and ultimately selected the Vecchia approximation technique for model fitting. To overcome the constraints in the R package GpGp, which lacked support for fitting zero-mean Gaussian processes and direct uncertainty estimation-two things that are necessary for the competition, we developed additional \texttt{R} functions. Besides, we implemented certain subsampling-based approximations and parametric smoothing for skewed sampling distributions of the estimators. Our team DesiBoys secured victory in two out of four sub-competitions, validating the effectiveness of our proposed strategies. Moreover, we extended our evaluation to a large real spatial satellite-derived dataset on total precipitable water, where we compared the predictive performances of different models using multiple diagnostics. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: 34 pages, 3 figures, 3 tables

arXiv:2308.03870 [pdf, other]

Spatial wildfire risk modeling using mixtures of tree-based multivariate Pareto distributions

Authors: Daniela Cisneros, Arnab Hazra, Raphaël Huser

Abstract: Wildfires pose a severe threat to the ecosystem and economy, and risk assessment is typically based on fire danger indices such as the McArthur Forest Fire Danger Index (FFDI) used in Australia. Studying the joint tail dependence structure of high-resolution spatial FFDI data is thus crucial for estimating current and future extreme wildfire risk. However, existing likelihood-based inference appro… ▽ More Wildfires pose a severe threat to the ecosystem and economy, and risk assessment is typically based on fire danger indices such as the McArthur Forest Fire Danger Index (FFDI) used in Australia. Studying the joint tail dependence structure of high-resolution spatial FFDI data is thus crucial for estimating current and future extreme wildfire risk. However, existing likelihood-based inference approaches are computationally prohibitive in high dimensions due to the need to censor observations in the bulk of the distribution. To address this, we construct models for spatial FFDI extremes by leveraging the sparse conditional independence structure of Hüsler--Reiss-type generalized Pareto processes defined on trees. These models allow for a simplified likelihood function that is computationally efficient. Our framework involves a mixture of tree-based multivariate Pareto distributions with randomly generated tree structures, resulting in a flexible model that can capture nonstationary spatial dependence structures. We fit the model to summer FFDI data from different spatial clusters in Mainland Australia and 14 decadal windows between 1999--2022 to study local spatiotemporal variability with respect to the magnitude and extent of extreme wildfires. Our results demonstrate that our proposed method fits the margins and spatial tail dependence structure adequately, and is helpful to provide extreme wildfire risk measures. △ Less

Submitted 7 August, 2023; originally announced August 2023.

arXiv:2303.00989 [pdf, other]

Role of modified cloud microphysics parameterization in coupled climate model for studying ISM rainfall: small-scale cloud model and climate model work better together

Authors: Moumita Bhowmik, Anupam Hazra, Ankur Srivastava, Dipjyoti Mudiar, Hemantkumar S. Chaudhari, Suryachandra A. Rao, Lian-Ping Wang

Abstract: An unresolved problem of present generation coupled climate models is the realistic distribution of rainfall over Indian monsoon region, which is also related to the persistent dry bias over Indian land mass. Therefore, quantitative prediction of the intensity of rainfall events has remained a challenge for the state-of-the-art global coupled models. Guided by the observation, it is hypothesized t… ▽ More An unresolved problem of present generation coupled climate models is the realistic distribution of rainfall over Indian monsoon region, which is also related to the persistent dry bias over Indian land mass. Therefore, quantitative prediction of the intensity of rainfall events has remained a challenge for the state-of-the-art global coupled models. Guided by the observation, it is hypothesized that insufficient growth of cloud droplets and processes responsible for the cloud to rain water conversion are key components to distinguish between shallow to convective clouds. The new diffusional growth rates and relative dispersion based autoconversion from the Eulerian-Lagrangian particleby-particle based small-scale model provide a pathway to revisit the parameterizations in climate models for monsoon clouds. The realistic information of cloud drop size distribution is incorporated in the microphysical parameterization scheme of climate model. Two sensitivity simulations are conducted using coupled forecast system (CFSv2) model. When our physically based small-scale derived modified parameterization is used, a coupled climate model simulates the probability distribution (PDF) of rainfall and accompanying specific humidity, liquid water content, and outgoing long-wave radiation (OLR) with increasing accuracy. The improved simulation of rainfall PDF appears to have been aided by much improved simulation of OLR and resulted better simulation of the ISM rainfall. △ Less

Submitted 2 March, 2023; originally announced March 2023.

arXiv:2303.00987 [pdf, other]

Eulerian-Lagrangian particle-based model for diffusional growth for the better parameterization of ISM clouds: A road map for improving climate model through small-scale model using observations

Authors: Moumita Bhowmik, Anupam Hazra, Suryachandra A. Rao, Lian-Ping Wang

Abstract: The quantitative prediction of the intensity of rainfall events (light or heavy) has remained a challenge in Numerical Weather Prediction (NWP) models. For the first time the mean coefficient of diffusional growth rates are calculated using an Eulerian-Lagrangian particle-based small-scale model on in situ airborne measurement data of Cloud Aerosol Interaction and Precipitation Enhancement Experim… ▽ More The quantitative prediction of the intensity of rainfall events (light or heavy) has remained a challenge in Numerical Weather Prediction (NWP) models. For the first time the mean coefficient of diffusional growth rates are calculated using an Eulerian-Lagrangian particle-based small-scale model on in situ airborne measurement data of Cloud Aerosol Interaction and Precipitation Enhancement Experiment (CAIPEEX) during monsoon over Indian sub-continent. The results show that diffusional growth rates varies in the range of 0.00025 - 0.0015(cm/s). The generic problem of the overestimation of light rain in NWP models might be related with the choice of cm in the model. It is also shown from DNS experiment using Eulerian-Lagrangian particle-based small-scale model that the relative dispersion is constrained with average values in the range of ~ 0.2 - 0.37 (~ 0.1- 0.26) in less humid (more humid) conditions. This is in agreement with in situ airborne observation (dispersion ~ 0.36) and previous study over Indian sub-continent. The linear relationship between relative dispersion and cloud droplet number concentration (NC) is obtained from this study using CAIPEEX observation over Indian subcontinent. The dispersion based autoconversion-scheme for Indian region must be useful for the Indian summer monsoon precipitation calculation in the general circulation model. The present study also provide valuable guidance for the parameterization of effective radius, important for radiation scheme. △ Less

Submitted 2 March, 2023; originally announced March 2023.

arXiv:2302.04914 [pdf, other]

doi 10.1039/D4DD00016A

Flexible, Model-Agnostic Method for Materials Data Extraction from Text Using General Purpose Language Models

Authors: Maciej P. Polak, Shrey Modi, Anna Latosinska, Jinming Zhang, Ching-Wen Wang, Shaonan Wang, Ayan Deep Hazra, Dane Morgan

Abstract: Accurate and comprehensive material databases extracted from research papers are crucial for materials science and engineering, but their development requires significant human effort. With large language models (LLMs) transforming the way humans interact with text, LLMs provide an opportunity to revolutionize data extraction. In this study, we demonstrate a simple and efficient method for extract… ▽ More Accurate and comprehensive material databases extracted from research papers are crucial for materials science and engineering, but their development requires significant human effort. With large language models (LLMs) transforming the way humans interact with text, LLMs provide an opportunity to revolutionize data extraction. In this study, we demonstrate a simple and efficient method for extracting materials data from full-text research papers leveraging the capabilities of LLMs combined with human supervision. This approach is particularly suitable for mid-sized databases and requires minimal to no coding or prior knowledge about the extracted property. It offers high recall and nearly perfect precision in the resulting database. The method is easily adaptable to new and superior language models, ensuring continued utility. We show this by evaluating and comparing its performance on GPT-3 and GPT-3.5/4 (which underlie ChatGPT), as well as free alternatives such as BART and DeBERTaV3. We provide a detailed analysis of the method's performance in extracting sentences containing bulk modulus data, achieving up to 90% precision at 96% recall, depending on the amount of human effort involved. We further demonstrate the method's broader effectiveness by developing a database of critical cooling rates for metallic glasses over twice the size of previous human curated databases. △ Less

Submitted 12 June, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

Comments: 13 pages, 4 figures

Journal ref: Digital Discovery, 2024, 3, 1221-1235

arXiv:2211.16418 [pdf, ps, other]

doi 10.1103/PhysRevD.106.113007

Screening of quark charge and mixing effects on transition moments and M1 decay widths of baryons

Authors: Binesh Mohan, Thejus Mary S., Avijit Hazra, Rohit Dhir

Abstract: Motivated by the precision measurements of heavy flavor baryon masses, we analyze the modification of quark charge by employing the screening effect inside the baryon. In addition, we calculate the isospin mass splitting up to charmed baryons employing isospin symmetry breaking. Consequently, we obtain the masses, magnetic moments, and transition moments of $J^P=\frac{1}{2}^+$ and $\frac{3}{2}^+$… ▽ More Motivated by the precision measurements of heavy flavor baryon masses, we analyze the modification of quark charge by employing the screening effect inside the baryon. In addition, we calculate the isospin mass splitting up to charmed baryons employing isospin symmetry breaking. Consequently, we obtain the masses, magnetic moments, and transition moments of $J^P=\frac{1}{2}^+$ and $\frac{3}{2}^+$ baryons to predict radiative decay widths for $\frac{1}{2}^{\prime +} \to \frac{1}{2}^+$ and $\frac{3}{2}^+\to \frac{1}{2}^{(\prime)+}$ transitions. Finally, we include the effects of state mixing in flavor degenerate baryon magnetic and transition moments, as well as M1 transition decay widths. △ Less

Submitted 29 November, 2022; originally announced November 2022.

Comments: 43 pages, to appear in Phys. Rev. D

arXiv:2211.09016 [pdf, other]

Multidimensional Generalized Riemann Problem Solver for Maxwell's Equations

Authors: Arijit Hazra, Dinshaw S. Balsara, Praveen Chandrashekar, Sudip K. Garain

Abstract: Approximate multidimensional Riemann solvers are essential building blocks in designing globally constraint-preserving finite volume time domain (FVTD) and discontinuous Galerkin time domain (DGTD) schemes for computational electrodynamics (CED). In those schemes, we can achieve high-order temporal accuracy with the help of Runge-Kutta or ADER time-stepping. This paper presents the design of a mul… ▽ More Approximate multidimensional Riemann solvers are essential building blocks in designing globally constraint-preserving finite volume time domain (FVTD) and discontinuous Galerkin time domain (DGTD) schemes for computational electrodynamics (CED). In those schemes, we can achieve high-order temporal accuracy with the help of Runge-Kutta or ADER time-stepping. This paper presents the design of a multidimensional approximate Generalized Riemann Problem (GRP) solver for the first time. The multidimensional Riemann solver accepts as its inputs the four states surrounding an edge on a structured mesh, and its output consists of a resolved state and its associated fluxes. In contrast, the multidimensional GRP solver accepts as its inputs the four states and their gradients in all directions; its output consists of the resolved state and its corresponding fluxes and the gradients of the resolved state. The gradients can then be used to extend the solution in time. As a result, we achieve second-order temporal accuracy in a single step. In this work, the formulation is optimized for linear hyperbolic systems with stiff, linear source terms because such a formulation will find maximal use in CED. Our formulation produces an overall constraint-preserving time-stepping strategy based on the GRP that is provably L-stable in the presence of stiff source terms. We present several stringent test problems, showing that the multidimensional GRP solver for CED meets its design accuracy and performs stably with optimal time steps. The test problems include cases with high conductivity, showing that the beneficial L-stability is indeed realized in practical applications. △ Less

Submitted 30 April, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

Comments: 38 Pages, 10 figures

MSC Class: 78M12; 65M08; 35L65

arXiv:2210.05792 [pdf, other]

Flexible Modeling of Nonstationary Extremal Dependence using Spatially-Fused LASSO and Ridge Penalties

Authors: Xuanjie Shao, Arnab Hazra, Jordan Richards, Raphaël Huser

Abstract: Statistical modeling of a nonstationary spatial extremal dependence structure is challenging. Max-stable processes are common choices for modeling spatially-indexed block maxima, where an assumption of stationarity is usual to make inference feasible. However, this assumption is often unrealistic for data observed over a large or complex domain. We propose a computationally-efficient method for es… ▽ More Statistical modeling of a nonstationary spatial extremal dependence structure is challenging. Max-stable processes are common choices for modeling spatially-indexed block maxima, where an assumption of stationarity is usual to make inference feasible. However, this assumption is often unrealistic for data observed over a large or complex domain. We propose a computationally-efficient method for estimating extremal dependence using a globally nonstationary, but locally-stationary, max-stable process by exploiting nonstationary kernel convolutions. We divide the spatial domain into a fine grid of subregions, assign each of them its own dependence parameters, and use LASSO ($L_1$) or ridge ($L_2$) penalties to obtain spatially-smooth parameter estimates. We then develop a novel data-driven algorithm to merge homogeneous neighboring subregions. The algorithm facilitates model parsimony and interpretability. To make our model suitable for high-dimensional data, we exploit a pairwise likelihood to draw inferences and discuss computational and statistical efficiency. An extensive simulation study demonstrates the superior performance of our proposed model and the subregion-merging algorithm over the approaches that either do not model nonstationarity or do not update the domain partition. We apply our proposed method to model monthly maximum temperatures at over 1400 sites in Nepal and the surrounding Himalayan and sub-Himalayan regions; we again observe significant improvements in model fit compared to a stationary process and a nonstationary process without subregion-merging. Furthermore, we demonstrate that the estimated merged partition is interpretable from a geographic perspective and leads to better model diagnostics by adequately reducing the number of subregion-specific parameters. △ Less

Submitted 30 April, 2024; v1 submitted 11 October, 2022; originally announced October 2022.

arXiv:2208.13838 [pdf, other]

Towards Adversarial Purification using Denoising AutoEncoders

Authors: Dvij Kalaria, Aritra Hazra, Partha Pratim Chakrabarti

Abstract: With the rapid advancement and increased use of deep learning models in image identification, security becomes a major concern to their deployment in safety-critical systems. Since the accuracy and robustness of deep learning models are primarily attributed from the purity of the training samples, therefore the deep learning architectures are often susceptible to adversarial attacks. Adversarial a… ▽ More With the rapid advancement and increased use of deep learning models in image identification, security becomes a major concern to their deployment in safety-critical systems. Since the accuracy and robustness of deep learning models are primarily attributed from the purity of the training samples, therefore the deep learning architectures are often susceptible to adversarial attacks. Adversarial attacks are often obtained by making subtle perturbations to normal images, which are mostly imperceptible to humans, but can seriously confuse the state-of-the-art machine learning models. We propose a framework, named APuDAE, leveraging Denoising AutoEncoders (DAEs) to purify these samples by using them in an adaptive way and thus improve the classification accuracy of the target classifier networks that have been attacked. We also show how using DAEs adaptively instead of using them directly, improves classification accuracy further and is more robust to the possibility of designing adaptive attacks to fool them. We demonstrate our results over MNIST, CIFAR-10, ImageNet dataset and show how our framework (APuDAE) provides comparable and in most cases better performance to the baseline methods in purifying adversaries. We also design adaptive attack specifically designed to attack our purifying model and demonstrate how our defense is robust to that. △ Less

Submitted 29 August, 2022; originally announced August 2022.

Comments: Submitted to AAAI 2023

arXiv:2206.08216 [pdf, other]

Minimum Density Power Divergence Estimation for the Generalized Exponential Distribution

Authors: Arnab Hazra

Abstract: Statistical modeling of rainfall data is an active research area in agro-meteorology. The most common models fitted to such datasets are exponential, gamma, log-normal, and Weibull distributions. As an alternative to some of these models, the generalized exponential (GE) distribution was proposed by Gupta and Kundu (2001, Exponentiated Exponential Family: An Alternative to Gamma and Weibull Distri… ▽ More Statistical modeling of rainfall data is an active research area in agro-meteorology. The most common models fitted to such datasets are exponential, gamma, log-normal, and Weibull distributions. As an alternative to some of these models, the generalized exponential (GE) distribution was proposed by Gupta and Kundu (2001, Exponentiated Exponential Family: An Alternative to Gamma and Weibull Distributions, Biometrical Journal). Rainfall (specifically for short periods) datasets often include outliers, and thus, a proper robust parameter estimation procedure is necessary. Here, we use the popular minimum density power divergence estimation (MDPDE) procedure developed by Basu et al. (1998, Robust and Efficient Estimation by Minimising a Density Power Divergence, Biometrika) for estimating the GE parameters. We derive the analytical expressions for the estimating equations and asymptotic distributions. We analytically compare MDPDE with maximum likelihood estimation in terms of robustness, through an influence function analysis. Besides, we study the asymptotic relative efficiency of MDPDE analytically for different parameter settings. We apply the proposed technique to some simulated datasets and two rainfall datasets from Texas, United States. The results indicate superior performance of MDPDE compared to the other existing estimation techniques in most of the scenarios. △ Less

Submitted 7 February, 2024; v1 submitted 16 June, 2022; originally announced June 2022.

Comments: 23 pages, 7 figures

arXiv:2205.14586 [pdf, ps, other]

Formal Methods for Characterization and Analysis of Quality Specifications in Component-based Systems

Authors: Aritra Hazra

Abstract: Component-based design paradigm is of paramount importance due to prolific growth in the complexity of modern-day systems. Since the components are developed primarily by multi-party vendors and often assembled to realize the overall system, it is an onus of the designer to certify both the functional and non-functional requirements of such systems. Several of the earlier works concentrated on for… ▽ More Component-based design paradigm is of paramount importance due to prolific growth in the complexity of modern-day systems. Since the components are developed primarily by multi-party vendors and often assembled to realize the overall system, it is an onus of the designer to certify both the functional and non-functional requirements of such systems. Several of the earlier works concentrated on formally analyzing the behavioral correctness, safety, security, reliability and robustness of such compositional systems. However, the assurance for quality measures of such systems is also considered as an important parameter for their acceptance. Formalization of quality measures is still at an immature state and often dictated by the user satisfaction. This paper presents a novel compositional framework for reliable quality analysis of component-based systems from the formal quality specifications of its constituent components. The proposed framework enables elegant and generic computation methods for quality attributes of various component-based system structures. In addition to this, we provide a formal query-driven quality assessment and design exploration framework which enables the designer to explore various component structures and operating setups and finally converge into better acceptable systems. A detailed case-study is presented over a component-based system structure to show the efficacy and practicality of our proposed framework. △ Less

Submitted 29 May, 2022; originally announced May 2022.

Comments: 27 pages

arXiv:2202.10449 [pdf, other]

Optimal Multi-Agent Path Finding for Precedence Constrained Planning Tasks

Authors: Kushal Kedia, Rajat Kumar Jenamani, Aritra Hazra, Partha Pratim Chakrabarti

Abstract: Multi-Agent Path Finding (MAPF) is the problem of finding collision-free paths for multiple agents from their start locations to end locations. We consider an extension to this problem, Precedence Constrained Multi-Agent Path Finding (PC-MAPF), wherein agents are assigned a sequence of planning tasks that contain precedence constraints between them. PC-MAPF has various applications, for example in… ▽ More Multi-Agent Path Finding (MAPF) is the problem of finding collision-free paths for multiple agents from their start locations to end locations. We consider an extension to this problem, Precedence Constrained Multi-Agent Path Finding (PC-MAPF), wherein agents are assigned a sequence of planning tasks that contain precedence constraints between them. PC-MAPF has various applications, for example in multi-agent pickup and delivery problems where some objects might require multiple agents to collaboratively pickup and move them in unison. Precedence constraints also arise in warehouse assembly problems where before a manufacturing task can begin, its input resources must be manufactured and delivered. We propose a novel algorithm, Precedence Constrained Conflict Based Search (PC-CBS), which finds makespan-optimal solutions for this class of problems. PC-CBS utilizes a Precedence-Constrained Task-Graph to define valid intervals for each planning task and updates them when precedence conflicts are encountered. We benchmark the performance of this algorithm over various warehouse assembly, and multi-agent pickup and delivery tasks, and use it to evaluate the sub-optimality of a recently proposed efficient baseline. △ Less

Submitted 8 February, 2022; originally announced February 2022.

arXiv:2112.14920 [pdf, other]

doi 10.1007/s10687-022-00460-8

A combined statistical and machine learning approach for spatial prediction of extreme wildfire frequencies and sizes

Authors: Daniela Cisneros, Yan Gong, Rishikesh Yadav, Arnab Hazra, Raphael Huser

Abstract: Motivated by the Extreme Value Analysis 2021 (EVA 2021) data challenge we propose a method based on statistics and machine learning for the spatial prediction of extreme wildfire frequencies and sizes. This method is tailored to handle large datasets, including missing observations. Our approach relies on a four-stage high-dimensional bivariate sparse spatial model for zero-inflated data, which is… ▽ More Motivated by the Extreme Value Analysis 2021 (EVA 2021) data challenge we propose a method based on statistics and machine learning for the spatial prediction of extreme wildfire frequencies and sizes. This method is tailored to handle large datasets, including missing observations. Our approach relies on a four-stage high-dimensional bivariate sparse spatial model for zero-inflated data, which is developed using stochastic partial differential equations(SPDE). In Stage 1, the observations are categorized in zero/nonzero categories and are modeled using a two-layered hierarchical Bayesian sparse spatial model to estimate the probabilities of these two categories. In Stage 2, before modeling the positive observations using spatially-varying coefficients, smoothed parameter surfaces are obtained from empirical estimates using fixed rank kriging. This approximate Bayesian method inference was employed to avoid the high computational burden of large spatial data modeling using spatially-varying coefficients. In Stage 3, the standardized log-transformed positive observations from the second stage are further modeled using a sparse bivariate spatial Gaussian process. The Gaussian distribution assumption for wildfire counts developed in the third stage is computationally effective but erroneous. Thus in Stage 4, the predicted values are rectified using Random Forests. The posterior inference is drawn for Stages 1 and 3 using Markov chain Monte Carlo (MCMC) sampling. A cross-validation scheme is then created for the artificially generated gaps, and the EVA 2021 prediction scores of the proposed model are compared to those obtained using certain natural competitors. △ Less

Submitted 29 December, 2021; originally announced December 2021.

Comments: 49 pages, 11 figures

Journal ref: 2023

arXiv:2112.10248 [pdf, other]

Efficient Modeling of Spatial Extremes over Large Geographical Domains

Authors: Arnab Hazra, Raphaël Huser, David Bolin

Abstract: Various natural phenomena exhibit spatial extremal dependence at short spatial distances. However, existing models proposed in the spatial extremes literature often assume that extremal dependence persists across the entire domain. This is a strong limitation when modeling extremes over large geographical domains, and yet it has been mostly overlooked in the literature. We here develop a more real… ▽ More Various natural phenomena exhibit spatial extremal dependence at short spatial distances. However, existing models proposed in the spatial extremes literature often assume that extremal dependence persists across the entire domain. This is a strong limitation when modeling extremes over large geographical domains, and yet it has been mostly overlooked in the literature. We here develop a more realistic Bayesian framework based on a novel Gaussian scale mixture model, with the Gaussian process component defined by a stochastic partial differential equation yielding a sparse precision matrix, and the random scale component modeled as a low-rank Pareto-tailed or Weibull-tailed spatial process determined by compactly-supported basis functions. We show that our proposed model is approximately tail-stationary and that it can capture a wide range of extremal dependence structures. Its inherently sparse structure allows fast Bayesian computations in high spatial dimensions based on a customized Markov chain Monte Carlo algorithm prioritizing calibration in the tail. We fit our model to analyze heavy monsoon rainfall data in Bangladesh. Our study shows that our model outperforms natural competitors and that it fits precipitation extremes well. We finally use the fitted model to draw inference on long-term return levels for marginal precipitation and spatial aggregates. △ Less

Submitted 30 April, 2024; v1 submitted 19 December, 2021; originally announced December 2021.

arXiv:2111.15518 [pdf, other]

Detecting Adversaries, yet Faltering to Noise? Leveraging Conditional Variational AutoEncoders for Adversary Detection in the Presence of Noisy Images

Authors: Dvij Kalaria, Aritra Hazra, Partha Pratim Chakrabarti

Abstract: With the rapid advancement and increased use of deep learning models in image identification, security becomes a major concern to their deployment in safety-critical systems. Since the accuracy and robustness of deep learning models are primarily attributed from the purity of the training samples, therefore the deep learning architectures are often susceptible to adversarial attacks. Adversarial a… ▽ More With the rapid advancement and increased use of deep learning models in image identification, security becomes a major concern to their deployment in safety-critical systems. Since the accuracy and robustness of deep learning models are primarily attributed from the purity of the training samples, therefore the deep learning architectures are often susceptible to adversarial attacks. Adversarial attacks are often obtained by making subtle perturbations to normal images, which are mostly imperceptible to humans, but can seriously confuse the state-of-the-art machine learning models. What is so special in the slightest intelligent perturbations or noise additions over normal images that it leads to catastrophic classifications by the deep neural networks? Using statistical hypothesis testing, we find that Conditional Variational AutoEncoders (CVAE) are surprisingly good at detecting imperceptible image perturbations. In this paper, we show how CVAEs can be effectively used to detect adversarial attacks on image classification networks. We demonstrate our results over MNIST, CIFAR-10 dataset and show how our method gives comparable performance to the state-of-the-art methods in detecting adversaries while not getting confused with noisy images, where most of the existing methods falter. △ Less

Submitted 9 December, 2021; v1 submitted 28 November, 2021; originally announced November 2021.

Comments: Accepted at Adversarial Machine Learning (AdvML) workshop, AAAI 2022

arXiv:2110.03956 [pdf]

doi 10.1029/2021GL096489

Seasonal Predictability of Lightning over the Global Hotspot Regions

Authors: Chandrima Mallick, Anupam Hazra, Subodh K. Saha, Hemantkumar S. Chaudhari, Samir Pokhrel, Mahen Konwar, Ushnanshu Dutta, Greeshma M. Mohan, K. Gayatri Vani

Abstract: Skillful seasonal prediction of lightning is crucial over several global hotspot regions, as it causes severe damages to infrastructures and losses of human life. While major emphasis has been given for predicting rainfall, prediction of lightning in one season advance remained uncommon, owing to the nature of the problem, which is short-lived local phenomenon. Here we show that on the seasonal ti… ▽ More Skillful seasonal prediction of lightning is crucial over several global hotspot regions, as it causes severe damages to infrastructures and losses of human life. While major emphasis has been given for predicting rainfall, prediction of lightning in one season advance remained uncommon, owing to the nature of the problem, which is short-lived local phenomenon. Here we show that on the seasonal time scale, lightning over the major global hot-spot regions is strongly tied with slowly varying global predictors (e.g., El Nino and Southern Oscillation). Moreover, the sub-seasonal variance of lightning is highly correlated with global predictors, suggesting a seminal role played by the global climate mode in shaping the local land-atmosphere interactions, which eventually affects seasonal lightning variability. It is shown that the seasonal predictability of lightning over the hotspot is comparable to that of seasonal rainfall, which opens up an avenue for reliable seasonal forecasting of lightning for special awareness and preventive measures. Keywords: Lightning, Seasonal forecasting, SST, Global predictors △ Less

Submitted 8 October, 2021; originally announced October 2021.

arXiv:2110.02680 [pdf, other]

Latent Gaussian Models for High-Dimensional Spatial Extremes

Authors: Arnab Hazra, Raphaël Huser, Árni V. Jóhannesson

Abstract: In this chapter, we show how to efficiently model high-dimensional extreme peaks-over-threshold events over space in complex non-stationary settings, using extended latent Gaussian Models (LGMs), and how to exploit the fitted model in practice for the computation of long-term return levels. The extended LGM framework assumes that the data follow a specific parametric distribution, whose unknown pa… ▽ More In this chapter, we show how to efficiently model high-dimensional extreme peaks-over-threshold events over space in complex non-stationary settings, using extended latent Gaussian Models (LGMs), and how to exploit the fitted model in practice for the computation of long-term return levels. The extended LGM framework assumes that the data follow a specific parametric distribution, whose unknown parameters are transformed using a multivariate link function and are then further modeled at the latent level in terms of fixed and random effects that have a joint Gaussian distribution. In the extremal context, we here assume that the data level distribution is described in terms of a Poisson point process likelihood, motivated by asymptotic extreme-value theory, and which conveniently exploits information from all threshold exceedances. This contrasts with the more common data-wasteful approach based on block maxima, which are typically modeled with the generalized extreme-value (GEV) distribution. When conditional independence can be assumed at the data level and latent random effects have a sparse probabilistic structure, fast approximate Bayesian inference becomes possible in very high dimensions, and we here present the recently proposed inference approach called "Max-and-Smooth", which provides exceptional speed-up compared to alternative methods. The proposed methodology is illustrated by application to satellite-derived precipitation data over Saudi Arabia, obtained from the Tropical Rainfall Measuring Mission, with 2738 grid cells and about 20 million spatio-temporal observations in total. Our fitted model captures the spatial variability of extreme precipitation satisfactorily and our results show that the most intense precipitation events are expected near the south-western part of Saudi Arabia, along the Red Sea coastline. △ Less

Submitted 6 October, 2021; originally announced October 2021.

Comments: This paper (after peer-review) will be a book chapter of the forthcoming book entitled "Statistical modeling using latent Gaussian models - with applications in geophysics and environmental sciences", expected to be published by Springer in 2022

arXiv:2109.07122 [pdf]

doi 10.1016/j.gloplacha.2022.103873

Unraveling the Global Teleconnections of Indian Summer Monsoon Clouds: Expedition from CMIP5 to CMIP6

Authors: Ushnanshu Dutta, Anupam Hazra, Hemantkumar S. Chaudhari, Subodh Kumar Saha, Samir Pokhrel, Utkarsh Verma

Abstract: We have analyzed the teleconnection of total cloud fraction (TCF) with global sea surface temperature (SST) in multi-model ensembles (MME) of the fifth and sixth Coupled Model Intercomparison Projects (CMIP5 and CMIP6). CMIP6-MME has a more robust and realistic teleconnection (TCF and global SST) pattern over the extra-tropics (R ~0.43) and North Atlantic (R ~0.39) region, which in turn resulted i… ▽ More We have analyzed the teleconnection of total cloud fraction (TCF) with global sea surface temperature (SST) in multi-model ensembles (MME) of the fifth and sixth Coupled Model Intercomparison Projects (CMIP5 and CMIP6). CMIP6-MME has a more robust and realistic teleconnection (TCF and global SST) pattern over the extra-tropics (R ~0.43) and North Atlantic (R ~0.39) region, which in turn resulted in improvement of rainfall bias over the Asian summer monsoon (ASM) region. CMIP6-MME can better reproduce the mean TCF and have reduced dry (wet) rainfall bias on land (ocean) over the ASM region. CMIP6-MME has improved the biases of seasonal mean rainfall, TCF, and outgoing longwave radiation (OLR) over the Indian Summer Monsoon (ISM) region by ~40%, ~45%, and ~31%, respectively, than CMIP5-MME and demonstrates better spatial correlation with observation/reanalysis. Results establish the credibility of the CMIP6 models and provide a scientific basis for improving the seasonal prediction of ISM. △ Less

Submitted 20 September, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

Comments: 12 pages, 4 main figures, 2 supplementary figures

arXiv:2108.01840 [pdf, ps, other]

doi 10.1103/PhysRevD.104.053002

Radiative M1 transitions of heavy baryons: Effective Quark Mass Scheme

Authors: Avijit Hazra, Saheli Rakshit, Rohit Dhir

Abstract: We calculate the magnetic moments of ground state $J^P=\frac{1}{2}^+$ and $J^P=\frac{3}{2}^+$ heavy flavor charm and bottom baryon states employing the concept of effective mass based on single gluon exchange interaction coupling to the spectator quarks in the non-relativistic quark model. We exploit the current experimental information in the heavy flavor sector to estimate the interaction contri… ▽ More We calculate the magnetic moments of ground state $J^P=\frac{1}{2}^+$ and $J^P=\frac{3}{2}^+$ heavy flavor charm and bottom baryon states employing the concept of effective mass based on single gluon exchange interaction coupling to the spectator quarks in the non-relativistic quark model. We exploit the current experimental information in the heavy flavor sector to estimate the interaction contributions to get the effective masses of the quarks inside the baryons. We study the spin $\frac{1}{2}^{'+} \rightarrow \frac{1}{2}^+$, $\frac{3}{2}^+ \rightarrow \frac{1}{2}^+$, and $\frac{3}{2}^+ \rightarrow \frac{1}{2}^{'+}$ transition moments for these baryons. We make robust predictions of the radiative M1 decay widths of singly, doubly, and triply heavy flavored baryons. △ Less

Submitted 4 August, 2021; originally announced August 2021.

Comments: 32 pages, 15 Tables

Journal ref: Phys. Rev. D 104, 053002 (2021)

arXiv:2106.15730 [pdf, other]

Contamination mapping in Bangladesh using a multivariate spatial Bayesian model for left-censored data

Authors: Indranil Sahoo, Arnab Hazra

Abstract: Arsenic (As) and other toxic elements contamination of groundwater in Bangladesh poses a major threat to millions of people on a daily basis. Understanding complex relationships between arsenic and other elements can provide useful insights for mitigating arsenic poisoning in drinking water and requires multivariate modeling of the elements. However, environmental monitoring of such contaminants o… ▽ More Arsenic (As) and other toxic elements contamination of groundwater in Bangladesh poses a major threat to millions of people on a daily basis. Understanding complex relationships between arsenic and other elements can provide useful insights for mitigating arsenic poisoning in drinking water and requires multivariate modeling of the elements. However, environmental monitoring of such contaminants often involves a substantial proportion of left-censored observations falling below a minimum detection limit (MDL). This problem motivates us to propose a multivariate spatial Bayesian model for left-censored data for investigating the abundance of arsenic in Bangladesh groundwater and for creating spatial maps of the contaminants. Inference about the model parameters is drawn using an adaptive Markov Chain Monte Carlo (MCMC) sampling. The computation time for the proposed model is of the same order as a multivariate Gaussian process model that does not impute the censored values. The proposed method is applied to the arsenic contamination dataset made available by the Bangladesh Water Development Board (BWDB). Spatial maps of arsenic, barium (Ba), and calcium (Ca) concentrations in groundwater are prepared using the posterior predictive means calculated on a fine lattice over Bangladesh. Our results indicate that Chittagong and Dhaka divisions suffer from excessive concentrations of arsenic and only the divisions of Rajshahi and Rangpur have safe drinking water based on recommendations by the World Health Organization (WHO). △ Less

Submitted 25 November, 2021; v1 submitted 29 June, 2021; originally announced June 2021.

arXiv:2104.14785 [pdf, other]

Methodology for Biasing Random Simulation for Rapid Coverage of Corner Cases in AMS Designs

Authors: Sayandeep Sanyal, Ayan Chakraborty, Pallab Dasgupta, Aritra Hazra

Abstract: Exploring the limits of an Analog and Mixed Signal (AMS) circuit by driving appropriate inputs has been a serious challenge to the industry. Doing an exhaustive search of the entire input state space is a time-consuming exercise and the returns to efforts ratio is quite low. In order to meet time-to-market requirements, often suboptimal coverage results of an integrated circuit (IC) are leveraged.… ▽ More Exploring the limits of an Analog and Mixed Signal (AMS) circuit by driving appropriate inputs has been a serious challenge to the industry. Doing an exhaustive search of the entire input state space is a time-consuming exercise and the returns to efforts ratio is quite low. In order to meet time-to-market requirements, often suboptimal coverage results of an integrated circuit (IC) are leveraged. Additionally, no standards have been defined which can be used to identify a target in the continuous state space of analog domain such that the searching algorithm can be guided with some heuristics. In this report, we elaborate on two approaches for tackling this challenge - one is based on frequency domain analysis of the circuit, while the other applies the concept of Bayesian optimization. We have also presented our results by applying the two approaches on an industrial LDO and a few AMS benchmark circuits. △ Less

Submitted 30 April, 2021; originally announced April 2021.

arXiv:2101.05623 [pdf, other]

Design of borehole resistivity measurement acquisition systems using deep learning

Authors: M. Shahriari, A. Hazra, D. Pardo

Abstract: Borehole resistivity measurements recorded with logging-while-drilling (LWD) instruments are widely used for characterizing the earth's subsurface properties. They facilitate the extraction of natural resources such as oil and gas. LWD instruments require real-time inversions of electromagnetic measurements to estimate the electrical properties of the earth's subsurface near the well and possibly… ▽ More Borehole resistivity measurements recorded with logging-while-drilling (LWD) instruments are widely used for characterizing the earth's subsurface properties. They facilitate the extraction of natural resources such as oil and gas. LWD instruments require real-time inversions of electromagnetic measurements to estimate the electrical properties of the earth's subsurface near the well and possibly correct the well trajectory. Deep Neural Network (DNN)-based methods are suitable for the rapid inversion of borehole resistivity measurements as they approximate the forward and inverse problem offline during the training phase and they only require a fraction of a second for the evaluation (aka prediction). However, the inverse problem generally admits multiple solutions. DNNs with traditional loss functions based on data misfit are ill-equipped for solving an inverse problem. This can be partially overcome by adding regularization terms to a loss function specifically designed for encoder-decoder architectures. But adding regularization seriously limits the number of possible solutions to a set of a priori desirable physical solutions. To avoid this, we use a two-step loss function without any regularization. In addition, to guarantee an inverse solution, we need a carefully selected measurement acquisition system with a sufficient number of measurements. In this work, we propose a DNN-based iterative algorithm for designing such a measurement acquisition system. We illustrate our DNN-based iterative algorithm via several synthetic examples. Numerical results show that the obtained measurement acquisition system is sufficient to identify and characterize both resistive and conductive layers above and below the logging instrument. Numerical results are promising, although further improvements are required to make our method amenable for industrial purposes. △ Less

Submitted 12 January, 2021; originally announced January 2021.

arXiv:2101.04521 [pdf]

Examining the variability of cloud hydrometeors and its importance on the Indian summer monsoon rainfall predictability

Authors: Ushnanshu Dutta, Anupam Hazra, Subodh Kumar Saha, Hemantkumar S. Chaudhari, Samir Pokhrel, Mahen Konwar

Abstract: Skilful prediction of the seasonal Indian summer monsoon (ISM) rainfall (ISMR) at least one season in advance has great socio-economic value. It represents a lifeline for about a sixth of the world's population. The ISMR prediction remained a challenging problem with the sub-critical skills of the dynamical models attributable to limited understanding of the interaction among clouds, convection, a… ▽ More Skilful prediction of the seasonal Indian summer monsoon (ISM) rainfall (ISMR) at least one season in advance has great socio-economic value. It represents a lifeline for about a sixth of the world's population. The ISMR prediction remained a challenging problem with the sub-critical skills of the dynamical models attributable to limited understanding of the interaction among clouds, convection, and circulation. The variability of cloud hydrometeors (cloud ice and cloud water) in different time scales (3-7 days, 10-20 days and 30-60 days bands) are examined from re-analysis data during Indian summer monsoon (ISM). Here, we also show that the 'internal' variability of cloud hydrometeors (particularly cloud ice) associated with the ISM sub-seasonal (synoptic + intra-seasonal) fluctuations is partly predictable as they are found to be tied with slowly varying forcing (e.g., El Niño and Southern Oscillation). The representation of deep convective clouds, which involve ice phase processes in a coupled climate model, strongly modulates ISMR variability in association with global predictors. The results from the two sensitivity simulations using coupled global climate model (CGCM) are provided to demonstrate the importance of the cloud hydrometeors on ISM rainfall predictability. Therefore, this study provides a scientific basis for improving the simulation of the seasonal ISMR by improving the physical processes of the cloud on a sub-seasonal time scale and motivating further research in this direction. △ Less

Submitted 12 January, 2021; originally announced January 2021.

Comments: 36 Pages, 14 figures

arXiv:2005.00995 [pdf, ps, other]

Early-Stage Resource Estimation from Functional Reliability Specification in Embedded Cyber-Physical Systems

Authors: Ginju V. George, Aritra Hazra, Pallab Dasgupta, Partha Pratim Chakrabarti

Abstract: Reliability and fault tolerance are critical attributes of embedded cyber-physical systems that require a high safety-integrity level. For such systems, the use of formal functional safety specifications has been strongly advocated in most industrial safety standards, but reliability and fault tolerance have traditionally been treated as platform issues. We believe that addressing reliability and… ▽ More Reliability and fault tolerance are critical attributes of embedded cyber-physical systems that require a high safety-integrity level. For such systems, the use of formal functional safety specifications has been strongly advocated in most industrial safety standards, but reliability and fault tolerance have traditionally been treated as platform issues. We believe that addressing reliability and fault tolerance at the functional safety level widens the scope for resource optimization, targeting those functionalities that are safety-critical, rather than the entire platform. Moreover, for software based control functionalities, temporal redundancies have become just as important as replication of physical resources, and such redundancies can be modeled at the functional specification level. The ability to formally model functional reliability at a specification level enables early estimation of physical resources and computation bandwidth requirements. In this paper we propose, for the first time, a resource estimation methodology from a formal functional safety specification augmented by reliability annotations. The proposed reliability specification is overlaid on the safety-critical functional specification and our methodology extracts a constraint satisfaction problem for determining the optimal set of resources for meeting the reliability target for the safety-critical behaviors. We use SMT (Satisfiability Modulo Theories) / ILP (Integer Linear Programming) solvers at the back end to solve the optimization problem, and demonstrate the feasibility of our methodology on a Satellite Launch Vehicle Navigation, Guidance and Control (NGC) System. △ Less

Submitted 3 May, 2020; originally announced May 2020.

Comments: 23 pages

ACM Class: B.8; F.3.1

arXiv:2004.08888 [pdf]

Electrical Route to Realising Intensity Simulation of Heavy Rain Events in Tropics

Authors: Dipjyoti Mudiar, Anupam Hazra, S. D. Pawar, Rama Krishna Karumuri, Mahen Konwar, Subrata Mukherjee, M. K. Srivastava, B. N. Goswami

Abstract: In the backdrop of a revolution in weather prediction by Numerical Weather Prediction (NWP) models, quantitative prediction of intensity of heavy rainfall events and associated disasters has remained a challenge. Encouraged by compelling evidence of electrical influences on cloud/rain microphysical processes, here we propose a hypothesis that modification of raindrop size distribution (RDSD) towar… ▽ More In the backdrop of a revolution in weather prediction by Numerical Weather Prediction (NWP) models, quantitative prediction of intensity of heavy rainfall events and associated disasters has remained a challenge. Encouraged by compelling evidence of electrical influences on cloud/rain microphysical processes, here we propose a hypothesis that modification of raindrop size distribution (RDSD) towards larger drop sizes through enhanced collision-coalescence facilitated by cloud electric fields could be one of the factors responsible for intensity errors in weather/climate models. The robustness of the hypothesis is confirmed through a series of simulations of strongly electrified (SE) rain events and weakly electrified (WE) events with a convection-permitting weather prediction model incorporating the electrically modified RDSD parameters in the model physics. Our results indicate a possible roadmap for improving hazard prediction associated with extreme rainfall events in weather prediction models and climatological dry bias of precipitation simulation in many climate models. △ Less

Submitted 19 April, 2020; originally announced April 2020.

arXiv:1912.05657 [pdf, other]

Estimating high-resolution Red Sea surface temperature hotspots, using a low-rank semiparametric spatial model

Authors: Arnab Hazra, Raphaël Huser

Abstract: In this work, we estimate extreme sea surface temperature (SST) hotspots, i.e., high threshold exceedance regions, for the Red Sea, a vital region of high biodiversity. We analyze high-resolution satellite-derived SST data comprising daily measurements at 16703 grid cells across the Red Sea over the period 1985-2015. We propose a semiparametric Bayesian spatial mixed-effects linear model with a fl… ▽ More In this work, we estimate extreme sea surface temperature (SST) hotspots, i.e., high threshold exceedance regions, for the Red Sea, a vital region of high biodiversity. We analyze high-resolution satellite-derived SST data comprising daily measurements at 16703 grid cells across the Red Sea over the period 1985-2015. We propose a semiparametric Bayesian spatial mixed-effects linear model with a flexible mean structure to capture spatially-varying trend and seasonality, while the residual spatial variability is modeled through a Dirichlet process mixture (DPM) of low-rank spatial Student-$t$ processes (LTPs). By specifying cluster-specific parameters for each LTP mixture component, the bulk of the SST residuals influence tail inference and hotspot estimation only moderately. Our proposed model has a nonstationary mean, covariance and tail dependence, and posterior inference can be drawn efficiently through Gibbs sampling. In our application, we show that the proposed method outperforms some natural parametric and semiparametric alternatives. Moreover, we show how hotspots can be identified and we estimate extreme SST hotspots for the whole Red Sea, projected until the year 2100, based on the Representative Concentration Pathways 4.5 and 8.5. The estimated 95\% credible region for joint high threshold exceedances include large areas covering major endangered coral reefs in the southern Red Sea. △ Less

Submitted 18 October, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

arXiv:1909.08035 [pdf, other]

doi 10.1007/s13571-024-00324-0

Robust statistical modeling of monthly rainfall: The minimum density power divergence approach

Authors: Arnab Hazra, Abhik Ghosh

Abstract: Statistical modeling of monthly, seasonal, or annual rainfall data is an important research area in meteorology. These models play a crucial role in rainfed agriculture, where a proper assessment of the future availability of rainwater is necessary. The rainfall amount during a rainy month or a whole rainy season} can take any positive value and some simple (one or two-parameter) probability model… ▽ More Statistical modeling of monthly, seasonal, or annual rainfall data is an important research area in meteorology. These models play a crucial role in rainfed agriculture, where a proper assessment of the future availability of rainwater is necessary. The rainfall amount during a rainy month or a whole rainy season} can take any positive value and some simple (one or two-parameter) probability models supported over the positive real line that are generally used for rainfall modeling are exponential, gamma, Weibull, lognormal, Pearson Type-V/VI, log-logistic, etc., where the unknown model parameters are routinely estimated using the maximum likelihood estimator (MLE). However, the presence of outliers or extreme observations is a common issue in rainfall data and the MLEs being highly sensitive to them often leads to spurious inference. Here, we discuss a robust parameter estimation approach based on the minimum density power divergence estimator (MDPDE). We fit the above four parametric models to the detrended areally-weighted monthly rainfall data from the 36 meteorological subdivisions of India for the years 1951-2014 and compare the fits based on MLE and the proposed optimum MDPDE; the superior performance of MDPDE is showcased for several cases. For all month-subdivision combinations, we discuss the best-fit models and median rainfall amounts. △ Less

Submitted 3 March, 2024; v1 submitted 17 September, 2019; originally announced September 2019.

Comments: 32 pages, 6 Tables, 7 Figures

MSC Class: 62P12

arXiv:1812.11704 [pdf, other]

A multivariate spatial skew-t process for joint modeling of extreme precipitation indexes

Authors: Arnab Hazra, Brian J. Reich, Ana-Maria Staicu

Abstract: To study trends in extreme precipitation across US over the years 1951-2017, we consider 10 climate indexes that represent extreme precipitation, such as annual maximum of daily precipitation, annual maximum of consecutive 5-day average precipitation, which exhibit spatial correlation as well as mutual dependence. We consider the gridded data, produced by the CLIMDEX project (http://www.climdex.or… ▽ More To study trends in extreme precipitation across US over the years 1951-2017, we consider 10 climate indexes that represent extreme precipitation, such as annual maximum of daily precipitation, annual maximum of consecutive 5-day average precipitation, which exhibit spatial correlation as well as mutual dependence. We consider the gridded data, produced by the CLIMDEX project (http://www.climdex.org/gewocs.html), constructed using daily precipitation data. In this paper, we propose a multivariate spatial skew-t process for joint modeling of extreme precipitation indexes and discuss its theoretical properties. The model framework allows Bayesian inference while maintaining a computational time that is competitive with common multivariate geostatistical approaches. In a numerical study, we find that the proposed model outperforms multivariate spatial Gaussian processes, multivariate spatial t-processes including their univariate alternatives in terms of various model selection criteria. We apply the proposed model to estimate the average decadal change in the extreme precipitation indexes throughout the United States and find several significant local changes. △ Less

Submitted 31 December, 2018; originally announced December 2018.

Comments: 23 pages, 6 Figures

arXiv:1812.11699 [pdf, other]

A semiparametric spatiotemporal Bayesian model for the bulk and extremes of the Fosberg Fire Weather Index

Authors: Arnab Hazra, Brian J. Reich, Benjamin A. Shaby, Ana-Maria Staicu

Abstract: Large wildfires pose a major environmental concern, and precise maps of fire risk can improve disaster relief planning. Fosberg Fire Weather Index (FFWI) is often used to measure wildfire risk; FFWI exhibits non-Gaussian marginal distributions as well as strong spatiotemporal extremal dependence and thus, modeling FFWI using geostatistical models like Gaussian processes is questionable. Extreme va… ▽ More Large wildfires pose a major environmental concern, and precise maps of fire risk can improve disaster relief planning. Fosberg Fire Weather Index (FFWI) is often used to measure wildfire risk; FFWI exhibits non-Gaussian marginal distributions as well as strong spatiotemporal extremal dependence and thus, modeling FFWI using geostatistical models like Gaussian processes is questionable. Extreme value theory (EVT)-driven models like max-stable processes are theoretically appealing but are computationally demanding and applicable only for threshold exceedances or block maxima. Disaster management policies often consider moderate-to-extreme quantiles of climate parameters and hence, joint modeling of the bulk and the tail of the data is required. In this paper, we consider a Dirichlet process mixture of spatial skew-t processes that can flexibly model the bulk as well as the tail. The proposed model has nonstationary mean and covariance structure, and also nonzero spatiotemporal extremal dependence. A simulation study demonstrates that the proposed model has better spatial prediction performance compared to some competing models. We develop spatial maps of FFWI medians and extremes, and discuss the wildfire risk throughout the Santa Ana region of California. △ Less

Submitted 16 November, 2020; v1 submitted 31 December, 2018; originally announced December 2018.

Comments: 69 pages, 16 Figures

arXiv:1810.08821 [pdf, other]

Testability Analysis of PUFs Leveraging Correlation-Spectra in Boolean Functions

Authors: Durba Chatterjee, Aritra Hazra, Debdeep Mukhopadhyay

Abstract: Testability of digital ICs rely on the principle of controllability and observability. Adopting conventional techniques like scan-chains open up avenues for attacks, and hence cannot be adopted in a straight-forward manner for security chips. Furthermore, testing becomes incredibly challenging for the promising class of hardware security primitives, called PUFs, which offer unique properties like… ▽ More Testability of digital ICs rely on the principle of controllability and observability. Adopting conventional techniques like scan-chains open up avenues for attacks, and hence cannot be adopted in a straight-forward manner for security chips. Furthermore, testing becomes incredibly challenging for the promising class of hardware security primitives, called PUFs, which offer unique properties like unclonability, unpredictibility, uniformity, uniqueness, and yet easily computable. However, the definition of PUF itself poses a challenge on test engineers, simply because it has no golden response for a given input, often called challenge. In this paper, we develop a novel test strategy considering that the fabrication of a batch of $N>1$ PUFs is equivalent to drawing random instances of Boolean mappings. We hence model the PUFs as black-box Boolean functions of dimension $m\times1$, and show combinatorially that random designs of such functions exhibit correlation-spectra which can be used to characterize random and thus {\em good} designs of PUFs. We first develop theoretical results to quantize the correlation values, and subsequently the expected number of pairs of such Boolean functions which should belong to a given spectra. In addition to this, we show through extensive experimental results that a randomly chosen sample of such PUFs also resemble the correlation-spectra property of the overall PUF population. Interestingly, we show through experimental results on $50$ FPGAs that when the PUFs are infected by faults the usual randomness tests for the PUF outputs such as uniformity, fail to detect any aberration. However, the spectral-pattern is clearly shown to get affected, which we demonstrate by standard statistical tools. We finally propose a systematic testing framework for the evaluation of PUFs by observing the correlation-spectra of the PUF instances under test. △ Less

Submitted 20 October, 2018; originally announced October 2018.

arXiv:1809.03816 [pdf, other]

doi 10.1016/j.jcp.2019.06.003

Globally constraint-preserving FR/DG scheme for Maxwell's equations at all orders

Authors: Arijit Hazra, Praveen Chandrashekar, Dinshaw S. Balsara

Abstract: Computational electrodynamics (CED), the numerical solution of Maxwell's equations, plays an incredibly important role in several problems in science and engineering. High accuracy solutions are desired, and the discontinuous Galerkin (DG) method is one of the better ways of delivering high accuracy in CED. Maxwell's equations have a pair of involution constraints for which mimetic schemes that gl… ▽ More Computational electrodynamics (CED), the numerical solution of Maxwell's equations, plays an incredibly important role in several problems in science and engineering. High accuracy solutions are desired, and the discontinuous Galerkin (DG) method is one of the better ways of delivering high accuracy in CED. Maxwell's equations have a pair of involution constraints for which mimetic schemes that globally satisfy the constraints at a discrete level are highly desirable. Balsara and Kappeli presented a von Neumann stability analysis of globally constraint-preserving DG schemes for CED up to 4'th order which was focused on developing the theory and documenting the superior dissipation and dispersion of DGTD schemes in media with constant permittivity and permeability. In this paper we present DGTD schemes for CED that go up to 5'th order of accuracy and analyze their performance when permittivity and permeability vary strongly in space. Our DGTD schemes achieve constraint preservation by collocating the electric displacement and magnetic induction as well as their higher order modes in the faces of the mesh. Our first finding is that at 4'th and higher orders, one has to evolve some zone-centered modes in addition to the face-centered modes. It is well-known that the limiting step in DG schemes causes a reduction of the optimal accuracy of the scheme. In this paper we document simulations where permittivity and permeability vary by almost an order of magnitude without requiring any limiting of the DG scheme. This very favorable finding ensures that DGTD schemes retain optimal accuracy even in the presence of large spatial variations in permittivity/permeability. Our third finding shows that the electromagnetic energy is conserved very well even when permittivity and permeability vary strongly in space; as long as the conductivity is zero. △ Less

Submitted 11 September, 2018; originally announced September 2018.

arXiv:1809.00878 [pdf, other]

doi 10.1029/2018JD030082

Unraveling the Mystery of Indian Summer Monsoon Prediction: Improved Estimate of Predictability Limit

Authors: Subodh Kumar Saha, Anupam Hazra, Samir Pokhrel, Hemantkumar S. Chaudhari, K. Sujith, Archana Rai, Hasibur Rahaman, B. N. Goswami

Abstract: Large socio-economic impact of the Indian Summer Monsoon (ISM) extremes motivated numerous attempts at its long range prediction over the past century. However, a rather estimated low potential predictability limit (PPL) of seasonal prediction of the ISM, contributed significantly by 'internal' interannual variability was considered insurmountable. Here we show that the 'internal' variability cont… ▽ More Large socio-economic impact of the Indian Summer Monsoon (ISM) extremes motivated numerous attempts at its long range prediction over the past century. However, a rather estimated low potential predictability limit (PPL) of seasonal prediction of the ISM, contributed significantly by 'internal' interannual variability was considered insurmountable. Here we show that the 'internal' variability contributed by the ISM sub-seasonal (synoptic + intra-seasonal) fluctuations, so far considered chaotic, is partly predictable as found to be tied to slowly varying forcing (e.g. El Nino and Southern Oscillation). This provides a scientific basis for predictability of the ISM rainfall beyond the conventional estimates of PPL. We establish a much higher actual limit of predictability (r~0.82) through an extensive re-forecast experiment (1920 years of simulation) by improving two major physics in a global coupled climate model, which raises a hope for a very reliable dynamical seasonal ISM forecasting in the near future. △ Less

Submitted 4 September, 2018; originally announced September 2018.

arXiv:1708.03972 [pdf, other]

Analysis of Annual Cyclone Frequencies over Bay of Bengal: Effect of 2004 Indian Ocean Tsunami

Authors: Arnab Hazra

Abstract: This paper discusses the time series trend and variability of the cyclone frequencies over Bay of Bengal, particularly in order to conclude if there is any significant difference in the pattern visible before and after the disastrous 2004 Indian ocean tsunami based on the observed annual cyclone frequency data obtained by India Meteorological Department over the years 1891-2015. Three different ca… ▽ More This paper discusses the time series trend and variability of the cyclone frequencies over Bay of Bengal, particularly in order to conclude if there is any significant difference in the pattern visible before and after the disastrous 2004 Indian ocean tsunami based on the observed annual cyclone frequency data obtained by India Meteorological Department over the years 1891-2015. Three different categories of cyclones- depression (<34 knots), cyclonic storm (34-47 knots) and severe cyclonic storm (>47 knots) have been analyzed separately using a non-homogeneous Poisson process approach. The estimated intensity functions of the Poisson processes along with their first two derivatives are discussed and all three categories show decreasing trend of the intensity functions after the tsunami. Using an exact change-point analysis, we show that the drops in mean intensity functions are significant for all three categories. As of author's knowledge, no study so far have discussed the relation between cyclones and tsunamis. Bay of Bengal is surrounded by one of the most densely populated areas of the world and any kind of significant change in tropical cyclone pattern has a large impact in various ways, for example, disaster management planning and our study is immensely important from that perspective. △ Less

Submitted 13 August, 2017; originally announced August 2017.

Comments: 14 pages, 5 figures

arXiv:1703.05985 [pdf, ps, other]

Numerical Simulation of Bloch Equations for Dynamic Magnetic Resonance Imaging

Authors: Arijit Hazra, Gert Lube, Hans-Georg Raumer

Abstract: Magnetic Resonance Imaging (MRI) is a widely applied non-invasive imaging modality based on non-ionizing radiation which gives excellent images and soft tissue contrast of living tissues. We consider the modified Bloch problem as a model of MRI for flowing spins in an incompressible flow field. After establishing the well-posedness of the corresponding evolution problem, we analyze its spatial sem… ▽ More Magnetic Resonance Imaging (MRI) is a widely applied non-invasive imaging modality based on non-ionizing radiation which gives excellent images and soft tissue contrast of living tissues. We consider the modified Bloch problem as a model of MRI for flowing spins in an incompressible flow field. After establishing the well-posedness of the corresponding evolution problem, we analyze its spatial semidiscretization using discontinuous Galerkin methods. The high frequency time evolution requires a proper explicit and adaptive temporal discretization. The applicability of the approach is shown for basic examples. △ Less

Submitted 10 September, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

Comments: Section 1 is improved with a description of the state-of-art. Some simulation results are added in section 5. Mathematical theorems and estimates are re-arranged

arXiv:1512.02149 [pdf, ps, other]

A Time-varying Parameter Based Seasonally-adjusted Bayesian State-space Model for Forecasting

Authors: Arnab Hazra

Abstract: In this paper, we develop a time-varying parameter based seasonally-adjusted Bayesian state-space model for non-stationary time series datasets where both the trend and seasonal components are present and it is the general scenario for most of the real datasets in various scientific disciplines. In spite of removing such terms using some do-and-check procedure to make the data stationary, our mode… ▽ More In this paper, we develop a time-varying parameter based seasonally-adjusted Bayesian state-space model for non-stationary time series datasets where both the trend and seasonal components are present and it is the general scenario for most of the real datasets in various scientific disciplines. In spite of removing such terms using some do-and-check procedure to make the data stationary, our model directly fits a dataset and forecasts a number of future observations. For a specific prior construction we have considered, every parameter update is one-dimensional so that we don't need to invert any matrix and also we overcome the difficulty of Metropolis-Hastings steps simply by Gibbs sampling which is another advantage of this model. It can handle missing data as well which occurs very often in time series contexts. We implement it on the sufficiently large (24 years of monthly average temperature series, i.e. the number of observations =288) for 57 meteorological stations across India and show that for most of the cases, our method forecasts quite accurately for the months of the 25-th year. △ Less

Submitted 7 December, 2015; originally announced December 2015.

Comments: 15 pages

arXiv:1405.7206 [pdf, ps, other]

A Note on the Misuse of the Variance Test in Meteorological Studies

Authors: Arnab Hazra, Sourabh Bhattacharya, Sabyasachi Bhattacharya, Pabitra Banik

Abstract: The erroneous assumption "for all distributions for which the theoretical variance can be computed independently from parameters estimated by any method different from the method of moments" has been used in the case of fitting the gamma distribution to a rainfall data by Mooley (1973) which was followed by several researchers. We show that the asymptotic distribution of the test statistic is gene… ▽ More The erroneous assumption "for all distributions for which the theoretical variance can be computed independently from parameters estimated by any method different from the method of moments" has been used in the case of fitting the gamma distribution to a rainfall data by Mooley (1973) which was followed by several researchers. We show that the asymptotic distribution of the test statistic is generally not even comparable to any central chi-square distribution. We also describe a method for checking the validity of the asymptotic distribution for a class of distributions. △ Less

Submitted 28 May, 2014; originally announced May 2014.

arXiv:1405.2400 [pdf, other]

doi 10.1103/PhysRevA.90.022303

Estimating Franck-Condon factors using an NMR quantum processor

Authors: Sharad Joshi, Abhishek Shukla, Hemant Katiyar, Anirban Hazra, T. S. Mahesh

Abstract: Interaction of molecules with light may lead to electronic transitions and simultaneous vibrational excitations. Franck-Condon factors (FCFs) play an important role in quantifying the intensities of such vibronic transitions occurring during molecular photo-excitations. In this article, we describe a general method for estimating FCFs using a quantum information processor. The method involves the… ▽ More Interaction of molecules with light may lead to electronic transitions and simultaneous vibrational excitations. Franck-Condon factors (FCFs) play an important role in quantifying the intensities of such vibronic transitions occurring during molecular photo-excitations. In this article, we describe a general method for estimating FCFs using a quantum information processor. The method involves the application of a translation operator followed by the measurement of certain projections. We also illustrate the method by experimentally estimating FCFs with the help of a three-qubit NMR quantum information processor. We describe two methods for the measurement of projections - (i) using diagonal tomography and (ii) using Moussa protocol. The experimental results agree fairly well with the theory. △ Less

Submitted 10 May, 2014; originally announced May 2014.

Comments: 6 pages, 7 figures

Showing 1–50 of 59 results for author: Hazra, A