Search | arXiv e-print repository

arXiv:2406.18597 [pdf, other]

Relative Measurement and Extrapolation of the Scintillation Quenching Factor of $α$-Particles in Liquid Argon using DEAP-3600 Data

Authors: The DEAP Collaboration, P. Adhikari, M. Alpízar-Venegas, P. -A. Amaudruz, J. Anstey, D. J. Auty, M. Batygov, B. Beltran, C. E. Bina, W. Bonivento, M. G. Boulay, J. F. Bueno, B. Cai, M. Cárdenas-Montes, S. Choudhary, B. T. Cleveland, R. Crampton, S. Daugherty, P. DelGobbo, P. Di Stefano, G. Dolganov, L. Doria, F. A. Duncan, M. Dunford, E. Ellingwood , et al. (73 additional authors not shown)

Abstract: The knowledge of scintillation quenching of $α$-particles plays a paramount role in understanding $α$-induced backgrounds and improving the sensitivity of liquid argon-based direct detection of dark matter experiments. We performed a relative measurement of scintillation quenching in the MeV energy region using radioactive isotopes ($^{222}$Rn, $^{218}$Po and $^{214}$Po isotopes) present in trace… ▽ More The knowledge of scintillation quenching of $α$-particles plays a paramount role in understanding $α$-induced backgrounds and improving the sensitivity of liquid argon-based direct detection of dark matter experiments. We performed a relative measurement of scintillation quenching in the MeV energy region using radioactive isotopes ($^{222}$Rn, $^{218}$Po and $^{214}$Po isotopes) present in trace amounts in the DEAP-3600 detector and quantified the uncertainty of extrapolating the quenching factor to the low-energy region. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: Submitted to Eur. Phys. J. C

arXiv:2406.15387 [pdf, ps, other]

On Profinite Quandles

Authors: On Profinite Quandles Alexander W. Byard, Brian Cai, Nathan P. Jones, Lucy H. Vuong, David N. Yetter

Abstract: We undertake the study of profinite quandles. We provide several constructions of profinite quandles from profinite groups, and from other profinite quandle. We characterize which subquandles of profinite quandles are again profinite. Finally, we provide a characterization of algebraically connected profinite quandles in terms of the profinite completion of their inner automorphism groups… ▽ More We undertake the study of profinite quandles. We provide several constructions of profinite quandles from profinite groups, and from other profinite quandle. We characterize which subquandles of profinite quandles are again profinite. Finally, we provide a characterization of algebraically connected profinite quandles in terms of the profinite completion of their inner automorphism groups $\widehat{\Inn(Q)}$. It is anticipated that the results herein will find applications to the étale homotopy theory of number fields. △ Less

Submitted 22 April, 2024; originally announced June 2024.

MSC Class: 08A99; 57K12; 20E18

arXiv:2406.05025 [pdf, ps, other]

Unraveling Trace Anomaly of Supradense Matter via Neutron Star Compactness Scaling

Authors: Bao-Jun Cai, Bao-An Li

Abstract: The trace anomaly $Δ\equiv 1/3-P/\varepsilon$ quantifies the possibly broken conformal symmetry in supradense matter under pressure $P$ at energy density $\varepsilon$. Perturbative QCD (pQCD) predicts a vanishing $Δ$ at extremely high energy or baryon densities when the conformal symmetry is realized but its behavior at intermediate densities reachable in neutron stars (NSs) are still very uncert… ▽ More The trace anomaly $Δ\equiv 1/3-P/\varepsilon$ quantifies the possibly broken conformal symmetry in supradense matter under pressure $P$ at energy density $\varepsilon$. Perturbative QCD (pQCD) predicts a vanishing $Δ$ at extremely high energy or baryon densities when the conformal symmetry is realized but its behavior at intermediate densities reachable in neutron stars (NSs) are still very uncertain. The extraction of $Δ$ from NS observations strongly depends on the employed model for nuclear Equation of State (EOS). Based on the analytical results from perturbatively analyzing the dimensionless Tolman-Oppenheimer-Volkoff (TOV) equations that are further verified numerically by using $10^5$ EOSs generated randomly with a meta-model in a very broad EOS parameter space constrained by terrestrial nuclear experiments and astrophysical observations, here we first show that the compactness $ξ\equiv M_{\rm{NS}}/R$ of a NS with mass $M_{\rm{NS}}$ and radius $R$ scales very accurately with $Π_{\rm{c}}\equiv\mathrm{X}/(1+3\mathrm{X}^2+4\mathrm{X})$ where $\mathrm{X}\equiv P_{\rm{c}}/\varepsilon_{\rm{c}}$ is the ratio of pressure over energy density at NS centers. The scaling of NS compactness thus enables one to readily read off the central trace anomaly $Δ_{\rm{c}}=1/3-\mathrm{X}$ directly from the observational data of either the mass-radius or red-shift measurements. We then demonstrate indeed that the available NS data themselves from recent X-ray and gravitational wave observations can determine model-independently the trace anomaly as a function of energy density in NS cores, providing a stringent test of existing NS models and a clear guidance in a new direction for further understanding the nature and EOS of supradense matter. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 5 pages of main text with 4 pages of supplementary materials

arXiv:2406.04070 [pdf, other]

Batch-in-Batch: a new adversarial training framework for initial perturbation and sample selection

Authors: Yinting Wu, Pai Peng, Bo Cai, Le Li, .

Abstract: Adversarial training methods commonly generate independent initial perturbation for adversarial samples from a simple uniform distribution, and obtain the training batch for the classifier without selection. In this work, we propose a simple yet effective training framework called Batch-in-Batch (BB) to enhance models robustness. It involves specifically a joint construction of initial values that… ▽ More Adversarial training methods commonly generate independent initial perturbation for adversarial samples from a simple uniform distribution, and obtain the training batch for the classifier without selection. In this work, we propose a simple yet effective training framework called Batch-in-Batch (BB) to enhance models robustness. It involves specifically a joint construction of initial values that could simultaneously generates $m$ sets of perturbations from the original batch set to provide more diversity for adversarial samples; and also includes various sample selection strategies that enable the trained models to have smoother losses and avoid overconfident outputs. Through extensive experiments on three benchmark datasets (CIFAR-10, SVHN, CIFAR-100) with two networks (PreActResNet18 and WideResNet28-10) that are used in both the single-step (Noise-Fast Gradient Sign Method, N-FGSM) and multi-step (Projected Gradient Descent, PGD-10) adversarial training, we show that models trained within the BB framework consistently have higher adversarial accuracy across various adversarial settings, notably achieving over a 13% improvement on the SVHN dataset with an attack radius of 8/255 compared to the N-FGSM baseline model. Furthermore, experimental analysis of the efficiency of both the proposed initial perturbation method and sample selection strategies validates our insights. Finally, we show that our framework is cost-effective in terms of computational resources, even with a relatively large value of $m$. △ Less

Submitted 6 June, 2024; originally announced June 2024.

Comments: 29 pages, 11 figures

arXiv:2405.15038 [pdf, other]

Preferential Latent Space Models for Networks with Textual Edges

Authors: Maoyu Zhang, Biao Cai, Dong Li, Xiaoyue Niu, Jingfei Zhang

Abstract: Many real-world networks contain rich textual information in the edges, such as email networks where an edge between two nodes is an email exchange. Other examples include co-author networks and social media networks. The useful textual information carried in the edges is often discarded in most network analyses, resulting in an incomplete view of the relationships between nodes. In this work, we… ▽ More Many real-world networks contain rich textual information in the edges, such as email networks where an edge between two nodes is an email exchange. Other examples include co-author networks and social media networks. The useful textual information carried in the edges is often discarded in most network analyses, resulting in an incomplete view of the relationships between nodes. In this work, we propose to represent the text document between each pair of nodes as a vector counting the appearances of keywords extracted from the corpus, and introduce a new and flexible preferential latent space network model that can offer direct insights on how contents of the textual exchanges modulate the relationships between nodes. We establish identifiability conditions for the proposed model and tackle model estimation with a computationally efficient projected gradient descent algorithm. We further derive the non-asymptotic error bound of the estimator from each step of the algorithm. The efficacy of our proposed method is demonstrated through simulations and an analysis of the Enron email network. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 30 pages

MSC Class: G.3; F.2 ACM Class: G.3

arXiv:2405.01113 [pdf, other]

Domain-Transferred Synthetic Data Generation for Improving Monocular Depth Estimation

Authors: Seungyeop Lee, Knut Peterson, Solmaz Arezoomandan, Bill Cai, Peihan Li, Lifeng Zhou, David Han

Abstract: A major obstacle to the development of effective monocular depth estimation algorithms is the difficulty in obtaining high-quality depth data that corresponds to collected RGB images. Collecting this data is time-consuming and costly, and even data collected by modern sensors has limited range or resolution, and is subject to inconsistencies and noise. To combat this, we propose a method of data g… ▽ More A major obstacle to the development of effective monocular depth estimation algorithms is the difficulty in obtaining high-quality depth data that corresponds to collected RGB images. Collecting this data is time-consuming and costly, and even data collected by modern sensors has limited range or resolution, and is subject to inconsistencies and noise. To combat this, we propose a method of data generation in simulation using 3D synthetic environments and CycleGAN domain transfer. We compare this method of data generation to the popular NYUDepth V2 dataset by training a depth estimation model based on the DenseDepth structure using different training sets of real and simulated data. We evaluate the performance of the models on newly collected images and LiDAR depth data from a Husky robot to verify the generalizability of the approach and show that GAN-transformed data can serve as an effective alternative to real-world data, particularly in depth estimation. △ Less

Submitted 2 May, 2024; originally announced May 2024.

arXiv:2404.06224 [pdf, other]

Low-Cost Generation and Evaluation of Dictionary Example Sentences

Authors: Bill Cai, Clarence Boon Liang Ng, Daniel Tan, Shelvia Hotama

Abstract: Dictionary example sentences play an important role in illustrating word definitions and usage, but manually creating quality sentences is challenging. Prior works have demonstrated that language models can be trained to generate example sentences. However, they relied on costly customized models and word sense datasets for generation and evaluation of their work. Rapid advancements in foundationa… ▽ More Dictionary example sentences play an important role in illustrating word definitions and usage, but manually creating quality sentences is challenging. Prior works have demonstrated that language models can be trained to generate example sentences. However, they relied on costly customized models and word sense datasets for generation and evaluation of their work. Rapid advancements in foundational models present the opportunity to create low-cost, zero-shot methods for the generation and evaluation of dictionary example sentences. We introduce a new automatic evaluation metric called OxfordEval that measures the win-rate of generated sentences against existing Oxford Dictionary sentences. OxfordEval shows high alignment with human judgments, enabling large-scale automated quality evaluation. We experiment with various LLMs and configurations to generate dictionary sentences across word classes. We complement this with a novel approach of using masked language models to identify and select sentences that best exemplify word meaning. The eventual model, FM-MLM, achieves over 85.1% win rate against Oxford baseline sentences according to OxfordEval, compared to 39.8% win rate for prior model-generated sentences. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.01677 [pdf, other]

Towards Generalizable and Faithful Logic Reasoning over Natural Language via Resolution Refutation

Authors: Zhouhao Sun, Xiao Ding, Li Du, Bibo Cai, Jinglong Gao, Ting Liu, Qin Bing

Abstract: Large language models (LLMs) have achieved significant performance in various natural language reasoning tasks. However, they still struggle with performing first-order logic reasoning over formal logical theories expressed in natural language. This is because the previous LLMs-based reasoning systems have the theoretical incompleteness issue. As a result, it can only address a limited set of simp… ▽ More Large language models (LLMs) have achieved significant performance in various natural language reasoning tasks. However, they still struggle with performing first-order logic reasoning over formal logical theories expressed in natural language. This is because the previous LLMs-based reasoning systems have the theoretical incompleteness issue. As a result, it can only address a limited set of simple reasoning problems, which significantly decreases their generalization ability. To address this issue, we propose a novel framework, named Generalizable and Faithful Reasoner (GFaiR), which introduces the paradigm of resolution refutation. Resolution refutation has the capability to solve all first-order logic reasoning problems by extending reasoning rules and employing the principle of proof by contradiction, so our system's completeness can be improved by introducing resolution refutation. Experimental results demonstrate that our system outperforms previous works by achieving state-of-the-art performances in complex scenarios while maintaining performances in simple scenarios. Besides, we observe that GFaiR is faithful to its reasoning process. △ Less

Submitted 3 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

Comments: LREC-Coling 2024

arXiv:2403.11450 [pdf, other]

Zero-shot Compound Expression Recognition with Visual Language Model at the 6th ABAW Challenge

Authors: Jiahe Wang, Jiale Huang, Bingzhao Cai, Yifan Cao, Xin Yun, Shangfei Wang

Abstract: Conventional approaches to facial expression recognition primarily focus on the classification of six basic facial expressions. Nevertheless, real-world situations present a wider range of complex compound expressions that consist of combinations of these basics ones due to limited availability of comprehensive training datasets. The 6th Workshop and Competition on Affective Behavior Analysis in-t… ▽ More Conventional approaches to facial expression recognition primarily focus on the classification of six basic facial expressions. Nevertheless, real-world situations present a wider range of complex compound expressions that consist of combinations of these basics ones due to limited availability of comprehensive training datasets. The 6th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW) offered unlabeled datasets containing compound expressions. In this study, we propose a zero-shot approach for recognizing compound expressions by leveraging a pretrained visual language model integrated with some traditional CNN networks. △ Less

Submitted 17 March, 2024; originally announced March 2024.

Comments: USTC-AC's paper for Compound Expression (CE) Recognition Challenge in 6th Workshop and Competition on Affective Behavior Analysis in-the-wild (ABAW)

arXiv:2403.09825 [pdf, other]

Field Line Curvature Scattering in the Dayside Off-Equatorial Minima Regions

Authors: Bin Cai, Hui Zhu

Abstract: Magnetic field line curvature (FLC) scattering is an effective mechanism for collisionless particle scattering. In the terrestrial magnetosphere, the FLC scattering plays an essential role in shaping the outer boundary of protons radiation belt, the rapid decay of ring current, and the formation of proton isotropic boundary (IB). However, previous studies have yet to adequately investigate the inf… ▽ More Magnetic field line curvature (FLC) scattering is an effective mechanism for collisionless particle scattering. In the terrestrial magnetosphere, the FLC scattering plays an essential role in shaping the outer boundary of protons radiation belt, the rapid decay of ring current, and the formation of proton isotropic boundary (IB). However, previous studies have yet to adequately investigate the influence of FLC scattering on charged particles in the Earth's dayside magnetosphere, particularly in the off-equatorial magnetic minima regions. This study employs T89 magnetic field model to investigate the impacts of FLC scattering on ring current protons in the dayside magnetosphere, with a specific focus on the off-equatorial minimum regions. We analyze the spatial distributions of single and dual magnetic minima regions, adiabatic parameter, and pitch angle diffusion coefficients due to FLC scattering as functions of $Kp$. The results show that the effects of FLC scattering are significant not only on the dusk and dawn sides but also in the off-equatorial minima regions on the noon. Additionally, we investigate the role of dipole tilt angle in the hemispheric asymmetry of FLC scattering effects. The dipole tilt angle controls the overall displacement of the dayside magnetosphere, resulting in different FLC scattering effects in the two hemispheres. Our study holds significance for understanding the FLC scattering effects in the off-equatorial region of Earth's dayside magnetosphere and for constructing a more accurate dynamic model of particles. △ Less

Submitted 14 March, 2024; originally announced March 2024.

arXiv:2403.03730 [pdf, other]

Learning 3D object-centric representation through prediction

Authors: John Day, Tushar Arora, Jirui Liu, Li Erran Li, Ming Bo Cai

Abstract: As part of human core knowledge, the representation of objects is the building block of mental representation that supports high-level concepts and symbolic reasoning. While humans develop the ability of perceiving objects situated in 3D environments without supervision, models that learn the same set of abilities with similar constraints faced by human infants are lacking. Towards this end, we de… ▽ More As part of human core knowledge, the representation of objects is the building block of mental representation that supports high-level concepts and symbolic reasoning. While humans develop the ability of perceiving objects situated in 3D environments without supervision, models that learn the same set of abilities with similar constraints faced by human infants are lacking. Towards this end, we developed a novel network architecture that simultaneously learns to 1) segment objects from discrete images, 2) infer their 3D locations, and 3) perceive depth, all while using only information directly available to the brain as training data, namely: sequences of images and self-motion. The core idea is treating objects as latent causes of visual input which the brain uses to make efficient predictions of future scenes. This results in object representations being learned as an essential byproduct of learning to predict. △ Less

Submitted 6 March, 2024; originally announced March 2024.

Comments: 21 pages, 11 figures. Project webpage can be found at https://jday54.github.io/opple_site/

ACM Class: I.2.10; I.4.8; I.4.6; I.4.10; I.2.6

arXiv:2312.14399 [pdf, ps, other]

Quantum multigraph states and multihypergraph states

Authors: Xiao-Dong Zhang, Bin-Bin Cai, Song Lin

Abstract: We proposed two classes of multiparticle entangled states, the multigraph states and multihypergraph states, defined by unique operations on the edges and hyperedges. A key discovery is the one-to-one correspondence between the proposed multihypergraph states and the generalized real equally weighted states when d is prime. While for composite d, multihypergraph states form a subset of the general… ▽ More We proposed two classes of multiparticle entangled states, the multigraph states and multihypergraph states, defined by unique operations on the edges and hyperedges. A key discovery is the one-to-one correspondence between the proposed multihypergraph states and the generalized real equally weighted states when d is prime. While for composite d, multihypergraph states form a subset of the generalized real equally weighted states. Meanwhile, we detailed a method for constructing real equally weighted states from hypergraph states and revealed the generalized real equally weighted states which cannot be generated from d-dimensional hypergraph states. △ Less

Submitted 21 December, 2023; originally announced December 2023.

arXiv:2312.07031 [pdf, other]

doi 10.1016/j.physletb.2023.138435

Bayesian model averaging for nuclear symmetry energy from effective proton-neutron chemical potential difference of neutron-rich nuclei

Authors: Mengying Qiu, Bao-Jun Cai, Lie-Wen Chen, Cen-Xi Yuan, Zhen Zhang

Abstract: The data-driven Bayesian model averaging is a rigorous statistical approach to combining multiple models for a unified prediction. Compared with the individual model, it provides more reliable information, especially for problems involving apparent model dependence. In this work, within both the non-relativistic Skyrme energy density functional and the nonlinear relativistic mean field model, the… ▽ More The data-driven Bayesian model averaging is a rigorous statistical approach to combining multiple models for a unified prediction. Compared with the individual model, it provides more reliable information, especially for problems involving apparent model dependence. In this work, within both the non-relativistic Skyrme energy density functional and the nonlinear relativistic mean field model, the effective proton-neutron chemical potential difference $Δμ^*_{\rm{pn}}$ of neutron-rich nuclei is found to be strongly sensitive to the symmetry energy $E_{\rm{sym}}(ρ)$ around $2ρ_0/3$, with $ρ_0$ being the nuclear saturation density. Given discrepancies on the $Δμ^*_{\rm{pn}}$-$E_{\rm{sym}}(2ρ_0/3)$ correlations between the two models, we carry out a Bayesian model averaging analysis based on Gaussian process emulators to extract the symmetry energy around $2ρ_0/3$ from the measured $Δμ^*_{\rm{pn}}$ of 5 doubly magic nuclei $^{48}$Ca, $^{68}$Ni, $^{88}$Sr, $^{132}$Sn and $^{208}$Pb. Specifically, the $E_{\mathrm{sym}}(2ρ_0/3)$ is inferred to be $E_{\mathrm{sym}}(2ρ_0/3) = 25.6_{-1.3}^{+1.4}\,\mathrm{MeV}$ at $1σ$ confidence level. The obtained constraints on the $E_{\mathrm{sym}}(ρ)$ around $2ρ_0/3$ agree well with microscopic predictions and results from other isovector indicators. △ Less

Submitted 18 January, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

Comments: 6 pages, 4 figures; published version

Journal ref: Phys. Lett. B 849 (2024) 138435

arXiv:2311.13037 [pdf, ps, other]

doi 10.1103/PhysRevD.109.083015

Strong Gravity Extruding Peaks in Speed of Sound Profiles of Massive Neutron Stars

Authors: Bao-Jun Cai, Bao-An Li

Abstract: The speed of sound squared (SSS) $s^2$ in massive neutron stars (NSs) characterizes not only the stiffness of supradense neutron-rich matter within but also equivalently properties of the curved geometry due to the strong-field gravity and matter-geometry coupling. A peaked density or radius profile of $s^2$ has been predicted for massive NSs using various NS Equation of State (EOS) models. Howeve… ▽ More The speed of sound squared (SSS) $s^2$ in massive neutron stars (NSs) characterizes not only the stiffness of supradense neutron-rich matter within but also equivalently properties of the curved geometry due to the strong-field gravity and matter-geometry coupling. A peaked density or radius profile of $s^2$ has been predicted for massive NSs using various NS Equation of State (EOS) models. However, the nature, cause, location and size of the peak in $s^2$ profiles are still very EOS model dependent. In this work, we investigate systematically $s^2$ profiles in massive NSs in a new approach that is independent of the nuclear EOS model and without any presumption about the NS structure and/or composition. In terms of the small quantities (reduced radius, the energy density and pressure scaled by their central values), we perform double-element perturbative expansions in solving perturbatively the scaled Tolman--Oppenheimer--Volkoff (TOV) equations and analyzing $s^2$ profiles from the Newtonian limit to the general relativistic (GR) case. The GR term in the TOV equations plays a twofold role: it compresses NS matter and modifies the pressure/energy density ratio from small values in Newtonian stars showing no $s^2$ peak to large ones for massive NSs possessing a peak in their $s^2$ profiles, and eventually takes away the peak in extremely compact/massive NSs approaching the causality limit. {In particular, the peaked behavior in $s^2$ is expected to emerge near the center of massive NSs like PSR J0740+6620, while a sharp phase transition is unlikely to occur there.} These features revealed from our analyses are universal as they are intrinsic properties of the GR stellar structure equations independent of the still very uncertain EOS of supradense neutron-rich matter in NSs. △ Less

Submitted 15 March, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

Comments: Added/revised some discussions. Phys. Rev. D in press

Journal ref: Phys. Rev. D 109, 083015 (2024)

arXiv:2311.01167 [pdf, ps, other]

Modulation Design and Optimization for RIS-Assisted Symbiotic Radios

Authors: Hu Zhou, Bowen Cai, Qianqian Zhang, Ruizhe Long, Yiyang Pei, Ying-Chang Liang

Abstract: In reconfigurable intelligent surface (RIS)-assisted symbiotic radio (SR), the RIS acts as a secondary transmitter by modulating its information bits over the incident primary signal and simultaneously assists the primary transmission, then a cooperative receiver is used to jointly decode the primary and secondary signals. Most existing works of SR focus on using RIS to enhance the reflecting link… ▽ More In reconfigurable intelligent surface (RIS)-assisted symbiotic radio (SR), the RIS acts as a secondary transmitter by modulating its information bits over the incident primary signal and simultaneously assists the primary transmission, then a cooperative receiver is used to jointly decode the primary and secondary signals. Most existing works of SR focus on using RIS to enhance the reflecting link while ignoring the ambiguity problem for the joint detection caused by the multiplication relationship of the primary and secondary signals. Particularly, in case of a blocked direct link, joint detection will suffer from severe performance loss due to the ambiguity, when using the conventional on-off keying and binary phase shift keying modulation schemes for RIS. To address this issue, we propose a novel modulation scheme for RIS-assisted SR that divides the phase-shift matrix into two components: the symbol-invariant and symbol-varying components, which are used to assist the primary transmission and carry the secondary signal, respectively. To design these two components, we focus on the detection of the composite signal formed by the primary and secondary signals, through which a problem of minimizing the bit error rate (BER) of the composite signal is formulated to improve both the BER performance of the primary and secondary ones. By solving the problem, we derive the closed-form solution of the optimal symbol-invariant and symbol-varying components, which is related to the channel strength ratio of the direct link to the reflecting link. Moreover, theoretical BER performance is analyzed. Finally, simulation results show the superiority of the proposed modulation scheme over its conventional counterpart. △ Less

Submitted 26 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: 16 pages,16 figures

arXiv:2310.11295 [pdf, other]

CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation

Authors: Zhaojie Chu, Kailing Guo, Xiaofen Xing, Yilin Lan, Bolun Cai, Xiangmin Xu

Abstract: Speech-driven 3D facial animation is a challenging cross-modal task that has attracted growing research interest. During speaking activities, the mouth displays strong motions, while the other facial regions typically demonstrate comparatively weak activity levels. Existing approaches often simplify the process by directly mapping single-level speech features to the entire facial animation, which… ▽ More Speech-driven 3D facial animation is a challenging cross-modal task that has attracted growing research interest. During speaking activities, the mouth displays strong motions, while the other facial regions typically demonstrate comparatively weak activity levels. Existing approaches often simplify the process by directly mapping single-level speech features to the entire facial animation, which overlook the differences in facial activity intensity leading to overly smoothed facial movements. In this study, we propose a novel framework, CorrTalk, which effectively establishes the temporal correlation between hierarchical speech features and facial activities of different intensities across distinct regions. A novel facial activity intensity metric is defined to distinguish between strong and weak facial activity, obtained by computing the short-time Fourier transform of facial vertex displacements. Based on the variances in facial activity, we propose a dual-branch decoding framework to synchronously synthesize strong and weak facial activity, which guarantees wider intensity facial animation synthesis. Furthermore, a weighted hierarchical feature encoder is proposed to establish temporal correlation between hierarchical speech features and facial activity at different intensities, which ensures lip-sync and plausible facial expressions. Extensive qualitatively and quantitatively experiments as well as a user study indicate that our CorrTalk outperforms existing state-of-the-art methods. The source code and supplementary video are publicly available at: https://zjchu.github.io/projects/CorrTalk/ △ Less

Submitted 17 October, 2023; originally announced October 2023.

arXiv:2310.07729 [pdf, other]

Energy-Aware Routing Algorithm for Mobile Ground-to-Air Charging

Authors: Bill Cai, Fei Lu, Lifeng Zhou

Abstract: We investigate the problem of energy-constrained planning for a cooperative system of an Unmanned Ground Vehicles (UGV) and an Unmanned Aerial Vehicle (UAV). In scenarios where the UGV serves as a mobile base to ferry the UAV and as a charging station to recharge the UAV, we formulate a novel energy-constrained routing problem. To tackle this problem, we design an energy-aware routing algorithm, a… ▽ More We investigate the problem of energy-constrained planning for a cooperative system of an Unmanned Ground Vehicles (UGV) and an Unmanned Aerial Vehicle (UAV). In scenarios where the UGV serves as a mobile base to ferry the UAV and as a charging station to recharge the UAV, we formulate a novel energy-constrained routing problem. To tackle this problem, we design an energy-aware routing algorithm, aiming to minimize the overall mission duration under the energy limitations of both vehicles. The algorithm first solves a Traveling Salesman Problem (TSP) to generate a guided tour. Then, it employs the Monte-Carlo Tree Search (MCTS) algorithm to refine the tour and generate paths for the two vehicles. We evaluate the performance of our algorithm through extensive simulations and a proof-of-concept experiment. The results show that our algorithm consistently achieves near-optimal mission time and maintains fast running time across a wide range of problem instances. △ Less

Submitted 6 August, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

arXiv:2308.00458 [pdf, other]

Center Contrastive Loss for Metric Learning

Authors: Bolun Cai, Pengfei Xiong, Shangxuan Tian

Abstract: Contrastive learning is a major studied topic in metric learning. However, sampling effective contrastive pairs remains a challenge due to factors such as limited batch size, imbalanced data distribution, and the risk of overfitting. In this paper, we propose a novel metric learning function called Center Contrastive Loss, which maintains a class-wise center bank and compares the category centers… ▽ More Contrastive learning is a major studied topic in metric learning. However, sampling effective contrastive pairs remains a challenge due to factors such as limited batch size, imbalanced data distribution, and the risk of overfitting. In this paper, we propose a novel metric learning function called Center Contrastive Loss, which maintains a class-wise center bank and compares the category centers with the query data points using a contrastive loss. The center bank is updated in real-time to boost model convergence without the need for well-designed sample mining. The category centers are well-optimized classification proxies to re-balance the supervisory signal of each class. Furthermore, the proposed loss combines the advantages of both contrastive and classification methods by reducing intra-class variations and enhancing inter-class differences to improve the discriminative power of embeddings. Our experimental results, as shown in Figure 1, demonstrate that a standard network (ResNet50) trained with our loss achieves state-of-the-art performance and faster convergence. △ Less

Submitted 1 August, 2023; originally announced August 2023.

Comments: 12 pages, 6 figures

arXiv:2307.15223 [pdf, ps, other]

doi 10.1103/PhysRevD.108.103041

Central Speed of Sound, Trace Anomaly and Observables of Neutron Stars from Perturbative Analyses of Scaled TOV Equations

Authors: Bao-Jun Cai, Bao-An Li, Zhen Zhang

Abstract: The central speed of sound (SS) measures the stiffness of the Equation of State (EOS) of superdense neutron star (NS) matter. Its variations with density and radial coordinate in NSs in conventional analyses often suffer from uncertainties of the specific nuclear EOSs used. Using the central SS and NS mass/radius scaling obtained from solving perturbatively the scaled Tolman-Oppenheimer-Volkoff (T… ▽ More The central speed of sound (SS) measures the stiffness of the Equation of State (EOS) of superdense neutron star (NS) matter. Its variations with density and radial coordinate in NSs in conventional analyses often suffer from uncertainties of the specific nuclear EOSs used. Using the central SS and NS mass/radius scaling obtained from solving perturbatively the scaled Tolman-Oppenheimer-Volkoff (TOV) equations, we study the variations of SS, trace anomaly and several closely related properties of NSs in an EOS-model independent manner. We find that the SS increases with the reduced central pressure $\widehat{P}_{\rm{c}}\equiv P_{\rm{c}}/\varepsilon_{\rm{c}}$ (scaled by the central energy density $\varepsilon_{\rm{c}}$), and the conformal bound for SS tends to break down for NSs with masses higher than about 1.9$M_{\odot}$. The ratio $P/\varepsilon$ is upper bounded as $P/\varepsilon\lesssim0.374$ around the centers of stable NSs. We demonstrate that it is an intrinsic property of strong-field gravity and is more relevant than the perturbative QCD bound on it. While a sharp phase transition at high densities characterized by a sudden vanishing of SS in cores of massive NSs are basically excluded, the probability for a continuous crossover signaled by a peaked radial profile of SS is found to be enhanced as $\widehat{P}_{\rm{c}}$ decreases, implying it likely happens near the centers of massive NSs. Moreover, a new and more stringent causality boundary as $R_{\max}/\rm{km}\gtrsim 4.73M_{\rm{NS}}^{\max}/M_{\odot}+1.14$ for NS M-R curve is found to be excellently consistent with observational data on NS masses and radii. Furthermore, new constraints on the ultimate energy density and pressure allowed in NSs before collapsing into black holes are obtained and compared with earlier predictions in the literature. △ Less

Submitted 23 October, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

Comments: Phys. Rev. D (2023) in press

Journal ref: Phys. Rev. D 108, 103041 (2023)

arXiv:2307.00266 [pdf, other]

Hierarchical Pretraining for Biomedical Term Embeddings

Authors: Bryan Cai, Sihang Zeng, Yucong Lin, Zheng Yuan, Doudou Zhou, Lu Tian

Abstract: Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly… ▽ More Electronic health records (EHR) contain narrative notes that provide extensive details on the medical condition and management of patients. Natural language processing (NLP) of clinical notes can use observed frequencies of clinical terms as predictive features for downstream applications such as clinical decision making and patient trajectory prediction. However, due to the vast number of highly similar and related clinical concepts, a more effective modeling strategy is to represent clinical terms as semantic embeddings via representation learning and use the low dimensional embeddings as feature vectors for predictive modeling. To achieve efficient representation, fine-tuning pretrained language models with biomedical knowledge graphs may generate better embeddings for biomedical terms than those from standard language models alone. These embeddings can effectively discriminate synonymous pairs of from those that are unrelated. However, they often fail to capture different degrees of similarity or relatedness for concepts that are hierarchical in nature. To overcome this limitation, we propose HiPrBERT, a novel biomedical term representation model trained on additionally complied data that contains hierarchical structures for various biomedical terms. We modify an existing contrastive loss function to extract information from these hierarchies. Our numerical experiments demonstrate that HiPrBERT effectively learns the pair-wise distance from hierarchical information, resulting in a substantially more informative embeddings for further biomedical applications △ Less

Submitted 1 July, 2023; originally announced July 2023.

arXiv:2307.00260 [pdf, other]

Bootstrapping the Cross-Validation Estimate

Authors: Bryan Cai, Fabio Pellegrini, Menglan Pang, Carl de Moor, Changyu Shen, Vivek Charu, Lu Tian

Abstract: Cross-validation is a widely used technique for evaluating the performance of prediction models. It helps avoid the optimism bias in error estimates, which can be significant for models built using complex statistical learning algorithms. However, since the cross-validation estimate is a random value dependent on observed data, it is essential to accurately quantify the uncertainty associated with… ▽ More Cross-validation is a widely used technique for evaluating the performance of prediction models. It helps avoid the optimism bias in error estimates, which can be significant for models built using complex statistical learning algorithms. However, since the cross-validation estimate is a random value dependent on observed data, it is essential to accurately quantify the uncertainty associated with the estimate. This is especially important when comparing the performance of two models using cross-validation, as one must determine whether differences in error estimates are a result of chance fluctuations. Although various methods have been developed for making inferences on cross-validation estimates, they often have many limitations, such as stringent model assumptions This paper proposes a fast bootstrap method that quickly estimates the standard error of the cross-validation estimate and produces valid confidence intervals for a population parameter measuring average model performance. Our method overcomes the computational challenge inherent in bootstrapping the cross-validation estimate by estimating the variance component within a random effects model. It is just as flexible as the cross-validation procedure itself. To showcase the effectiveness of our approach, we employ comprehensive simulations and real data analysis across three diverse applications. △ Less

Submitted 1 July, 2023; originally announced July 2023.

arXiv:2306.08202 [pdf, ps, other]

doi 10.3847/1538-4357/acdef0

Core States of Neutron Stars from Anatomizing their Scaled Structure Equations

Authors: Bao-Jun Cai, Bao-An Li, Zhen Zhang

Abstract: Given an Equation of State (EOS) for neutron star (NS) matter, there is a unique mass-radius sequence characterized by a maximum mass $M_{\rm{NS}}^{\max}$ at radius $R_{\max}$. We first show analytically that the $M_{\rm{NS}}^{\max}$ and $R_{\max}$ scale linearly with two different combinations of NS central pressure $P_{\rm{c}}$ and energy density $\varepsilon_{\rm{c}}$ by dissecting perturbative… ▽ More Given an Equation of State (EOS) for neutron star (NS) matter, there is a unique mass-radius sequence characterized by a maximum mass $M_{\rm{NS}}^{\max}$ at radius $R_{\max}$. We first show analytically that the $M_{\rm{NS}}^{\max}$ and $R_{\max}$ scale linearly with two different combinations of NS central pressure $P_{\rm{c}}$ and energy density $\varepsilon_{\rm{c}}$ by dissecting perturbatively the dimensionless Tolman-Oppenheimer-Volkoff (TOV) equations governing NS internal variables. The scaling relations are then verified via 87 widely used and rather diverse phenomenological as well as 17 microscopic NS EOSs with/without considering hadron-quark phase transitions and hyperons by solving numerically the original TOV equations. The EOS of densest NS matter allowed before it collapses into a black hole (BH) is then obtained. Using the universal $M_{\rm{NS}}^{\max}$ and $R_{\max}$ scalings and NICER (Neutron Star Interior Composition Explorer) and XMM-Newton mass-radius observational data for PSR J0740+6620, a very narrow constraining band on the NS central EOS is extracted directly from the data for the first time without using any specific input EOS model. △ Less

Submitted 13 June, 2023; originally announced June 2023.

Comments: APJ in press

Journal ref: ApJ 952, 147 (2023)

arXiv:2306.06986 [pdf, other]

Multilevel leapfrogging initialization for quantum approximate optimization algorithm

Authors: Xiao-Hui Ni, Bin-Bin Cai, Hai-Ling Liu, Su-Juan Qin, Fei Gao, Qiao-Yan Wen

Abstract: Recently, Zhou et al. have proposed a novel Interpolation-based (INTERP) strategy to generate the initial parameters for the Parameterized Quantum Circuit (PQC) in Quantum Approximate Optimization Algorithm (QAOA). INTERP produces the guess of the initial parameters at level $i+1$ by applying linear interpolation to the optimized parameters at level $i$, achieving better performance than random in… ▽ More Recently, Zhou et al. have proposed a novel Interpolation-based (INTERP) strategy to generate the initial parameters for the Parameterized Quantum Circuit (PQC) in Quantum Approximate Optimization Algorithm (QAOA). INTERP produces the guess of the initial parameters at level $i+1$ by applying linear interpolation to the optimized parameters at level $i$, achieving better performance than random initialization (RI). Nevertheless, INTERP consumes extensive running costs for deep QAOA because it necessitates optimization at each level of the PQC. To address this problem, a Multilevel Leapfrogging Interpolation (MLI) strategy is proposed. MLI can produce the guess of the initial parameters from level $i+1$ to $i+l$ ($l>1$) at level $i$, omitting the optimization rounds from level $i+1$ to $(i+l-1)$. The final result is that MLI executes optimization at few levels rather than each level, and this operation is referred to as Multilevel Leapfrogging optimization (M-Leap). The performance of MLI is investigated on the Maxcut problem. Compared with INTERP, MLI reduces most optimization rounds. Remarkably, the simulation results demonstrate that MLI can achieve the same quasi-optima as INTERP while consuming only 1/2 of the running costs required by INTERP. In addition, for MLI, where there is no RI except for level $1$, the greedy-MLI strategy is presented. The simulation results suggest that greedy-MLI has better stability (i.e., a higher average approximation ratio) than INTERP and MLI beyond obtaining the same quasi-optima as INTERP. According to the efficiency of finding the quasi-optima, the idea of M-Leap might be extended to other training tasks, especially those requiring numerous optimizations, such as training adaptive quantum circuits. △ Less

Submitted 12 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

arXiv:2305.14895 [pdf, other]

doi 10.1088/1674-4527/acd593

The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite

Authors: Z. X. Ling, X. J. Sun, C. Zhang, S. L. Sun, G. Jin, S. N. Zhang, X. F. Zhang, J. B. Chang, F. S. Chen, Y. F. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, Z. D. Li, P. R. Liu, Y. H. Lv, X. H. Ma, Y. J. Tang, C. B. Wang, R. J. Xie, Y. L. Xue, A. L. Yan , et al. (101 additional authors not shown)

Abstract: The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo… ▽ More The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (FoV) of 346 square degrees (18.6 degrees * 18.6 degrees) of the X-ray imager is realized. An optical assembly composed of 36 MPO chips is used to focus incident X-ray photons, and four large-format complementary metal-oxide semiconductor (CMOS) sensors, each of 6 cm * 6 cm, are used as the focal plane detectors. The instrument has an angular resolution of 4 - 8 arcmin (in FWHM) for the central focal spot of the point spread function, and an effective area of 2 - 3 cm2 at 1 keV in essentially all the directions within the field of view. The detection passband is 0.5 - 4 keV in the soft X-rays and the sensitivity is 2 - 3 * 10-11 erg s-1 cm-2 (about 1 mini-Crab) at 1,000 second observation. The total weight of LEIA is 56 kg and the power is 85 W. The satellite, with a design lifetime of 2 years, operates in a Sun-synchronous orbit of 500 km with an orbital period of 95 minutes. LEIA is paving the way for future missions by verifying in flight the technologies of both novel focusing imaging optics and CMOS sensors for X-ray observation, and by optimizing the working setups of the instrumental parameters. In addition, LEIA is able to carry out scientific observations to find new transients and to monitor known sources in the soft X-ray band, albeit limited useful observing time available. △ Less

Submitted 24 May, 2023; originally announced May 2023.

Comments: Accepted by RAA

arXiv:2305.05209 [pdf, ps, other]

Decay of superheavy nuclei based on the random forest algorithm

Authors: Boshuai Cai, Cenxi Yuan

Abstract: How nuclides decay in the superheavy region is key information for investigating new elements beyond oganesson and the island of stability. The Random Forest algorithm is applied to study the competition between different decay modes in the superheavy region, including $α$ decay, $β^-$ decay, $β^+$ decay, electron capture and spontaneous fission. The observed half-lives and dominant decay mode are… ▽ More How nuclides decay in the superheavy region is key information for investigating new elements beyond oganesson and the island of stability. The Random Forest algorithm is applied to study the competition between different decay modes in the superheavy region, including $α$ decay, $β^-$ decay, $β^+$ decay, electron capture and spontaneous fission. The observed half-lives and dominant decay mode are well reproduced. The dominant decay mode of 96.9 % nuclei beyond $^{212}$Po is correctly described. $α$ decay is predicted to be the dominant decay mode for isotopes in new elements $Z = 119 - 122$, except for spontaneous fission in some even-even ones because of the odd-even staggering effect. The predicted half-lives show the existence of a long-lived spontaneous fission island at the southwest of $^{298}$Fl caused by the competition of nuclear deformation and Coulomb repulsion. More understanding of spontaneous fission, especially beyond $^{286}$Fl, is crucial to search for new elements and the island of stability. △ Less

Submitted 9 May, 2023; originally announced May 2023.

arXiv:2304.03292 [pdf, other]

SE-shapelets: Semi-supervised Clustering of Time Series Using Representative Shapelets

Authors: Borui Cai, Guangyan Huang, Shuiqiao Yang, Yong Xiang, Chi-Hung Chi

Abstract: Shapelets that discriminate time series using local features (subsequences) are promising for time series clustering. Existing time series clustering methods may fail to capture representative shapelets because they discover shapelets from a large pool of uninformative subsequences, and thus result in low clustering accuracy. This paper proposes a Semi-supervised Clustering of Time Series Using Re… ▽ More Shapelets that discriminate time series using local features (subsequences) are promising for time series clustering. Existing time series clustering methods may fail to capture representative shapelets because they discover shapelets from a large pool of uninformative subsequences, and thus result in low clustering accuracy. This paper proposes a Semi-supervised Clustering of Time Series Using Representative Shapelets (SE-Shapelets) method, which utilizes a small number of labeled and propagated pseudo-labeled time series to help discover representative shapelets, thereby improving the clustering accuracy. In SE-Shapelets, we propose two techniques to discover representative shapelets for the effective clustering of time series. 1) A \textit{salient subsequence chain} ($SSC$) that can extract salient subsequences (as candidate shapelets) of a labeled/pseudo-labeled time series, which helps remove massive uninformative subsequences from the pool. 2) A \textit{linear discriminant selection} ($LDS$) algorithm to identify shapelets that can capture representative local features of time series in different classes, for convenient clustering. Experiments on UCR time series datasets demonstrate that SE-shapelets discovers representative shapelets and achieves higher clustering accuracy than counterpart semi-supervised time series clustering methods. △ Less

Submitted 14 November, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

arXiv:2303.12816 [pdf, other]

From Wide to Deep: Dimension Lifting Network for Parameter-efficient Knowledge Graph Embedding

Authors: Borui Cai, Yong Xiang, Longxiang Gao, Di Wu, He Zhang, Jiong Jin, Tom Luan

Abstract: Knowledge graph embedding (KGE) that maps entities and relations into vector representations is essential for downstream applications. Conventional KGE methods require high-dimensional representations to learn the complex structure of knowledge graph, but lead to oversized model parameters. Recent advances reduce parameters by low-dimensional entity representations, while developing techniques (e.… ▽ More Knowledge graph embedding (KGE) that maps entities and relations into vector representations is essential for downstream applications. Conventional KGE methods require high-dimensional representations to learn the complex structure of knowledge graph, but lead to oversized model parameters. Recent advances reduce parameters by low-dimensional entity representations, while developing techniques (e.g., knowledge distillation or reinvented representation forms) to compensate for reduced dimension. However, such operations introduce complicated computations and model designs that may not benefit large knowledge graphs. To seek a simple strategy to improve the parameter efficiency of conventional KGE models, we take inspiration from that deeper neural networks require exponentially fewer parameters to achieve expressiveness comparable to wider networks for compositional structures. We view all entity representations as a single-layer embedding network, and conventional KGE methods that adopt high-dimensional entity representations equal widening the embedding network to gain expressiveness. To achieve parameter efficiency, we instead propose a deeper embedding network for entity representations, i.e., a narrow entity embedding layer plus a multi-layer dimension lifting network (LiftNet). Experiments on three public datasets show that by integrating LiftNet, four conventional KGE methods with 16-dimensional representations achieve comparable link prediction accuracy as original models that adopt 512-dimensional representations, saving 68.4% to 96.9% parameters. △ Less

Submitted 1 September, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

arXiv:2303.12677 [pdf, other]

Learning Brain Connectivity in Social Cognition with Dynamic Network Regression

Authors: Maoyu Zhang, Biao Cai, Wenlin Dai, Dehan Kong, Hongyu Zhao, Jingfei Zhang

Abstract: Dynamic networks have been increasingly used to characterize brain connectivity that varies during resting and task states. In such characterizations, a connectivity network is typically measured at each time point for a subject over a common set of nodes representing brain regions, together with rich subject-level information. A common approach to analyzing such data is an edge-based method that… ▽ More Dynamic networks have been increasingly used to characterize brain connectivity that varies during resting and task states. In such characterizations, a connectivity network is typically measured at each time point for a subject over a common set of nodes representing brain regions, together with rich subject-level information. A common approach to analyzing such data is an edge-based method that models the connectivity between each pair of nodes separately. However, such approach may have limited performance when the noise level is high and the number of subjects is limited, as it does not take advantage of the inherent network structure. To better understand if and how the subject-level covariates affect the dynamic brain connectivity, we introduce a semi-parametric dynamic network response regression that relates a dynamic brain connectivity network to a vector of subject-level covariates. A key advantage of our method is to exploit the structure of dynamic imaging coefficients in the form of high-order tensors. We develop an efficient estimation algorithm and evaluate the efficacy of our approach through simulation studies. Finally, we present our results on the analysis of a task-related study on social cognition in the Human Connectome Project, where we identify known sex-specific effects on brain connectivity that cannot be inferred using alternative methods. △ Less

Submitted 22 March, 2023; originally announced March 2023.

arXiv:2303.07048 [pdf, other]

doi 10.1016/j.knosys.2023.111079

Hybrid Variational Autoencoder for Time Series Forecasting

Authors: Borui Cai, Shuiqiao Yang, Longxiang Gao, Yong Xiang

Abstract: Variational autoencoders (VAE) are powerful generative models that learn the latent representations of input data as random variables. Recent studies show that VAE can flexibly learn the complex temporal dynamics of time series and achieve more promising forecasting results than deterministic models. However, a major limitation of existing works is that they fail to jointly learn the local pattern… ▽ More Variational autoencoders (VAE) are powerful generative models that learn the latent representations of input data as random variables. Recent studies show that VAE can flexibly learn the complex temporal dynamics of time series and achieve more promising forecasting results than deterministic models. However, a major limitation of existing works is that they fail to jointly learn the local patterns (e.g., seasonality and trend) and temporal dynamics of time series for forecasting. Accordingly, we propose a novel hybrid variational autoencoder (HyVAE) to integrate the learning of local patterns and temporal dynamics by variational inference for time series forecasting. Experimental results on four real-world datasets show that the proposed HyVAE achieves better forecasting results than various counterpart methods, as well as two HyVAE variants that only learn the local patterns or temporal dynamics of time series, respectively. △ Less

Submitted 13 March, 2023; originally announced March 2023.

Journal ref: Knowledge-Based Systems. 281 (2023) 111079

arXiv:2303.06687 [pdf, other]

Quantifying the Effects of Magnetic Field Line Curvature Scattering on Radiation Belt and Ring Current Particles

Authors: Bin Cai, Hanlin Li, Yifan Wu, Xin Tao

Abstract: Magnetic field line curvature (FLC) scattering is a collisionless scattering mechanism that arises when a particle's gyro-radius is comparable to the magnetic field line's curvature radius, resulting in the breaking of the conservation of the first adiabatic invariant. Studies in recent years have explored the implications of FLC scattering on the precipitation of both ring current ions and radiat… ▽ More Magnetic field line curvature (FLC) scattering is a collisionless scattering mechanism that arises when a particle's gyro-radius is comparable to the magnetic field line's curvature radius, resulting in the breaking of the conservation of the first adiabatic invariant. Studies in recent years have explored the implications of FLC scattering on the precipitation of both ring current ions and radiation belt electrons. In this work, we first compare two previous FLC scattering coefficients using test particle calculations. Then, we systematically calculate diffusion coefficients from FLC scattering in radial and MLT directions for particles of various energy levels, as well as its sensitivity to the $Kp$ index. We find that the timescale of FLC scattering is sufficient to account for the sudden loss of MeV electrons near the geostationary orbit during disturbed times. Additionally, the decay time of ring current protons is on the order of hours to minutes, providing an explanation for the ring current decay throughout the recovery phase of magnetic storms. Lastly, we compare the effects of wave-particle resonant scattering and FLC scattering in the vicinity of the midnight equator. Our findings suggest that the impacts of FLC scattering on MeV electrons or hundreds keV protons with smaller pitch angle is comparable to, or even more significant than, the effects of whistler mode or EMIC wave resonant scattering. Our quantitative results should be useful to evaluate the importance of the effects of FLC scattering while modeling the dynamics of radiation belt and ring current. △ Less

Submitted 15 December, 2023; v1 submitted 12 March, 2023; originally announced March 2023.

arXiv:2303.02375 [pdf, other]

NeuDA: Neural Deformable Anchor for High-Fidelity Implicit Surface Reconstruction

Authors: Bowen Cai, Jinchi Huang, Rongfei Jia, Chengfei Lv, Huan Fu

Abstract: This paper studies implicit surface reconstruction leveraging differentiable ray casting. Previous works such as IDR and NeuS overlook the spatial context in 3D space when predicting and rendering the surface, thereby may fail to capture sharp local topologies such as small holes and structures. To mitigate the limitation, we propose a flexible neural implicit representation leveraging hierarchica… ▽ More This paper studies implicit surface reconstruction leveraging differentiable ray casting. Previous works such as IDR and NeuS overlook the spatial context in 3D space when predicting and rendering the surface, thereby may fail to capture sharp local topologies such as small holes and structures. To mitigate the limitation, we propose a flexible neural implicit representation leveraging hierarchical voxel grids, namely Neural Deformable Anchor (NeuDA), for high-fidelity surface reconstruction. NeuDA maintains the hierarchical anchor grids where each vertex stores a 3D position (or anchor) instead of the direct embedding (or feature). We optimize the anchor grids such that different local geometry structures can be adaptively encoded. Besides, we dig into the frequency encoding strategies and introduce a simple hierarchical positional encoding method for the hierarchical anchor structure to flexibly exploit the properties of high-frequency and low-frequency geometry and appearance. Experiments on both the DTU and BlendedMVS datasets demonstrate that NeuDA can produce promising mesh surfaces. △ Less

Submitted 26 March, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

Comments: Accepted to CVPR 2023, project page: https://3d-front-future.github.io/neuda

arXiv:2303.00501 [pdf, other]

OmniForce: On Human-Centered, Large Model Empowered and Cloud-Edge Collaborative AutoML System

Authors: Chao Xue, Wei Liu, Shuai Xie, Zhenfang Wang, Jiaxing Li, Xuyang Peng, Liang Ding, Shanshan Zhao, Qiong Cao, Yibo Yang, Fengxiang He, Bohua Cai, Rongcheng Bian, Yiyan Zhao, Heliang Zheng, Xiangyang Liu, Dongkai Liu, Daqing Liu, Li Shen, Chang Li, Shijin Zhang, Yukang Zhang, Guanpu Chen, Shixiang Chen, Yibing Zhan , et al. (3 additional authors not shown)

Abstract: Automated machine learning (AutoML) seeks to build ML models with minimal human effort. While considerable research has been conducted in the area of AutoML in general, aiming to take humans out of the loop when building artificial intelligence (AI) applications, scant literature has focused on how AutoML works well in open-environment scenarios such as the process of training and updating large m… ▽ More Automated machine learning (AutoML) seeks to build ML models with minimal human effort. While considerable research has been conducted in the area of AutoML in general, aiming to take humans out of the loop when building artificial intelligence (AI) applications, scant literature has focused on how AutoML works well in open-environment scenarios such as the process of training and updating large models, industrial supply chains or the industrial metaverse, where people often face open-loop problems during the search process: they must continuously collect data, update data and models, satisfy the requirements of the development and deployment environment, support massive devices, modify evaluation metrics, etc. Addressing the open-environment issue with pure data-driven approaches requires considerable data, computing resources, and effort from dedicated data engineers, making current AutoML systems and platforms inefficient and computationally intractable. Human-computer interaction is a practical and feasible way to tackle the problem of open-environment AI. In this paper, we introduce OmniForce, a human-centered AutoML (HAML) system that yields both human-assisted ML and ML-assisted human techniques, to put an AutoML system into practice and build adaptive AI in open-environment scenarios. Specifically, we present OmniForce in terms of ML version management; pipeline-driven development and deployment collaborations; a flexible search strategy framework; and widely provisioned and crowdsourced application algorithms, including large models. Furthermore, the (large) models constructed by OmniForce can be automatically turned into remote services in a few minutes; this process is dubbed model as a service (MaaS). Experimental results obtained in multiple search spaces and real-world use cases demonstrate the efficacy and efficiency of OmniForce. △ Less

Submitted 8 July, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

arXiv:2302.14639 [pdf, other]

doi 10.1140/epjc/s10052-023-11678-6

Precision Measurement of the Specific Activity of $^{39}$Ar in Atmospheric Argon with the DEAP-3600 Detector

Authors: P. Adhikari, R. Ajaj, M. Alpízar-Venegas, P. -A. Amaudruz, J. Anstey, G. R. Araujo, D. J. Auty, M. Baldwin, M. Batygov, B. Beltran, H. Benmansour, C. E. Bina, J. Bonatt, W. Bonivento, M. G. Boulay, B. Broerman, J. F. Bueno, P. M. Burghardt, A. Butcher, M. Cadeddu, B. Cai, M. Cárdenas-Montes, S. Cavuoti, M. Chen, Y. Chen , et al. (125 additional authors not shown)

Abstract: The specific activity of the beta decay of $^{39}$Ar in atmospheric argon is measured using the DEAP-3600 detector. DEAP-3600, located 2 km underground at SNOLAB, uses a total of (3269 $\pm$ 24) kg of liquid argon distilled from the atmosphere to search for dark matter. This detector with very low background uses pulseshape discrimination to differentiate between nuclear recoils and electron recoi… ▽ More The specific activity of the beta decay of $^{39}$Ar in atmospheric argon is measured using the DEAP-3600 detector. DEAP-3600, located 2 km underground at SNOLAB, uses a total of (3269 $\pm$ 24) kg of liquid argon distilled from the atmosphere to search for dark matter. This detector with very low background uses pulseshape discrimination to differentiate between nuclear recoils and electron recoils and is well-suited to measure the decay of $^{39}$Ar. With 167 live-days of data, the measured specific activity at the time of atmospheric extraction is [0.964 $\pm$ 0.001 (stat) $\pm$ 0.024 (sys)] Bq/kg$_{\rm atmAr}$ which is consistent with results from other experiments. A cross-check analysis using different event selection criteria provides a consistent result. △ Less

Submitted 10 October, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

Journal ref: Eur. Phys. J. C 83, 642 (2023)

arXiv:2302.02553 [pdf, ps, other]

A Correction-Based Dynamic Enhancement Framework towards Underwater Detection

Authors: Yanling Qiu, Qianxue Feng, Boqin Cai, Hongan Wei, Weiling Chen

Abstract: To assist underwater object detection for better performance, image enhancement technology is often used as a pre-processing step. However, most of the existing enhancement methods tend to pursue the visual quality of an image, instead of providing effective help for detection tasks. In fact, image enhancement algorithms should be optimized with the goal of utility improvement. In this paper, to a… ▽ More To assist underwater object detection for better performance, image enhancement technology is often used as a pre-processing step. However, most of the existing enhancement methods tend to pursue the visual quality of an image, instead of providing effective help for detection tasks. In fact, image enhancement algorithms should be optimized with the goal of utility improvement. In this paper, to adapt to the underwater detection tasks, we proposed a lightweight dynamic enhancement algorithm using a contribution dictionary to guide low-level corrections. Dynamic solutions are designed to capture differences in detection preferences. In addition, it can also balance the inconsistency between the contribution of correction operations and their time complexity. Experimental results in real underwater object detection tasks show the superiority of our proposed method in both generalization and real-time performance. △ Less

Submitted 5 February, 2023; originally announced February 2023.

arXiv:2302.02305 [pdf]

doi 10.1109/LED.2023.3285525

Probabilistic-Bits based on Ferroelectric Field-Effect Transistors for Stochastic Computing

Authors: Sheng Luo, Yihan He, Baofang Cai, Xiao Gong, Gengchiau Liang

Abstract: A probabilistic-bit (p-bit) is the fundamental building block in the circuit network of a stochastic computing, and it could produce a continuous random bit-stream with tunable probability. Utilizing the stochasticity in few-domain ferroelectric material(FE), we propose for the first time, the p-bits based on ferroelectric FET. The stochasticity of the FE p-bits stems from the thermal noise-induce… ▽ More A probabilistic-bit (p-bit) is the fundamental building block in the circuit network of a stochastic computing, and it could produce a continuous random bit-stream with tunable probability. Utilizing the stochasticity in few-domain ferroelectric material(FE), we propose for the first time, the p-bits based on ferroelectric FET. The stochasticity of the FE p-bits stems from the thermal noise-induced lattice vibration, which renders dipole fluctuations and is tunable by an external electric field. The impact of several key FE parameters on p-bits' stochasticity is evaluated, where the domain properties are revealed to play crucial roles. Furthermore, the integer factorization based on FE p-bits circuit network is performed to verify its functionality, and the accuracy is found to depend on FE p-bits' stochasticity. The proposed FE p-bits possess the advantages of both extremely low hardware coast and the compatibility with CMOS-technology, rendering it a promising candidate for stochastic computing applications. △ Less

Submitted 5 February, 2023; originally announced February 2023.

Comments: 23 pages, 7 figures and supplementary materials with 3 notes

Journal ref: IEEE Electron Device Letters

arXiv:2302.00258 [pdf, ps, other]

doi 10.1016/j.physletb.2023.137740

Extended R-matrix description of two-proton radioactivity

Authors: Zhaozhan Zhang, Cenxi Yuan, Chong Qi, Boshuai Cai, Xinxing Xu

Abstract: Two-proton ($2p$) radioactivity provides fundamental knowledge on the three-body decay mechanism and the residual nuclear interaction. In this work, we propose decay width formulae in the extended R-matrix framework for different decay mechanisms, including sequential $2p$ decay, diproton decay, tri-body decay, and sequential two-diproton decay. The diproton and tri-body formulae, combined with in… ▽ More Two-proton ($2p$) radioactivity provides fundamental knowledge on the three-body decay mechanism and the residual nuclear interaction. In this work, we propose decay width formulae in the extended R-matrix framework for different decay mechanisms, including sequential $2p$ decay, diproton decay, tri-body decay, and sequential two-diproton decay. The diproton and tri-body formulae, combined with information on the two-nucleon transfer amplitude and Wigner single-particle reduced width, can reproduce well experimental $2p$ radioactivity half-lives. For the case of $^{67}$Kr, theoretical predictions for direct $2p$ decay give much larger half-lives than the recent measurement from RIKEN. A combination of direct and sequential $2p$ emission is analyzed by considering a small negative one-proton separation energy and a possible enhanced contribution from the $p$-wave component. The present method predicts that $^{71}$Sr and $^{74}$Zr may be the most promising candidates for future study on $2p$ radioactivity. Our model gives an upper limit of 55(4) keV for the decay width of $4p$ emission in recently found four-proton resonant nuclide, $^{18}$Mg, which agrees with the observed width of 115(100) keV. △ Less

Submitted 2 February, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

Comments: version accepted for publication in Physics Letters B

arXiv:2301.03836 [pdf, other]

doi 10.1103/PhysRevLett.130.172501

Multiple Mechanisms in Proton-Induced Nucleon Removal at $\sim$100 MeV/Nucleon

Authors: T. Pohl, Y. L. Sun, A. Obertelli, J. Lee, M. Gomez-Ramos, K. Ogata, K. Yoshida, B. S. Cai, C. X. Yuan, B. A. Brown, H. Baba, D. Beaumel, A. Corsi, J. Gao, J. Gibelin, A. Gillibert, K. I. Hahn, T. Isobe, D. Kim, Y. Kondo, T. Kobayashi, Y. Kubota, P. Li, P. Liang, H. N. Liu , et al. (26 additional authors not shown)

Abstract: We report on the first proton-induced single proton- and neutron-removal reactions from the neutron-deficient $^{14}$O nucleus with large Fermi-surface asymmetry $S_n-S_p$ = 18.6 MeV at $\sim$100 MeV/nucleon, a widely used energy regime for rare-isotope studies. The measured inclusive cross sections and parallel momentum distributions of the $^{13}$N and $^{13}$O residues are compared to the state… ▽ More We report on the first proton-induced single proton- and neutron-removal reactions from the neutron-deficient $^{14}$O nucleus with large Fermi-surface asymmetry $S_n-S_p$ = 18.6 MeV at $\sim$100 MeV/nucleon, a widely used energy regime for rare-isotope studies. The measured inclusive cross sections and parallel momentum distributions of the $^{13}$N and $^{13}$O residues are compared to the state-of-the-art reaction models, with nuclear structure inputs from many-body shell-model calculations. Our results provide the first quantitative contributions of multiple reaction mechanisms including the quasifree knockout, inelastic scattering and nucleon transfer processes. It is shown that the inelastic scattering and nucleon transfer, usually neglected at such energy regime, contribute about 50% and 30% to the loosely bound proton and deeply bound neutron removal, respectively. These multiple reaction mechanisms should be considered in analyses of inclusive one-nucleon removal cross sections measured at intermediate energies for quantitative investigation of single-particle strengths and correlations in atomic nuclei. △ Less

Submitted 27 April, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

Journal ref: Physical Review Letters 130, 172501 (2023)

arXiv:2211.14823 [pdf, other]

3D Scene Creation and Rendering via Rough Meshes: A Lighting Transfer Avenue

Authors: Bowen Cai, Yujie Li, Yuqin Liang, Rongfei Jia, Binqiang Zhao, Mingming Gong, Huan Fu

Abstract: This paper studies how to flexibly integrate reconstructed 3D models into practical 3D modeling pipelines such as 3D scene creation and rendering. Due to the technical difficulty, one can only obtain rough 3D models (R3DMs) for most real objects using existing 3D reconstruction techniques. As a result, physically-based rendering (PBR) would render low-quality images or videos for scenes that are c… ▽ More This paper studies how to flexibly integrate reconstructed 3D models into practical 3D modeling pipelines such as 3D scene creation and rendering. Due to the technical difficulty, one can only obtain rough 3D models (R3DMs) for most real objects using existing 3D reconstruction techniques. As a result, physically-based rendering (PBR) would render low-quality images or videos for scenes that are constructed by R3DMs. One promising solution would be representing real-world objects as Neural Fields such as NeRFs, which are able to generate photo-realistic renderings of an object under desired viewpoints. However, a drawback is that the synthesized views through Neural Fields Rendering (NFR) cannot reflect the simulated lighting details on R3DMs in PBR pipelines, especially when object interactions in the 3D scene creation cause local shadows. To solve this dilemma, we propose a lighting transfer network (LighTNet) to bridge NFR and PBR, such that they can benefit from each other. LighTNet reasons about a simplified image composition model, remedies the uneven surface issue caused by R3DMs, and is empowered by several perceptual-motivated constraints and a new Lab angle loss which enhances the contrast between lighting strength and colors. Comparisons demonstrate that LighTNet is superior in synthesizing impressive lighting, and is promising in pushing NFR further in practical 3D modeling workflows. △ Less

Submitted 19 March, 2024; v1 submitted 27 November, 2022; originally announced November 2022.

Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), project page: http://3d-front-future.github.io/LighTNet

arXiv:2211.12247 [pdf]

doi 10.1103/PhysRevB.106.184419

Spatially Nonuniform Oscillations in Ferrimagnets Based on an Atomistic Model

Authors: Xue Zhang, Baofang Cai, Jie Ren, Zhengping Yuan, Zhengde Xu, Yumeng Yang, Gengchiau Liang, Zhifeng Zhu

Abstract: The ferrimagnets, such as GdxFeCo(1-x), can produce ultrafast magnetic switching and oscillation due to the strong exchange field. The two-sublattices macrospin model has been widely used to explain the experimental results. However, it fails in describing the spatial nonuniform magnetic dynamics which gives rises to many important phenomenons such as the domain walls and skyrmions. Here we develo… ▽ More The ferrimagnets, such as GdxFeCo(1-x), can produce ultrafast magnetic switching and oscillation due to the strong exchange field. The two-sublattices macrospin model has been widely used to explain the experimental results. However, it fails in describing the spatial nonuniform magnetic dynamics which gives rises to many important phenomenons such as the domain walls and skyrmions. Here we develop the two-dimensional atomistic model and provide a torque analysis method to study the ferrimagnetic oscillation. Under the spin-transfer torque, the magnetization oscillates in the exchange mode or the flipped exchange mode. When the Gd composition is increased, the exchange mode firstly disappears, and then appears again as the magnetization compensation point is reached. We show that these results can only be explained by analyzing the spatial distribution of magnetization and effective fields. In particular, when the sample is small, a spatial nonuniform oscillation is also observed in the square film. Our work reveals the importance of spatial magnetic distributions in understanding the ferrimagnetic dynamics. The method developed in this paper provides an important tool to gain a deeper understanding of ferrimagnets and antiferromagnets. The observed ultrafast dynamics can also stimulate the development of THz oscillators. △ Less

Submitted 22 November, 2022; originally announced November 2022.

Comments: 17 pages, 4 figures

Journal ref: Phys. Rev. B 106,184419(2022)

arXiv:2211.06608 [pdf, other]

doi 10.3847/1538-3881/aca098

Li-rich Giants Identified from LAMOST DR8 Low-Resolution Survey

Authors: BeiChen Cai, XiaoMing Kong, JianRong Shi, Qi Gao, Yude Bu, Zhenping Yi

Abstract: A small fraction of giants possess photospheric lithium(Li) abundance higher than the value predicted by the standard stellar evolution models, and the detailed mechanisms of Li enhancement are complicated and lack a definite conclusion. In order to better understand the Li enhancement behaviors, a large and homogeneous Li-rich giants sample is needed. In this study, we designed a modified convolu… ▽ More A small fraction of giants possess photospheric lithium(Li) abundance higher than the value predicted by the standard stellar evolution models, and the detailed mechanisms of Li enhancement are complicated and lack a definite conclusion. In order to better understand the Li enhancement behaviors, a large and homogeneous Li-rich giants sample is needed. In this study, we designed a modified convolutional neural network model called Coord-DenseNet to determine the A(Li) of Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) low-resolution survey (LRS) giant spectra. The precision is good on the test set: MAE=0.15 dex, and σ=0.21 dex. We used this model to predict the Li abundance of more than 900,000 LAMOST DR8 LRS giant spectra and identified 7,768 Li-rich giants with Li abundances ranging from 2.0 to 5.4 dex, accounting for about 1.02% of all giants. We compared the Li abundance estimated by our work with those derived from high-resolution spectra. We found that the consistency was good if the overall deviation of 0.27 dex between them was not considered. The analysis shows that the difference is mainly due to the high A(Li) from the medium-resolution spectra in the training set. This sample of Li-rich giants dramatically expands the existing sample size of Li-rich giants and provides us with more samples to further study the formation and evolution of Li-rich giants. △ Less

Submitted 12 November, 2022; originally announced November 2022.

Comments: 14 pages,13 figures

arXiv:2210.16541 [pdf, other]

Entity-centered Cross-document Relation Extraction

Authors: Fengqi Wang, Fei Li, Hao Fei, Jingye Li, Shengqiong Wu, Fangfang Su, Wenxuan Shi, Donghong Ji, Bo Cai

Abstract: Relation Extraction (RE) is a fundamental task of information extraction, which has attracted a large amount of research attention. Previous studies focus on extracting the relations within a sentence or document, while currently researchers begin to explore cross-document RE. However, current cross-document RE methods directly utilize text snippets surrounding target entities in multiple given do… ▽ More Relation Extraction (RE) is a fundamental task of information extraction, which has attracted a large amount of research attention. Previous studies focus on extracting the relations within a sentence or document, while currently researchers begin to explore cross-document RE. However, current cross-document RE methods directly utilize text snippets surrounding target entities in multiple given documents, which brings considerable noisy and non-relevant sentences. Moreover, they utilize all the text paths in a document bag in a coarse-grained way, without considering the connections between these text paths.In this paper, we aim to address both of these shortages and push the state-of-the-art for cross-document RE. First, we focus on input construction for our RE model and propose an entity-based document-context filter to retain useful information in the given documents by using the bridge entities in the text paths. Second, we propose a cross-document RE model based on cross-path entity relation attention, which allow the entity relations across text paths to interact with each other. We compare our cross-document RE method with the state-of-the-art methods in the dataset CodRED. Our method outperforms them by at least 10% in F1, thus demonstrating its effectiveness. △ Less

Submitted 29 October, 2022; originally announced October 2022.

Comments: This paper was accepted by EMNLP 2022 conference

arXiv:2210.10924 [pdf, ps, other]

Nuclear Equation of State and Single-nucleon Potential from Gogny-like Energy Density Functionals Encapsulating Effects of Nucleon-nucleon Short-range Correlations

Authors: Bao-Jun Cai, Bao-An Li

Abstract: Nucleon-nucleon short-range correlations (SRCs) induce a high momentum tail (HMT) in the single-nucleon momentum distribution function $n_{ǩ}^J(ρ,δ)$ in cold neutron-rich matter. While there are clear experimental evidences that the SRC/HMT effects are different for neutrons and protons and their strengths depend strongly on the isospin asymmetry of finite nuclei mostly based on electron-nucleus s… ▽ More Nucleon-nucleon short-range correlations (SRCs) induce a high momentum tail (HMT) in the single-nucleon momentum distribution function $n_{ǩ}^J(ρ,δ)$ in cold neutron-rich matter. While there are clear experimental evidences that the SRC/HMT effects are different for neutrons and protons and their strengths depend strongly on the isospin asymmetry of finite nuclei mostly based on electron-nucleus scattering experiments, much less is known experimentally about the SRC/HMT effects in the dense neutron-rich matter. To facilitate further explorations of SRC/HMT effects in dense neutron-rich matter especially with heavy-ion reactions involving high-energy radioactive beams as well as multimessenger observations of neutron stars and their mergers, by incorporating the SRC-induced HMT in $n_{ǩ}^J(ρ,δ)$ into a Gogny-like energy density functional we study SRC/HMT effects on the equation of state (EOS) especially its symmetry energy term and single-nucleon potential in the dense asymmetric nucleonic matter (ANM). Using a parametrization as a surrogate for the momentum-dependent kernel in the Gogny-like energy density functional (EDF) we derive analytical expressions for all components of the ANM EOS and their characteristics (e.g., magnitude, slope and curvature as well as nucleon effective mass) at saturation density $ρ_0$ as well as the momentum-dependent single-nucleon optical potential in neutron-rich matter using parameters characterizing nuclear interactions as well as the size, shape and isospin dependence of the HMT at $ρ_0$. Some consequences of the SRC/HMT effects on properties of neutron stars are also studied. △ Less

Submitted 19 October, 2022; originally announced October 2022.

Comments: 35 pages including 16 figures

arXiv:2210.00085 [pdf]

Development of a Full Monte Carlo Therapeutic Dose Calculation Toolkit for Halcyon Using Geant4

Authors: Ruirui Liu, Zhen Ji, Xiandong Zhao, Tianyu Zhao, Abhishek Sethi, Daren Sawkey, Bin Cai

Abstract: Purpose: To develop a Monte Carlo (MC) therapeutic dose calculation toolkit of a recently released ring gantry linac in Geant4 (Version 10.7) for secondary dose validation of radiotherapy plan. Methods: For the Halcyon (Varian Medical Systems), the DSMLC was modeled and radiation transport in DSMLC and patient phantom was simulated using Geant4. Radiation source was sampled from a phase space file… ▽ More Purpose: To develop a Monte Carlo (MC) therapeutic dose calculation toolkit of a recently released ring gantry linac in Geant4 (Version 10.7) for secondary dose validation of radiotherapy plan. Methods: For the Halcyon (Varian Medical Systems), the DSMLC was modeled and radiation transport in DSMLC and patient phantom was simulated using Geant4. Radiation source was sampled from a phase space file for linac head above the DSMLC. The phase space file was obtained using a cloud-based Monte Carlo (MC) simulator, VirtuaLinac (VL) provide by Varian. Dosimetric profiles for different square field widths (2x2, 4x4, 6x6, 8x8, 10x10, 20x20, and 28x28 cm2), i.e., percent depth dose (PDD) curves and lateral profiles are simulated and compared against the experimental profiles. IMRT (intensity modulated radiation therapy) plans in two anatomical sites (prostate and brain) were also calculated using the developed toolkit and compared against the TPS calculated dose (Acuros, Eclipse 15.6). 3D dose difference and 3D gamma analysis were used to evaluate the simulation accuracy compared against the TPS calculated dose. Results: The simulated lateral dose profiles and PDD curves in water phantom match well with the measured ones for all the simulated field sizes with relative difference +-2%. For the prostate and brain IMRT plans, the simulated dose showed a good agreement with the TPS calculated dose. The 3D gamma pass rate (3%/3mm) are 98.08% and 95.4% for the two prostate and brain plans, respectively. Conclusion: The developed full MC dose calculation toolkit for Halcyon performs well in dose calculations in water phantom and patient CT phantom. The developed toolkit shows promising possibility for future secondary dose calculation for IMRT and serve as clinical quality assurance (QA) tool for Halcyon. △ Less

Submitted 30 September, 2022; originally announced October 2022.

arXiv:2209.04038 [pdf, other]

Statistical Inference of Cell-type Proportions Estimated from Bulk Expression Data

Authors: Biao Cai, Jingfei Zhang, Hongyu Li, Chang Su, Hongyu Zhao

Abstract: There is a growing interest in cell-type-specific analysis from bulk samples with a mixture of different cell types. A critical first step in such analyses is the accurate estimation of cell-type proportions in a bulk sample. Although many methods have been proposed recently, quantifying the uncertainties associated with the estimated cell-type proportions has not been well studied. Lack of consid… ▽ More There is a growing interest in cell-type-specific analysis from bulk samples with a mixture of different cell types. A critical first step in such analyses is the accurate estimation of cell-type proportions in a bulk sample. Although many methods have been proposed recently, quantifying the uncertainties associated with the estimated cell-type proportions has not been well studied. Lack of consideration of these uncertainties can lead to missed or false findings in downstream analyses. In this article, we introduce a flexible statistical deconvolution framework that allows a general and subject-specific covariance of bulk gene expressions. Under this framework, we propose a decorrelated constrained least squares method called DECALS that estimates cell-type proportions as well as the sampling distribution of the estimates. Simulation studies demonstrate that DECALS can accurately quantify the uncertainties in the estimated proportions whereas other methods fail. Applying DECALS to analyze bulk gene expression data of post mortem brain samples from the ROSMAP and GTEx projects, we show that taking into account the uncertainties in the estimated cell-type proportions can lead to more accurate identifications of cell-type-specific differentially expressed genes and transcripts between different subject groups, such as between Alzheimer's disease patients and controls and between males and females. △ Less

Submitted 8 September, 2022; originally announced September 2022.

arXiv:2208.10438 [pdf, ps, other]

doi 10.1103/PhysRevC.106.044319

High-order isospin-dependent surface tension contribution to the fourth-order symmetry energy of finite nuclei

Authors: Bao-Jun Cai, Rui Wang, Zhen Zhang, Lie-Wen Chen

Abstract: The relation between the fourth-order symmetry energy $E_{\rm{sym,4}}(ρ_0)$ of nuclear matter at saturation density $ρ_0$ and its counterpart $a_{\rm{sym,4}}(A)$ of finite nuclei in a semiempirical nuclear mass formula is revisited by considering the high-order isospin-dependent surface tension contribution to the latter. We derive the full expression of $a_{\rm{sym,4}}(A)$, which includes explici… ▽ More The relation between the fourth-order symmetry energy $E_{\rm{sym,4}}(ρ_0)$ of nuclear matter at saturation density $ρ_0$ and its counterpart $a_{\rm{sym,4}}(A)$ of finite nuclei in a semiempirical nuclear mass formula is revisited by considering the high-order isospin-dependent surface tension contribution to the latter. We derive the full expression of $a_{\rm{sym,4}}(A)$, which includes explicitly the high-order isospin-dependent surface tension effects, and find that the value of $E_{\rm{sym,4}}(ρ_0)$ cannot be extracted from the measured $a_{\rm{sym,4}}(A)$ before the high-order surface tension is well constrained. Our results imply that a large $a_{\rm{sym,4}}(A)$ value of several MeVs obtained from analyzing nuclear masses can nicely agree with the empirical constraint of $E_{\rm{sym,4}}(ρ_0)\lesssim 2$ MeV from mean-field models and does not necessarily lead to a large $E_{\rm{sym,4}}(ρ_0)$ value of $\approx 20$ MeV obtained previously without considering the high-order surface tension. Furthermore, we also give the expression for the sixth-order symmetry energy $a_{\rm{sym,6}}(A)$ of finite nuclei, which involves more nuclear matter bulk parameters and the higher-order isospin-dependent surface tension. △ Less

Submitted 18 October, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

Comments: 7 pages, 2 figures. Minor corrections. Published version in PRC

Journal ref: Phys. Rev. C 106, 044319 (2022)

arXiv:2208.03922 [pdf, other]

CSSAM:Code Search via Attention Matching of Code Semantics and Structures

Authors: Yi Hu, Bo Cai, Yaoxiang Yu

Abstract: Despite the continuous efforts in improving both the effectiveness and efficiency of code search, two issues remained unsolved. First, programming languages have inherent strong structural linkages, and feature mining of code as text form would omit the structural information contained inside it. Second, there is a potential semantic relationship between code and query, it is challenging to align… ▽ More Despite the continuous efforts in improving both the effectiveness and efficiency of code search, two issues remained unsolved. First, programming languages have inherent strong structural linkages, and feature mining of code as text form would omit the structural information contained inside it. Second, there is a potential semantic relationship between code and query, it is challenging to align code and text across sequences so that vectors are spatially consistent during similarity matching. To tackle both issues, in this paper, a code search model named CSSAM (Code Semantics and Structures Attention Matching) is proposed. By introducing semantic and structural matching mechanisms, CSSAM effectively extracts and fuses multidimensional code features. Specifically, the cross and residual layer was developed to facilitate high-latitude spatial alignment of code and query at the token level. By leveraging the residual interaction, a matching module is designed to preserve more code semantics and descriptive features, that enhances the adhesion between the code and its corresponding query text. Besides, to improve the model's comprehension of the code's inherent structure, a code representation structure named CSRG (Code Semantic Representation Graph) is proposed for jointly representing abstract syntax tree nodes and the data flow of the codes. According to the experimental results on two publicly available datasets containing 540k and 330k code segments, CSSAM significantly outperforms the baselines in terms of achieving the highest SR@1/5/10, MRR, and NDCG@50 on both datasets respectively. Moreover, the ablation study is conducted to quantitatively measure the impact of each key component of CSSAM on the efficiency and effectiveness of code search, which offers the insights into the improvement of advanced code search solutions. △ Less

Submitted 8 August, 2022; originally announced August 2022.

Comments: arXiv admin note: text overlap with arXiv:1909.13516 by other authors

arXiv:2206.15314 [pdf, ps, other]

doi 10.1016/j.aop.2022.169062

Equation of State of Neutron-Rich Matter in $d$-Dimensions

Authors: Bao-Jun Cai, Bao-An Li

Abstract: Nuclear systems under constraints, with high degrees of symmetries and/or collectivities may be considered as moving effectively in spaces with reduced spatial dimensions. We first derive analytical expressions for the nucleon specific energy $E_0(ρ)$, pressure $P_0(ρ)$, incompressibility coefficient $K_0(ρ)$ and skewness coefficient $J_0(ρ)$ of symmetric nucleonic matter (SNM), the quadratic symm… ▽ More Nuclear systems under constraints, with high degrees of symmetries and/or collectivities may be considered as moving effectively in spaces with reduced spatial dimensions. We first derive analytical expressions for the nucleon specific energy $E_0(ρ)$, pressure $P_0(ρ)$, incompressibility coefficient $K_0(ρ)$ and skewness coefficient $J_0(ρ)$ of symmetric nucleonic matter (SNM), the quadratic symmetry energy $E_{\rm{sym}}(ρ)$, its slope parameter $L(ρ)$ and curvature coefficient $K_{\rm{sym}}(ρ)$ as well as the fourth-order symmetry energy $E_{\rm{sym,4}}(ρ)$ of neutron-rich matter in general $d$ spatial dimensions (abbreviated as "$d$D") in terms of the isoscalar and isovector parts of the isospin-dependent single-nucleon potential according to the generalized Hugenholtz-Van Hove (HVH) theorem. The equation of state (EOS) of nuclear matter in $d$D can be linked to that in the conventional 3-dimensional (3D) space by the $ε$-expansion which is a perturbative approach successfully used previously in treating second-order phase transitions and related critical phenomena and more recently in studying the EOS of cold atoms. The $ε$-expansion of nuclear EOS in $d$D based on a reference dimension $d_{\rm{f}}=d-ε$ is shown to be effective with $-1\lesssimε\lesssim1$ starting from $1\lesssim d_{\rm{f}}\lesssim3$ in comparison with the exact expressions derived using the HVH theorem. Moreover, the EOS of SNM (with/without considering its potential part) is found to be reduced (enhanced) in lower (higher) dimensions, indicating in particular that the many-nucleon system tends to be deeper bounded but saturate at higher densities in spaces with lower dimensions. The links between the EOSs in 3D and $d$D spaces from the $ε$-expansion provide new perspectives to the EOS of neutron-rich matter. △ Less

Submitted 21 July, 2022; v1 submitted 30 June, 2022; originally announced June 2022.

Comments: With minor revisions and a new figure. Accepted by Annals of Physics

Journal ref: Annals of Physics 444 (2022) 169062

arXiv:2205.05922 [pdf, other]

Ray Priors through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation

Authors: Jian Zhang, Yuanqing Zhang, Huan Fu, Xiaowei Zhou, Bowen Cai, Jinchi Huang, Rongfei Jia, Binqiang Zhao, Xing Tang

Abstract: Neural Radiance Fields (NeRF) have emerged as a potent paradigm for representing scenes and synthesizing photo-realistic images. A main limitation of conventional NeRFs is that they often fail to produce high-quality renderings under novel viewpoints that are significantly different from the training viewpoints. In this paper, instead of exploiting few-shot image synthesis, we study the novel view… ▽ More Neural Radiance Fields (NeRF) have emerged as a potent paradigm for representing scenes and synthesizing photo-realistic images. A main limitation of conventional NeRFs is that they often fail to produce high-quality renderings under novel viewpoints that are significantly different from the training viewpoints. In this paper, instead of exploiting few-shot image synthesis, we study the novel view extrapolation setting that (1) the training images can well describe an object, and (2) there is a notable discrepancy between the training and test viewpoints' distributions. We present RapNeRF (RAy Priors) as a solution. Our insight is that the inherent appearances of a 3D surface's arbitrary visible projections should be consistent. We thus propose a random ray casting policy that allows training unseen views using seen views. Furthermore, we show that a ray atlas pre-computed from the observed rays' viewing directions could further enhance the rendering quality for extrapolated views. A main limitation is that RapNeRF would remove the strong view-dependent effects because it leverages the multi-view consistency property. △ Less

Submitted 12 May, 2022; originally announced May 2022.

arXiv:2203.12773 [pdf, ps, other]

doi 10.1103/PhysRevC.105.064607

Investigating Effects of Relativistic Kinematics, Dimensionality, Interactions, and Short-Range Correlations on the Ratio of Quartic over Quadratic Nuclear Symmetry Energies

Authors: Bao-Jun Cai, Bao-An Li

Abstract: While ample evidence for the so-called empirical parabolic law of the Equation of State (EOS) of isospin asymmetric nuclear matter (ANM) has been obtained in many studies within both non-relativistic and relativistic nuclear many-body theories using various interactions, it has been unclear if there is any fundamental physics reason for the small quartic symmetry energy compared to the quadratic o… ▽ More While ample evidence for the so-called empirical parabolic law of the Equation of State (EOS) of isospin asymmetric nuclear matter (ANM) has been obtained in many studies within both non-relativistic and relativistic nuclear many-body theories using various interactions, it has been unclear if there is any fundamental physics reason for the small quartic symmetry energy compared to the quadratic one even as the ANM approaches pure neutron matter. Within both relativistic and non-relativistic Free Fermi Gas (FFG) models in coordinate spaces of arbitrary dimension $d$ with and without considering Short-Range Correlations (SRC) as well as non-linear Relativistic Mean Field (RMF) models, we study effects of relativistic kinematics, dimensionality, interactions and SRC on the ratio $Ψ(ρ)$ of quartic over quadratic symmetry energies in ANM EOSs. We found that the ratio $Ψ(ρ)$ in the FFG model depends strongly on the dimension $d$. While it is very small already in the normal 3D space, it could be even smaller in spaces with reduced dimensions for sub-systems of particles in heavy-ion reactions and/or whole neutron stars due to constraints, collectivities and/or symmetries. We also found that the ratio $Ψ(ρ)$ could theoretically become very large only at the ultra-relativistic limit far above the density reachable in neutron stars. On the other hand, nuclear interaction directly and/or indirectly through SRC-induced high-momentum nucleons affect significantly the density dependence of $Ψ(ρ)$ compared to the relativistic FFG model prediction. The SRC affects significantly not only the kinetic energy of symmetric nuclear matter but also the ratio $Ψ(ρ)$ while the relativistic corrections are found negligible. The results may help better understand the EOS of dense neutron-rich matter. △ Less

Submitted 3 June, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

Comments: A few wording modifications. Phys. Rev. C (2022) in press

Journal ref: Physical Review C 105, 064607 (2022)

arXiv:2203.06606 [pdf, ps, other]

doi 10.1371/journal.pone.0265109

Deep Learning for 1-Bit Compressed Sensing-based Superimposed CSI Feedback

Authors: Chaojin Qing, Qing Ye, Bin Cai, Wenhui Liu, Jiafan Wang

Abstract: In frequency-division duplexing (FDD) massive multiple-input multiple-output (MIMO) systems, 1-bit compressed sensing (CS)-based superimposed channel state information (CSI) feedback has shown many advantages, while still faces many challenges, such as low accuracy of the downlink CSI recovery and large processing delays. To overcome these drawbacks, this paper proposes a deep learning (DL) scheme… ▽ More In frequency-division duplexing (FDD) massive multiple-input multiple-output (MIMO) systems, 1-bit compressed sensing (CS)-based superimposed channel state information (CSI) feedback has shown many advantages, while still faces many challenges, such as low accuracy of the downlink CSI recovery and large processing delays. To overcome these drawbacks, this paper proposes a deep learning (DL) scheme to improve the 1-bit compressed sensing-based superimposed CSI feedback. On the user side, the downlink CSI is compressed with the 1-bit CS technique, superimposed on the uplink user data sequences (UL-US), and then sent back to the base station (BS). At the BS, based on the model-driven approach and assisted by the superimposition-interference cancellation technology, a multi-task detection network is first constructed for detecting both the UL-US and downlink CSI. In particular, this detection network is jointly trained to detect the UL-US and downlink CSI simultaneously, capturing a globally optimized network parameter. Then, with the recovered bits for the downlink CSI, a lightweight reconstruction scheme, which consists of an initial feature extraction of the downlink CSI with the simplified traditional method and a single hidden layer network, is utilized to reconstruct the downlink CSI with low processing delay. Compared with the 1-bit CS-based superimposed CSI feedback scheme, the proposed scheme improves the recovery accuracy of the UL-US and downlink CSI with lower processing delay and possesses robustness against parameter variations. △ Less

Submitted 13 March, 2022; originally announced March 2022.

Comments: 12 pages, 11 figures

Showing 1–50 of 186 results for author: Cai, B