-
User Behavior Analysis and Clustering in Peace Elite: Insights and Recommendations
Authors:
Yang Qiu,
Yuxin Gong,
Guanliang Liu
Abstract:
This study presents a comprehensive analysis of user behavior and clustering in Peace Elite, a popular mobile battle royale game, employing temporal and static data mining techniques to uncover distinct player segments. Our methodology encompasses time series K-means clustering, graph-based algorithms (DeepWalk and LINE), and static attribute clustering, visualized through innovative hybrid charts…
▽ More
This study presents a comprehensive analysis of user behavior and clustering in Peace Elite, a popular mobile battle royale game, employing temporal and static data mining techniques to uncover distinct player segments. Our methodology encompasses time series K-means clustering, graph-based algorithms (DeepWalk and LINE), and static attribute clustering, visualized through innovative hybrid charts. Key findings reveal significant variations in player engagement, skill levels, and social interactions across five primary user segments, ranging from highly active and skilled players to inactive or new users. We also analyze the impact of external factors on user retention and the network structure within clusters, uncovering correlations between cluster cohesion and player activity levels. This research provides valuable insights for game developers and marketers, offering data-driven recommendations for personalized game experiences, targeted marketing strategies, and improved player retention in online gaming environments.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
Retrospective for the Dynamic Sensorium Competition for predicting large-scale mouse primary visual cortex activity from videos
Authors:
Polina Turishcheva,
Paul G. Fahey,
Michaela Vystrčilová,
Laura Hansel,
Rachel Froebe,
Kayla Ponder,
Yongrong Qiu,
Konstantin F. Willeke,
Mohammad Bashiri,
Ruslan Baikulov,
Yu Zhu,
Lei Ma,
Shan Yu,
Tiejun Huang,
Bryan M. Li,
Wolf De Wulf,
Nina Kudryashova,
Matthias H. Hennig,
Nathalie L. Rochefort,
Arno Onken,
Eric Wang,
Zhiwei Ding,
Andreas S. Tolias,
Fabian H. Sinz,
Alexander S Ecker
Abstract:
Understanding how biological visual systems process information is challenging because of the nonlinear relationship between visual input and neuronal responses. Artificial neural networks allow computational neuroscientists to create predictive models that connect biological and machine vision. Machine learning has benefited tremendously from benchmarks that compare different model on the same ta…
▽ More
Understanding how biological visual systems process information is challenging because of the nonlinear relationship between visual input and neuronal responses. Artificial neural networks allow computational neuroscientists to create predictive models that connect biological and machine vision. Machine learning has benefited tremendously from benchmarks that compare different model on the same task under standardized conditions. However, there was no standardized benchmark to identify state-of-the-art dynamic models of the mouse visual system. To address this gap, we established the Sensorium 2023 Benchmark Competition with dynamic input, featuring a new large-scale dataset from the primary visual cortex of ten mice. This dataset includes responses from 78,853 neurons to 2 hours of dynamic stimuli per neuron, together with the behavioral measurements such as running speed, pupil dilation, and eye movements. The competition ranked models in two tracks based on predictive performance for neuronal responses on a held-out test set: one focusing on predicting in-domain natural stimuli and another on out-of-distribution (OOD) stimuli to assess model generalization. As part of the NeurIPS 2023 competition track, we received more than 160 model submissions from 22 teams. Several new architectures for predictive models were proposed, and the winning teams improved the previous state-of-the-art model by 50%. Access to the dataset as well as the benchmarking infrastructure will remain online at www.sensorium-competition.net.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Estimation of tail risk measures in finance: Approaches to extreme value mixture modeling
Authors:
Yujuan Qiu
Abstract:
This thesis evaluates most of the extreme mixture models and methods that have appended in the literature and implements them in the context of finance and insurance. The paper also reviews and studies extreme value theory, time series, volatility clustering, and risk measurement methods in detail. Comparing the performance of extreme mixture models and methods on different simulated distributions…
▽ More
This thesis evaluates most of the extreme mixture models and methods that have appended in the literature and implements them in the context of finance and insurance. The paper also reviews and studies extreme value theory, time series, volatility clustering, and risk measurement methods in detail. Comparing the performance of extreme mixture models and methods on different simulated distributions shows that the method based on kernel density estimation does not have an absolute superior or close to the best performance, especially for the estimation of the extreme upper or lower tail of the distribution. Preprocessing time series data using a generalized autoregressive conditional heteroskedasticity model (GARCH) and applying extreme value mixture models on extracted residuals from GARCH can improve the goodness of fit and the estimation of the tail distribution.
△ Less
Submitted 1 June, 2024;
originally announced July 2024.
-
Gemini Dark Matter
Authors:
Andrew Cheek,
Yu-Cheng Qiu,
Liang Tan
Abstract:
The $S_8/σ_8$ tension in the large scale structure can be explained by decaying dark matter with an almost degenerate spectrum and small enough decay width. Here we propose the Gemini dark matter model, which contains a heavy mother particle $χ_3$ and two twins $χ_{1/2}$ which are almost degenerate in mass and are produced at the same time. The dark sector is charged under the same Froggatt-Nielse…
▽ More
The $S_8/σ_8$ tension in the large scale structure can be explained by decaying dark matter with an almost degenerate spectrum and small enough decay width. Here we propose the Gemini dark matter model, which contains a heavy mother particle $χ_3$ and two twins $χ_{1/2}$ which are almost degenerate in mass and are produced at the same time. The dark sector is charged under the same Froggatt-Nielsen symmetry that can explain the hierarchy of the Standard model Yukawa couplings. The slightly heavier $χ_2$ decays into $χ_1$ and the axionic component of the flavon, which washes out the small scale structure and resolves $S_8/σ_8$ tension. We present the production mechanism of Gemini dark matter and viable parameter regions. We find that despite the preferred dark matter mass being $\mathcal{O}(1)$--$\mathcal{O}(100)$ keV, they constitute cold dark matter. The Gemini dark matter model predicts an abundance of dark radiation that will be probed in future measurements of the CMB.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Self-absorption of Hankel systems on monoids --a seemingly universal property
Authors:
Yong Han,
Yanqi Qiu,
Zipeng Wang
Abstract:
Given any cancellative monoid $\mathcal{M}$, we study the Hankel system determined by its multiplication table. We prove that the Hankel system admits self-absorption property provided that the monoid $\mathcal{M}$ has the local algebraic structure: \[ \big(ax = by, cx=dy, az=bw \,\, \text{in $\mathcal{M}$}\big)\Longrightarrow \big(cz=dw \,\, \text{in $\mathcal{M}$}\big). \] Our result holds for a…
▽ More
Given any cancellative monoid $\mathcal{M}$, we study the Hankel system determined by its multiplication table. We prove that the Hankel system admits self-absorption property provided that the monoid $\mathcal{M}$ has the local algebraic structure: \[ \big(ax = by, cx=dy, az=bw \,\, \text{in $\mathcal{M}$}\big)\Longrightarrow \big(cz=dw \,\, \text{in $\mathcal{M}$}\big). \] Our result holds for all group-embeddable monoids and goes beyond. In particular, it works for all cancellative Abelian monoids and most common non-Abelian cancellative monoids such as $$ \mathrm{SL}_d(\mathbb{N}): = \big\{[a_{ij}]_{1\le i,j\le d}\in \mathrm{SL}_d(\mathbb{Z})\big| a_{ij} \in \mathbb{N}\big\}. $$ The Hankel system determined by the multiplication table of a monoid is further generalized to that determined by level sets of any abstract two-variable map. We introduce an algebraic notion of lunar maps and establish a stronger hereditary self-absorption property for the corresponding generalized Hankel systems. As a consequence, we prove the self-absorption property for arbitrary spatial compression of the regular representation system $\{λ_G(g)\}_{g\in G}$ of any discrete group $G$, as well as the Hankel system $\{Γ_\ell^Φ\}$ determined by the level sets of any rational map of the form $Φ(x,y)=a x^m + b y^n$ with $a,b,m,n\in \mathbb{Z}^*$: \[ Γ_\ell^Φ(x, y)= \mathbf{1}(a x^m + b y^n= \ell), \quad x, y\in \mathbb{N}^*, \, \ell\in Φ(\mathbb{N}^*\times \mathbb{N}^*). \] The self-absorption property is applied to the study of completely bounded Fourier multipliers between Hardy spaces. Further applications are: i) exact complete bounded norm of the Carleman embedding in any dimension; ii) mixed Fourier-Schur multiplier inequalities with critical exponent $4/3$; iii) failure of hyper-complete-contractivity for the Poisson semigroup.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Detecting and Identifying Selection Structure in Sequential Data
Authors:
Yujia Zheng,
Zeyu Tang,
Yiwen Qiu,
Bernhard Schölkopf,
Kun Zhang
Abstract:
We argue that the selective inclusion of data points based on latent objectives is common in practical situations, such as music sequences. Since this selection process often distorts statistical analysis, previous work primarily views it as a bias to be corrected and proposes various methods to mitigate its effect. However, while controlling this bias is crucial, selection also offers an opportun…
▽ More
We argue that the selective inclusion of data points based on latent objectives is common in practical situations, such as music sequences. Since this selection process often distorts statistical analysis, previous work primarily views it as a bias to be corrected and proposes various methods to mitigate its effect. However, while controlling this bias is crucial, selection also offers an opportunity to provide a deeper insight into the hidden generation process, as it is a fundamental mechanism underlying what we observe. In particular, overlooking selection in sequential data can lead to an incomplete or overcomplicated inductive bias in modeling, such as assuming a universal autoregressive structure for all dependencies. Therefore, rather than merely viewing it as a bias, we explore the causal structure of selection in sequential data to delve deeper into the complete causal process. Specifically, we show that selection structure is identifiable without any parametric assumptions or interventional experiments. Moreover, even in cases where selection variables coexist with latent confounders, we still establish the nonparametric identifiability under appropriate structural conditions. Meanwhile, we also propose a provably correct algorithm to detect and identify selection structures as well as other types of dependencies. The framework has been validated empirically on both synthetic data and real-world music.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Perverse schobers, stability conditions and quadratic differentials II: relative graded Brauer graph algebras
Authors:
Merlin Christ,
Fabian Haiden,
Yu Qiu
Abstract:
We introduce a class of dg-algebras which generalize the classical Brauer graph algebras. They are constructed from mixed-angulations of surfaces and often admit a (relative) Calabi--Yau structure. We discovered these algebras through two very distinct routes, one involving perverse schobers whose stalks are cyclic quotients of the derived categories of relative Ginzburg algebras, and another invo…
▽ More
We introduce a class of dg-algebras which generalize the classical Brauer graph algebras. They are constructed from mixed-angulations of surfaces and often admit a (relative) Calabi--Yau structure. We discovered these algebras through two very distinct routes, one involving perverse schobers whose stalks are cyclic quotients of the derived categories of relative Ginzburg algebras, and another involving deformations of partially wrapped Fukaya categories of surfaces. Applying the results of our previous work arXiv:2303.18249, we describe the spaces of stability conditions on the derived categories of these algebras in terms of spaces of quadratic differentials.
△ Less
Submitted 28 June, 2024;
originally announced July 2024.
-
Large CP Violation from the Minimum Seesaw Model
Authors:
Yu-Cheng Qiu,
Jin-Wei Wang,
Tsutomu T. Yanagida
Abstract:
The minimum seesaw model with two right-handed neutrinos is considered, where the lightest neutrino is naturally massless. Instead of adopting texture zeros in the lepton Yukawa matrices, which cause both theoretical and experimental troubles, here we propose two-$\boldsymbolε$ textures, where $\boldsymbolε$ is a small number. Combined with neutrino oscillation experimental data, we find that a la…
▽ More
The minimum seesaw model with two right-handed neutrinos is considered, where the lightest neutrino is naturally massless. Instead of adopting texture zeros in the lepton Yukawa matrices, which cause both theoretical and experimental troubles, here we propose two-$\boldsymbolε$ textures, where $\boldsymbolε$ is a small number. Combined with neutrino oscillation experimental data, we find that a large CP angle is preferred for the normal neutrino mass order. In contrast, the CP angle almost vanishes for the inverted order. This can be well-tested in near-future experiments, such as Hyper-Kamiokande. Besides, the predicted effective Majorana neutrino mass $m_{ee}$ and the total neutrino mass $\sum m^ν_i$ are also within reach of ongoing or future experiments.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy
Authors:
Chen Wang,
Kaiyi Ji,
Junyi Geng,
Zhongqiang Ren,
Taimeng Fu,
Fan Yang,
Yifan Guo,
Haonan He,
Xiangyu Chen,
Zitong Zhan,
Qiwei Du,
Shaoshu Su,
Bowen Li,
Yuheng Qiu,
Yi Du,
Qihang Li,
Yifan Yang,
Xiao Lin,
Zhipeng Zhao
Abstract:
Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeS…
▽ More
Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeSy) computational framework, imperative learning (IL), for robot autonomy, leveraging the generalization abilities of symbolic reasoning. The framework of IL consists of three primary components: a neural module, a reasoning engine, and a memory system. We formulate IL as a special bilevel optimization (BLO), which enables reciprocal learning over the three modules. This overcomes the label-intensive obstacles associated with data-driven approaches and takes advantage of symbolic reasoning concerning logical reasoning, physical principles, geometric analysis, etc. We discuss several optimization techniques for IL and verify their effectiveness in five distinct robot autonomy tasks including path planning, rule induction, optimal control, visual odometry, and multi-robot routing. Through various experiments, we show that IL can significantly enhance robot autonomy capabilities and we anticipate that it will catalyze further research across diverse domains.
△ Less
Submitted 6 July, 2024; v1 submitted 23 June, 2024;
originally announced June 2024.
-
Rapid and Accurate Diagnosis of Acute Aortic Syndrome using Non-contrast CT: A Large-scale, Retrospective, Multi-center and AI-based Study
Authors:
Yujian Hu,
Yilang Xiang,
Yan-Jie Zhou,
Yangyan He,
Shifeng Yang,
Xiaolong Du,
Chunlan Den,
Youyao Xu,
Gaofeng Wang,
Zhengyao Ding,
Jingyong Huang,
Wenjun Zhao,
Xuejun Wu,
Donglin Li,
Qianqian Zhu,
Zhenjiang Li,
Chenyang Qiu,
Ziheng Wu,
Yunjun He,
Chen Tian,
Yihui Qiu,
Zuodong Lin,
Xiaolong Zhang,
Yuan He,
Zhenpeng Yuan
, et al. (15 additional authors not shown)
Abstract:
Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed…
▽ More
Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. However, current triage practices in the ED can cause up to approximately half of patients with AAS to have an initially missed diagnosis or be misdiagnosed as having other acute chest pain conditions. Subsequently, these AAS patients will undergo clinically inaccurate or suboptimal differential diagnosis. Fortunately, even under these suboptimal protocols, nearly all these patients underwent non-contrast CT covering the aorta anatomy at the early stage of differential diagnosis. In this study, we developed an artificial intelligence model (DeepAAS) using non-contrast CT, which is highly accurate for identifying AAS and provides interpretable results to assist in clinical decision-making. Performance was assessed in two major phases: a multi-center retrospective study (n = 20,750) and an exploration in real-world emergency scenarios (n = 137,525). In the multi-center cohort, DeepAAS achieved a mean area under the receiver operating characteristic curve of 0.958 (95% CI 0.950-0.967). In the real-world cohort, DeepAAS detected 109 AAS patients with misguided initial suspicion, achieving 92.6% (95% CI 76.2%-97.5%) in mean sensitivity and 99.2% (95% CI 99.1%-99.3%) in mean specificity. Our AI model performed well on non-contrast CT at all applicable early stages of differential diagnosis workflows, effectively reduced the overall missed diagnosis and misdiagnosis rate from 48.8% to 4.8% and shortened the diagnosis time for patients with misguided initial suspicion from an average of 681.8 (74-11,820) mins to 68.5 (23-195) mins. DeepAAS could effectively fill the gap in the current clinical workflow without requiring additional tests.
△ Less
Submitted 24 June, 2024; v1 submitted 13 June, 2024;
originally announced June 2024.
-
Dynamic Response of Ionic Current in Conical Nanopores
Authors:
Zhe Liu,
Long Ma,
Hongwen Zhang,
Jiakun Zhuang,
Jia Man,
Zuzanna S. Siwy,
Yinghua Qiu
Abstract:
Ionic current rectification (ICR) of charged conical nanopores has various applications in fields including nanofluidics, bio-sensing, and energy conversion, whose function is closely related to the dynamic response of nanopores. The occurrence of ICR originates from the ion enrichment and depletion in conical pores, whose formation is found to be affected by the scanning rate of voltages. Here, t…
▽ More
Ionic current rectification (ICR) of charged conical nanopores has various applications in fields including nanofluidics, bio-sensing, and energy conversion, whose function is closely related to the dynamic response of nanopores. The occurrence of ICR originates from the ion enrichment and depletion in conical pores, whose formation is found to be affected by the scanning rate of voltages. Here, through time-dependent simulations, we investigate the variation of ion current under electric fields and the dynamic formation of ion enrichment and depletion, which can reflect the response time of conical nanopores. The response time of nanopores when ion enrichment forms i.e. at the on state is significantly longer than that with the formation of ion depletion i.e. at the off state. Our simulation results reveal the regulation of response time by different nanopore parameters including the surface charge density, pore length, tip, and base radius, as well as the applied conditions such as the voltage and bulk concentration. The response time of nanopores is closely related to the surface charge density, pore length, voltage, and bulk concentration. Our uncovered dynamic response mechanism of the ionic current can guide the design of nanofluidic devices with conical nanopores, including memristors, ionic switches, and rectifiers.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Structure of Massive Gauge/Gravity Scattering Amplitudes, Equivalence Theorems, and Extended Double-Copy with Compactified Warped Space
Authors:
Yanfeng Hang,
Wei-Wei Zhao,
Hong-Jian He,
Yin-Long Qiu
Abstract:
We study the structure of scattering amplitudes of massive Kaluza-Klein (KK) states in the compactified 5-dimensional warped gauge and gravity theories. We present systematic formulations of the gauge theory equivalence theorem (GAET) and the gravitational equivalence theorem (GRET) for warped KK theories in $R_ξ^{}$ gauge, where the GAET connects the scattering amplitudes of longitudinal KK gauge…
▽ More
We study the structure of scattering amplitudes of massive Kaluza-Klein (KK) states in the compactified 5-dimensional warped gauge and gravity theories. We present systematic formulations of the gauge theory equivalence theorem (GAET) and the gravitational equivalence theorem (GRET) for warped KK theories in $R_ξ^{}$ gauge, where the GAET connects the scattering amplitudes of longitudinal KK gauge bosons to that of the corresponding KK Goldstone bosons and the GRET connects the scattering amplitudes of KK gravitons of helicity-zero (helicity-one) to that of the corresponding gravitational KK Goldstone bosons. We analyze the structure of 3-point and 4-point scattering amplitudes of massive KK gauge bosons and of massive KK gravitons as well as their corresponding Goldstone bosons. We first prove the GAET and GRET explicitly for the fundamental 3-point KK gauge/gravity scattering amplitudes. We then demonstrate that the validity of the GAET and GRET for 4-point gauge/gravity scattering amplitudes can be reduced to the validity of GAET and GRET for 3-point gauge/gravity scattering amplitudes at tree level. With these, we study the double-copy construction of KK scattering amplitudes in the warped gauge/gravity theories. We newly realize the double-copy for massive 3-point full gauge/gravity amplitudes at tree level under proper correspondences of color-kinematics and of gauge/gravity couplings, whereas we can construct the double-copy for 4-point KK gauge/gravity amplitudes to the leading order (LO) of high energy expansion. We further demonstrate that this LO double-copy construction can be extended to $N$-point KK scattering amplitudes with $N\geqslant 4$.
△ Less
Submitted 27 June, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
Association between a Failed Prominence Eruption and the Drainage of Mass from Another Prominence
Authors:
Jianchao Xue,
Li Feng,
Hui Li,
Ping Zhang,
Jun Chen,
Guanglu Shi,
Kaifan Ji,
Ye Qiu,
Chuan Li,
Lei Lu,
Beili Ying,
Ying Li,
Yu Huang,
Youping Li,
Jingwei Li,
Jie Zhao,
Dechao Song,
Shuting Li,
Zhengyuan Tian,
Yingna Su,
Qingmin Zhang,
Yunyi Ge,
Jiahui Shan,
Qiao Li,
Gen Li
, et al. (9 additional authors not shown)
Abstract:
Sympathetic eruptions of solar prominences have been studied for decades, however, it is usually difficult to identify their causal links. Here we present two failed prominence eruptions on 26 October 2022 and explore their connections. Using stereoscopic observations, the south prominence (PRO-S) erupts with untwisting motions, flare ribbons occur underneath, and new connections are formed during…
▽ More
Sympathetic eruptions of solar prominences have been studied for decades, however, it is usually difficult to identify their causal links. Here we present two failed prominence eruptions on 26 October 2022 and explore their connections. Using stereoscopic observations, the south prominence (PRO-S) erupts with untwisting motions, flare ribbons occur underneath, and new connections are formed during the eruption. The north prominence (PRO-N) rises up along with PRO-S, and its upper part disappears due to catastrophic mass draining along an elongated structure after PRO-S failed eruption. We suggest that the eruption of PRO-S initiates due to a kink instability, further rises up, and fails to erupt due to reconnection with surrounding fields. The elongated structure connecting PRO-N overlies PRO-S, which causes the rising up of PRO-N along with PRO-S and mass drainage after PRO-S eruption. This study suggests that a prominence may end its life through mass drainage forced by an eruption underneath.
△ Less
Submitted 20 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Compressed Video Quality Enhancement with Temporal Group Alignment and Fusion
Authors:
Qiang Zhu,
Yajun Qiu,
Yu Liu,
Shuyuan Zhu,
Bing Zeng
Abstract:
In this paper, we propose a temporal group alignment and fusion network to enhance the quality of compressed videos by using the long-short term correlations between frames. The proposed model consists of the intra-group feature alignment (IntraGFA) module, the inter-group feature fusion (InterGFF) module, and the feature enhancement (FE) module. We form the group of pictures (GoP) by selecting fr…
▽ More
In this paper, we propose a temporal group alignment and fusion network to enhance the quality of compressed videos by using the long-short term correlations between frames. The proposed model consists of the intra-group feature alignment (IntraGFA) module, the inter-group feature fusion (InterGFF) module, and the feature enhancement (FE) module. We form the group of pictures (GoP) by selecting frames from the video according to their temporal distances to the target enhanced frame. With this grouping, the composed GoP can contain either long- or short-term correlated information of neighboring frames. We design the IntraGFA module to align the features of frames of each GoP to eliminate the motion existing between frames. We construct the InterGFF module to fuse features belonging to different GoPs and finally enhance the fused features with the FE module to generate high-quality video frames. The experimental results show that our proposed method achieves up to 0.05dB gain and lower complexity compared to the state-of-the-art method.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Classification of Cellular Fake Surfaces
Authors:
Lucas Fagan,
Yang Qiu,
Zhenghan Wang
Abstract:
Generic polyhedra are interesting mathematical objects to study in their own right. In this paper, we initialize a systematic study of two-dimensional generic polyhedra with an eye towards applications to low-dimensional topology, especially the Andrews-Curtis and Zeeman conjectures. After recalling the basic notions of generic polyhedra and fake surfaces, we derive some interesting properties of…
▽ More
Generic polyhedra are interesting mathematical objects to study in their own right. In this paper, we initialize a systematic study of two-dimensional generic polyhedra with an eye towards applications to low-dimensional topology, especially the Andrews-Curtis and Zeeman conjectures. After recalling the basic notions of generic polyhedra and fake surfaces, we derive some interesting properties of fake surfaces. Our main result is a complete classification of acyclic cellular fake surfaces up to complexity 4 and a classification of acyclic cellular fake surfaces without small disks of complexity 5. From this classification, we prove the contractibility conjecture for acyclic cellular fake surfaces of complexity 4, and the embedded disk conjecture up to complexity 5. We provide evidence for the conjectures that the probability of being a spine among fake surfaces is 0 and that every contractible fake surface has an embedded disk.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
MSz: An Efficient Parallel Algorithm for Correcting Morse-Smale Segmentations in Error-Bounded Lossy Compressors
Authors:
Yuxiao Li,
Xin Liang,
Bei Wang,
Yongfeng Qiu,
Lin Yan,
Hanqi Guo
Abstract:
This research explores a novel paradigm for preserving topological segmentations in existing error-bounded lossy compressors. Today's lossy compressors rarely consider preserving topologies such as Morse-Smale complexes, and the discrepancies in topology between original and decompressed datasets could potentially result in erroneous interpretations or even incorrect scientific conclusions. In thi…
▽ More
This research explores a novel paradigm for preserving topological segmentations in existing error-bounded lossy compressors. Today's lossy compressors rarely consider preserving topologies such as Morse-Smale complexes, and the discrepancies in topology between original and decompressed datasets could potentially result in erroneous interpretations or even incorrect scientific conclusions. In this paper, we focus on preserving Morse-Smale segmentations in 2D/3D piecewise linear scalar fields, targeting the precise reconstruction of minimum/maximum labels induced by the integral line of each vertex. The key is to derive a series of edits during compression time; the edits are applied to the decompressed data, leading to an accurate reconstruction of segmentations while keeping the error within the prescribed error bound. To this end, we developed a workflow to fix extrema and integral lines alternatively until convergence within finite iterations; we accelerate each workflow component with shared-memory/GPU parallelism to make the performance practical for coupling with compressors. We demonstrate use cases with fluid dynamics, ocean, and cosmology application datasets with a significant acceleration with an NVIDIA A100 GPU.
△ Less
Submitted 5 July, 2024; v1 submitted 5 April, 2024;
originally announced June 2024.
-
Resolving the Orientations of and Separation between an Overlapping Pair of Dipole Emitters
Authors:
Yiyang Chen,
Yuanxin Qiu,
Matthew D. Lew
Abstract:
We prove that it is impossible to distinguish two spatially overlapping fluorescent molecules from a single rotating molecule, even if one modulates the polarization of pumping light or the detection dipole-spread function (DSF). If the target is known to be a dipole pair, existing imaging methods perform poorly for measuring their angular separation. We propose simultaneously modulating the excit…
▽ More
We prove that it is impossible to distinguish two spatially overlapping fluorescent molecules from a single rotating molecule, even if one modulates the polarization of pumping light or the detection dipole-spread function (DSF). If the target is known to be a dipole pair, existing imaging methods perform poorly for measuring their angular separation. We propose simultaneously modulating the excitation polarization and DSF, which demonstrates robust discrimination between dipole pairs versus single molecules. Our method improves the precision of measuring centroid orientation by 50% and angular separation by 4- to 6-fold over existing techniques.
△ Less
Submitted 26 June, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
Low-Rank Similarity Mining for Multimodal Dataset Distillation
Authors:
Yue Xu,
Zhilin Lin,
Yusong Qiu,
Cewu Lu,
Yong-Lu Li
Abstract:
Though dataset distillation has witnessed rapid development in recent years, the distillation of multimodal data, e.g., image-text pairs, poses unique and under-explored challenges. Unlike unimodal data, image-text contrastive learning (ITC) data lack inherent categorization and should instead place greater emphasis on modality correspondence. In this work, we propose Low-Rank Similarity Mining (L…
▽ More
Though dataset distillation has witnessed rapid development in recent years, the distillation of multimodal data, e.g., image-text pairs, poses unique and under-explored challenges. Unlike unimodal data, image-text contrastive learning (ITC) data lack inherent categorization and should instead place greater emphasis on modality correspondence. In this work, we propose Low-Rank Similarity Mining (LoRS) for multimodal dataset distillation, that concurrently distills a ground truth similarity matrix with image-text pairs, and leverages low-rank factorization for efficiency and scalability. The proposed approach brings significant improvement to the existing algorithms, marking a significant contribution to the field of visual-language dataset distillation. We advocate adopting LoRS as a foundational synthetic data setup for image-text dataset distillation. Our code is available at https://github.com/silicx/LoRS_Distill.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Tighter yet more tractable relaxations and nontrivial instance generation for sparse standard quadratic optimization
Authors:
Immanuel Bomze,
Bo Peng,
Yuzhou Qiu,
E. Alper Yildirim
Abstract:
The Standard Quadratic optimization Problem (StQP), arguably the simplest among all classes of NP-hard optimization problems, consists of extremizing a quadratic form (the simplest nonlinear polynomial) over the standard simplex (the simplest polytope/compact feasible set). As a problem class, StQPs may be nonconvex with an exponential number of inefficient local solutions. StQPs arise in a multit…
▽ More
The Standard Quadratic optimization Problem (StQP), arguably the simplest among all classes of NP-hard optimization problems, consists of extremizing a quadratic form (the simplest nonlinear polynomial) over the standard simplex (the simplest polytope/compact feasible set). As a problem class, StQPs may be nonconvex with an exponential number of inefficient local solutions. StQPs arise in a multitude of applications, among them mathematical finance, machine learning (clustering), and modeling in biosciences (e.g., selection and ecology). This paper deals with such StQPs under an additional sparsity or cardinality constraint, which, even for convex objectives, renders NP-hard problems. One motivation to study StQPs under such sparsity restrictions is the high-dimensional portfolio selection problem with too many assets to handle, in particular, in the presence of transaction costs. Here, relying on modern conic optimization techniques, we present tractable convex relaxations for this relevant but difficult problem. We propose novel equivalent reformulations of these relaxations with significant dimensional reduction, which is essential for the tractability of these relaxations when the problem size grows. Moreover, we propose an instance generation procedure which systematically avoids too easy instances. Our extensive computational results illustrate the high quality of the relaxation bounds in a significant number of instances. Furthermore, in contrast with exact mixed-integer quadratic programming models, the solution time of the relaxations is very robust to the choices of the problem parameters. In particular, the reduced formulations achieve significant improvements in terms of the solution time over their counterparts.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Financial Deepening and Economic Growth in Select Emerging Markets with Currency Board Systems: Theory and Evidence
Authors:
Yujuan Qiu
Abstract:
This paper investigates some indicators of financial development in select countries with currency board systems and raises some questions about the connection between financial development and growth in currency board systems. Most of those cases are long past episodes of what we would now call emerging markets. However, the paper also looks at Hong Kong, the currency board system that is one of…
▽ More
This paper investigates some indicators of financial development in select countries with currency board systems and raises some questions about the connection between financial development and growth in currency board systems. Most of those cases are long past episodes of what we would now call emerging markets. However, the paper also looks at Hong Kong, the currency board system that is one of the world's largest and most advanced financial markets. The global financial crisis of 2008 09 created doubts about the efficiency of financial markets in advanced economies, including in Hong Kong, and unsettled the previous consensus that a large financial sector would be more stable than a smaller one.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
Spectral measure of large random Helson matrices
Authors:
Yanqi Qiu,
Guocheng Zhen
Abstract:
We study the limiting spectral measure of large random Helson matrices and large random matrices of certain patterned structures.
Given a real random variable $X \in L^{2+ \varepsilon}(\mathbb{P}) $ for some $\varepsilon > 0$ and $\mathrm{Var}(X) = 1$. For the random $n \times n$ Helson matrices generated by the independent copies of $X$, scaling the eigenvalues by $\sqrt{n}$, we prove the almos…
▽ More
We study the limiting spectral measure of large random Helson matrices and large random matrices of certain patterned structures.
Given a real random variable $X \in L^{2+ \varepsilon}(\mathbb{P}) $ for some $\varepsilon > 0$ and $\mathrm{Var}(X) = 1$. For the random $n \times n$ Helson matrices generated by the independent copies of $X$, scaling the eigenvalues by $\sqrt{n}$, we prove the almost sure weak convergence of the spectral measure to the standard Wigner semi-circular law. Similar results are established for large random matrices with certain general patterned structures.
△ Less
Submitted 12 June, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
Silicon-integrated scandium-doped aluminum nitride electro-optic modulator
Authors:
Tianqi Xu,
Yushuai Liu,
Yuanmao Pu,
Yongxiang Yang,
Qize Zhong,
Xingyan Zhao,
Yang Qiu,
Yuan Dong,
Tao Wu,
Shaonan Zheng,
Ting Hu
Abstract:
Scandium-doped aluminum nitride (AlScN) with an asymmetric hexagonal wurtzite structure exhibits enhanced second-order nonlinear and piezoelectric properties compared to aluminum nitride (AlN), while maintaining a relatively large bandgap. It provides a promising platform for photonic integration and facilitates the seamless integration of passive and active functional devices. Here, we present th…
▽ More
Scandium-doped aluminum nitride (AlScN) with an asymmetric hexagonal wurtzite structure exhibits enhanced second-order nonlinear and piezoelectric properties compared to aluminum nitride (AlN), while maintaining a relatively large bandgap. It provides a promising platform for photonic integration and facilitates the seamless integration of passive and active functional devices. Here, we present the design, fabrication, and characterization of AlScN EO micro-ring modulators, introducing active functionalities to the chip-scale AlScN platform. These waveguide-integrated EO modulators employ sputtered AlScN thin films as the light-guiding medium, and the entire fabrication process is compatible with complementary metal oxide semiconductor (CMOS) technology. We characterize the high-frequency performance of an AlScN modulator for the first time, extracting a maximum in-device effective EO coefficient of 2.86 pm/V at 12 GHz. The devices show a minimum half-wave voltage-length product of 3.12 V*cm and a 3-dB modulation bandwidth of approximately 22 GHz. Our work provides a promising modulation scheme for cost-effective silicon-integrated photonics systems.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Network Diffusion -- Framework to Simulate Spreading Processes in Complex Networks
Authors:
Michał Czuba,
Mateusz Nurek,
Damian Serwata,
Yu-Xuan Qiu,
Mingshan Jia,
Katarzyna Musial,
Radosław Michalski,
Piotr Bródka
Abstract:
With the advancement of computational network science, its research scope has significantly expanded beyond static graphs to encompass more complex structures. The introduction of streaming, temporal, multilayer, and hypernetwork approaches has brought new possibilities and imposed additional requirements. For instance, by utilising these advancements, one can model structures such as social netwo…
▽ More
With the advancement of computational network science, its research scope has significantly expanded beyond static graphs to encompass more complex structures. The introduction of streaming, temporal, multilayer, and hypernetwork approaches has brought new possibilities and imposed additional requirements. For instance, by utilising these advancements, one can model structures such as social networks in a much more refined manner, which is particularly relevant in simulations of the spreading processes. Unfortunately, the pace of advancement is often too rapid for existing computational packages to keep up with the functionality updates. This results in a significant proliferation of tools used by researchers and, consequently, a lack of a universally accepted technological stack that would standardise experimental methods (as seen, e.g. in machine learning). This article addresses that issue by presenting an extended version of the Network Diffusion library. First, a survey of the existing approaches and toolkits for simulating spreading phenomena is shown and then, an overview of the framework functionalities. Finally, we report four case studies conducted with the package to demonstrate its usefulness: the impact of sanitary measures on the spread of COVID-19, the comparison of information diffusion on two temporal network models, and the effectiveness of seed selection methods in the task of influence maximisation in multilayer networks. We conclude the paper with a critical assessment of the library and the outline of still awaiting challenges to standardise research environments in computational network science.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
ROSE: Register Assisted General Time Series Forecasting with Decomposed Frequency Learning
Authors:
Yihang Wang,
Yuying Qiu,
Peng Chen,
Kai Zhao,
Yang Shu,
Zhongwen Rao,
Lujia Pan,
Bin Yang,
Chenjuan Guo
Abstract:
With the increasing collection of time series data from various domains, there arises a strong demand for general time series forecasting models pre-trained on a large number of time-series datasets to support a variety of downstream prediction tasks. Enabling general time series forecasting faces two challenges: how to obtain unified representations from multi-domian time series data, and how to…
▽ More
With the increasing collection of time series data from various domains, there arises a strong demand for general time series forecasting models pre-trained on a large number of time-series datasets to support a variety of downstream prediction tasks. Enabling general time series forecasting faces two challenges: how to obtain unified representations from multi-domian time series data, and how to capture domain-specific features from time series data across various domains for adaptive transfer in downstream tasks. To address these challenges, we propose a Register Assisted General Time Series Forecasting Model with Decomposed Frequency Learning (ROSE), a novel pre-trained model for time series forecasting. ROSE employs Decomposed Frequency Learning for the pre-training task, which decomposes coupled semantic and periodic information in time series with frequency-based masking and reconstruction to obtain unified representations across domains. We also equip ROSE with a Time Series Register, which learns to generate a register codebook to capture domain-specific representations during pre-training and enhances domain-adaptive transfer by selecting related register tokens on downstream tasks. After pre-training on large-scale time series data, ROSE achieves state-of-the-art forecasting performance on 8 real-world benchmarks. Remarkably, even in few-shot scenarios, it demonstrates competitive or superior performance compared to existing methods trained with full data.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Authors:
Shenyuan Gao,
Jiazhi Yang,
Li Chen,
Kashyap Chitta,
Yihang Qiu,
Andreas Geiger,
Jun Zhang,
Hongyang Li
Abstract:
World models can foresee the outcomes of different actions, which is of paramount importance for autonomous driving. Nevertheless, existing driving world models still have limitations in generalization to unseen environments, prediction fidelity of critical details, and action controllability for flexible application. In this paper, we present Vista, a generalizable driving world model with high f…
▽ More
World models can foresee the outcomes of different actions, which is of paramount importance for autonomous driving. Nevertheless, existing driving world models still have limitations in generalization to unseen environments, prediction fidelity of critical details, and action controllability for flexible application. In this paper, we present Vista, a generalizable driving world model with high fidelity and versatile controllability. Based on a systematic diagnosis of existing methods, we introduce several key ingredients to address these limitations. To accurately predict real-world dynamics at high resolution, we propose two novel losses to promote the learning of moving instances and structural information. We also devise an effective latent replacement approach to inject historical frames as priors for coherent long-horizon rollouts. For action controllability, we incorporate a versatile set of controls from high-level intentions (command, goal point) to low-level maneuvers (trajectory, angle, and speed) through an efficient learning strategy. After large-scale training, the capabilities of Vista can seamlessly generalize to different scenarios. Extensive experiments on multiple datasets show that Vista outperforms the most advanced general-purpose video generator in over 70% of comparisons and surpasses the best-performing driving world model by 55% in FID and 27% in FVD. Moreover, for the first time, we utilize the capacity of Vista itself to establish a generalizable reward for real-world action evaluation without accessing the ground truth actions.
△ Less
Submitted 6 June, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
Meta-Homogenization for Knitwear Simulation
Authors:
Chun Yuan,
Kui Wu,
Haoyang Shi,
Lei Lan,
Yuxing Qiu,
Cem Yuksel,
Huamin Wang,
Chenfanfu Jiang,
Yin Yang
Abstract:
This paper presents meta-homogenization, a spatially varying homogenization scheme for knitwear simulation. We are motivated by the observation that macro-scale fabric dynamics is strongly correlated with its underlying knitting patterns. Therefore, homogenization towards a single material is less effective when the knitting is complex and non-repetitive. Our method tackles this challenge by homog…
▽ More
This paper presents meta-homogenization, a spatially varying homogenization scheme for knitwear simulation. We are motivated by the observation that macro-scale fabric dynamics is strongly correlated with its underlying knitting patterns. Therefore, homogenization towards a single material is less effective when the knitting is complex and non-repetitive. Our method tackles this challenge by homogenizing the yarn-level material locally at volumetric elements. Assigning a virtual volume of a knitting structure enables us to model bending and twisting effects via a simple volume-preserving penalty and thus effectively alleviates the material nonlinearity. We employ an adjoint Gauss-Newton formulation to battle the dimensionality challenge of such per-element material optimization. This intuitive material model makes the forward simulation GPU-friendly. To this end, our pipeline also equips a novel domain-decomposed subspace solver crafted for GPU projective dynamics, which makes our simulator hundreds of times faster than the yarn-level simulator. Experiments validate the capability and effectiveness of meta-homogenization. Our method produces realistic animations of knitwear matching the quality of full-scale yarn-level simulations. It is also orders of magnitude faster than existing homogenization techniques in both the training and simulation stages.
△ Less
Submitted 23 May, 2024; v1 submitted 21 May, 2024;
originally announced May 2024.
-
Coarse-graining conformational dynamics with multi-dimensional generalized Langevin equation: how, when, and why
Authors:
Pinchen Xie,
Yunrui Qiu,
Weinan E
Abstract:
A data-driven ab initio generalized Langevin equation (AIGLE) approach is developed to learn and simulate high-dimensional, heterogeneous, coarse-grained conformational dynamics. Constrained by the fluctuation-dissipation theorem, the approach can build coarse-grained models in dynamical consistency with all-atom molecular dynamics. We also propose practical criteria for AIGLE to enforce long-term…
▽ More
A data-driven ab initio generalized Langevin equation (AIGLE) approach is developed to learn and simulate high-dimensional, heterogeneous, coarse-grained conformational dynamics. Constrained by the fluctuation-dissipation theorem, the approach can build coarse-grained models in dynamical consistency with all-atom molecular dynamics. We also propose practical criteria for AIGLE to enforce long-term dynamical consistency. Case studies of a toy polymer, with 20 coarse-grained sites, and the alanine dipeptide, with two dihedral angles, elucidate why one should adopt AIGLE or its Markovian limit for modeling coarse-grained conformational dynamics in practice.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
Spectral Editing of Activations for Large Language Model Alignment
Authors:
Yifu Qiu,
Zheng Zhao,
Yftah Ziser,
Anna Korhonen,
Edoardo M. Ponti,
Shay B. Cohen
Abstract:
Large language models (LLMs) often exhibit undesirable behaviours, such as generating untruthful or biased content. Editing their internal representations has been shown to be effective in mitigating such behaviours on top of the existing alignment methods. We propose a novel inference-time editing method, namely spectral editing of activations (SEA), to project the input representations into dire…
▽ More
Large language models (LLMs) often exhibit undesirable behaviours, such as generating untruthful or biased content. Editing their internal representations has been shown to be effective in mitigating such behaviours on top of the existing alignment methods. We propose a novel inference-time editing method, namely spectral editing of activations (SEA), to project the input representations into directions with maximal covariance with the positive demonstrations (e.g., truthful) while minimising covariance with the negative demonstrations (e.g., hallucinated). We also extend our method to non-linear editing using feature functions. We run extensive experiments on benchmarks concerning truthfulness and bias with six open-source LLMs of different sizes and model families. The results demonstrate the superiority of SEA in effectiveness, generalisation to similar tasks, as well as computation and data efficiency. We also show that SEA editing only has a limited negative impact on other model capabilities.
△ Less
Submitted 25 May, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition
Authors:
Honghui Chen,
Yuhang Qiu,
Jiabao Wang,
Pingping Chen,
Nam Ling
Abstract:
Internal Language Model (LM)-based methods use permutation language modeling (PLM) to solve the error correction caused by conditional independence in external LM-based methods. However, random permutations of human interference cause fit oscillations in the model training, and Iterative Refinement (IR) operation to improve multimodal information decoupling also introduces additional overhead. To…
▽ More
Internal Language Model (LM)-based methods use permutation language modeling (PLM) to solve the error correction caused by conditional independence in external LM-based methods. However, random permutations of human interference cause fit oscillations in the model training, and Iterative Refinement (IR) operation to improve multimodal information decoupling also introduces additional overhead. To address these issues, this paper proposes the Hierarchical Attention autoregressive Model with Adaptive Permutation (HAAP) to enhance the location-context-image interaction capability, improving autoregressive generalization with internal LM. First, we propose Implicit Permutation Neurons (IPN) to generate adaptive attention masks to dynamically exploit token dependencies. The adaptive masks increase the diversity of training data and prevent model dependency on a specific order. It reduces the training overhead of PLM while avoiding training fit oscillations. Second, we develop Cross-modal Hierarchical Attention mechanism (CHA) to couple context and image features. This processing establishes rich positional semantic dependencies between context and image while avoiding IR. Extensive experimental results show the proposed HAAP achieves state-of-the-art (SOTA) performance in terms of accuracy, complexity, and latency on several datasets.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering
Authors:
Xiaohan Zhang,
Yukui Qiu,
Zhenyu Sun,
Qi Liu
Abstract:
Recent progress in large-scale scene rendering has yielded Neural Radiance Fields (NeRF)-based models with an impressive ability to synthesize scenes across small objects and indoor scenes. Nevertheless, extending this idea to large-scale aerial rendering poses two critical problems. Firstly, a single NeRF cannot render the entire scene with high-precision for complex large-scale aerial datasets s…
▽ More
Recent progress in large-scale scene rendering has yielded Neural Radiance Fields (NeRF)-based models with an impressive ability to synthesize scenes across small objects and indoor scenes. Nevertheless, extending this idea to large-scale aerial rendering poses two critical problems. Firstly, a single NeRF cannot render the entire scene with high-precision for complex large-scale aerial datasets since the sampling range along each view ray is insufficient to cover buildings adequately. Secondly, traditional NeRFs are infeasible to train on one GPU to enable interactive fly-throughs for modeling massive images. Instead, existing methods typically separate the whole scene into multiple regions and train a NeRF on each region, which are unaccustomed to different flight trajectories and difficult to achieve fast rendering. To that end, we propose Aerial-NeRF with three innovative modifications for jointly adapting NeRF in large-scale aerial rendering: (1) Designing an adaptive spatial partitioning and selection method based on drones' poses to adapt different flight trajectories; (2) Using similarity of poses instead of (expert) network for rendering speedup to determine which region a new viewpoint belongs to; (3) Developing an adaptive sampling approach for rendering performance improvement to cover the entire buildings at different heights. Extensive experiments have conducted to verify the effectiveness and efficiency of Aerial-NeRF, and new state-of-the-art results have been achieved on two public large-scale aerial datasets and presented SCUTic dataset. Note that our model allows us to perform rendering over 4 times as fast as compared to multiple competitors. Our dataset, code, and model are publicly available at https://drliuqi.github.io/.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Current progress in corrosion of multi principal element alloys
Authors:
M. Ghorbani,
Z. Li,
Y. Qiu,
P. Marcus,
J. R. Scully,
O. Gharbi,
H. Luo,
R. K. Gupta,
Z. R. Zeng,
H. L. Fraser,
M. L. Taheri,
N. Birbilis
Abstract:
Whilst multi-principal element alloys (MPEAs) remain a promising class of materials owing to several attractive mechanical properties, their corrosion performance is also unique. In this concise review, we present an emerging overview of some of the general features related to MPEA corrosion, following a decade of work in the field. This includes highlighting some of the key aspects related to the…
▽ More
Whilst multi-principal element alloys (MPEAs) remain a promising class of materials owing to several attractive mechanical properties, their corrosion performance is also unique. In this concise review, we present an emerging overview of some of the general features related to MPEA corrosion, following a decade of work in the field. This includes highlighting some of the key aspects related to the electrochemical phenomena in MPEA corrosion, and the relevant future works required for a holistic mechanistic understanding. In addition, a comprehensive database of the reported corrosion performance of MPEAs is presented, based on works reported to date. The database is assembled to also allow users to undertake machine learning or their own data analysis, with a parsed representation of alloy composition, test electrolyte, and corrosion related parameters.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Optimizing E-commerce Search: Toward a Generalizable and Rank-Consistent Pre-Ranking Model
Authors:
Enqiang Xu,
Yiming Qiu,
Junyang Bai,
Ping Zhang,
Dadong Miao,
Songlin Wang,
Guoyu Tang,
Lin Liu,
Mingming Li
Abstract:
In large e-commerce platforms, search systems are typically composed of a series of modules, including recall, pre-ranking, and ranking phases. The pre-ranking phase, serving as a lightweight module, is crucial for filtering out the bulk of products in advance for the downstream ranking module. Industrial efforts on optimizing the pre-ranking model have predominantly focused on enhancing ranking c…
▽ More
In large e-commerce platforms, search systems are typically composed of a series of modules, including recall, pre-ranking, and ranking phases. The pre-ranking phase, serving as a lightweight module, is crucial for filtering out the bulk of products in advance for the downstream ranking module. Industrial efforts on optimizing the pre-ranking model have predominantly focused on enhancing ranking consistency, model structure, and generalization towards long-tail items. Beyond these optimizations, meeting the system performance requirements presents a significant challenge. Contrasting with existing industry works, we propose a novel method: a Generalizable and RAnk-ConsistEnt Pre-Ranking Model (GRACE), which achieves: 1) Ranking consistency by introducing multiple binary classification tasks that predict whether a product is within the top-k results as estimated by the ranking model, which facilitates the addition of learning objectives on common point-wise ranking models; 2) Generalizability through contrastive learning of representation for all products by pre-training on a subset of ranking product embeddings; 3) Ease of implementation in feature construction and online deployment. Our extensive experiments demonstrate significant improvements in both offline metrics and online A/B test: a 0.75% increase in AUC and a 1.28% increase in CVR.
△ Less
Submitted 13 May, 2024; v1 submitted 9 May, 2024;
originally announced May 2024.
-
Merging Parameter Estimation and Classification Using LASSO
Authors:
Le Wang,
Ying Wang,
Yu Qiu,
Mian Li,
Håkan Hjalmarsson
Abstract:
Soft sensing is a way to indirectly obtain information of signals for which direct sensing is difficult or prohibitively expensive. It may not a priori be evident which sensors provide useful information about the target signal. There may be sensors irrelevant for the estimation as well as sensors for which the information is very poor. It is often required that the soft sensor should cover a wide…
▽ More
Soft sensing is a way to indirectly obtain information of signals for which direct sensing is difficult or prohibitively expensive. It may not a priori be evident which sensors provide useful information about the target signal. There may be sensors irrelevant for the estimation as well as sensors for which the information is very poor. It is often required that the soft sensor should cover a wide range of operating points. This means that some sensors may be useful in certain operating conditions while irrelevant in others, while others may have no bearing on the target signal whatsoever. However, this type of structural information is typically not available but has to be deduced from data. A further compounding issue is that multiple operating conditions may be described by the same model, but which ones is not known in advance either. In this contribution, we provide a systematic method to construct a soft sensor that can deal with these issues. While the different models can be used, we adopt the multi-input single output finite impulse response models since they are linear in the parameters. We propose a single estimation criterion, where the objectives are encoded in terms of model fit, model sparsity (reducing the number of different models), and model parameter coefficient sparsity (to exclude irrelevant sensors). A post-processing model clustering step is also included. As proof of concept, the method is tested on field test datasets from a prototype vehicle.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Design, analysis, and manufacturing of a glass-plastic hybrid minimalist aspheric panoramic annular lens
Authors:
Shaohua Gao,
Qi Jiang,
Yiqi Liao,
Yi Qiu,
Wanglei Ying,
Kailun Yang,
Kaiwei Wang,
Benhao Zhang,
Jian Bai
Abstract:
We propose a high-performance glass-plastic hybrid minimalist aspheric panoramic annular lens (ASPAL) to solve several major limitations of the traditional panoramic annular lens (PAL), such as large size, high weight, and complex system. The field of view (FoV) of the ASPAL is 360°x(35°~110°) and the imaging quality is close to the diffraction limit. This large FoV ASPAL is composed of only 4 len…
▽ More
We propose a high-performance glass-plastic hybrid minimalist aspheric panoramic annular lens (ASPAL) to solve several major limitations of the traditional panoramic annular lens (PAL), such as large size, high weight, and complex system. The field of view (FoV) of the ASPAL is 360°x(35°~110°) and the imaging quality is close to the diffraction limit. This large FoV ASPAL is composed of only 4 lenses. Moreover, we establish a physical structure model of PAL using the ray tracing method and study the influence of its physical parameters on compactness ratio. In addition, for the evaluation of local tolerances of annular surfaces, we propose a tolerance analysis method suitable for ASPAL. This analytical method can effectively analyze surface irregularities on annular surfaces and provide clear guidance on manufacturing tolerances for ASPAL. Benefiting from high-precision glass molding and injection molding aspheric lens manufacturing techniques, we finally manufactured 20 ASPALs in small batches. The weight of an ASPAL prototype is only 8.5 g. Our framework provides promising insights for the application of panoramic systems in space and weight-constrained environmental sensing scenarios such as intelligent security, micro-UAVs, and micro-robots.
△ Less
Submitted 5 May, 2024;
originally announced May 2024.
-
Privacy-Enhanced Database Synthesis for Benchmark Publishing
Authors:
Yongrui Zhong,
Yunqing Ge,
Jianbin Qin,
Shuyuan Zheng,
Bo Tang,
Yu-Xuan Qiu,
Rui Mao,
Ye Yuan,
Makoto Onizuka,
Chuan Xiao
Abstract:
Benchmarking is crucial for evaluating a DBMS, yet existing benchmarks often fail to reflect the varied nature of user workloads. As a result, there is increasing momentum toward creating databases that incorporate real-world user data to more accurately mirror business environments. However, privacy concerns deter users from directly sharing their data, underscoring the importance of creating syn…
▽ More
Benchmarking is crucial for evaluating a DBMS, yet existing benchmarks often fail to reflect the varied nature of user workloads. As a result, there is increasing momentum toward creating databases that incorporate real-world user data to more accurately mirror business environments. However, privacy concerns deter users from directly sharing their data, underscoring the importance of creating synthesized databases for benchmarking that also prioritize privacy protection. Differential privacy has become a key method for safeguarding privacy when sharing data, but the focus has largely been on minimizing errors in aggregate queries or classification tasks, with less attention given to benchmarking factors like runtime performance. This paper delves into the creation of privacy-preserving databases specifically for benchmarking, aiming to produce a differentially private database whose query performance closely resembles that of the original data. Introducing PrivBench, an innovative synthesis framework, we support the generation of high-quality data that maintains privacy. PrivBench uses sum-product networks (SPNs) to partition and sample data, enhancing data representation while securing privacy. The framework allows users to adjust the detail of SPN partitions and privacy settings, crucial for customizing privacy levels. We validate our approach, which uses the Laplace and exponential mechanisms, in maintaining privacy. Our tests show that PrivBench effectively generates data that maintains privacy and excels in query performance, consistently reducing errors in query execution time, query cardinality, and KL divergence.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
On orbit categories with dg enhancement
Authors:
Li Fan,
Bernhard Keller,
Yu Qiu
Abstract:
We show that pretriangulated dg categories enjoy a universal property and deduce that the passage to an orbit quotient commutes with the dg quotient. In particular, for a triangulated category with dg enhancement and an endofunctor, there exists a unique triangulated orbit category.
As an application, we prove that for any connective, smooth and proper dg algebra $A$, its perfect derived categor…
▽ More
We show that pretriangulated dg categories enjoy a universal property and deduce that the passage to an orbit quotient commutes with the dg quotient. In particular, for a triangulated category with dg enhancement and an endofunctor, there exists a unique triangulated orbit category.
As an application, we prove that for any connective, smooth and proper dg algebra $A$, its perfect derived category is equivalent to the generalized $(\mathbb{X}-1)$-cluster category of $A$. This implies that the orbit $m$-cluster category of $A$ is equivalent to the generalized $m$-cluster category of $A$, which implies a conjecture by Ikeda-Qiu for the case when $A$ is a smooth proper graded gentle algebra.
△ Less
Submitted 30 April, 2024;
originally announced May 2024.
-
Broad and Bi-directional narrow quasi-periodic fast-propagating wave trains associated with a filament-driven halo CME on 2023 April 21
Authors:
Xinping Zhou,
Yuandeng Shen,
Yihua Yan,
Ke Yu,
Zhining Qu,
Ahmed Ahmed Ibrahim,
Zehao Tang,
Chengrui Zhou,
Song Tan,
Ye Qiu,
Hongfei Liang
Abstract:
This paper presents three distinct wave trains that occurred on 2023 April 21: a broad quasi-periodic fast-propagating (QFP) wave train and a bi-directional narrow QFP wave train. The broad QFP wave train expands outward in a circular wavefront, while bi-directional narrow QFP wave trains propagate in the northward and southward directions, respectively. The concurrent presence of the wave trains…
▽ More
This paper presents three distinct wave trains that occurred on 2023 April 21: a broad quasi-periodic fast-propagating (QFP) wave train and a bi-directional narrow QFP wave train. The broad QFP wave train expands outward in a circular wavefront, while bi-directional narrow QFP wave trains propagate in the northward and southward directions, respectively. The concurrent presence of the wave trains offers a remarkable opportunity to investigate their respective triggering mechanisms. Measurement shows that the broad QFP wave train's speed is 300- 1100 km/s in different propagating directions. There is a significant difference in the speed of the bi-directional narrow QFP wave trains: the southward propagation achieves 1400 km/s, while the northward propagation only reaches about 550 km/s accompanied by a deceleration of about 1- 2 kms-2. Using the wavelet analysis, we find that the periodicity of the propagating wave trains in the southward and northward directions closely matches the quasi-periodic pulsations (QPPs) exhibited by the flares. Based on these results, the narrow QFP wave trains were most likely excited by the intermittent energy release in the accompanying flare. In contrast, the broad QFP wave train had a tight relationship with the erupting filament, probably attributed to the unwinding motion of the erupting filament or the leakage of the fast sausage wave train inside the filament body.
△ Less
Submitted 28 April, 2024;
originally announced April 2024.
-
A Conditional Independence Test in the Presence of Discretization
Authors:
Boyang Sun,
Yu Yao,
Huangyuan Hao,
Yumou Qiu,
Kun Zhang
Abstract:
Testing conditional independence has many applications, such as in Bayesian network learning and causal discovery. Different test methods have been proposed. However, existing methods generally can not work when only discretized observations are available. Specifically, consider $X_1$, $\tilde{X}_2$ and $X_3$ are observed variables, where $\tilde{X}_2$ is a discretization of latent variables…
▽ More
Testing conditional independence has many applications, such as in Bayesian network learning and causal discovery. Different test methods have been proposed. However, existing methods generally can not work when only discretized observations are available. Specifically, consider $X_1$, $\tilde{X}_2$ and $X_3$ are observed variables, where $\tilde{X}_2$ is a discretization of latent variables $X_2$. Applying existing test methods to the observations of $X_1$, $\tilde{X}_2$ and $X_3$ can lead to a false conclusion about the underlying conditional independence of variables $X_1$, $X_2$ and $X_3$. Motivated by this, we propose a conditional independence test specifically designed to accommodate the presence of such discretization. To achieve this, we design the bridge equations to recover the parameter reflecting the statistical information of the underlying latent continuous variables. An appropriate test statistic and its asymptotic distribution under the null hypothesis of conditional independence have also been derived. Both theoretical results and empirical validation have been provided, demonstrating the effectiveness of our test methods.
△ Less
Submitted 3 May, 2024; v1 submitted 26 April, 2024;
originally announced April 2024.
-
MDDD: Manifold-based Domain Adaptation with Dynamic Distribution for Non-Deep Transfer Learning in Cross-subject and Cross-session EEG-based Emotion Recognition
Authors:
Ting Luo,
Jing Zhang,
Yingwei Qiu,
Li Zhang,
Yaohua Hu,
Zhuliang Yu,
Zhen Liang
Abstract:
Emotion decoding using Electroencephalography (EEG)-based affective brain-computer interfaces represents a significant area within the field of affective computing. In the present study, we propose a novel non-deep transfer learning method, termed as Manifold-based Domain adaptation with Dynamic Distribution (MDDD). The proposed MDDD includes four main modules: manifold feature transformation, dyn…
▽ More
Emotion decoding using Electroencephalography (EEG)-based affective brain-computer interfaces represents a significant area within the field of affective computing. In the present study, we propose a novel non-deep transfer learning method, termed as Manifold-based Domain adaptation with Dynamic Distribution (MDDD). The proposed MDDD includes four main modules: manifold feature transformation, dynamic distribution alignment, classifier learning, and ensemble learning. The data undergoes a transformation onto an optimal Grassmann manifold space, enabling dynamic alignment of the source and target domains. This process prioritizes both marginal and conditional distributions according to their significance, ensuring enhanced adaptation efficiency across various types of data. In the classifier learning, the principle of structural risk minimization is integrated to develop robust classification models. This is complemented by dynamic distribution alignment, which refines the classifier iteratively. Additionally, the ensemble learning module aggregates the classifiers obtained at different stages of the optimization process, which leverages the diversity of the classifiers to enhance the overall prediction accuracy. The experimental results indicate that MDDD outperforms traditional non-deep learning methods, achieving an average improvement of 3.54%, and is comparable to deep learning methods. This suggests that MDDD could be a promising method for enhancing the utility and applicability of aBCIs in real-world scenarios.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
A Law of large numbers for vector-valued linear statistics of Bergman DPP
Authors:
Zhaofeng Lin,
Yanqi Qiu,
Kai Wang
Abstract:
We establish a law of large numbers for a certain class of vector-valued linear statistics for the Bergman determinantal point process on the unit disk. Our result seems to be the first LLN for vector-valued linear statistics in the setting of determinantal point processes. As an application, we prove that, for almost all configurations $X$ with respect to with respect to the Bergman determinantal…
▽ More
We establish a law of large numbers for a certain class of vector-valued linear statistics for the Bergman determinantal point process on the unit disk. Our result seems to be the first LLN for vector-valued linear statistics in the setting of determinantal point processes. As an application, we prove that, for almost all configurations $X$ with respect to with respect to the Bergman determinantal point process, the weighted Poincaré series (we denote by $d_{h}(\cdot,\cdot)$ the hyperbolic distance on $\mathbb{D}$) \begin{align*} \sum_{k=0}^\infty\sum_{x\in X\atop k\le d_{h}(z,x)<k+1}e^{-sd_{\mathrm{h}}(z,x)}f(x) \end{align*} cannot be simultaneously convergent for all Bergman functions $f\in A^2(\mathbb{D})$ whenever $1<s<3/2$. This confirms a result announced without proof in Bufetov-Qiu's work.
△ Less
Submitted 23 April, 2024;
originally announced April 2024.
-
Unsupervised Learning of Individual Kohn-Sham States: Interpretable Representations and Consequences for Downstream Predictions of Many-Body Effects
Authors:
Bowen Hou,
Jinyuan Wu,
Diana Y. Qiu
Abstract:
Representation learning for the electronic structure problem is a major challenge of machine learning in computational condensed matter and materials physics. Within quantum mechanical first principles approaches, Kohn-Sham density functional theory (DFT) is the preeminent tool for understanding electronic structure, and the high-dimensional wavefunctions calculated in this approach serve as the b…
▽ More
Representation learning for the electronic structure problem is a major challenge of machine learning in computational condensed matter and materials physics. Within quantum mechanical first principles approaches, Kohn-Sham density functional theory (DFT) is the preeminent tool for understanding electronic structure, and the high-dimensional wavefunctions calculated in this approach serve as the building block for downstream calculations of correlated many-body excitations and related physical observables. Here, we use variational autoencoders (VAE) for the unsupervised learning of high-dimensional DFT wavefunctions and show that these wavefunctions lie in a low-dimensional manifold within the latent space. Our model autonomously determines the optimal representation of the electronic structure, avoiding limitations due to manual feature engineering and selection in prior work. To demonstrate the utility of the latent space representation of the DFT wavefunction, we use it for the supervised training of neural networks (NN) for downstream prediction of the quasiparticle bandstructures within the GW formalism, which includes many-electron correlations beyond DFT. The GW prediction achieves a low error of 0.11 eV for a combined test set of metals and semiconductors drawn from the Computational 2D Materials Database (C2DB), suggesting that latent space representation captures key physical information from the original data. Finally, we explore the interpretability of the VAE representation and show that the successful representation learning and downstream prediction by our model is derived from the smoothness of the VAE latent space, which also enables the generation of wavefunctions on arbitrary points in latent space. Our work provides a novel and general machine-learning framework for investigating electronic structure and many-body physics.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
IFViT: Interpretable Fixed-Length Representation for Fingerprint Matching via Vision Transformer
Authors:
Yuhang Qiu,
Honghui Chen,
Xingbo Dong,
Zheng Lin,
Iman Yi Liao,
Massimo Tistarelli,
Zhe Jin
Abstract:
Determining dense feature points on fingerprints used in constructing deep fixed-length representations for accurate matching, particularly at the pixel level, is of significant interest. To explore the interpretability of fingerprint matching, we propose a multi-stage interpretable fingerprint matching network, namely Interpretable Fixed-length Representation for Fingerprint Matching via Vision T…
▽ More
Determining dense feature points on fingerprints used in constructing deep fixed-length representations for accurate matching, particularly at the pixel level, is of significant interest. To explore the interpretability of fingerprint matching, we propose a multi-stage interpretable fingerprint matching network, namely Interpretable Fixed-length Representation for Fingerprint Matching via Vision Transformer (IFViT), which consists of two primary modules. The first module, an interpretable dense registration module, establishes a Vision Transformer (ViT)-based Siamese Network to capture long-range dependencies and the global context in fingerprint pairs. It provides interpretable dense pixel-wise correspondences of feature points for fingerprint alignment and enhances the interpretability in the subsequent matching stage. The second module takes into account both local and global representations of the aligned fingerprint pair to achieve an interpretable fixed-length representation extraction and matching. It employs the ViTs trained in the first module with the additional fully connected layer and retrains them to simultaneously produce the discriminative fixed-length representation and interpretable dense pixel-wise correspondences of feature points. Extensive experimental results on diverse publicly available fingerprint databases demonstrate that the proposed framework not only exhibits superior performance on dense registration and matching but also significantly promotes the interpretability in deep fixed-length representations-based fingerprint matching.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation
Authors:
Heyuan Li,
Ce Chen,
Tianhao Shi,
Yuda Qiu,
Sizhe An,
Guanying Chen,
Xiaoguang Han
Abstract:
While recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. Although PanoHead proves the possibilities of using a large-scale dataset with images of both frontal and back views for full-head synthesis, it often caus…
▽ More
While recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. Although PanoHead proves the possibilities of using a large-scale dataset with images of both frontal and back views for full-head synthesis, it often causes artifacts for back views. Based on our in-depth analysis, we found the reasons are mainly twofold. First, from network architecture perspective, we found each plane in the utilized tri-plane/tri-grid representation space tends to confuse the features from both sides, causing "mirroring" artifacts (e.g., the glasses appear in the back). Second, from data supervision aspect, we found that existing discriminator training in 3D GANs mainly focuses on the quality of the rendered image itself, and does not care much about its plausibility with the perspective from which it was rendered. This makes it possible to generate "face" in non-frontal views, due to its easiness to fool the discriminator. In response, we propose SphereHead, a novel tri-plane representation in the spherical coordinate system that fits the human head's geometric characteristics and efficiently mitigates many of the generated artifacts. We further introduce a view-image consistency loss for the discriminator to emphasize the correspondence of the camera parameters and the images. The combination of these efforts results in visually superior outcomes with significantly fewer artifacts. Our code and dataset are publicly available at https://lhyfst.github.io/spherehead.
△ Less
Submitted 16 July, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Unsupervised machine learning for supercooled liquids
Authors:
Yunrui Qiu,
Inhyuk Jang,
Xuhui Huang,
Arun Yethiraj
Abstract:
Unraveling the relation between structural information and the dynamic properties of supercooled liquids is one of the grand challenges of physics. Dynamic heterogeneity, characterized by the propensity of particles, is often used as a proxy for the dynamic slowing down. In this work, we introduce an unsupervised machine learning approach based on a time-lagged autoencoder (TAE) to elucidate the e…
▽ More
Unraveling the relation between structural information and the dynamic properties of supercooled liquids is one of the grand challenges of physics. Dynamic heterogeneity, characterized by the propensity of particles, is often used as a proxy for the dynamic slowing down. In this work, we introduce an unsupervised machine learning approach based on a time-lagged autoencoder (TAE) to elucidate the effect of structural features on the long-time dynamic heterogeneity of supercooled liquids. The TAE uses an autoencoder to reconstruct features at time $t + Δt$ from input features at time $t$ for individual particles, and the resulting latent space variables are considered as order parameters. In the Kob-Andersen system, with a $Δt$ about a thousand times smaller than the relaxation time, the TAE order parameter exhibits a remarkable correlation with the long-time propensity. We find that radial features on all length-scales are required to capture the long-time dynamics, consistent with recent simulations. This shows that fluctuations of structural features contain sufficient information about the long-time dynamic heterogeneity.
△ Less
Submitted 25 April, 2024; v1 submitted 5 April, 2024;
originally announced April 2024.
-
An Information Bottleneck Approach for Markov Model Construction
Authors:
Dedi Wang,
Yunrui Qiu,
Eric Beyerle,
Xuhui Huang,
Pratyush Tiwary
Abstract:
Markov state models (MSMs) are valuable for studying dynamics of protein conformational changes via statistical analysis of molecular dynamics (MD) simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with the dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific…
▽ More
Markov state models (MSMs) are valuable for studying dynamics of protein conformational changes via statistical analysis of molecular dynamics (MD) simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with the dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific lag time requires state defined without significant internal energy barriers, enabling internal dynamics relaxation within the lag time. This process coarse grains time and space, integrating out rapid motions within metastable states. This work introduces a continuous embedding approach for molecular conformations using the state predictive information bottleneck (SPIB), which unifies dimensionality reduction and state space partitioning via a continuous, machine learned basis set. Without explicit optimization of VAMP-based scores, SPIB demonstrates state-of-the-art performance in identifying slow dynamical processes and constructing predictive multi-resolution Markovian models. When applied to mini-proteins trajectories, SPIB showcases unique advantages compared to competing methods. It automatically adjusts the number of metastable states based on a specified minimal time resolution, eliminating the need for manual tuning. While maintaining efficacy in dynamical properties, SPIB excels in accurately distinguishing metastable states and capturing numerous well-populated macrostates. Furthermore, SPIB's ability to learn a low-dimensional continuous embedding of the underlying MSMs enhances the interpretation of dynamic pathways. Accordingly, we propose SPIB as an easy-to-implement methodology for end-to-end MSM construction.
△ Less
Submitted 10 June, 2024; v1 submitted 3 April, 2024;
originally announced April 2024.
-
CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)
Authors:
Xu Li,
Ruiqi Sun,
Jiameng Lv,
Peng Jia,
Nan Li,
Chengliang Wei,
Zou Hu,
Xinzhong Er,
Yun Chen,
Zhang Ban,
Yuedong Fang,
Qi Guo,
Dezi Liu,
Guoliang Li,
Lin Lin,
Ming Li,
Ran Li,
Xiaobo Li,
Yu Luo,
Xianmin Meng,
Jundan Nie,
Zhaoxiang Qi,
Yisheng Qiu,
Li Shao,
Hao Tian
, et al. (7 additional authors not shown)
Abstract:
Strong gravitational lensing is a powerful tool for investigating dark matter and dark energy properties. With the advent of large-scale sky surveys, we can discover strong lensing systems on an unprecedented scale, which requires efficient tools to extract them from billions of astronomical objects. The existing mainstream lens-finding tools are based on machine learning algorithms and applied to…
▽ More
Strong gravitational lensing is a powerful tool for investigating dark matter and dark energy properties. With the advent of large-scale sky surveys, we can discover strong lensing systems on an unprecedented scale, which requires efficient tools to extract them from billions of astronomical objects. The existing mainstream lens-finding tools are based on machine learning algorithms and applied to cut-out-centered galaxies. However, according to the design and survey strategy of optical surveys by CSST, preparing cutouts with multiple bands requires considerable efforts. To overcome these challenges, we have developed a framework based on a hierarchical visual Transformer with a sliding window technique to search for strong lensing systems within entire images. Moreover, given that multi-color images of strong lensing systems can provide insights into their physical characteristics, our framework is specifically crafted to identify strong lensing systems in images with any number of channels. As evaluated using CSST mock data based on an Semi-Analytic Model named CosmoDC2, our framework achieves precision and recall rates of 0.98 and 0.90, respectively. To evaluate the effectiveness of our method in real observations, we have applied it to a subset of images from the DESI Legacy Imaging Surveys and media images from Euclid Early Release Observations. 61 new strong lensing system candidates are discovered by our method. However, we also identified false positives arising primarily from the simplified galaxy morphology assumptions within the simulation. This underscores the practical limitations of our approach while simultaneously highlighting potential avenues for future improvements.
△ Less
Submitted 2 April, 2024;
originally announced April 2024.
-
Debiased calibration estimation using generalized entropy in survey sampling
Authors:
Yonghyun Kwon,
Jae Kwang Kim,
Yumou Qiu
Abstract:
Incorporating the auxiliary information into the survey estimation is a fundamental problem in survey sampling. Calibration weighting is a popular tool for incorporating the auxiliary information. The calibration weighting method of Deville and Sarndal (1992) uses a distance measure between the design weights and the final weights to solve the optimization problem with calibration constraints. Thi…
▽ More
Incorporating the auxiliary information into the survey estimation is a fundamental problem in survey sampling. Calibration weighting is a popular tool for incorporating the auxiliary information. The calibration weighting method of Deville and Sarndal (1992) uses a distance measure between the design weights and the final weights to solve the optimization problem with calibration constraints. This paper introduces a novel framework that leverages generalized entropy as the objective function for optimization, where design weights play a role in the constraints to ensure design consistency, rather than being part of the objective function. This innovative calibration framework is particularly attractive due to its generality and its ability to generate more efficient calibration weights compared to traditional methods based on Deville and Sarndal (1992). Furthermore, we identify the optimal choice of the generalized entropy function that achieves the minimum variance across various choices of the generalized entropy function under the same constraints. Asymptotic properties, such as design consistency and asymptotic normality, are presented rigorously. The results from a limited simulation study are also presented. We demonstrate a real-life application using agricultural survey data collected from Kynetec, Inc.
△ Less
Submitted 2 April, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Some super-Poincaré inequalities for gaussian-like measures on stratified Lie groups
Authors:
Yaozhong W. Qiu
Abstract:
We continue the $U$-bound program initiated in [J. Funct. Anal. 258, 814-851 (2010)] and prove super-Poincaré inequalities for a class of subelliptic probability measures defined on Métivier groups, the main ingredient in the proof being a Hardy-type inequality. In doing so, we recover and extend some previous results from the probabilistic viewpoint.
We continue the $U$-bound program initiated in [J. Funct. Anal. 258, 814-851 (2010)] and prove super-Poincaré inequalities for a class of subelliptic probability measures defined on Métivier groups, the main ingredient in the proof being a Hardy-type inequality. In doing so, we recover and extend some previous results from the probabilistic viewpoint.
△ Less
Submitted 25 May, 2024; v1 submitted 30 March, 2024;
originally announced April 2024.
-
A geometric realization of Koszul duality for graded gentle algebras
Authors:
Zixu Li,
Yu Qiu,
Yu Zhou
Abstract:
We show that the Koszul functor of a homologically smooth graded gentle algebra can be realized as the half rotation in a geometric model. As a byproduct, we prove an intersection-dim formula involving the Koszul functor.
We show that the Koszul functor of a homologically smooth graded gentle algebra can be realized as the half rotation in a geometric model. As a byproduct, we prove an intersection-dim formula involving the Koszul functor.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation
Authors:
Mariia Khan,
Yue Qiu,
Yuren Cong,
Jumana Abu-Khalaf,
David Suter,
Bodo Rosenhahn
Abstract:
Multi-class multi-instance segmentation is the task of identifying masks for multiple object classes and multiple instances of the same class within an image. The foundational Segment Anything Model (SAM) is designed for promptable multi-class multi-instance segmentation but tends to output part or sub-part masks in the "everything" mode for various real-world applications. Whole object segmentati…
▽ More
Multi-class multi-instance segmentation is the task of identifying masks for multiple object classes and multiple instances of the same class within an image. The foundational Segment Anything Model (SAM) is designed for promptable multi-class multi-instance segmentation but tends to output part or sub-part masks in the "everything" mode for various real-world applications. Whole object segmentation masks play a crucial role for indoor scene understanding, especially in robotics applications. We propose a new domain invariant Real-to-Simulation (Real-Sim) fine-tuning strategy for SAM. We use object images and ground truth data collected from Ai2Thor simulator during fine-tuning (real-to-sim). To allow our Segment Any Object Model (SAOM) to work in the "everything" mode, we propose the novel nearest neighbour assignment method, updating point embeddings for each ground-truth mask. SAOM is evaluated on our own dataset collected from Ai2Thor simulator. SAOM significantly improves on SAM, with a 28% increase in mIoU and a 25% increase in mAcc for 54 frequently-seen indoor object classes. Moreover, our Real-to-Simulation fine-tuning strategy demonstrates promising generalization performance in real environments without being trained on the real-world data (sim-to-real). The dataset and the code will be released after publication.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.