Search | arXiv e-print repository

Convergences of Combinatorial Ricci Flows to Degenerated Circle Packings in Hyperbolic Background Geometry

Authors: Guangming Hu, Sicheng Lu, Dong Tan, Youliang Zhong, Puchun Zhou

Abstract: This paper investigates a kind of degenerated circle packings in hyperbolic background geometry. A main problem is whether a prescribed total geodesic curvature data can be realized by a degenerated circle packing or not. We fully characterize the sufficient and necessary conditions and show the uniqueness. Furthermore, we introduce the combinatoral Ricci flow to find the desired degenerated circl… ▽ More This paper investigates a kind of degenerated circle packings in hyperbolic background geometry. A main problem is whether a prescribed total geodesic curvature data can be realized by a degenerated circle packing or not. We fully characterize the sufficient and necessary conditions and show the uniqueness. Furthermore, we introduce the combinatoral Ricci flow to find the desired degenerated circle packed surface, analougus to the methods of Chow-Luo and Takatsu. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 36 pages, 9 figures

MSC Class: 52C26; 57M50

arXiv:2407.05840 [pdf, other]

A 103-TOPS/mm$^2$ Integrated Photonic Computing Engine Enabling Next-Generation Reservoir Computing

Authors: Dongliang Wang, Yikun Nie, Gaolei Hu, Hon Ki Tsang, Chaoran Huang

Abstract: Reservoir computing (RC) is a leading machine learning algorithm for information processing due to its rich expressiveness. A new RC paradigm has recently emerged, showcasing superior performance and delivering more interpretable results with shorter training data sets and training times, representing the next generation of RC computing. This work presents the first realization of a high-speed nex… ▽ More Reservoir computing (RC) is a leading machine learning algorithm for information processing due to its rich expressiveness. A new RC paradigm has recently emerged, showcasing superior performance and delivering more interpretable results with shorter training data sets and training times, representing the next generation of RC computing. This work presents the first realization of a high-speed next-generation RC system on an integrated photonic chip. Our experimental results demonstrate state-of-the-art forecasting and classification performances under various machine learning tasks and achieve the fastest speeds of 60 Gbaud and a computing density of 103 tera operations/second/mm$^2$ (TOPS/mm$^2$). The passive system, composed of a simple star coupler with on-chip delay lines, offers several advantages over traditional RC systems, including no speed limitations, compact footprint, extremely high fabrication error tolerance, fewer metaparameters, and greater interpretability. This work lays the foundation for ultrafast on-chip photonic RC, representing significant progress toward developing next-generation high-speed photonic computing and signal processing. △ Less

Submitted 31 May, 2024; originally announced July 2024.

arXiv:2407.03835 [pdf, other]

7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition

Authors: Dimitrios Kollias, Stefanos Zafeiriou, Irene Kotsia, Abhinav Dhall, Shreya Ghosh, Chunchang Shao, Guanyu Hu

Abstract: This paper describes the 7th Affective Behavior Analysis in-the-wild (ABAW) Competition, which is part of the respective Workshop held in conjunction with ECCV 2024. The 7th ABAW Competition addresses novel challenges in understanding human expressions and behaviors, crucial for the development of human-centered technologies. The Competition comprises of two sub-challenges: i) Multi-Task Learning… ▽ More This paper describes the 7th Affective Behavior Analysis in-the-wild (ABAW) Competition, which is part of the respective Workshop held in conjunction with ECCV 2024. The 7th ABAW Competition addresses novel challenges in understanding human expressions and behaviors, crucial for the development of human-centered technologies. The Competition comprises of two sub-challenges: i) Multi-Task Learning (the goal is to learn at the same time, in a multi-task learning setting, to estimate two continuous affect dimensions, valence and arousal, to recognise between the mutually exclusive classes of the 7 basic expressions and 'other'), and to detect 12 Action Units); and ii) Compound Expression Recognition (the target is to recognise between the 7 mutually exclusive compound expression classes). s-Aff-Wild2, which is a static version of the A/V Aff-Wild2 database and contains annotations for valence-arousal, expressions and Action Units, is utilized for the purposes of the Multi-Task Learning Challenge; a part of C-EXPR-DB, which is an A/V in-the-wild database with compound expression annotations, is utilized for the purposes of the Compound Expression Recognition Challenge. In this paper, we introduce the two challenges, detailing their datasets and the protocols followed for each. We also outline the evaluation metrics, and highlight the baseline systems and their results. Additional information about the competition can be found at \url{https://affective-behavior-analysis-in-the-wild.github.io/7th}. △ Less

Submitted 8 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

arXiv:2406.14992 [pdf, other]

A multi-mesh approach for accurate computation of multi-target functionals in aerodynamics design

Authors: Guanghui Hu, Ruo Li, Jingfeng Wang

Abstract: Aerodynamic optimal design is crucial for enhancing performance of aircrafts, while calculating multi-target functionals through solving dual equations with arbitrary right-hand sides remains challenging. In this paper, a novel multi-target framework of DWR-based mesh refinement is proposed and analyzed. Theoretically, an extrapolation method is generalized to expand multi-variable functionals, wh… ▽ More Aerodynamic optimal design is crucial for enhancing performance of aircrafts, while calculating multi-target functionals through solving dual equations with arbitrary right-hand sides remains challenging. In this paper, a novel multi-target framework of DWR-based mesh refinement is proposed and analyzed. Theoretically, an extrapolation method is generalized to expand multi-variable functionals, which guarantees the dual equations of different objective functionals can be calculated separately. Numerically, an algorithm of calculating multi-target functionals is designed based on the multi-mesh approach, which can help to obtain different dual solutions simultaneously. One feature of our framework is the algorithm is easy to implement with the help of the hierarchical geometry tree structure and the calculation avoids the Galerkin orthogonality naturally. The framework takes a balance between different targets even when they are not the same orders of magnitude. While existing approach uses a linear combination of different components in multi-target functionals for adaptation, it introduces additional coefficients for adjusting. With each component calculated under a dual-consistent scheme, this multi-mesh framework addresses challenges such as the lift-drag ratio and other kinds of multi-target functionals, ensuring smooth convergence and precise calculations of dual solutions. △ Less

Submitted 21 June, 2024; originally announced June 2024.

arXiv:2406.12180 [pdf]

Unusual charge density wave introduced by Janus structure in monolayer vanadium dichalcogenides

Authors: Ziqiang Xu, Yan Shao, Chun Huang, Genyu Hu, Shihao Hu, Zhi-Lin Li, Xiaoyu Hao, Yanhui Hou, Teng Zhang, Jin-An Shi, Chen Liu, Jia-Ou Wang, Wu Zhou, Jiadong Zhou, Wei Ji, Jingsi Qiao, Xu Wu, Hong-Jun Gao, Yeliang Wang

Abstract: As a fundamental structural feature, the symmetry of materials determines the exotic quantum properties in transition metal dichalcogenides (TMDs) with charge density wave (CDW). Breaking the inversion symmetry, the Janus structure, an artificially constructed lattice, provides an opportunity to tune the CDW states and the related properties. However, limited by the difficulties in atomic-level fa… ▽ More As a fundamental structural feature, the symmetry of materials determines the exotic quantum properties in transition metal dichalcogenides (TMDs) with charge density wave (CDW). Breaking the inversion symmetry, the Janus structure, an artificially constructed lattice, provides an opportunity to tune the CDW states and the related properties. However, limited by the difficulties in atomic-level fabrication and material stability, the experimental visualization of the CDW states in 2D TMDs with Janus structure is still rare. Here, using surface selenization of VTe2, we fabricated monolayer Janus VTeSe. With scanning tunneling microscopy, an unusual root13-root13 CDW state with threefold rotational symmetry breaking was observed and characterized. Combined with theoretical calculations, we find this CDW state can be attributed to the charge modulation in the Janus VTeSe, beyond the conventional electron-phonon coupling. Our findings provide a promising platform for studying the CDW states and artificially tuning the electronic properties toward the applications. △ Less

Submitted 17 June, 2024; originally announced June 2024.

arXiv:2406.12164 [pdf, other]

A Mel Spectrogram Enhancement Paradigm Based on CWT in Speech Synthesis

Authors: Guoqiang Hu, Huaning Tan, Ruilai Li

Abstract: Acoustic features play an important role in improving the quality of the synthesised speech. Currently, the Mel spectrogram is a widely employed acoustic feature in most acoustic models. However, due to the fine-grained loss caused by its Fourier transform process, the clarity of speech synthesised by Mel spectrogram is compromised in mutant signals. In order to obtain a more detailed Mel spectrog… ▽ More Acoustic features play an important role in improving the quality of the synthesised speech. Currently, the Mel spectrogram is a widely employed acoustic feature in most acoustic models. However, due to the fine-grained loss caused by its Fourier transform process, the clarity of speech synthesised by Mel spectrogram is compromised in mutant signals. In order to obtain a more detailed Mel spectrogram, we propose a Mel spectrogram enhancement paradigm based on the continuous wavelet transform (CWT). This paradigm introduces an additional task: a more detailed wavelet spectrogram, which like the post-processing network takes as input the Mel spectrogram output by the decoder. We choose Tacotron2 and Fastspeech2 for experimental validation in order to test autoregressive (AR) and non-autoregressive (NAR) speech systems, respectively. The experimental results demonstrate that the speech synthesised using the model with the Mel spectrogram enhancement paradigm exhibits higher MOS, with an improvement of 0.14 and 0.09 compared to the baseline model, respectively. These findings provide some validation for the universality of the enhancement paradigm, as they demonstrate the success of the paradigm in different architectures. △ Less

Submitted 9 July, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

Comments: Accepted by IALP 2024

arXiv:2406.11030 [pdf, other]

FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture

Authors: Wenyan Li, Xinyu Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott

Abstract: Food is a rich and varied dimension of cultural heritage, crucial to both individuals and social groups. To bridge the gap in the literature on the often-overlooked regional diversity in this domain, we introduce FoodieQA, a manually curated, fine-grained image-text dataset capturing the intricate features of food cultures across various regions in China. We evaluate vision-language Models (VLMs)… ▽ More Food is a rich and varied dimension of cultural heritage, crucial to both individuals and social groups. To bridge the gap in the literature on the often-overlooked regional diversity in this domain, we introduce FoodieQA, a manually curated, fine-grained image-text dataset capturing the intricate features of food cultures across various regions in China. We evaluate vision-language Models (VLMs) and large language models (LLMs) on newly collected, unseen food images and corresponding questions. FoodieQA comprises three multiple-choice question-answering tasks where models need to answer questions based on multiple images, a single image, and text-only descriptions, respectively. While LLMs excel at text-based question answering, surpassing human accuracy, the open-sourced VLMs still fall short by 41\% on multi-image and 21\% on single-image VQA tasks, although closed-weights models perform closer to human levels (within 10\%). Our findings highlight that understanding food and its cultural implications remains a challenging and under-explored direction. △ Less

Submitted 16 June, 2024; originally announced June 2024.

arXiv:2406.10052 [pdf, other]

Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection

Authors: Haoyu Wang, Guoqiang Hu, Guodong Lin, Wei-Qiang Zhang, Jian Li

Abstract: As a robust and large-scale multilingual speech recognition model, Whisper has demonstrated impressive results in many low-resource and out-of-distribution scenarios. However, its encoder-decoder structure hinders its application to streaming speech recognition. In this paper, we introduce Simul-Whisper, which uses the time alignment embedded in Whisper's cross-attention to guide auto-regressive d… ▽ More As a robust and large-scale multilingual speech recognition model, Whisper has demonstrated impressive results in many low-resource and out-of-distribution scenarios. However, its encoder-decoder structure hinders its application to streaming speech recognition. In this paper, we introduce Simul-Whisper, which uses the time alignment embedded in Whisper's cross-attention to guide auto-regressive decoding and achieve chunk-based streaming ASR without any fine-tuning of the pre-trained model. Furthermore, we observe the negative effect of the truncated words at the chunk boundaries on the decoding results and propose an integrate-and-fire-based truncation detection model to address this issue. Experiments on multiple languages and Whisper architectures show that Simul-Whisper achieves an average absolute word error rate degradation of only 1.46% at a chunk size of 1 second, which significantly outperforms the current state-of-the-art baseline. △ Less

Submitted 14 June, 2024; originally announced June 2024.

Comments: Accepted by INTERSPEECH 2024

arXiv:2406.09776 [pdf, other]

Faster Convergence on Heterogeneous Federated Edge Learning: An Adaptive Clustered Data Sharing Approach

Authors: Gang Hu, Yinglei Teng, Nan Wang, Zhu Han

Abstract: Federated Edge Learning (FEEL) emerges as a pioneering distributed machine learning paradigm for the 6G Hyper-Connectivity, harnessing data from the Internet of Things (IoT) devices while upholding data privacy. However, current FEEL algorithms struggle with non-independent and non-identically distributed (non-IID) data, leading to elevated communication costs and compromised model accuracy. To ad… ▽ More Federated Edge Learning (FEEL) emerges as a pioneering distributed machine learning paradigm for the 6G Hyper-Connectivity, harnessing data from the Internet of Things (IoT) devices while upholding data privacy. However, current FEEL algorithms struggle with non-independent and non-identically distributed (non-IID) data, leading to elevated communication costs and compromised model accuracy. To address these statistical imbalances within FEEL, we introduce a clustered data sharing framework, mitigating data heterogeneity by selectively sharing partial data from cluster heads to trusted associates through sidelink-aided multicasting. The collective communication pattern is integral to FEEL training, where both cluster formation and the efficiency of communication and computation impact training latency and accuracy simultaneously. To tackle the strictly coupled data sharing and resource optimization, we decompose the overall optimization problem into the clients clustering and effective data sharing subproblems. Specifically, a distribution-based adaptive clustering algorithm (DACA) is devised basing on three deductive cluster forming conditions, which ensures the maximum sharing yield. Meanwhile, we design a stochastic optimization based joint computed frequency and shared data volume optimization (JFVO) algorithm, determining the optimal resource allocation with an uncertain objective function. The experiments show that the proposed framework facilitates FEEL on non-IID datasets with faster convergence rate and higher model accuracy in a limited communication environment. △ Less

Submitted 8 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

arXiv:2406.09304 [pdf]

Self-reconfigurable Multifunctional Memristive Nociceptor for Intelligent Robotics

Authors: Shengbo Wang, Mingchao Fang, Lekai Song, Cong Li, Jian Zhang, Arokia Nathan, Guohua Hu, Shuo Gao

Abstract: Artificial nociceptors, mimicking human-like stimuli perception, are of significance for intelligent robotics to work in hazardous and dynamic scenarios. One of the most essential characteristics of the human nociceptor is its self-adjustable attribute, which indicates that the threshold of determination of a potentially hazardous stimulus relies on environmental knowledge. This critical attribute… ▽ More Artificial nociceptors, mimicking human-like stimuli perception, are of significance for intelligent robotics to work in hazardous and dynamic scenarios. One of the most essential characteristics of the human nociceptor is its self-adjustable attribute, which indicates that the threshold of determination of a potentially hazardous stimulus relies on environmental knowledge. This critical attribute has been currently omitted, but it is highly desired for artificial nociceptors. Inspired by these shortcomings, this article presents, for the first time, a Self-Directed Channel (SDC) memristor-based self-reconfigurable nociceptor, capable of perceiving hazardous pressure stimuli under different temperatures and demonstrates key features of tactile nociceptors, including 'threshold,' 'no-adaptation,' and 'sensitization.' The maximum amplification of hazardous external stimuli is 1000%, and its response characteristics dynamically adapt to current temperature conditions by automatically altering the generated modulation schemes for the memristor. The maximum difference ratio of the response of memristors at different temperatures is 500%, and this adaptability closely mimics the functions of biological tactile nociceptors, resulting in accurate danger perception in various conditions. Beyond temperature adaptation, this memristor-based nociceptor has the potential to integrate different sensory modalities by applying various sensors, thereby achieving human-like perception capabilities in real-world environments. △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: 14 pages, 4 figures

arXiv:2406.07462 [pdf]

Rayleigh surface waves of extremal elastic materials

Authors: Yu Wei, Yi Chen, Wen Cheng, Xiaoning Liu, Gengkai Hu

Abstract: Extremal elastic materials here refer to a specific class of elastic materials whose elastic matrices exhibit one or more zero eigenvalues, resulting in soft deformation modes that, in principle, cost no energy. They can be approximated through artificially designed solid microstructures. Extremal elastic materials have exotic bulk wave properties unavailable with conventional solids due to the so… ▽ More Extremal elastic materials here refer to a specific class of elastic materials whose elastic matrices exhibit one or more zero eigenvalues, resulting in soft deformation modes that, in principle, cost no energy. They can be approximated through artificially designed solid microstructures. Extremal elastic materials have exotic bulk wave properties unavailable with conventional solids due to the soft modes, offering unprecedented opportunities for manipulating bulk waves, e.g., acting as phonon polarizers for elastic waves or invisibility cloaks for underwater acoustic waves. Despite their potential, Rayleigh surface waves, crucially linked to bulk wave behaviors of such extremal elastic materials, have largely remained unexplored so far. In this paper, we theoretically investigate the propagation of Rayleigh waves in extremal elastic materials based on continuum theory and verify our findings with designed microstructure metamaterials based on pantographic structures. Dispersion relations and polarizations of Rayleigh waves in extremal elastic materials are derived, and the impact of higher order gradient effects is also investigated by using strain gradient theory. This study provides a continuum model for exploring surface waves in extremal elastic materials and may stimulate applications of extremal elastic materials for controlling surface waves. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 8 figures

arXiv:2406.01503 [pdf, ps, other]

An inverse obstacle problem with a single pair of Cauchy data: Laplace's equation case

Authors: Xiaoxu Xu, Guanghui Hu

Abstract: This paper is concerned with an inverse obstacle problem for the Laplace's equation. The aim is to recover the constant conductivity coefficient in the equation and the boundary of a Dirichlet polygonal obstacle from a single pair of Cauchy data. Uniqueness results are established under some a priori assumptions on the input boundary value data. A domain-defined sampling method, based on the facto… ▽ More This paper is concerned with an inverse obstacle problem for the Laplace's equation. The aim is to recover the constant conductivity coefficient in the equation and the boundary of a Dirichlet polygonal obstacle from a single pair of Cauchy data. Uniqueness results are established under some a priori assumptions on the input boundary value data. A domain-defined sampling method, based on the factorization method originating from inverse acoustic scattering, has been proposed to recover both the constant conductivity coefficient and the polygonal obstacle. A hybrid strategy, which combines the sampling method and iterative scheme, is employed {\color{hgh}to reconstruct} the location and shape of the obstacle. Numerical examples indicate that our method is efficient. △ Less

Submitted 3 June, 2024; originally announced June 2024.

MSC Class: 35R30; 35R25; 35J25

arXiv:2406.01151 [pdf, other]

A 0.96pJ/SOP, 30.23K-neuron/mm^2 Heterogeneous Neuromorphic Chip With Fullerene-like Interconnection Topology for Edge-AI Computing

Authors: P. J. Zhou, Q. Yu, M. Chen, Y. C. Wang, L. W. Meng, Y. Zuo, N. Ning, Y. Liu, S. G. Hu, G. C. Qiao

Abstract: Edge-AI computing requires high energy efficiency, low power consumption, and relatively high flexibility and compact area, challenging the AI-chip design. This work presents a 0.96 pJ/SOP heterogeneous neuromorphic system-on-chip (SoC) with fullerene-like interconnection topology for edge-AI computing. The neuromorphic core integrates different technologies to augment computing energy efficiency,… ▽ More Edge-AI computing requires high energy efficiency, low power consumption, and relatively high flexibility and compact area, challenging the AI-chip design. This work presents a 0.96 pJ/SOP heterogeneous neuromorphic system-on-chip (SoC) with fullerene-like interconnection topology for edge-AI computing. The neuromorphic core integrates different technologies to augment computing energy efficiency, including sparse computing, partial membrane potential updates, and non-uniform weight quantization. Multiple neuromorphic cores and multi-mode routers form a fullerene-like network-on-chip (NoC). The average degree of communication nodes exceeds traditional topologies by 32%, with a minimal degree variance of 0.93, allowing advanced decentralized on-chip communication. Additionally, the NoC can be scaled up through extended off-chip high-level router nodes. A RISC-V CPU and a neuromorphic processor are tightly coupled and fabricated within a 5.42 mm^2 die area under 55 nm CMOS technology. The chip has a low power density of 0.52 mW/mm^2, reducing 67.5% compared to related works, and achieves a high neuron density of 30.23 K/mm^2. Eventually, the chip is demonstrated to be effective on different datasets and achieves 0.96 pJ/SOP energy efficiency. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 5 pages, 8 figures

arXiv:2406.00497 [pdf, ps, other]

Recent Advances in End-to-End Simultaneous Speech Translation

Authors: Xiaoqian Liu, Guoqiang Hu, Yangfan Du, Erfeng He, YingFeng Luo, Chen Xu, Tong Xiao, Jingbo Zhu

Abstract: Simultaneous speech translation (SimulST) is a demanding task that involves generating translations in real-time while continuously processing speech input. This paper offers a comprehensive overview of the recent developments in SimulST research, focusing on four major challenges. Firstly, the complexities associated with processing lengthy and continuous speech streams pose significant hurdles.… ▽ More Simultaneous speech translation (SimulST) is a demanding task that involves generating translations in real-time while continuously processing speech input. This paper offers a comprehensive overview of the recent developments in SimulST research, focusing on four major challenges. Firstly, the complexities associated with processing lengthy and continuous speech streams pose significant hurdles. Secondly, satisfying real-time requirements presents inherent difficulties due to the need for immediate translation output. Thirdly, striking a balance between translation quality and latency constraints remains a critical challenge. Finally, the scarcity of annotated data adds another layer of complexity to the task. Through our exploration of these challenges and the proposed solutions, we aim to provide valuable insights into the current landscape of SimulST research and suggest promising directions for future exploration. △ Less

Submitted 1 June, 2024; originally announced June 2024.

arXiv:2405.07408 [pdf, other]

Bayesian Spatially Clustered Compositional Regression: Linking intersectoral GDP contributions to Gini Coefficients

Authors: Jingcheng Meng, Yimeng Ren, Xuening Zhu, Guanyu Hu

Abstract: The Gini coefficient is an universally used measurement of income inequality. Intersectoral GDP contributions reveal the economic development of different sectors of the national economy. Linking intersectoral GDP contributions to Gini coefficients will provide better understandings of how the Gini coefficient is influenced by different industries. In this paper, a compositional regression with sp… ▽ More The Gini coefficient is an universally used measurement of income inequality. Intersectoral GDP contributions reveal the economic development of different sectors of the national economy. Linking intersectoral GDP contributions to Gini coefficients will provide better understandings of how the Gini coefficient is influenced by different industries. In this paper, a compositional regression with spatially clustered coefficients is proposed to explore heterogeneous effects over spatial locations under nonparametric Bayesian framework. Specifically, a Markov random field constraint mixture of finite mixtures prior is designed for Bayesian log contrast regression with compostional covariates, which allows for both spatially contiguous clusters and discontinous clusters. In addition, an efficient Markov chain Monte Carlo algorithm for posterior sampling that enables simultaneous inference on both cluster configurations and cluster-wise parameters is designed. The compelling empirical performance of the proposed method is demonstrated via extensive simulation studies and an application to 51 states of United States from 2019 Bureau of Economic Analysis. △ Less

Submitted 12 May, 2024; originally announced May 2024.

arXiv:2405.06841 [pdf, other]

Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis

Authors: Guanyu Hu, Eleni Papadopoulou, Dimitrios Kollias, Paraskevi Tzouveli, Jie Wei, Xinyu Yang

Abstract: The increasing integration of machine learning algorithms in daily life underscores the critical need for fairness and equity in their deployment. As these technologies play a pivotal role in decision-making, addressing biases across diverse subpopulation groups, including age, gender, and race, becomes paramount. Automatic affect analysis, at the intersection of physiology, psychology, and machin… ▽ More The increasing integration of machine learning algorithms in daily life underscores the critical need for fairness and equity in their deployment. As these technologies play a pivotal role in decision-making, addressing biases across diverse subpopulation groups, including age, gender, and race, becomes paramount. Automatic affect analysis, at the intersection of physiology, psychology, and machine learning, has seen significant development. However, existing databases and methodologies lack uniformity, leading to biased evaluations. This work addresses these issues by analyzing six affective databases, annotating demographic attributes, and proposing a common protocol for database partitioning. Emphasis is placed on fairness in evaluations. Extensive experiments with baseline and state-of-the-art methods demonstrate the impact of these changes, revealing the inadequacy of prior assessments. The findings underscore the importance of considering demographic attributes in affect analysis research and provide a foundation for more equitable methodologies. Our annotations, code and pre-trained models are available at: https://github.com/dkollias/Fair-Consistent-Affect-Analysis △ Less

Submitted 16 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

Comments: accepted at IEEE FG 2024

arXiv:2405.05449 [pdf, other]

Markowitz Meets Bellman: Knowledge-distilled Reinforcement Learning for Portfolio Management

Authors: Gang Hu, Ming Gu

Abstract: Investment portfolios, central to finance, balance potential returns and risks. This paper introduces a hybrid approach combining Markowitz's portfolio theory with reinforcement learning, utilizing knowledge distillation for training agents. In particular, our proposed method, called KDD (Knowledge Distillation DDPG), consist of two training stages: supervised and reinforcement learning stages. Th… ▽ More Investment portfolios, central to finance, balance potential returns and risks. This paper introduces a hybrid approach combining Markowitz's portfolio theory with reinforcement learning, utilizing knowledge distillation for training agents. In particular, our proposed method, called KDD (Knowledge Distillation DDPG), consist of two training stages: supervised and reinforcement learning stages. The trained agents optimize portfolio assembly. A comparative analysis against standard financial models and AI frameworks, using metrics like returns, the Sharpe ratio, and nine evaluation indices, reveals our model's superiority. It notably achieves the highest yield and Sharpe ratio of 2.03, ensuring top profitability with the lowest risk in comparable return scenarios. △ Less

Submitted 8 May, 2024; originally announced May 2024.

arXiv:2405.05179 [pdf, ps, other]

Detection of a piecewise linear crack with one incident wave

Authors: Xiaoxu Xu, Guanqiu Ma, Guanghui Hu

Abstract: This paper is concerned with inverse crack scattering problems for time-harmonic acoustic waves. We prove that a piecewise linear crack with the sound-soft boundary condition in two dimensions can be uniquely determined by the far-field data corresponding to a single incident plane wave or point source. We propose two non-iterative methods for imaging the location and shape of a crack. The first o… ▽ More This paper is concerned with inverse crack scattering problems for time-harmonic acoustic waves. We prove that a piecewise linear crack with the sound-soft boundary condition in two dimensions can be uniquely determined by the far-field data corresponding to a single incident plane wave or point source. We propose two non-iterative methods for imaging the location and shape of a crack. The first one is a contrast sampling method, while the second one is a variant of the classical factorization method but only with one incoming wave. Newton's iteration method is then employed for getting a more precise reconstruction result. Numerical examples are presented to show the effectiveness of the proposed hybrid method. △ Less

Submitted 8 May, 2024; originally announced May 2024.

arXiv:2405.04120 [pdf, ps, other]

Movable Antennas-Enabled Two-User Multicasting: Do We Really Need Alternating Optimization for Minimum Rate Maximization?

Authors: Guojie Hu, Qingqing Wu, Donghui Xu, Kui Xu, Jiangbo Si, Yunlong Cai, Naofal Al-Dhahir

Abstract: Movable antenna (MA) technology, which can reconfigure wireless channels by flexibly moving antenna positions in a specified region, has great potential for improving communication performance. In this paper, we consider a new setup of MAs-enabled multicasting, where we adopt a simple setting in which a linear MA array-enabled source (${\rm{S}}$) transmits a common message to two single-antenna us… ▽ More Movable antenna (MA) technology, which can reconfigure wireless channels by flexibly moving antenna positions in a specified region, has great potential for improving communication performance. In this paper, we consider a new setup of MAs-enabled multicasting, where we adopt a simple setting in which a linear MA array-enabled source (${\rm{S}}$) transmits a common message to two single-antenna users ${\rm{U}}_1$ and ${\rm{U}}_2$. We aim to maximize the minimum rate among these two users, by jointly optimizing the transmit beamforming and antenna positions at ${\rm{S}}$. Instead of utilizing the widely-used alternating optimization (AO) approach, we reveal, with rigorous proof, that the above two variables can be optimized separately: i) the optimal antenna positions can be firstly determined via the successive convex approximation technique, based on the rule of maximizing the correlation between ${\rm{S}}$-${\rm{U}}_1$ and ${\rm{S}}$-${\rm{U}}_2$ channels; ii) afterwards, the optimal closed-form transmit beamforming can be derived via simple arguments. Compared to AO, this new approach yields the same performance but reduces the computational complexities significantly. Moreover, it can provide insightful conclusions which are not possible with AO. △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.19249 [pdf, other]

A Nonnested Augmented Subspace Method for Kohn-Sham Equation

Authors: Guanghui Hu, Hehu Xie, Fei Xu, Gang Zhao

Abstract: In this paper, a novel adaptive finite element method is proposed to solve the Kohn-Sham equation based on the moving mesh (nonnested mesh) adaptive technique and the augmented subspace method. Different from the classical self-consistent field iterative algorithm which requires to solve the Kohn-Sham equation directly in each adaptive finite element space, our algorithm transforms the Kohn-Sham e… ▽ More In this paper, a novel adaptive finite element method is proposed to solve the Kohn-Sham equation based on the moving mesh (nonnested mesh) adaptive technique and the augmented subspace method. Different from the classical self-consistent field iterative algorithm which requires to solve the Kohn-Sham equation directly in each adaptive finite element space, our algorithm transforms the Kohn-Sham equation into some linear boundary value problems of the same scale in each adaptive finite element space, and then the wavefunctions derived from the linear boundary value problems are corrected by solving a small-scale Kohn-Sham equation defined in a low-dimensional augmented subspace. Since the new algorithm avoids solving large-scale Kohn-Sham equation directly, a significant improvement for the solving efficiency can be obtained. In addition, the adaptive moving mesh technique is used to generate the nonnested adaptive mesh for the nonnested augmented subspace method according to the singularity of the approximate wavefunctions. The modified Hessian matrix of the approximate wavefunctions is used as the metric matrix to redistribute the mesh. Through the moving mesh adaptive technique, the redistributed mesh is almost optimal. A number of numerical experiments are carried out to verify the efficiency and the accuracy of the proposed algorithm. △ Less

Submitted 30 April, 2024; originally announced April 2024.

MSC Class: 65N30; 65N25; 65L15; 65B99

arXiv:2404.16271 [pdf]

True random number generation using metastable 1T' molybdenum ditelluride

Authors: Yang Liu, Pengyu Liu, Yingyi Wen, Zihan Liang, Songwei Liu, Lekai Song, Jingfang Pei, Xiaoyue Fan, Teng Ma, Gang Wang, Shuo Gao, Kong-Pang Pun, Xiaolong Chen, Guohua Hu

Abstract: True random numbers play a critical role in secure cryptography. The generation relies on a stable and readily extractable entropy source. Here, from solution-processed structurally metastable 1T' MoTe2, we prove stable output of featureless, stochastic, and yet stable conductance noise at a broad temperature (down to 15 K) with minimal power consumption (down to 0.05 micro-W). Our characterizatio… ▽ More True random numbers play a critical role in secure cryptography. The generation relies on a stable and readily extractable entropy source. Here, from solution-processed structurally metastable 1T' MoTe2, we prove stable output of featureless, stochastic, and yet stable conductance noise at a broad temperature (down to 15 K) with minimal power consumption (down to 0.05 micro-W). Our characterizations and statistical analysis of the characteristics of the conductance noise suggest that the noise arises from the volatility of the stochastic polarization of the underlying ferroelectric dipoles in the 1T' MoTe2. Further, as proved in our experiments and indicated by our Monte Carlo simulation, the ferroelectric dipole polarization is a reliable entropy source with the stochastic polarization persistent and stable over time. Exploiting the conductance noise, we achieve the generation of true random numbers and demonstrate their use in common cryptographic applications, for example, password generation and data encryption. Besides, particularly, we show a privacy safeguarding approach to sensitive data that can be critical for the cryptography of neural networks. We believe our work will bring insights into the understanding of the metastable 1T' MoTe2 and, more importantly, underpin its great potential in secure cryptography. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2404.12980 [pdf, other]

Ring-a-Pose: A Ring for Continuous Hand Pose Tracking

Authors: Tianhong Catherine Yu, Guilin Hu, Ruidong Zhang, Hyunchul Lim, Saif Mahmud, Chi-Jung Lee, Ke Li, Devansh Agarwal, Shuyang Nie, Jinseok Oh, François Guimbretière, Cheng Zhang

Abstract: We present Ring-a-Pose, a single untethered ring that tracks continuous 3D hand poses. Located in the center of the hand, the ring emits an inaudible acoustic signal that each hand pose reflects differently. Ring-a-Pose imposes minimal obtrusions on the hand, unlike multi-ring or glove systems. It is not affected by the choice of clothing that may cover wrist-worn systems. In a series of three use… ▽ More We present Ring-a-Pose, a single untethered ring that tracks continuous 3D hand poses. Located in the center of the hand, the ring emits an inaudible acoustic signal that each hand pose reflects differently. Ring-a-Pose imposes minimal obtrusions on the hand, unlike multi-ring or glove systems. It is not affected by the choice of clothing that may cover wrist-worn systems. In a series of three user studies with a total of 30 participants, we evaluate Ring-a-Pose's performance on pose tracking and micro-finger gesture recognition. Without collecting any training data from a user, Ring-a-Pose tracks continuous hand poses with a joint error of 14.1mm. The joint error decreases to 10.3mm for fine-tuned user-dependent models. Ring-a-Pose recognizes 7-class micro-gestures with a 90.60% and 99.27% accuracy for user-independent and user-dependent models, respectively. Furthermore, the ring exhibits promising performance when worn on any finger. Ring-a-Pose enables the future of smart rings to track and recognize hand poses using relatively low-power acoustic sensing. △ Less

Submitted 19 April, 2024; originally announced April 2024.

arXiv:2404.06821 [pdf, other]

Uniqueness to inverse acoustic and elastic medium scattering problems with hyper-singular source method

Authors: Chun Liu, Guanghui Hu, Jianli Xiang, Jiayi Zhang

Abstract: This paper is concerned with inverse scattering problems of determining the support of an isotropic and homogeneous penetrable body from knowledge of multi-static far-field patterns in acoustics and in linear elasticity. The normal derivative of the total fields admits no jump on the interface of the scatterer in the trace sense. If the contrast function of the refractive index function or the den… ▽ More This paper is concerned with inverse scattering problems of determining the support of an isotropic and homogeneous penetrable body from knowledge of multi-static far-field patterns in acoustics and in linear elasticity. The normal derivative of the total fields admits no jump on the interface of the scatterer in the trace sense. If the contrast function of the refractive index function or the density function has a positive lower bound near the boundary, we propose a hyper-singular source method to prove uniqueness of inverse scattering with all incoming plane waves at a fixed energy. It is based on subtle analysis on the leading part of the scattered field when hyper-singular sources caused by the first derivative of the fundamental solution approach to a boundary point. As a by-product, we show that this hyper-singular method can be also used to determine the boundary value of a Holder continuous refractive index function in acoustics or a Holder continuous density function in linear elasticity. △ Less

Submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.05982 [pdf, other]

The Convergence of Prescribed Combinatorial Ricci Flows for Total Geodesic Curvatures in Spherical Background Geometry

Authors: Guangming Hu, Ziping Lei, Yu Sun, Puchun Zhou

Abstract: In this paper, we study the existence and rigidity of (degenerated) circle pattern metric with prescribed total geodesic curvatures in spherical background geometry. To find the (degenerated) circle pattern metric with prescribed total geodesic curvatures, we define some prescribed combinatorial Ricci flows and study the convergence of flows for (degenerated) circle pattern metrics. We solve the p… ▽ More In this paper, we study the existence and rigidity of (degenerated) circle pattern metric with prescribed total geodesic curvatures in spherical background geometry. To find the (degenerated) circle pattern metric with prescribed total geodesic curvatures, we define some prescribed combinatorial Ricci flows and study the convergence of flows for (degenerated) circle pattern metrics. We solve the prescribed total geodesic curvature problem and provide two methods to find the degenerated circle pattern metric with prescribed total geodesic curvatures. As far as we know, this is the first degenerated result for total geodesic curvatures in spherical background geometry. △ Less

Submitted 8 April, 2024; originally announced April 2024.

Comments: 29 pages, 7 figures

arXiv:2404.03395 [pdf, ps, other]

Movable Antennas-Assisted Secure Transmission Without Eavesdroppers' Instantaneous CSI

Authors: Guojie Hu, Qingqing Wu, Donghui Xu, Kui Xu, Jiangbo Si, Yunlong Cai, Naofal Al-Dhahir

Abstract: Movable antenna (MA) technology is highly promising for improving communication performance, due to its advantage of flexibly adjusting positions of antennas to reconfigure channel conditions. In this paper, we investigate MAs-assisted secure transmission under a legitimate transmitter Alice, a legitimate receiver Bob and multiple eavesdroppers. Specifically, we consider a practical scenario where… ▽ More Movable antenna (MA) technology is highly promising for improving communication performance, due to its advantage of flexibly adjusting positions of antennas to reconfigure channel conditions. In this paper, we investigate MAs-assisted secure transmission under a legitimate transmitter Alice, a legitimate receiver Bob and multiple eavesdroppers. Specifically, we consider a practical scenario where Alice has no any knowledge about the instantaneous non-line-of-sight component of the wiretap channel. Under this setup, we evaluate the secrecy performance by adopting the secrecy outage probability metric, the tight approximation of which is first derived by interpreting the Rician fading as a special case of Nakagami fading and concurrently exploiting the Laguerre series approximation. Then, we minimize the secrecy outage probability by jointly optimizing the transmit beamforming and positions of antennas at Alice. However, the problem is highly non-convex because the objective includes the complex incomplete gamma function. To tackle this challenge, we, for the first time, effectively approximate the inverse of the incomplete gamma function as a simple linear model. Based on this approximation, we arrive at a simplified problem with a clear structure, which can be solved via the developed alternating projected gradient ascent (APGA) algorithm. Considering the high complexity of the APGA, we further design another scheme where the zero-forcing based beamforming is adopted by Alice, and then we transform the problem into minimizing a simple function which is only related to positions of antennas at Alice.As demonstrated by simulations, our proposed schemes achieve significant performance gains compared to conventional schemes based on fixed-position antennas. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: Submitted for journal publication

arXiv:2404.00403 [pdf, other]

UniMEEC: Towards Unified Multimodal Emotion Recognition and Emotion Cause

Authors: Guimin Hu, Zhihong Zhu, Daniel Hershcovich, Hasti Seifi, Jiayuan Xie

Abstract: Multimodal emotion recognition in conversation (MERC) and multimodal emotion-cause pair extraction (MECPE) has recently garnered significant attention. Emotions are the expression of affect or feelings; responses to specific events, thoughts, or situations are known as emotion causes. Both are like two sides of a coin, collectively describing human behaviors and intents. However, most existing wor… ▽ More Multimodal emotion recognition in conversation (MERC) and multimodal emotion-cause pair extraction (MECPE) has recently garnered significant attention. Emotions are the expression of affect or feelings; responses to specific events, thoughts, or situations are known as emotion causes. Both are like two sides of a coin, collectively describing human behaviors and intents. However, most existing works treat MERC and MECPE as separate tasks, which may result in potential challenges in integrating emotion and cause in real-world applications. In this paper, we propose a Unified Multimodal Emotion recognition and Emotion-Cause analysis framework (UniMEEC) to explore the causality and complementarity between emotion and emotion cause. Concretely, UniMEEC reformulates the MERC and MECPE tasks as two mask prediction problems, enhancing the interaction between emotion and cause. Meanwhile, UniMEEC shares the prompt learning among modalities for probing modality-specific knowledge from the Pre-trained model. Furthermore, we propose a task-specific hierarchical context aggregation to control the information flow to the task. Experiment results on four public benchmark datasets verify the model performance on MERC and MECPE tasks and achieve consistent improvements compared with state-of-the-art methods. △ Less

Submitted 30 March, 2024; originally announced April 2024.

arXiv:2403.18791 [pdf, other]

Object Pose Estimation via the Aggregation of Diffusion Features

Authors: Tianfu Wang, Guosheng Hu, Hongguang Wang

Abstract: Estimating the pose of objects from images is a crucial task of 3D scene understanding, and recent approaches have shown promising results on very large benchmarks. However, these methods experience a significant performance drop when dealing with unseen objects. We believe that it results from the limited generalizability of image features. To address this problem, we have an in-depth analysis on… ▽ More Estimating the pose of objects from images is a crucial task of 3D scene understanding, and recent approaches have shown promising results on very large benchmarks. However, these methods experience a significant performance drop when dealing with unseen objects. We believe that it results from the limited generalizability of image features. To address this problem, we have an in-depth analysis on the features of diffusion models, e.g. Stable Diffusion, which hold substantial potential for modeling unseen objects. Based on this analysis, we then innovatively introduce these diffusion features for object pose estimation. To achieve this, we propose three distinct architectures that can effectively capture and aggregate diffusion features of different granularity, greatly improving the generalizability of object pose estimation. Our approach outperforms the state-of-the-art methods by a considerable margin on three popular benchmark datasets, LM, O-LM, and T-LESS. In particular, our method achieves higher accuracy than the previous best arts on unseen objects: 98.2% vs. 93.5% on Unseen LM, 85.9% vs. 76.3% on Unseen O-LM, showing the strong generalizability of our method. Our code is released at https://github.com/Tianfu18/diff-feats-pose. △ Less

Submitted 1 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

Comments: Accepted to CVPR2024

arXiv:2403.17676 [pdf]

Analysis on reservoir activation with the nonlinearity harnessed from solution-processed MoS2 devices

Authors: Songwei Liu, Yang Liu, Yingyi Wen, Jingfang Pei, Pengyu Liu, Lekai Song, Xiaoyue Fan, Wenchen Yang, Danmei Pan, Teng Ma, Yue Lin, Gang Wang, Guohua Hu

Abstract: Reservoir computing is a recurrent neural network that has been applied across various domains in machine learning. The implementation of reservoir computing, however, often demands heavy computations for activating the reservoir. Configuring physical reservoir networks and harnessing the nonlinearity from the underlying devices for activation is an emergent solution to address the computational c… ▽ More Reservoir computing is a recurrent neural network that has been applied across various domains in machine learning. The implementation of reservoir computing, however, often demands heavy computations for activating the reservoir. Configuring physical reservoir networks and harnessing the nonlinearity from the underlying devices for activation is an emergent solution to address the computational challenge. Herein, we analyze the feasibility of employing the nonlinearity from solution-processed molybdenum disulfide (MoS2) devices for reservoir activation. The devices, fabricated using liquid-phase exfoliated MoS2, exhibit a high-order nonlinearity achieved by Stark modulation of the MoS2 material. We demonstrate that this nonlinearity can be fitted and employed as the activation function to facilitate reservoir computing implementation. Notably, owing to the high-order nonlinearity, the network exhibits long-term synchronization and robust generalization abilities for approximating complex dynamical systems. Given the remarkable reservoir activation capability, coupled with the scalability of the device fabrication, our findings open the possibility for the physical realization of lightweight, efficient reservoir computing for, for instance, signal classification, motion tracking, and pattern recognition of complex time series as well as secure cryptography. As an example, we show the network can be appointed to generate chaotic random numbers for secure data encryption. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2403.17221 [pdf, other]

Are Made and Missed Different? An analysis of Field Goal Attempts of Professional Basketball Players via Depth Based Testing Procedure

Authors: Kai Qi, Guanyu Hu, Wei Wu

Abstract: In this paper, we develop a novel depth-based testing procedure on spatial point processes to examine the difference in made and missed field goal attempts for NBA players. Specifically, our testing procedure can statistically detect the differences between made and missed field goal attempts for NBA players. We first obtain the depths of two processes under the polar coordinate system. A two-dime… ▽ More In this paper, we develop a novel depth-based testing procedure on spatial point processes to examine the difference in made and missed field goal attempts for NBA players. Specifically, our testing procedure can statistically detect the differences between made and missed field goal attempts for NBA players. We first obtain the depths of two processes under the polar coordinate system. A two-dimensional Kolmogorov-Smirnov test is then performed to test the difference between the depths of the two processes. Throughout extensive simulation studies, we show our testing procedure with good frequentist properties under both null hypothesis and alternative hypothesis. A comparison against the competing methods shows that our proposed procedure has better testing reliability and testing power. Application to the shot chart data of 191 NBA players in the 2017-2018 regular season offers interesting insights about these players' made and missed shot patterns. △ Less

Submitted 25 March, 2024; originally announced March 2024.

Comments: 26 pages, 6 figures

arXiv:2403.15758 [pdf, ps, other]

An endpoint estimate for the maximal Calderón commutator with rough kernel

Authors: Guoen Hu, Xudong Lai, Xiangxing Tao, Qingying Xue

Abstract: In this paper, the authors consider the endpoint estimates for the maximal Calderón commutator defined by $$T_{Ω,\,a}^*f(x)=\sup_{ε>0}\Big|\int_{|x-y|>ε}\frac{Ω(x-y)}{|x-y|^{d+1}} \big(a(x)-a(y)\big)f(y)dy\Big|,$$ where $Ω$ is homogeneous of degree zero, integrable on $S^{d-1}$ and has vanishing moment of order one, $a$ be a function on $\mathbb{R}^d$ such that… ▽ More In this paper, the authors consider the endpoint estimates for the maximal Calderón commutator defined by $$T_{Ω,\,a}^*f(x)=\sup_{ε>0}\Big|\int_{|x-y|>ε}\frac{Ω(x-y)}{|x-y|^{d+1}} \big(a(x)-a(y)\big)f(y)dy\Big|,$$ where $Ω$ is homogeneous of degree zero, integrable on $S^{d-1}$ and has vanishing moment of order one, $a$ be a function on $\mathbb{R}^d$ such that $\nabla a\in L^{\infty}(\mathbb{R}^d)$. The authors prove that if $Ω\in L\log L(S^{d-1})$, then $T^*_{Ω,\,a}$ satisfies an endpoint estimate of $L\log\log L$ type. △ Less

Submitted 14 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

Comments: 25 pages

MSC Class: 42B20

arXiv:2403.15274 [pdf]

Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review

Authors: Jinge Wang, Zien Cheng, Qiuming Yao, Li Liu, Dong Xu, Gangqing Hu

Abstract: The year 2023 marked a significant surge in the exploration of applying large language model (LLM) chatbots, notably ChatGPT, across various disciplines. We surveyed the applications of ChatGPT in bioinformatics and biomedical informatics throughout the year, covering omics, genetics, biomedical text mining, drug discovery, biomedical image understanding, bioinformatics programming, and bioinforma… ▽ More The year 2023 marked a significant surge in the exploration of applying large language model (LLM) chatbots, notably ChatGPT, across various disciplines. We surveyed the applications of ChatGPT in bioinformatics and biomedical informatics throughout the year, covering omics, genetics, biomedical text mining, drug discovery, biomedical image understanding, bioinformatics programming, and bioinformatics education. Our survey delineates the current strengths and limitations of this chatbot in bioinformatics and offers insights into potential avenues for future developments. △ Less

Submitted 12 June, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

Comments: Peer-reviewed and accepted by Quantitative Biology

arXiv:2403.14242 [pdf, other]

E-Syn: E-Graph Rewriting with Technology-Aware Cost Functions for Logic Synthesis

Authors: Chen Chen, Guangyu Hu, Dongsheng Zuo, Cunxi Yu, Yuzhe Ma, Hongce Zhang

Abstract: Logic synthesis plays a crucial role in the digital design flow. It has a decisive influence on the final Quality of Results (QoR) of the circuit implementations. However, existing multi-level logic optimization algorithms often employ greedy approaches with a series of local optimization steps. Each step breaks the circuit into small pieces (e.g., k-feasible cuts) and applies incremental changes… ▽ More Logic synthesis plays a crucial role in the digital design flow. It has a decisive influence on the final Quality of Results (QoR) of the circuit implementations. However, existing multi-level logic optimization algorithms often employ greedy approaches with a series of local optimization steps. Each step breaks the circuit into small pieces (e.g., k-feasible cuts) and applies incremental changes to individual pieces separately. These local optimization steps could limit the exploration space and may miss opportunities for significant improvements. To address the limitation, this paper proposes using e-graph in logic synthesis. The new workflow, named Esyn, makes use of the well-established e-graph infrastructure to efficiently perform logic rewriting. It explores a diverse set of equivalent Boolean representations while allowing technology-aware cost functions to better support delay-oriented and area-oriented logic synthesis. Experiments over a wide range of benchmark designs show our proposed logic optimization approach reaches a wider design space compared to the commonly used AIG-based logic synthesis flow. It achieves on average 15.29% delay saving in delay-oriented synthesis and 6.42% area saving for area-oriented synthesis. △ Less

Submitted 25 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

Comments: Accepted by DAC 2024; Please note that this is not the final camera-ready version

arXiv:2403.11656 [pdf, other]

LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model

Authors: Yuxin Cao, Jinghao Li, Xi Xiao, Derui Wang, Minhui Xue, Hao Ge, Wei Liu, Guangwu Hu

Abstract: Previous work has shown that well-crafted adversarial perturbations can threaten the security of video recognition systems. Attackers can invade such models with a low query budget when the perturbations are semantic-invariant, such as StyleFool. Despite the query efficiency, the naturalness of the minutia areas still requires amelioration, since StyleFool leverages style transfer to all pixels in… ▽ More Previous work has shown that well-crafted adversarial perturbations can threaten the security of video recognition systems. Attackers can invade such models with a low query budget when the perturbations are semantic-invariant, such as StyleFool. Despite the query efficiency, the naturalness of the minutia areas still requires amelioration, since StyleFool leverages style transfer to all pixels in each frame. To close the gap, we propose LocalStyleFool, an improved black-box video adversarial attack that superimposes regional style-transfer-based perturbations on videos. Benefiting from the popularity and scalably usability of Segment Anything Model (SAM), we first extract different regions according to semantic information and then track them through the video stream to maintain the temporal consistency. Then, we add style-transfer-based perturbations to several regions selected based on the associative criterion of transfer-based gradient information and regional area. Perturbation fine adjustment is followed to make stylized videos adversarial. We demonstrate that LocalStyleFool can improve both intra-frame and inter-frame naturalness through a human-assessed survey, while maintaining competitive fooling rate and query efficiency. Successful experiments on the high-resolution dataset also showcase that scrupulous segmentation of SAM helps to improve the scalability of adversarial attacks under high-resolution data. △ Less

Submitted 27 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

Comments: Accepted to 2024 IEEE Security and Privacy Workshops (SPW)

arXiv:2403.11613 [pdf, other]

Scattering Singularity in Topological Dielectric Photonic Crystals

Authors: Langlang Xiong, Xunya Jiang, Guangwei Hu

Abstract: The exploration of topology in natural materials and metamaterials has garnered significant attention. Notably, the one-dimensional (1D) and two-dimensional (2D) Su-Schrieffer-Heeger (SSH) model, assessed through tight-binding approximations, has been extensively investigated in both quantum and classical systems, encompassing general and higher-order topology. Despite these advancements, a compre… ▽ More The exploration of topology in natural materials and metamaterials has garnered significant attention. Notably, the one-dimensional (1D) and two-dimensional (2D) Su-Schrieffer-Heeger (SSH) model, assessed through tight-binding approximations, has been extensively investigated in both quantum and classical systems, encompassing general and higher-order topology. Despite these advancements, a comprehensive examination of these models from the perspective of wave physics, particularly the scattering view, remains underexplored. In this study, we systematically unveil the origin of the 1D and 2D Zak phases stemming from the zero-scattering point, termed the scattering singularity in k-space. Employing an expanded plane wave expansion, we accurately compute the reflective spectrum of an infinite 2D photonic crystal (2D-PhC). Analyzing the reflective spectrum reveals the presence of a zero-scattering line in the 2D-PhC, considered the topological origin of the non-trivial Zak phase. Two distinct models, representing omnidirectional non-trivial cases and directional non-trivial cases, are employed to substantiate these findings. Our work introduces a novel perspective for characterizing the nature of non-trivial topological phases. The identification of the zero-scattering line not only enhances our understanding of the underlying physics but also provides valuable insights for the design of innovative devices. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 8 pages, 4 figures

arXiv:2403.08621 [pdf, ps, other]

Spin-resolved counting statistics as a sensitive probe of spin correlation in transport through a quantum dot spin valve

Authors: Guanjian Hu, Shikuan Wang, Jing Hu, RuiQiang Li, Yiying Yan, JunYan Luo

Abstract: We investigate the noise in spin transport through a single quantum dot (QD) tunnel coupled to ferromagnetic electrodes with noncollinear magnetizations. Based on a spin-resolved quantum master equation, auto- and cross-correlations of spin-resolved currents are analyzed to reveal the underlying spin transport dynamics and characteristics for various polarizations. We find the currents of majority… ▽ More We investigate the noise in spin transport through a single quantum dot (QD) tunnel coupled to ferromagnetic electrodes with noncollinear magnetizations. Based on a spin-resolved quantum master equation, auto- and cross-correlations of spin-resolved currents are analyzed to reveal the underlying spin transport dynamics and characteristics for various polarizations. We find the currents of majority and minority spins could be strongly autocorrelated despite uncorrelated charge transfer. The interplay between tunnel coupling and the Coulomb interaction gives rise to an exchange magnetic field, leading to the precession of the accumulated spin in the QD. It strongly suppresses the bunching of spin tunneling events and results in a unique double-peak structure in the noise of the net spin current. The spin autocorrelation is found to be susceptible to magnetization alignments, which may serve as a sensitive tool to measure the magnetization directions between the ferromagnetic electrodes. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 9 pages, 4 figures

arXiv:2403.08450 [pdf, ps, other]

Increasing stability for inverse source problem with limited-aperture far field data at multi-frequencies

Authors: Ibtissem Ben Aïcha, Guanghui Hu, Suliang Si

Abstract: We study the increasing stability of an inverse source problem for the Helmholtz equation from limited-aperture far field data at multiple wave numbers. The measurement data are givenby the far field patterns $u^\infity(\hat{x},k)$ for all observation directions in some neighborhood of a fixed direction $\hat{x}$ and for all wave numbers k belonging to a finite interval $(0,K)$. In this paper, we… ▽ More We study the increasing stability of an inverse source problem for the Helmholtz equation from limited-aperture far field data at multiple wave numbers. The measurement data are givenby the far field patterns $u^\infity(\hat{x},k)$ for all observation directions in some neighborhood of a fixed direction $\hat{x}$ and for all wave numbers k belonging to a finite interval $(0,K)$. In this paper, we discuss the increasing stability with respect to the width of the wavenumber interval $K>1$. In three dimensions we establish stability estimates of the $L^2$-norm and $H^{-1}$-norm of the source function from the far field data. The ill-posedness of the inverse source problem turns out to be of Hölder type while increasing the wavenumber band K. We also discuss an analytic continuation argument of the far-field data with respect to the wavenumbers at a fixed direction. △ Less

Submitted 13 March, 2024; originally announced March 2024.

MSC Class: 35R30; 78A46

arXiv:2403.08440 [pdf, other]

Increasing stability for inverse acoustic source problems in the time domain

Authors: Chun Liu, Suliang Si, Guanghui Hu, Bo Zhang

Abstract: This paper is concerned with inverse source problems for the acoustic wave equation in the full space R^3, where the source term is compactly supported in both time and spatial variables. The main goal is to investigate increasing stability for the wave equation in terms of the interval length of given parameters (e.g., bandwith of the temporal component of the source function). We establish incre… ▽ More This paper is concerned with inverse source problems for the acoustic wave equation in the full space R^3, where the source term is compactly supported in both time and spatial variables. The main goal is to investigate increasing stability for the wave equation in terms of the interval length of given parameters (e.g., bandwith of the temporal component of the source function). We establish increasing stability estimates of the L^2 -norm of the source function by using only the Dirichlet boundary data. Our method relies on the Huygens principle, the Fourier transform and explicit bounds for the continuation of analytic functions. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 26pages,7figures. arXiv admin note: substantial text overlap with arXiv:2402.15973

MSC Class: 35R30; 78A46

arXiv:2403.07340 [pdf, other]

Direct and inverse time-harmonic scattering by Dirichlet periodic curves with local perturbations

Authors: Guanghui Hu, Andreas Kirsch

Abstract: This is a continuation of the authors' previous work (A. Kirsch, Math. Meth. Appl. Sci., 45 (2022): 5737-5773.) on well-posedness of time-harmonic scattering by locally perturbed periodic curves of Dirichlet kind. The scattering interface is supposed to be given by a non-self-intersecting Lipschitz curve. We study properties of the Green's function and prove new well-posedness results for scatteri… ▽ More This is a continuation of the authors' previous work (A. Kirsch, Math. Meth. Appl. Sci., 45 (2022): 5737-5773.) on well-posedness of time-harmonic scattering by locally perturbed periodic curves of Dirichlet kind. The scattering interface is supposed to be given by a non-self-intersecting Lipschitz curve. We study properties of the Green's function and prove new well-posedness results for scattering of plane waves at a propagative wave number. In such a case there exist guided waves to the unperturbed problem, which are also known as Bounded States in the Continuity (BICs) in physics. In this paper uniqueness of the forward scattering follows from an orthogonal constraint condition enforcing on the total field to the unperturbed scattering problem. This constraint condition, which is also valid under the Neumann boundary condition, is derived from the singular perturbation arguments and also from the approach of approximating a plane wave by point source waves. For the inverse problem of determining the defect, we prove several uniqueness results using a finite or infinite number of point source and plane waves, depending on whether a priori information on the size and height of the defect is available. △ Less

Submitted 12 March, 2024; originally announced March 2024.

arXiv:2403.06439 [pdf, other]

Wide-Field, High-Resolution Reconstruction in Computational Multi-Aperture Miniscope Using a Fourier Neural Network

Authors: Qianwan Yang, Ruipeng Guo, Guorong Hu, Yujia Xue, Yunzhe Li, Lei Tian

Abstract: Traditional fluorescence microscopy is constrained by inherent trade-offs among resolution, field-of-view, and system complexity. To navigate these challenges, we introduce a simple and low-cost computational multi-aperture miniature microscope, utilizing a microlens array for single-shot wide-field, high-resolution imaging. Addressing the challenges posed by extensive view multiplexing and non-lo… ▽ More Traditional fluorescence microscopy is constrained by inherent trade-offs among resolution, field-of-view, and system complexity. To navigate these challenges, we introduce a simple and low-cost computational multi-aperture miniature microscope, utilizing a microlens array for single-shot wide-field, high-resolution imaging. Addressing the challenges posed by extensive view multiplexing and non-local, shift-variant aberrations in this device, we present SV-FourierNet, a novel multi-channel Fourier neural network. SV-FourierNet facilitates high-resolution image reconstruction across the entire imaging field through its learned global receptive field. We establish a close relationship between the physical spatially-varying point-spread functions and the network's learned effective receptive field. This ensures that SV-FourierNet has effectively encapsulated the spatially-varying aberrations in our system, and learned a physically meaningful function for image reconstruction. Training of SV-FourierNet is conducted entirely on a physics-based simulator. We showcase wide-field, high-resolution video reconstructions on colonies of freely moving C. elegans and imaging of a mouse brain section. Our computational multi-aperture miniature microscope, augmented with SV-FourierNet, represents a major advancement in computational microscopy and may find broad applications in biomedical research and other fields requiring compact microscopy solutions. △ Less

Submitted 30 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

arXiv:2403.06249 [pdf, other]

No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks

Authors: Gang Hu, Ke Qin, Chenhan Yuan, Min Peng, Alejandro Lopez-Lira, Benyou Wang, Sophia Ananiadou, Wanlong Yu, Jimin Huang, Qianqian Xie

Abstract: While the progression of Large Language Models (LLMs) has notably propelled financial analysis, their application has largely been confined to singular language realms, leaving untapped the potential of bilingual Chinese-English capacity. To bridge this chasm, we introduce ICE-PIXIU, seamlessly amalgamating the ICE-INTENT model and ICE-FLARE benchmark for bilingual financial analysis. ICE-PIXIU un… ▽ More While the progression of Large Language Models (LLMs) has notably propelled financial analysis, their application has largely been confined to singular language realms, leaving untapped the potential of bilingual Chinese-English capacity. To bridge this chasm, we introduce ICE-PIXIU, seamlessly amalgamating the ICE-INTENT model and ICE-FLARE benchmark for bilingual financial analysis. ICE-PIXIU uniquely integrates a spectrum of Chinese tasks, alongside translated and original English datasets, enriching the breadth and depth of bilingual financial modeling. It provides unrestricted access to diverse model variants, a substantial compilation of diverse cross-lingual and multi-modal instruction data, and an evaluation benchmark with expert annotations, comprising 10 NLP tasks, 20 bilingual specific tasks, totaling 95k datasets. Our thorough evaluation emphasizes the advantages of incorporating these bilingual datasets, especially in translation tasks and utilizing original English data, enhancing both linguistic flexibility and analytical acuity in financial contexts. Notably, ICE-INTENT distinguishes itself by showcasing significant enhancements over conventional LLMs and existing financial LLMs in bilingual milieus, underscoring the profound impact of robust bilingual data on the accuracy and efficacy of financial NLP. △ Less

Submitted 16 April, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

Comments: 24 pages, 5 figures, 12 tables, including Appendix

arXiv:2403.05777 [pdf, other]

Combinatorial p-th Calabi Flows for Total Geodesic Curvatures in hyperbolic background geometry

Authors: Guangming Hu, Ziping Lei, Yi Qi, Puchun Zhou

Abstract: In hyperbolic background geometry, we investigate a generalized circle packing (including circles, horocycles and hypercycles) with conical singularities on a surface with boundary, which has a total geodesic curvature on each generalized circle of this circle packing and a discrete Gaussian curvature on the center of each dual circle. The purpose of this paper is to find this type of circle packi… ▽ More In hyperbolic background geometry, we investigate a generalized circle packing (including circles, horocycles and hypercycles) with conical singularities on a surface with boundary, which has a total geodesic curvature on each generalized circle of this circle packing and a discrete Gaussian curvature on the center of each dual circle. The purpose of this paper is to find this type of circle packings with prescribed total geodesic curvatures on generalized circles and discrete Gaussian curvatures on centers of dual circles. To achieve this goal, we firstly establish existence and rigidity on this type of circle packings by the variational principle. Secondly, for $p>1$, we introduce combinatorial $p$-th Calabi flows to find the circle packing with prescribed total geodesic curvatures on generalized circles and discrete Gaussian curvatures on centers of dual circles for the first time. △ Less

Submitted 8 March, 2024; originally announced March 2024.

Comments: 25 pages, 4 figures

arXiv:2403.04329 [pdf, other]

A mechanism-driven reinforcement learning framework for shape optimization of airfoils

Authors: Jingfeng Wang, Guanghui Hu

Abstract: In this paper, a novel mechanism-driven reinforcement learning framework is proposed for airfoil shape optimization. To validate the framework, a reward function is designed and analyzed, from which the equivalence between the maximizing the cumulative reward and achieving the optimization objectives is guaranteed theoretically. To establish a quality exploration, and to obtain an accurate reward… ▽ More In this paper, a novel mechanism-driven reinforcement learning framework is proposed for airfoil shape optimization. To validate the framework, a reward function is designed and analyzed, from which the equivalence between the maximizing the cumulative reward and achieving the optimization objectives is guaranteed theoretically. To establish a quality exploration, and to obtain an accurate reward from the environment, an efficient solver for steady Euler equations is employed in the reinforcement learning method. The solver utilizes the Bézier curve to describe the shape of the airfoil, and a Newton-geometric multigrid method for the solution. In particular, a dual-weighted residual-based h-adaptive method is used for efficient calculation of target functional. To effectively streamline the airfoil shape during the deformation process, we introduce the Laplacian smoothing, and propose a Bézier fitting strategy, which not only remits mesh tangling but also guarantees a precise manipulation of the geometry. In addition, a neural network architecture is designed based on an attention mechanism to make the learning process more sensitive to the minor change of the airfoil geometry. Numerical experiments demonstrate that our framework can handle the optimization problem with hundreds of design variables. It is worth mentioning that, prior to this work, there are limited works combining such high-fidelity partial differential equatons framework with advanced reinforcement learning algorithms for design problems with such high dimensionality. △ Less

Submitted 26 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

Comments: 25 pages

arXiv:2402.19344 [pdf, other]

The 6th Affective Behavior Analysis in-the-wild (ABAW) Competition

Authors: Dimitrios Kollias, Panagiotis Tzirakis, Alan Cowen, Stefanos Zafeiriou, Irene Kotsia, Alice Baird, Chris Gagne, Chunchang Shao, Guanyu Hu

Abstract: This paper describes the 6th Affective Behavior Analysis in-the-wild (ABAW) Competition, which is part of the respective Workshop held in conjunction with IEEE CVPR 2024. The 6th ABAW Competition addresses contemporary challenges in understanding human emotions and behaviors, crucial for the development of human-centered technologies. In more detail, the Competition focuses on affect related bench… ▽ More This paper describes the 6th Affective Behavior Analysis in-the-wild (ABAW) Competition, which is part of the respective Workshop held in conjunction with IEEE CVPR 2024. The 6th ABAW Competition addresses contemporary challenges in understanding human emotions and behaviors, crucial for the development of human-centered technologies. In more detail, the Competition focuses on affect related benchmarking tasks and comprises of five sub-challenges: i) Valence-Arousal Estimation (the target is to estimate two continuous affect dimensions, valence and arousal), ii) Expression Recognition (the target is to recognise between the mutually exclusive classes of the 7 basic expressions and 'other'), iii) Action Unit Detection (the target is to detect 12 action units), iv) Compound Expression Recognition (the target is to recognise between the 7 mutually exclusive compound expression classes), and v) Emotional Mimicry Intensity Estimation (the target is to estimate six continuous emotion dimensions). In the paper, we present these Challenges, describe their respective datasets and challenge protocols (we outline the evaluation metrics) and present the baseline systems as well as their obtained performance. More information for the Competition can be found in: https://affective-behavior-analysis-in-the-wild.github.io/6th. △ Less

Submitted 12 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

arXiv:2402.18834 [pdf, other]

A single-particle energy-conserving dissipative particle dynamics approach for simulating thermophoresis of nanoparticles in polymer networks

Authors: Yu Lu, Guo-Hui Hu

Abstract: Thermophoresis is an effective method to drive the motion of nanoparticles in fluids. The transport of nanoparticles in polymer networks has significant fundamental and applied importance in biology and medicine, and can be described as Brownian particles crossing entropic barriers. This study proposes a novel extension of dissipative particle dynamics (DPD), called the single-particle energy-cons… ▽ More Thermophoresis is an effective method to drive the motion of nanoparticles in fluids. The transport of nanoparticles in polymer networks has significant fundamental and applied importance in biology and medicine, and can be described as Brownian particles crossing entropic barriers. This study proposes a novel extension of dissipative particle dynamics (DPD), called the single-particle energy-conserving dissipative particle dynamics (seDPD), which combines the features of single-particle dissipative particle dynamics (sDPD) and energy-conserving dissipative particle dynamics (eDPD) to simulate the thermophoresis of nanoparticles under temperature gradients. The reliability of the seDPD method is verified by considering the viscosity, thermal diffusivity, and hydrodynamic drag force on the nanoparticles. Using this method, the transport of nanoparticles driven by the thermophoretic force across the polymer network is simulated. The results show that the nanoparticles exhibit the phenomenon of giant acceleration of diffusion (GAD) in the polymer network, indicating that Brownian particles can exhibit GAD when crossing entropic barriers. △ Less

Submitted 28 February, 2024; originally announced February 2024.

arXiv:2402.16908 [pdf]

Lightweight, error-tolerant edge detection using memristor-enabled stochastic logics

Authors: Lekai Song, Pengyu Liu, Jingfang Pei, Yang Liu, Songwei Liu, Shengbo Wang, Leonard W. T. Ng, Tawfique Hasan, Kong-Pang Pun, Shuo Gao, Guohua Hu

Abstract: The demand for efficient edge vision has spurred the interest in developing stochastic computing approaches for performing image processing tasks. Memristors with inherent stochasticity readily introduce probability into the computations and thus enable stochastic image processing computations. Here, we present a stochastic computing approach for edge detection, a fundamental image processing tech… ▽ More The demand for efficient edge vision has spurred the interest in developing stochastic computing approaches for performing image processing tasks. Memristors with inherent stochasticity readily introduce probability into the computations and thus enable stochastic image processing computations. Here, we present a stochastic computing approach for edge detection, a fundamental image processing technique, facilitated with memristor-enabled stochastic logics. Specifically, we integrate the memristors with logic circuits and harness the stochasticity from the memristors to realize compact stochastic logics for stochastic number encoding and processing. The stochastic numbers, exhibiting well-regulated probabilities and correlations, can be processed to perform logic operations with statistical probabilities. This can facilitate lightweight stochastic edge detection for edge visual scenarios characterized with high-level noise errors. As a practical demonstration, we implement a hardware stochastic Roberts cross operator using the stochastic logics, and prove its exceptional edge detection performance, remarkably, with 95% less computational cost while withstanding 50% bit-flip errors. The results underscore the great potential of our stochastic edge detection approach in developing lightweight, error-tolerant edge vision hardware and systems for autonomous driving, virtual/augmented reality, medical imaging diagnosis, industrial automation, and beyond. △ Less

Submitted 20 March, 2024; v1 submitted 25 February, 2024; originally announced February 2024.

arXiv:2402.16318 [pdf, other]

Gradient-Guided Modality Decoupling for Missing-Modality Robustness

Authors: Hao Wang, Shengda Luo, Guosheng Hu, Jianguo Zhang

Abstract: Multimodal learning with incomplete input data (missing modality) is practical and challenging. In this work, we conduct an in-depth analysis of this challenge and find that modality dominance has a significant negative impact on the model training, greatly degrading the missing modality performance. Motivated by Grad-CAM, we introduce a novel indicator, gradients, to monitor and reduce modality d… ▽ More Multimodal learning with incomplete input data (missing modality) is practical and challenging. In this work, we conduct an in-depth analysis of this challenge and find that modality dominance has a significant negative impact on the model training, greatly degrading the missing modality performance. Motivated by Grad-CAM, we introduce a novel indicator, gradients, to monitor and reduce modality dominance which widely exists in the missing-modality scenario. In aid of this indicator, we present a novel Gradient-guided Modality Decoupling (GMD) method to decouple the dependency on dominating modalities. Specifically, GMD removes the conflicted gradient components from different modalities to achieve this decoupling, significantly improving the performance. In addition, to flexibly handle modal-incomplete data, we design a parameter-efficient Dynamic Sharing (DS) framework which can adaptively switch on/off the network parameters based on whether one modality is available. We conduct extensive experiments on three popular multimodal benchmarks, including BraTS 2018 for medical segmentation, CMU-MOSI, and CMU-MOSEI for sentiment analysis. The results show that our method can significantly outperform the competitors, showing the effectiveness of the proposed solutions. Our code is released here: https://github.com/HaoWang420/Gradient-guided-Modality-Decoupling. △ Less

Submitted 26 February, 2024; originally announced February 2024.

Comments: AAAI24

arXiv:2402.12659 [pdf, other]

FinBen: A Holistic Financial Benchmark for Large Language Models

Authors: Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu , et al. (9 additional authors not shown)

Abstract: LLMs have transformed NLP and shown promise in various fields, yet their potential in finance is underexplored due to a lack of comprehensive evaluation benchmarks, the rapid development of LLMs, and the complexity of financial tasks. In this paper, we introduce FinBen, the first extensive open-source evaluation benchmark, including 36 datasets spanning 24 financial tasks, covering seven critical… ▽ More LLMs have transformed NLP and shown promise in various fields, yet their potential in finance is underexplored due to a lack of comprehensive evaluation benchmarks, the rapid development of LLMs, and the complexity of financial tasks. In this paper, we introduce FinBen, the first extensive open-source evaluation benchmark, including 36 datasets spanning 24 financial tasks, covering seven critical aspects: information extraction (IE), textual analysis, question answering (QA), text generation, risk management, forecasting, and decision-making. FinBen offers several key innovations: a broader range of tasks and datasets, the first evaluation of stock trading, novel agent and Retrieval-Augmented Generation (RAG) evaluation, and three novel open-source evaluation datasets for text summarization, question answering, and stock trading. Our evaluation of 15 representative LLMs, including GPT-4, ChatGPT, and the latest Gemini, reveals several key findings: While LLMs excel in IE and textual analysis, they struggle with advanced reasoning and complex tasks like text generation and forecasting. GPT-4 excels in IE and stock trading, while Gemini is better at text generation and forecasting. Instruction-tuned LLMs improve textual analysis but offer limited benefits for complex tasks such as QA. FinBen has been used to host the first financial LLMs shared task at the FinNLP-AgentScen workshop during IJCAI-2024, attracting 12 teams. Their novel solutions outperformed GPT-4, showcasing FinBen's potential to drive innovation in financial LLMs. All datasets, results, and codes are released for the research community: https://github.com/The-FinAI/PIXIU. △ Less

Submitted 18 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: 26 pages, 11 figures

arXiv:2402.12088 [pdf, other]

Uniqueness, stability and algorithm for an inverse wave-number-dependent source problems

Authors: Mengjie Zhao, Suliang Si, Guanghui Hu

Abstract: This paper is concerned with an inverse wavenumber/frequency-dependent source problem for the Helmholtz equation. In two and three dimensions, the unknown source term is supposed to be compactly supported in spatial variables but independent on one spatial variable. The dependence of the source function on wavenumber/frequency is supposed to be unknown. Based on the Dirichlet-Laplacian and Fourier… ▽ More This paper is concerned with an inverse wavenumber/frequency-dependent source problem for the Helmholtz equation. In two and three dimensions, the unknown source term is supposed to be compactly supported in spatial variables but independent on one spatial variable. The dependence of the source function on wavenumber/frequency is supposed to be unknown. Based on the Dirichlet-Laplacian and Fourier-Transform methods, we develop two effcient non-iterative numerical algorithms to recover the wavenumber-dependent source. Uniqueness proof and increasing stability analysis are carried out in terms of the boundary measurement data of Dirichlet kind. Numerical experiments are conducted to illustrate the effectiveness and efficiency of the proposed methods. △ Less

Submitted 31 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: 26 pages, 35 figures

MSC Class: 35P25; 35Q30; 45Q05; 78A46

arXiv:2402.11230 [pdf, other]

Time-harmonic scattering by locally perturbed periodic structures with Dirichlet and Neumann boundary conditions

Authors: Guanghui Hu, Andreas Kirsch

Abstract: The paper is concerned with well-posedness of TE and TM polarizations of time-harmonic electromagnetic scattering by perfectly conducting periodic surfaces and periodically arrayed obstacles with local perturbations. The classical Rayleigh Expansion radiation condition does not always lead to well-posedness of the Helmholtz equation even in unperturbed periodic structures. We propose two equivalen… ▽ More The paper is concerned with well-posedness of TE and TM polarizations of time-harmonic electromagnetic scattering by perfectly conducting periodic surfaces and periodically arrayed obstacles with local perturbations. The classical Rayleigh Expansion radiation condition does not always lead to well-posedness of the Helmholtz equation even in unperturbed periodic structures. We propose two equivalent radiation conditions to characterize the radiating behavior of time-harmonic wave fields incited by a source term in an open waveguide under impenetrable boundary conditions. With these open waveguide radiation conditions, uniqueness and existence of time-harmonic scattering by incoming point source waves, plane waves and surface waves from locally perturbed periodic structures are established under either the Dirichlet or Neumann boundary condition. A DtN operator without using the Green's function is constructed for proving well-posedness of perturbed scattering problems. △ Less

Submitted 17 February, 2024; originally announced February 2024.

arXiv:2402.10186 [pdf, other]

Self-consistent Validation for Machine Learning Electronic Structure

Authors: Gengyuan Hu, Gengchen Wei, Zekun Lou, Philip H. S. Torr, Wanli Ouyang, Han-sen Zhong, Chen Lin

Abstract: Machine learning has emerged as a significant approach to efficiently tackle electronic structure problems. Despite its potential, there is less guarantee for the model to generalize to unseen data that hinders its application in real-world scenarios. To address this issue, a technique has been proposed to estimate the accuracy of the predictions. This method integrates machine learning with self-… ▽ More Machine learning has emerged as a significant approach to efficiently tackle electronic structure problems. Despite its potential, there is less guarantee for the model to generalize to unseen data that hinders its application in real-world scenarios. To address this issue, a technique has been proposed to estimate the accuracy of the predictions. This method integrates machine learning with self-consistent field methods to achieve both low validation cost and interpret-ability. This, in turn, enables exploration of the model's ability with active learning and instills confidence in its integration into real-world studies. △ Less

Submitted 15 February, 2024; originally announced February 2024.

Comments: 6 pages, 4 figures

Showing 1–50 of 523 results for author: Hu, G