Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 260 results for author: Gan, L

.
  1. arXiv:2408.12237  [pdf, other

    cs.AI cs.LG

    Weight Scope Alignment: A Frustratingly Easy Method for Model Merging

    Authors: Yichu Xu, Xin-Chun Li, Le Gan, De-Chuan Zhan

    Abstract: Merging models becomes a fundamental procedure in some applications that consider model efficiency and robustness. The training randomness or Non-I.I.D. data poses a huge challenge for averaging-based model fusion. Previous research efforts focus on element-wise regularization or neural permutations to enhance model averaging while overlooking weight scope variations among models, which can signif… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  2. arXiv:2408.10043  [pdf, ps, other

    cs.IT eess.SP

    Stacked Intelligent Metasurfaces for Integrated Sensing and Communications

    Authors: Haoxian Niu, Jiancheng An, Anastasios Papazafeiropoulos, Lu Gan, Symeon Chatzinotas, Mérouane Debbah

    Abstract: Stacked intelligent metasurfaces (SIM) have recently emerged as a promising technology, which can realize transmit precoding in the wave domain. In this paper, we investigate a SIM-aided integrated sensing and communications system, in which SIM is capable of generating a desired beam pattern for simultaneously communicating with multiple downlink users and detecting a radar target. Specifically,… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 15 pages, 5 figures, accepted by IEEE WCL

  3. arXiv:2408.04837  [pdf, ps, other

    cs.IT eess.SP

    Multi-User MISO with Stacked Intelligent Metasurfaces: A DRL-Based Sum-Rate Optimization Approach

    Authors: Hao Liu, Jiancheng An, George C. Alexandropoulos, Derrick Wing Kwan Ng, Chau Yuen, Lu Gan

    Abstract: Stacked intelligent metasurfaces (SIMs) represent a novel signal processing paradigm that enables over-the-air processing of electromagnetic waves at the speed of light. Their multi-layer architecture exhibits customizable computational capabilities compared to conventional single-layer reconfigurable intelligent surfaces and metasurface lenses. In this paper, we deploy SIM to improve the performa… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 34 pages, 11 figures, 2 tables. arXiv admin note: text overlap with arXiv:2402.09006

  4. arXiv:2408.01091  [pdf, other

    cs.AI

    Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions

    Authors: Jin Gao, Lei Gan, Yuankai Li, Yixin Ye, Dequan Wang

    Abstract: Large multimodal models (LMMs) excel in adhering to human instructions. However, self-contradictory instructions may arise due to the increasing trend of multimodal interaction and context length, which is challenging for language beginners and vulnerable populations. We introduce the Self-Contradictory Instructions benchmark to evaluate the capability of LMMs in recognizing conflicting commands.… ▽ More

    Submitted 5 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: Accepted by the 18th European Conference on Computer Vision ECCV 2024

  5. arXiv:2407.20428  [pdf, ps, other

    math.RT

    Bounding regularity of $FI^m$-modules

    Authors: Wee Liang Gan, Khoa Ta

    Abstract: Let $FI$ be a skeleton of the category of finite sets and injective maps, and $FI^m$ the product of $m$ copies of $FI$. We prove that if an $FI^m$-module is generated in degree $\leqslant d$ and related in degree $\leqslant r$, then its regularity is bounded above by a function of $m$, $d$, and $r$.

    Submitted 29 July, 2024; originally announced July 2024.

  6. arXiv:2407.15053  [pdf, ps, other

    cs.IT

    Stacked Intelligent Metasurfaces for Task-Oriented Semantic Communications

    Authors: Guojun Huang, Jiancheng An, Zhaohui Yang, Lu Gan, Mehdi Bennis, Mérouane Debbah

    Abstract: Semantic communication leveraging advanced deep learning (DL) technologies enhances the efficiency, reliability, and security of information transmission. Emerging stacked intelligent metasurface (SIM) having a diffractive neural network (DNN) architecture allows performing complex calculations at the speed of light. In this letter, we introduce an innovative SIM-aided semantic communication syste… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures

  7. arXiv:2407.13214  [pdf, other

    cs.CV

    TXL-PBC: a freely accessible labeled peripheral blood cell dataset

    Authors: Lu Gan, Xi Li

    Abstract: In a recent study, we found that publicly BCCD and BCD datasets have significant issues such as labeling errors, insufficient sample size, and poor data quality. To address these problems, we performed sample deletion, re-labeling, and integration of these two datasets. Additionally, we introduced the PBC and Raabin-WBC datasets, and ultimately created a high-quality, sample-balanced new dataset,… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  8. arXiv:2407.07614  [pdf, other

    cs.CV

    MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

    Authors: Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, LeiLei Gan, Hao Jiang

    Abstract: Auto-regressive models have made significant progress in the realm of language generation, yet they do not perform on par with diffusion models in the domain of image synthesis. In this work, we introduce MARS, a novel framework for T2I generation that incorporates a specially designed Semantic Vision-Language Integration Expert (SemVIE). This innovative component integrates pre-trained LLMs by in… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures

  9. arXiv:2407.03566  [pdf, ps, other

    cs.IT eess.SP

    Stacked Intelligent Metasurfaces for Wireless Sensing and Communication: Applications and Challenges

    Authors: Hao Liu, Jiancheng An, Xing Jia, Shining Lin, Xianghao Yao, Lu Gan, Bruno Clerckx, Chau Yuen, Mehdi Bennis, Mérouane Debbah

    Abstract: The rapid advancement of wireless communication technologies has precipitated an unprecedented demand for high data rates, extremely low latency, and ubiquitous connectivity. In order to achieve these goals, stacked intelligent metasurfaces (SIM) has been developed as a novel solution to perform advanced signal processing tasks directly in the electromagnetic wave domain, thus achieving ultra-fast… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 pages, 5 figures, 1 table

  10. arXiv:2407.03316  [pdf, other

    nucl-ex hep-ex

    An Upper Limit on the Photoproduction Cross Section of the Spin-Exotic $π_1(1600)$

    Authors: F. Afzal, C. S. Akondi, M. Albrecht, M. Amaryan, S. Arrigo, V. Arroyave, A. Asaturyan, A. Austregesilo, Z. Baldwin, F. Barbosa, J. Barlow, E. Barriga, R. Barsotti, D. Barton, V. Baturin, V. V. Berdnikov, T. Black, W. Boeglin, M. Boer, W. J. Briscoe, T. Britton, S. Cao, E. Chudakov, G. Chung, P. L. Cole , et al. (124 additional authors not shown)

    Abstract: The spin-exotic hybrid meson $π_{1}(1600)$ is predicted to have a large decay rate to the $ωππ$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $γp \to ωπ^+ π^- p$, $γp \to ωπ^0 π^0 p$, and $γp\toωπ^-π^0Δ^{++}$ in the range $E_γ=$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction c… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures plus supplemental materials

  11. arXiv:2406.19585  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Effect of interfacial Fe3O4 nanoparticles on the microstructure and mechanical properties of textured alumina densified by ultrafast high-temperature sintering

    Authors: Rohit Pratyush Behera, Andrew Yun Ru Ng, Zehui Du, Chee Lip Gan, Hortense Le Ferrand

    Abstract: Alumina microplatelets coated with a small amount of Fe3O4 can be oriented via a rotating magnetic field to create texture. After ultrafast high-temperature sintering (UHS), Fe atoms are found at the grain boundaries and within the grains, influencing the mechanical properties. Here, we compare the microstructure and mechanical properties of textured alumina prepared with and without Fe3O4 and sin… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 10 pages, 11 figures, contains main manuscript and supplementary file

    Journal ref: Journal of the European Ceramic Society 44 (2024) 116696

  12. arXiv:2406.16989  [pdf, other

    cs.LG cs.AI

    Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning

    Authors: Ziyu Zhao, Leilei Gan, Guoyin Wang, Yuwei Hu, Tao Shen, Hongxia Yang, Kun Kuang, Fei Wu

    Abstract: Low-Rank Adaptation (LoRA) offers an efficient way to fine-tune large language models (LLMs). Its modular and plug-and-play nature allows the integration of various domain-specific LoRAs, enhancing LLM capabilities. Open-source platforms like Huggingface and Modelscope have introduced a new computational paradigm, Uploadable Machine Learning (UML). In UML, contributors use decentralized data to tr… ▽ More

    Submitted 16 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2402.09997

  13. arXiv:2406.12829  [pdf, other

    nucl-ex

    Measurement of Spin-Density Matrix Elements in $Δ^{++}(1232)$ photoproduction

    Authors: F. Afzal, C. S. Akondi, M. Albrecht, M. Amaryan, S. Arrigo, V. Arroyave, A. Asaturyan, A. Austregesilo, Z. Baldwin, F. Barbosa, J. Barlow, E. Barriga, R. Barsotti, D. Barton, V. Baturin, V. V. Berdnikov, T. Black, W. Boeglin, M. Boer, W. J. Briscoe, T. Britton, S. Cao, E. Chudakov, G. Chung, P. L. Cole , et al. (124 additional authors not shown)

    Abstract: We measure the spin-density matrix elements (SDMEs) of the $Δ^{++}(1232)$ in the photoproduction reaction $γp \to π^-Δ^{++}(1232)$ with the GlueX experiment in Hall D at Jefferson Lab. The measurement uses a linearly--polarized photon beam with energies from $8.2$ to $8.8$~GeV and the statistical precision of the SDMEs exceeds the previous measurement by three orders of magnitude for the momentum… ▽ More

    Submitted 26 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  14. arXiv:2406.11793  [pdf, other

    cs.RO

    FetchBench: A Simulation Benchmark for Robot Fetching

    Authors: Beining Han, Meenal Parakh, Derek Geng, Jack A Defay, Luyang Gan, Jia Deng

    Abstract: Fetching, which includes approaching, grasping, and retrieving, is a critical challenge for robot manipulation tasks. Existing methods primarily focus on table-top scenarios, which do not adequately capture the complexities of environments where both grasping and planning are essential. To address this gap, we propose a new benchmark FetchBench, featuring diverse procedural scenes that integrate b… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  15. arXiv:2406.09486  [pdf, other

    cs.CV cs.AI

    SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets

    Authors: Shenghua Wan, Ziyuan Chen, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based offline reinforcement Learning (RL) is a promising approach that leverages existing data effectively in many real-world applications, especially those involving high-dimensional inputs like images and videos. To alleviate the distribution shift issue in offline RL, existing model-based methods heavily rely on the uncertainty of learned dynamics. However, the model uncertainty estimatio… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 23 pages, 10 figures

  16. arXiv:2406.09231  [pdf, other

    nucl-th quant-ph

    Elastic scattering on a quantum computer

    Authors: Muhammad Yusf, Ling Gan, Cameron Moffat, Gautam Rupak

    Abstract: Scattering probes the internal structure of quantum systems. We calculate the two-particle elastic scattering phase shift for a short-ranged interaction on a quantum computer. Short-ranged interactions with a large scattering length or shallow bound state describe a universality class that is of interest in atomic, condensed matter, nuclear, and particle physics. The phase shift is calculated by r… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures, 2 tables

  17. arXiv:2406.09058  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook Design for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Ertugrul Basar, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can proactively reshape the characteristics of wireless channel environments. In RIS-assisted communication systems, the acquisition of channel state information (CSI) and the optimization of reflecting coefficients constitute major design challenges. To address these issues, codebook-based sol… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 36 pages, 12 figures, 2 tables, accepted by IEEE TCOM. arXiv admin note: text overlap with arXiv:2404.00265

  18. arXiv:2406.06852  [pdf, other

    cs.CR cs.AI cs.CL

    A Survey of Backdoor Attacks and Defenses on Large Language Models: Implications for Security Measures

    Authors: Shuai Zhao, Meihuizi Jia, Zhongliang Guo, Leilei Gan, Xiaoyu Xu, Jie Fu, Yichao Feng, Fengjun Pan, Luu Anh Tuan

    Abstract: The large language models (LLMs), which bridge the gap between human language understanding and complex problem-solving, achieve state-of-the-art performance on several NLP tasks, particularly in few-shot and zero-shot settings. Despite the demonstrable efficacy of LMMs, due to constraints on computational resources, users have to engage with open-source language models or outsource the entire tra… ▽ More

    Submitted 19 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  19. arXiv:2405.18763  [pdf, ps, other

    math.PR q-bio.PE

    Stationary distribution approximations of Two-island Wright-Fisher and seed-bank models using Stein's method

    Authors: Han L. Gan, Maite Wilke-Berenguer

    Abstract: We consider two finite population Markov chain models, the two-island Wright-Fisher model with mutation, and the seed-bank model with mutation. Despite the relatively simple descriptions of the two processes, the the exact form of their stationary distributions is in general intractable. For each of the two models we provide two approximation theorems with explicit upper bounds on the distance bet… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 37 pages

    MSC Class: 92D25; 60F05; 60J95

  20. arXiv:2405.13078  [pdf, other

    cs.LG

    Exploring Dark Knowledge under Various Teacher Capacities and Addressing Capacity Mismatch

    Authors: Xin-Chun Li, Wen-Shu Fan, Bowen Tao, Le Gan, De-Chuan Zhan

    Abstract: Knowledge Distillation (KD) could transfer the ``dark knowledge" of a well-performed yet large neural network to a weaker but lightweight one. From the view of output logits and softened probabilities, this paper goes deeper into the dark knowledge provided by teachers with different capacities. Two fundamental observations are: (1) a larger teacher tends to produce probability vectors that are le… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  21. arXiv:2405.06004  [pdf, other

    physics.ao-ph cs.AI cs.LG

    EWMoE: An effective model for global weather forecasting with mixture-of-experts

    Authors: Lihao Gan, Xin Man, Chenghong Zhang, Jie Shao

    Abstract: Weather forecasting is a crucial task for meteorologic research, with direct social and economic impacts. Recently, data-driven weather forecasting models based on deep learning have shown great potential, achieving superior performance compared with traditional numerical weather prediction methods. However, these models often require massive training data and computational resources. In this pape… ▽ More

    Submitted 23 August, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  22. arXiv:2404.14233  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

    Authors: Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu

    Abstract: The rapidly developing Large Vision Language Models (LVLMs) have shown notable capabilities on a range of multi-modal tasks, but still face the hallucination phenomena where the generated texts do not align with the given contexts, significantly restricting the usages of LVLMs. Most previous work detects and mitigates hallucination at the coarse-grained level or requires expensive annotation (e.g.… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  23. arXiv:2404.03386  [pdf, other

    cs.RO cs.AI cs.LG

    SENSOR: Imitate Third-Person Expert's Behaviors via Active Sensoring

    Authors: Kaichen Huang, Minghao Shao, Shenghua Wan, Hai-Hang Sun, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: In many real-world visual Imitation Learning (IL) scenarios, there is a misalignment between the agent's and the expert's perspectives, which might lead to the failure of imitation. Previous methods have generally solved this problem by domain alignment, which incurs extra computation and storage costs, and these methods fail to handle the \textit{hard cases} where the viewpoint gap is too large.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  24. arXiv:2404.03382  [pdf, other

    cs.LG cs.AI

    DIDA: Denoised Imitation Learning based on Domain Adaptation

    Authors: Kaichen Huang, Hai-Hang Sun, Shenghua Wan, Minghao Shao, Shuai Feng, Le Gan, De-Chuan Zhan

    Abstract: Imitating skills from low-quality datasets, such as sub-optimal demonstrations and observations with distractors, is common in real-world applications. In this work, we focus on the problem of Learning from Noisy Demonstrations (LND), where the imitator is required to learn from data with noise that often occurs during the processes of data collection or transmission. Previous IL methods improve t… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  25. arXiv:2404.00265  [pdf, ps, other

    cs.IT eess.SP

    Environment-Aware Codebook for RIS-Assisted MU-MISO Communications: Implementation and Performance Analysis

    Authors: Zhiheng Yu, Jiancheng An, Lu Gan, Chau Yuen

    Abstract: Reconfigurable intelligent surface (RIS) provides a new electromagnetic response control solution, which can reshape the characteristics of wireless channels. In this paper, we propose a novel environment-aware codebook protocol for RIS-assisted multi-user multiple-input single-output (MU-MISO) systems. Specifically, we first introduce a channel training protocol which consists of off-line and on-… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures, accepted by VTC2024-Spring

  26. arXiv:2403.09976  [pdf, other

    cs.LG cs.CV

    AD3: Implicit Action is the Key for World Models to Distinguish the Diverse Visual Distractors

    Authors: Yucen Wang, Shenghua Wan, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based methods have significantly contributed to distinguishing task-irrelevant distractors for visual control. However, prior research has primarily focused on heterogeneous distractors like noisy background videos, leaving homogeneous distractors that closely resemble controllable agents largely unexplored, which poses significant challenges to existing methods. To tackle this problem, we p… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

  27. Stacked Intelligent Metasurface Enabled LEO Satellite Communications Relying on Statistical CSI

    Authors: Shining Lin, Jiancheng An, Lu Gan, Mérouane Debbah, Chau Yuen

    Abstract: Low earth orbit (LEO) satellite communication systems have gained increasing attention as a crucial supplement to terrestrial wireless networks due to their extensive coverage area. This letter presents a novel system design for LEO satellite systems by leveraging stacked intelligent metasurface (SIM) technology. Specifically, the lightweight and energy-efficient SIM is mounted on a satellite to a… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures, accepted by IEEE WCL

  28. Channel Estimation for Stacked Intelligent Metasurface-Assisted Wireless Networks

    Authors: Xianghao Yao, Jiancheng An, Lu Gan, Marco Di Renzo, Chau Yuen

    Abstract: Emerging technologies, such as holographic multiple-input multiple-output (HMIMO) and stacked intelligent metasurface (SIM), are driving the development of wireless communication systems. Specifically, the SIM is physically constructed by stacking multiple layers of metasurfaces and has an architecture similar to an artificial neural network (ANN), which can flexibly manipulate the electromagnetic… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 13 pages, 3 figures, accepted by IEEE WCL

  29. arXiv:2403.03215  [pdf, other

    cs.RO

    A Safety-Critical Framework for UGVs in Complex Environments: A Data-Driven Discrepancy-Aware Approach

    Authors: Skylar X. Wei, Lu Gan, Joel W. Burdick

    Abstract: This work presents a novel data-driven multi-layered planning and control framework for the safe navigation of a class of unmanned ground vehicles (UGVs) in the presence of unknown stationary obstacles and additive modeling uncertainties. The foundation of this framework is a novel robust model predictive planner, designed to generate optimal collision-free trajectories given an occupancy grid map… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  30. arXiv:2402.13532  [pdf, other

    cs.CL

    Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation

    Authors: Quanyu Long, Yue Deng, LeiLei Gan, Wenya Wang, Sinno Jialin Pan

    Abstract: Dense retrievers and retrieval-augmented language models have been widely used in various NLP applications. Despite being designed to deliver reliable and secure outcomes, the vulnerability of retrievers to potential attacks remains unclear, raising concerns about their security. In this paper, we introduce a novel scenario where the attackers aim to covertly disseminate targeted misinformation, s… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  31. arXiv:2402.12168  [pdf, other

    cs.CR cs.AI cs.CL

    Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning

    Authors: Shuai Zhao, Leilei Gan, Luu Anh Tuan, Jie Fu, Lingjuan Lyu, Meihuizi Jia, Jinming Wen

    Abstract: Recently, various parameter-efficient fine-tuning (PEFT) strategies for application to language models have been proposed and successfully implemented. However, this raises the question of whether PEFT, which only updates a limited set of model parameters, constitutes security vulnerabilities when confronted with weight-poisoning backdoor attacks. In this study, we show that PEFT is more susceptib… ▽ More

    Submitted 29 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: NAACL Findings 2024

  32. arXiv:2402.09997  [pdf, other

    cs.AI cs.CL cs.LG

    LoraRetriever: Input-Aware LoRA Retrieval and Composition for Mixed Tasks in the Wild

    Authors: Ziyu Zhao, Leilei Gan, Guoyin Wang, Wangchunshu Zhou, Hongxia Yang, Kun Kuang, Fei Wu

    Abstract: Low-Rank Adaptation (LoRA) provides an effective yet efficient solution for fine-tuning large language models (LLM). The modular and plug-and-play nature of LoRA enables the integration of diverse domain-specific LoRAs to enhance the capabilities of LLMs. Previous research on exploiting multiple LoRAs either focuses on specific isolated downstream tasks or fixes the selection of LoRAs during train… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  33. arXiv:2402.09006  [pdf, other

    eess.SP

    DRL-Based Orchestration of Multi-User MISO Systems with Stacked Intelligent Metasurfaces

    Authors: Hao Liu, Jiancheng An, Derrick Wing Kwan Ng, George C. Alexandropoulos, Lu Gan

    Abstract: Stacked intelligent metasurfaces (SIM) represents an advanced signal processing paradigm that enables over-the-air processing of electromagnetic waves at the speed of light. Its multi-layer structure exhibits customizable increased computational capability compared to conventional single-layer reconfigurable intelligent surfaces and metasurface lenses. In this paper, we deploy SIM to improve the p… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: accepted by IEEE ICC 2024

  34. arXiv:2401.08939  [pdf, other

    cs.RO

    Enhancing Campus Mobility: Achievements and Challenges of Autonomous Shuttle "Snow Lion''

    Authors: Yingbing Chen, Jie Cheng, Sheng Wang, Hongji Liu, Xiaodong Mei, Xiaoyang Yan, Mingkai Tang, Ge Sun, Ya Wen, Junwei Cai, Xupeng Xie, Lu Gan, Mandan Chao, Ren Xin, Ming Liu, Jianhao Jiao, Kangcheng Liu, Lujia Wang

    Abstract: The rapid evolution of autonomous vehicles (AVs) has significantly influenced global transportation systems. In this context, we present ``Snow Lion'', an autonomous shuttle meticulously designed to revolutionize on-campus transportation, offering a safer and more efficient mobility solution for students, faculty, and visitors. The primary objective of this research is to enhance campus mobility b… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 9 pages, 9 figures

  35. arXiv:2312.15394  [pdf, ps, other

    math.FA

    Order Relations of the Wasserstein mean and the spectral geometric mean

    Authors: Luyining Gan, Huajun Huang

    Abstract: On the space of positive definite matrices, several operator means are popular and have been studied extensively. In this paper, we investigate the near order and the Löwner order relations on the curves defined by the Wasserstein mean and the spectral geometric mean. We show that the near order $\preceq $ is stronger than the eigenvalue entrywise order, and that… ▽ More

    Submitted 5 May, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

    Comments: 17 pages

    MSC Class: 15A42; 15A45; 15B48

  36. arXiv:2312.10831  [pdf, other

    math.PR

    Steady-state Dirichlet approximation of the Wright-Fisher model using the prelimit generator comparison approach of Stein's method

    Authors: Anton Braverman, Han L. Gan

    Abstract: The Wright-Fisher model, originating in Wright (1931) is one of the canonical probabilistic models used in mathematical population genetics to study how genetic type frequencies evolve in time. In this paper we bound the rate of convergence of the stationary distribution for a finite population Wright-Fisher Markov chain with parent independent mutation to the Dirichlet distribution. Our result im… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    MSC Class: 92D25; 60F05 (Primary) 60K99 (Secondary)

  37. arXiv:2311.09860  [pdf, other

    cs.CL

    GSAP-NER: A Novel Task, Corpus, and Baseline for Scholarly Entity Extraction Focused on Machine Learning Models and Datasets

    Authors: Wolfgang Otto, Matthäus Zloch, Lu Gan, Saurav Karmakar, Stefan Dietze

    Abstract: Named Entity Recognition (NER) models play a crucial role in various NLP tasks, including information extraction (IE) and text understanding. In academic writing, references to machine learning models and datasets are fundamental components of various computer science publications and necessitate accurate models for identification. Despite the advancements in NER, existing ground truth datasets do… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 10 pages, 1 figure, Accepted at EMNLP2023-Findings

  38. arXiv:2311.08179  [pdf, other

    eess.SP cs.AI

    Semi-Supervised Learning via Swapped Prediction for Communication Signal Recognition

    Authors: Weidong Wang, Hongshu Liao, Lu Gan

    Abstract: Deep neural networks have been widely used in communication signal recognition and achieved remarkable performance, but this superiority typically depends on using massive examples for supervised learning, whereas training a deep neural network on small datasets with few labels generally falls into overfitting, resulting in degenerated performance. To this end, we develop a semi-supervised learnin… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  39. IR-STP: Enhancing Autonomous Driving with Interaction Reasoning in Spatio-Temporal Planning

    Authors: Yingbing Chen, Jie Cheng, Lu Gan, Sheng Wang, Hongji Liu, Xiaodong Mei, Ming Liu

    Abstract: Considerable research efforts have been devoted to the development of motion planning algorithms, which form a cornerstone of the autonomous driving system (ADS). Nonetheless, acquiring an interactive and secure trajectory for the ADS remains challenging due to the complex nature of interaction modeling in planning. Modern planning methods still employ a uniform treatment of prediction outcomes an… ▽ More

    Submitted 15 February, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: 12 pages, 10 figures, accepted by IEEE-TITS at this January

    MSC Class: 68T40 ACM Class: I.0; J.2

  40. arXiv:2310.19372  [pdf, other

    cs.CV cs.AI cs.RO

    RGB-X Object Detection via Scene-Specific Fusion Modules

    Authors: Sri Aditya Deevi, Connor Lee, Lu Gan, Sushruth Nagesh, Gaurav Pandey, Soon-Jo Chung

    Abstract: Multimodal deep sensor fusion has the potential to enable autonomous vehicles to visually understand their surrounding environments in all weather conditions. However, existing deep sensor fusion methods usually employ convoluted architectures with intermingled multimodal features, requiring large coregistered multimodal datasets for training. In this work, we present an efficient and modular RGB-… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2024)

  41. arXiv:2310.11142  [pdf, other

    cs.CV cs.LG

    BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference

    Authors: Siqi Kou, Lei Gan, Dequan Wang, Chongxuan Li, Zhijie Deng

    Abstract: Diffusion models have impressive image generation capability, but low-quality generations still exist, and their identification remains challenging due to the lack of a proper sample-wise metric. To address this, we propose BayesDiff, a pixel-wise uncertainty estimator for generations from diffusion models based on Bayesian inference. In particular, we derive a novel uncertainty iteration principl… ▽ More

    Submitted 4 March, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  42. arXiv:2310.05629  [pdf, ps, other

    eess.AS cs.SD

    Super Denoise Net: Speech Super Resolution with Noise Cancellation in Low Sampling Rate Noisy Environments

    Authors: Junkang Yang, Hongqing Liu, Lu Gan, Yi Zhou

    Abstract: Speech super-resolution (SSR) aims to predict a high resolution (HR) speech signal from its low resolution (LR) corresponding part. Most neural SSR models focus on producing the final result in a noise-free environment by recovering the spectrogram of high-frequency part of the signal and concatenating it with the original low-frequency part. Although these methods achieve high accuracy, they beco… ▽ More

    Submitted 9 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  43. arXiv:2309.04945  [pdf, other

    cs.PL cs.SE

    O2ATH: An OpenMP Offloading Toolkit for the Sunway Heterogeneous Manycore Platform

    Authors: Haoran Lin, Lifeng Yan, Qixin Chang, Haitian Lu, Chenlin Li, Quanjie He, Zeyu Song, Xiaohui Duan, Zekun Yin, Yuxuan Li, Zhao Liu, Wei Xue, Haohuan Fu, Lin Gan, Guangwen Yang, Weiguo Liu

    Abstract: The next generation Sunway supercomputer employs the SW26010pro processor, which features a specialized on-chip heterogeneous architecture. Applications with significant hotspots can benefit from the great computation capacity improvement of Sunway many-core architectures by carefully making intensive manual many-core parallelization efforts. However, some legacy projects with large codebases, suc… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 15 pages, 6 figures, 5 tables,

  44. arXiv:2308.01475  [pdf, other

    stat.ML cs.LG stat.ME

    Interpretable Machine Learning for Discovery: Statistical Challenges \& Opportunities

    Authors: Genevera I. Allen, Luqin Gan, Lili Zheng

    Abstract: New technologies have led to vast troves of large and complex datasets across many scientific domains and industries. People routinely use machine learning techniques to not only process, visualize, and make predictions from this big data, but also to make data-driven discoveries. These discoveries are often made using Interpretable Machine Learning, or machine learning models and techniques that… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  45. arXiv:2307.15509  [pdf

    cond-mat.soft

    Tunable topological phase transition in soft Rayleigh beam system with imperfect interfaces

    Authors: Tao Feng, Letian Gan, Shiheng Zhao, Zheng Chang, Siyang Li, Yaoting Xue, Xuxu Yang, Tuck-Whye Wong, Tiefeng Li, Weiqiu Chen

    Abstract: Acoustic metamaterials, particularly the topological insulators, exhibit exceptional wave characteristics that have sparked considerable research interest. The study of imperfect interfaces affect is of significant importance for the modeling of wave propagation behavior in topological insulators. This paper models a soft Rayleigh beam system with imperfect interfaces, and investigates its topolog… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: 39 pages,8 figures

  46. arXiv:2307.09027  [pdf, other

    cs.CV cs.RO

    Online Self-Supervised Thermal Water Segmentation for Aerial Vehicles

    Authors: Connor Lee, Jonathan Gustafsson Frennert, Lu Gan, Matthew Anderson, Soon-Jo Chung

    Abstract: We present a new method to adapt an RGB-trained water segmentation network to target-domain aerial thermal imagery using online self-supervision by leveraging texture and motion cues as supervisory signals. This new thermal capability enables current autonomous aerial robots operating in near-shore environments to perform tasks such as visual navigation, bathymetry, and flow tracking at night. Our… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 8 pages, 4 figures, 3 tables

  47. arXiv:2307.08732  [pdf, other

    math-ph cond-mat.stat-mech hep-th math.CO

    On the algebraic area of cubic lattice walks

    Authors: Li Gan

    Abstract: We obtain an explicit formula to enumerate closed random walks on a cubic lattice with a specified length and 3D algebraic area. The 3D algebraic area is defined as the sum of algebraic areas obtained from the walk's projection onto the three Cartesian planes. This enumeration formula can be mapped onto the cluster coefficients of three types of particles that obey quantum exclusion statistics wit… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 14 pages, 5 figures, comments welcome

    Journal ref: Phys. Rev. E 108, 054104 (2023)

  48. arXiv:2307.08056  [pdf, ps, other

    math.CO cs.CC

    An algorithmic version of the Hajnal--Szemerédi theorem

    Authors: Luyining Gan, Jie Han, Jie Hu

    Abstract: A $K_r$-factor of a graph $G$ is a collection of vertex disjoint $r$-cliques covering $V(G)$. We prove the following algorithmic version of the classical Hajnal--Szemerédi Theorem in graph theory, when $r$ is considered as a constant. Given $r, c, n\in \mathbb{N}$ such that $n\in r\mathbb N$, let $G$ be an $n$-vertex graph with minimum degree at least $(1-1/r)n - c$. Then there is an algorithm wit… ▽ More

    Submitted 8 July, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: 31 pages

  49. arXiv:2306.13895  [pdf, other

    eess.SP cs.CV

    Open-Set RF Fingerprinting via Improved Prototype Learning

    Authors: Weidong Wang, Hongshu Liao, Lu Gan

    Abstract: Deep learning has been widely used in radio frequency (RF) fingerprinting. Despite its excellent performance, most existing methods only consider a closed-set assumption, which cannot effectively tackle signals emitted from those unknown devices that have never been seen during training. In this letter, we exploit prototype learning for open-set RF fingerprinting and propose two improvements, incl… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  50. arXiv:2306.13893  [pdf, other

    eess.SP cs.AI cs.CV

    Radio Generation Using Generative Adversarial Networks with An Unrolled Design

    Authors: Weidong Wang, Jiancheng An, Hongshu Liao, Lu Gan, Chau Yuen

    Abstract: As a revolutionary generative paradigm of deep learning, generative adversarial networks (GANs) have been widely applied in various fields to synthesize realistic data. However, it is challenging for conventional GANs to synthesize raw signal data, especially in some complex cases. In this paper, we develop a novel GAN framework for radio generation called "Radio GAN". Compared to conventional met… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: Submitted to IEEE Transactions on Cognitive Communications and Networking on 20-Dec-2022