Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 135 results for author: Lu, A

.
  1. arXiv:2408.08827  [pdf, other

    cs.CV

    RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba

    Authors: Andong Lu, Wanyu Wang, Chenglong Li, Jin Tang, Bin Luo

    Abstract: Existing RGBT tracking methods often design various interaction models to perform cross-modal fusion of each layer, but can not execute the feature interactions among all layers, which plays a critical role in robust multimodal representation, due to large computational burden. To address this issue, this paper presents a novel All-layer multimodal Interaction Network, named AINet, which performs… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  2. arXiv:2408.07921  [pdf

    cs.LG

    Physics-Informed Neural Network for Predicting Out-of-Training-Range TCAD Solution with Minimized Domain Expertise

    Authors: Albert Lu, Yu Foon Chau, Hiu Yung Wong

    Abstract: Machine learning (ML) is promising in assisting technology computer-aided design (TCAD) simulations to alleviate difficulty in convergence and prolonged simulation time. While ML is widely used in TCAD, they either require access to the internal solver, require extensive domain expertise, are only trained by terminal quantities such as currents and voltages, and/or lack out-of-training-range predi… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  3. arXiv:2408.06907  [pdf, other

    cs.IT

    An Information Geometry Interpretation for Approximate Message Passing

    Authors: Bingyan Liu, An-An Lu, Mingrui Fan, Jiyuan Yang, Xiqi Gao

    Abstract: In this paper, we propose an information geometry (IG) framework to solve the standard linear regression problem. The proposed framework is an extension of the one for computing the mean of complex multivariate Gaussian distribution. By applying the proposed framework, the information geometry approach (IGA) and the approximate information geometry approach (AIGA) for basis pursuit de-noising (BPD… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 30 pages, 5 figures

  4. arXiv:2408.04579  [pdf, other

    cs.CV

    SAM2-Adapter: Evaluating & Adapting Segment Anything 2 in Downstream Tasks: Camouflage, Shadow, Medical Image Segmentation, and More

    Authors: Tianrun Chen, Ankang Lu, Lanyun Zhu, Chaotao Ding, Chunan Yu, Deyi Ji, Zejian Li, Lingyun Sun, Papa Mao, Ying Zang

    Abstract: The advent of large models, also known as foundation models, has significantly transformed the AI research landscape, with models like Segment Anything (SAM) achieving notable success in diverse image segmentation scenarios. Despite its advancements, SAM encountered limitations in handling some complex low-level segmentation tasks like camouflaged object and medical imaging. In response, in 2023,… ▽ More

    Submitted 10 August, 2024; v1 submitted 8 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: text overlap with arXiv:2304.09148

  5. arXiv:2408.02222  [pdf, other

    cs.CV

    Cross-modulated Attention Transformer for RGBT Tracking

    Authors: Yun Xiao, Jiacong Zhao, Andong Lu, Chenglong Li, Yin Lin, Bing Yin, Cong Liu

    Abstract: Existing Transformer-based RGBT trackers achieve remarkable performance benefits by leveraging self-attention to extract uni-modal features and cross-attention to enhance multi-modal feature interaction and template-search correlation computation. Nevertheless, the independent search-template correlation calculations ignore the consistency between branches, which can result in ambiguous and inappr… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: 10 pages, 5 figures

  6. arXiv:2407.18175  [pdf, other

    cs.LG cs.AI cs.CV

    Quasar-ViT: Hardware-Oriented Quantization-Aware Architecture Search for Vision Transformers

    Authors: Zhengang Li, Alec Lu, Yanyue Xie, Zhenglun Kong, Mengshu Sun, Hao Tang, Zhong Jia Xue, Peiyan Dong, Caiwen Ding, Yanzhi Wang, Xue Lin, Zhenman Fang

    Abstract: Vision transformers (ViTs) have demonstrated their superior accuracy for computer vision tasks compared to convolutional neural networks (CNNs). However, ViT models are often computation-intensive for efficient deployment on resource-limited edge devices. This work proposes Quasar-ViT, a hardware-oriented quantization-aware architecture search framework for ViTs, to design efficient ViT models for… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted by ICS 2024

  7. arXiv:2407.12322  [pdf, other

    cs.CV

    Frequency Guidance Matters: Skeletal Action Recognition by Frequency-Aware Mixed Transformer

    Authors: Wenhan Wu, Ce Zheng, Zihao Yang, Chen Chen, Srijan Das, Aidong Lu

    Abstract: Recently, transformers have demonstrated great potential for modeling long-term dependencies from skeleton sequences and thereby gained ever-increasing attention in skeleton action recognition. However, the existing transformer-based approaches heavily rely on the naive attention mechanism for capturing the spatiotemporal features, which falls short in learning discriminative representations that… ▽ More

    Submitted 29 July, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM Multimedia 2024

  8. arXiv:2406.19583  [pdf, other

    cs.IT

    Interference Cancellation Information Geometry Approach for Massive MIMO Channel Estimation

    Authors: An-An Lu, Bingyan Liu, Xiqi Gao

    Abstract: In this paper, the interference cancellation information geometry approaches (IC-IGAs) for massive MIMO channel estimation are proposed. The proposed algorithms are low-complexity approximations of the minimum mean square error (MMSE) estimation. To illustrate the proposed algorithms, a unified framework of the information geometry approach for channel estimation and its geometric explanation are… ▽ More

    Submitted 1 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 38 pages, 9 figures

  9. arXiv:2406.01291  [pdf, other

    astro-ph.GA

    WISDOM project XX -- Strong shear tearing molecular clouds apart in NGC 524

    Authors: Anan Lu, Daryl Haggard, Martin Bureau, Jindra Gensior, Sarah Jeffreson, Carmelle Robert, Thomas G. Williams, Fu-Heng Liang, Woorak Choi, Timothy A. Davis, Sara Babic, Hope Boyce, Benjamin Cheung, Laurent Drissen, Jacob S. Elford, Lijie Liu, Thomas Martin, Carter Rhea, Laurie Rousseau-Nepton, Ilaria Ruffa

    Abstract: Early-type galaxies (ETGs) are known to harbour dense spheroids of stars but scarce star formation (SF). Approximately a quarter of these galaxies have rich molecular gas reservoirs yet do not form stars efficiently. We study here the ETG NGC~524, with strong shear suspected to result in a smooth molecular gas disc and low star-formation efficiency (SFE). We present new spatially-resolved observat… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 17 pages, 10 figures. To be published in MNRAS, accepted on May 27

  10. arXiv:2405.19709  [pdf, other

    astro-ph.GA

    WISDOM Project -- XXI. Giant molecular clouds in the central region of the barred spiral galaxy NGC 613: a steep size -- linewidth relation

    Authors: Woorak Choi, Martin Bureau, Lijie Liu, Michele Cappellari, Timothy A. Davis, Jindra Gensior, Fu-Heng Liang, Anan Lu, Sanghyuk Moon, Ilaria Ruffa, Thomas G. Williams, Aeree Chung

    Abstract: NGC~613 is a nearby barred spiral galaxy with a nuclear ring. Exploiting high spatial resolution ($\approx20$ pc) Atacama Large Millimeter/sub-millimeter Array $^{12}$CO(1-0) observations, we study the giant molecular clouds (GMCs) in the nuclear ring and its vicinity, identifying $158$ spatially- and spectrally-resolved GMCs. The GMC sizes ($R_{\mathrm{c}}$) are comparable to those of the clouds… ▽ More

    Submitted 30 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: 15 pages, 8 figures, accepted for publication in MNRAS. arXiv admin note: text overlap with arXiv:2304.10471

  11. arXiv:2405.05428  [pdf, other

    cs.CV cs.CR cs.LG

    Adversary-Guided Motion Retargeting for Skeleton Anonymization

    Authors: Thomas Carr, Depeng Xu, Aidong Lu

    Abstract: Skeleton-based motion visualization is a rising field in computer vision, especially in the case of virtual reality (VR). With further advancements in human-pose estimation and skeleton extracting sensors, more and more applications that utilize skeleton data have come about. These skeletons may appear to be anonymous but they contain embedded personally identifiable information (PII). In this pap… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  12. arXiv:2405.02717  [pdf, other

    cs.CV

    AFter: Attention-based Fusion Router for RGBT Tracking

    Authors: Andong Lu, Wanyu Wang, Chenglong Li, Jin Tang, Bin Luo

    Abstract: Multi-modal feature fusion as a core investigative component of RGBT tracking emerges numerous fusion studies in recent years. However, existing RGBT tracking methods widely adopt fixed fusion structures to integrate multi-modal feature, which are hard to handle various challenges in dynamic scenarios. To address this problem, this work presents a novel \emph{A}ttention-based \emph{F}usion rou\emp… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Peer review

  13. arXiv:2404.14829  [pdf, other

    cs.LG cs.CV

    Revisiting Neural Networks for Continual Learning: An Architectural Perspective

    Authors: Aojun Lu, Tao Feng, Hangjie Yuan, Xiaotian Song, Yanan Sun

    Abstract: Efforts to overcome catastrophic forgetting have primarily centered around developing more effective Continual Learning (CL) methods. In contrast, less attention was devoted to analyzing the role of network architecture design (e.g., network depth, width, and components) in contributing to CL. This paper seeks to bridge this gap between network architecture design and CL, and to present a holistic… ▽ More

    Submitted 28 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

  14. arXiv:2404.07425  [pdf, ps, other

    eess.SP cs.IT

    Precoder Design for User-Centric Network Massive MIMO with Matrix Manifold Optimization

    Authors: Rui Sun, Li You, An-An Lu, Chen Sun, Xiqi Gao, Xiang-Gen Xia

    Abstract: In this paper, we investigate the precoder design for user-centric network (UCN) massive multiple-input multiple-output (mMIMO) downlink with matrix manifold optimization. In UCN mMIMO systems, each user terminal (UT) is served by a subset of base stations (BSs) instead of all the BSs, facilitating the implementation of the system and lowering the dimension of the precoders to be designed. By prov… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 13 pages, 9 figures, journal

  15. arXiv:2404.00986  [pdf, other

    cs.LG cs.CV

    Make Continual Learning Stronger via C-Flat

    Authors: Ang Bian, Wei Li, Hangjie Yuan, Chengrong Yu, Zixiang Zhao, Mang Wang, Aojun Lu, Tao Feng

    Abstract: Model generalization ability upon incrementally acquiring dynamically updating knowledge from sequentially arriving tasks is crucial to tackle the sensitivity-stability dilemma in Continual Learning (CL). Weight loss landscape sharpness minimization seeking for flat minima lying in neighborhoods with uniform low loss or smooth gradient is proven to be a strong training regime improving model gener… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  16. arXiv:2403.16151  [pdf, other

    cs.MA cs.IR

    Ultra Low-Cost Two-Stage Multimodal System for Non-Normative Behavior Detection

    Authors: Albert Lu, Stephen Cranefield

    Abstract: The online community has increasingly been inundated by a toxic wave of harmful comments. In response to this growing challenge, we introduce a two-stage ultra-low-cost multimodal harmful behavior detection method designed to identify harmful comments and images with high precision and recall rates. We first utilize the CLIP-ViT model to transform tweets and images into embeddings, effectively cap… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: to be appear in International Workshop on Coordination, Organizations, Institutions, Norms and Ethics for Governance of Multi-Agent Systems

  17. arXiv:2403.13588  [pdf, other

    cs.SE cs.CL

    Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models

    Authors: Chengzhe Feng, Yanan Sun, Ke Li, Pan Zhou, Jiancheng Lv, Aojun Lu

    Abstract: As Pre-trained Language Models (PLMs), a popular approach for code intelligence, continue to grow in size, the computational cost of their usage has become prohibitively expensive. Prompt learning, a recent development in the field of natural language processing, emerges as a potential solution to address this challenge. In this paper, we investigate the effectiveness of prompt learning in code in… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  18. arXiv:2403.12487  [pdf

    eess.SY

    Unveiling Four Key Factors for Tire Force Control Allocation in 4WID-4WIS Electric Vehicles at Handling Limits

    Authors: Ao Lu, Runfeng Li, Yunchang Yu, Ziwang Lu, Guangyu Tian

    Abstract: The four-wheel independent drive and four-wheel independent steering (4WID-4WIS) configurations enhance control flexibility and dynamic performance potential for more integrated electric vehicles. This paper comprehensively analyzes the impacts of four key factors on tire force control allocation: vertical load estimation, actuator dynamic characteristics, tire force constraints, and wheel steerin… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  19. arXiv:2403.02563  [pdf, ps, other

    cs.CV cs.CL

    Systemic Biases in Sign Language AI Research: A Deaf-Led Call to Reevaluate Research Agendas

    Authors: Aashaka Desai, Maartje De Meulder, Julie A. Hochgesang, Annemarie Kocab, Alex X. Lu

    Abstract: Growing research in sign language recognition, generation, and translation AI has been accompanied by calls for ethical development of such technologies. While these works are crucial to helping individual researchers do better, there is a notable lack of discussion of systemic biases or analysis of rhetoric that shape the research questions and methods in the field, especially as it remains domin… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  20. arXiv:2402.00033  [pdf, other

    cs.CV cs.AI

    LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition

    Authors: Youbing Hu, Yun Cheng, Anqi Lu, Zhiqiang Cao, Dawei Wei, Jie Liu, Zhijun Li

    Abstract: The Vision Transformer (ViT) excels in accuracy when handling high-resolution images, yet it confronts the challenge of significant spatial redundancy, leading to increased computational and memory requirements. To address this, we present the Localization and Focus Vision Transformer (LF-ViT). This model operates by strategically curtailing computational demands without impinging on performance.… ▽ More

    Submitted 7 January, 2024; originally announced February 2024.

  21. arXiv:2401.17697  [pdf, ps, other

    math.AP

    Suppression of Blowup by Slightly Superlinear Degradation in a Parabolic-Elliptic Keller--Segel System with Signal-dependent Motility

    Authors: Aijing Lu, Jie Jiang

    Abstract: In this paper, we consider an initial-Neumann boundary value problem for a parabolic-elliptic Keller-Segel system with signal-dependent motility and a source term. Previous research has rigorously shown that the source-free version of this system exhibits an infinite-time blowup phenomenon when dimension $N \geq 2$. In the current work, when $N \leq 3$, we establish uniform boundedness of global c… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  22. arXiv:2401.02035  [pdf, ps, other

    cs.IT

    Efficient Information Geometry Approach for Massive MIMO-OFDM Channel Estimation

    Authors: Jiyuan Yang, Yan Chen, Mingrui Fan, An-An Lu, Wen Zhong, Xiqi Gao, Xiaohu You, Xiang-Gen Xia, Dirk Slock

    Abstract: We investigate the channel estimation for massive multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) systems. We revisit the information geometry approach (IGA) for massive MIMO-OFDM channel estimation. By using the constant magnitude property of the entries of the measurement matrix, we find that the second-order natural parameters of the distributions on all th… ▽ More

    Submitted 3 June, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

  23. arXiv:2401.01674  [pdf, other

    cs.CV

    Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens

    Authors: Dengdi Sun, Yajie Pan, Andong Lu, Chenglong Li, Bin Luo

    Abstract: Many RGBT tracking researches primarily focus on modal fusion design, while overlooking the effective handling of target appearance changes. While some approaches have introduced historical frames or fuse and replace initial templates to incorporate temporal information, they have the risk of disrupting the original target appearance and accumulating errors over time. To alleviate these limitation… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  24. arXiv:2312.16246  [pdf, other

    cs.CV

    Nighttime Person Re-Identification via Collaborative Enhancement Network with Multi-domain Learning

    Authors: Andong Lu, Tianrui Zha, Chenglong Li, Jin Tang, Xiaofeng Wang, Bin Luo

    Abstract: Prevalent nighttime ReID methods typically combine relighting networks and ReID networks in a sequential manner, which not only restricts the ReID performance by the quality of relighting images, but also neglects the effective collaborative modeling between image relighting and person ReID tasks. To handle these problems, we propose a novel Collaborative Enhancement Network called CENet, which pe… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  25. arXiv:2312.16244  [pdf, other

    cs.CV

    Modality-missing RGBT Tracking: Invertible Prompt Learning and High-quality Benchmarks

    Authors: Andong Lu, Jiacong Zhao, Chenglong Li, Jin Tang, Bin Luo

    Abstract: Current RGBT tracking research relies on the complete multi-modal input, but modal information might miss due to some factors such as thermal sensor self-calibration and data transmission error, called modality-missing challenge in this work. To address this challenge, we propose a novel invertible prompt learning approach, which integrates the content-preserving prompts into a well-trained tracki… ▽ More

    Submitted 20 March, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

  26. arXiv:2311.17848  [pdf, other

    astro-ph.GA

    WISDOM Project -- XVI. The link between circumnuclear molecular gas reservoirs and active galactic nucleus fuelling

    Authors: Jacob S. Elford, Timothy A. Davis, Ilaria Ruffa, Martin Bureau, Michele Cappellari, Jindra Gensior, Satoru Iguchi, Fu-Heng Liang, Lijie Liu, Anan Lu, Thomas G. Williams

    Abstract: We use high-resolution data from the millimetre-Wave Interferometric Survey of Dark Object Masses (WISDOM) project to investigate the connection between circumnuclear gas reservoirs and nuclear activity in a sample of nearby galaxies. Our sample spans a wide range of nuclear activity types including radio galaxies, Seyfert galaxies, low-luminosity active galactic nuclei (AGN) and inactive galaxies… ▽ More

    Submitted 24 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 15 pages plus 3 in the appendix, 8 figures plus 1 in the appendix, 3 tables plus 4 in the appendix

  27. arXiv:2311.15447  [pdf, other

    astro-ph.GA

    WISDOM project -- XVIII. Molecular gas distributions and kinematics of three megamaser galaxies

    Authors: Fu-Heng Liang, Mark D. Smith, Martin Bureau, Feng Gao, Timothy A. Davis, Michele Cappellari, Jacob S. Elford, Jenny E. Greene, Satoru Iguchi, Federico Lelli, Anan Lu, Ilaria Ruffa, Thomas G. Williams, Hengyue Zhang

    Abstract: The co-evolution of galaxies and supermassive black holes (SMBHs) underpins our understanding of galaxy evolution, but different methods to measure SMBH masses have only infrequently been cross-checked. We attempt to identify targets to cross-check two of the most accurate methods, megamaser and cold molecular gas dynamics. Three promising galaxies are selected from all those with existing megamas… ▽ More

    Submitted 10 December, 2023; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 17 pages, 5 figures, accepted by MNRAS

  28. arXiv:2311.00690  [pdf, other

    cs.HC cs.CV cs.LG

    What User Behaviors Make the Differences During the Process of Visual Analytics?

    Authors: Zekun Wu, Shahin Doroudian, Aidong Lu

    Abstract: The understanding of visual analytics process can benefit visualization researchers from multiple aspects, including improving visual designs and developing advanced interaction functions. However, the log files of user behaviors are still hard to analyze due to the complexity of sensemaking and our lack of knowledge on the related user behaviors. This work presents a study on a comprehensive data… ▽ More

    Submitted 3 December, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: This version corrects the issues of previous versions

  29. arXiv:2310.12108  [pdf, other

    astro-ph.GA astro-ph.HE

    New Black Hole Spin Values for Sagittarius A* Obtained with the Outflow Method

    Authors: Ruth A. Daly, Megan Donahue, Christopher P. O'Dea, Biny Sebastian, Daryl Haggard, Anan Lu

    Abstract: Six archival Chandra observations are matched with eight sets of radio data and studied in the context of the outflow method to measure and study the spin properties of $\rm{Sgr ~A^*}$. Three radio and X-ray data sets obtained simultaneously, or partially simultaneously, are identified as preferred for the purpose of measuring the spin properties of $\rm{Sgr ~A^*}$. Similar results are obtained wi… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted for publication in MNRAS on October 16, 2023

  30. arXiv:2310.11912  [pdf, other

    astro-ph.GA

    The JWST Galactic Center Survey -- A White Paper

    Authors: Rainer Schoedel, Steve Longmore, Jonny Henshaw, Adam Ginsburg, John Bally, Anja Feldmeier, Matt Hosek, Francisco Nogueras Lara, Anna Ciurlo, Mélanie Chevance, J. M. Diederik Kruijssen, Ralf Klessen, Gabriele Ponti, Pau Amaro-Seoane, Konstantina Anastasopoulou, Jay Anderson, Maria Arias, Ashley T. Barnes, Cara Battersby, Giuseppe Bono, Lucía Bravo Ferres, Aaron Bryant, Miguel Cano Gonzáalez, Santi Cassisi, Leonardo Chaves-Velasquez , et al. (85 additional authors not shown)

    Abstract: The inner hundred parsecs of the Milky Way hosts the nearest supermassive black hole, largest reservoir of dense gas, greatest stellar density, hundreds of massive main and post main sequence stars, and the highest volume density of supernovae in the Galaxy. As the nearest environment in which it is possible to simultaneously observe many of the extreme processes shaping the Universe, it is one of… ▽ More

    Submitted 14 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: This White Paper will be updated when required (e.g. new authors joining, editing of content). Most recent update: 24 Oct 2023

  31. arXiv:2310.07822  [pdf, other

    cs.RO

    Body-mounted MR-conditional Robot for Minimally Invasive Liver Intervention

    Authors: Zhefeng Huang, Anthony L. Gunderman, Samuel E. Wilcox, Saikat Sengupta, Jay Shah, Aiming Lu, David Woodrum, Yue Chen

    Abstract: MR-guided microwave ablation (MWA) has proven effective in treating hepatocellular carcinoma (HCC) with small-sized tumors, but the state-of-the-art technique suffers from sub-optimal workflow due to speed and accuracy of needle placement. This paper presents a compact body-mounted MR-conditional robot that can operate in closed-bore MR scanners for accurate needle guidance. The robotic platform c… ▽ More

    Submitted 25 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 10 figures

  32. Mechatronic Generation of Datasets for Acoustics Research

    Authors: Austin Lu, Ethaniel Moore, Arya Nallanthighall, Kanad Sarkar, Manan Mittal, Ryan M. Corey, Paris Smaragdis, Andrew Singer

    Abstract: We address the challenge of making spatial audio datasets by proposing a shared mechanized recording space that can run custom acoustic experiments: a Mechatronic Acoustic Research System (MARS). To accommodate a wide variety of experiments, we implement an extensible architecture for wireless multi-robot coordination which enables synchronized robot motion for dynamic scenes with moving speakers… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: 5 pages, 5 figures, IWAENC 2022

  33. arXiv:2308.16486  [pdf, other

    cs.CV

    Illumination Distillation Framework for Nighttime Person Re-Identification and A New Benchmark

    Authors: Andong Lu, Zhang Zhang, Yan Huang, Yifan Zhang, Chenglong Li, Jin Tang, Liang Wang

    Abstract: Nighttime person Re-ID (person re-identification in the nighttime) is a very important and challenging task for visual surveillance but it has not been thoroughly investigated. Under the low illumination condition, the performance of person Re-ID methods usually sharply deteriorates. To address the low illumination challenge in nighttime person Re-ID, this paper proposes an Illumination Distillati… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by TMM

  34. arXiv:2308.05146  [pdf, other

    astro-ph.GA

    WISDOM Project -- XVII. Beam-by-beam Properties of the Molecular Gas in Early-type Galaxies

    Authors: Thomas G. Williams, Martin Bureau, Timothy A. Davis, Michele Cappellari, Woorak Choi, Jacob S. Elford, Satoru Iguchi, Jindra Gensior, Fu-Heng Liang, Anan Lu, Ilaria Ruffa, Hengyue Zhang

    Abstract: We present a study of the molecular gas of seven early-type galaxies with high angular resolution data obtained as part of the mm-Wave Interferometric Survey of Dark Object Masses (WISDOM) project with the Atacama Large Millimeter/submillimeter Array. Using a fixed spatial scale approach, we study the mass surface density ($Σ$) and velocity dispersion ($σ$) of the molecular gas on spatial scales r… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 32 pages (16 of Appendices), 39 Figures (27 in Appendices). Accepted for publication in MNRAS

  35. arXiv:2307.08848  [pdf

    physics.bio-ph q-bio.QM

    Microbiome-derived bile acids contribute to elevated antigenic response and bone erosion in rheumatoid arthritis

    Authors: Xiuli Su, Xiaona Li, Yanqin Bian, Qing Ren, Leiguang Li, Xiaohao Wu, Hemi Luan, Bing He, Xiaojuan He, Hui Feng, Xingye Cheng, Pan-Jun Kim, Leihan Tang, Aiping Lu, Lianbo Xiao, Liang Tian, Zhu Yang, Zongwei Cai

    Abstract: Rheumatoid arthritis (RA) is a chronic, disabling and incurable autoimmune disease. It has been widely recognized that gut microbial dysbiosis is an important contributor to the pathogenesis of RA, although distinct alterations in microbiota have been associated with this disease. Yet, the metabolites that mediate the impacts of the gut microbiome on RA are less well understood. Here, with microbi… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: 38 pages, 6 figures

  36. arXiv:2306.15123  [pdf, other

    cs.CL cs.CE

    Investigating Cross-Domain Behaviors of BERT in Review Understanding

    Authors: Albert Lu, Meng Jiang

    Abstract: Review score prediction requires review text understanding, a critical real-world application of natural language processing. Due to dissimilar text domains in product reviews, a common practice is fine-tuning BERT models upon reviews of differing domains. However, there has not yet been an empirical study of cross-domain behaviors of BERT models in the various tasks of product review understandin… ▽ More

    Submitted 27 June, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 9 pages, 1 figure, 2 tables

  37. arXiv:2306.08962  [pdf

    cond-mat.mtrl-sci

    Enhanced ferromagnetism in artificially stretched lattice in quasi two-dimensional Cr2Ge2Te6

    Authors: Hiroshi Idzuchi, Andres E Llacsahuanga Allcca, Anh Khoa Augustin Lu, Mitsuhiro Saito, Michel Houssa, Ruishen Meng, Kazutoshi Inoue, Xing-Chen Pan, Katsumi Tanigaki, Yuichi Ikuhara, Takeshi Nakanishi, Yong P Chen

    Abstract: In the fundamental understanding of magnetic interactions between atoms in solids, the crystal lattice is one of the key parameters. As the effective tool for controlling the lattice using tensile stress is limited, there are only few demonstrations of the control in magnetic properties with expanding the lattice structure. Here, we observe that the Curie temperature (Tc) of quasi two-dimensional… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  38. arXiv:2305.00666  [pdf, other

    cs.CV cs.AI

    Part Aware Contrastive Learning for Self-Supervised Action Recognition

    Authors: Yilei Hua, Wenhan Wu, Ce Zheng, Aidong Lu, Mengyuan Liu, Chen Chen, Shiqian Wu

    Abstract: In recent years, remarkable results have been achieved in self-supervised action recognition using skeleton sequences with contrastive learning. It has been observed that the semantic distinction of human action features is often represented by local body parts, such as legs or hands, which are advantageous for skeleton-based action recognition. This paper proposes an attention-based contrastive l… ▽ More

    Submitted 11 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: 7 pages, 4 figures, accepted by IJCAI 2023

  39. WISDOM Project -- XV. Giant Molecular Clouds in the Central Region of the Barred Spiral Galaxy NGC 5806

    Authors: Woorak Choi, Lijie Liu, Martin Bureau, Michele Cappellari, Timothy A. Davis, Jindra Gensior, Fu-Heng Liang, Anan Lu, Thomas G. Williams, Aeree Chung

    Abstract: We present high spatial resolution ($\approx24$ pc) Atacama Large Millimeter/sub-millimeter Array $^{12}$CO(2-1) observations of the central region of the nearby barred spiral galaxy NGC 5806. NGC 5806 has a highly structured molecular gas distribution with a clear nucleus, a nuclear ring and offset dust lanes. We identify $170$ spatially- and spectrally-resolved giant molecular clouds (GMCs). The… ▽ More

    Submitted 21 April, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted for publication in MNRAS, 20 pages, 16 figures

  40. WISDOM project -- XIV. SMBH mass in the early-type galaxies NGC0612, NGC1574, and NGC4261 from CO dynamical modelling

    Authors: Ilaria Ruffa, Timothy A. Davis, Michele Cappellari, Martin Bureau, Jacob S. Elford, Satoru Iguchi, Federico Lelli, Fu-Heng Liang, Lijie Liu, Anan Lu, Marc Sarzi, Thomas G. Williams

    Abstract: We present a CO dynamical estimate of the mass of the super-massive black hole (SMBH) in three nearby early-type galaxies: NGC0612, NGC1574 and NGC4261. Our analysis is based on Atacama Large Millimeter/submillimeter Array (ALMA) Cycle 3-6 observations of the $^{12}$CO(2-1) emission line with spatial resolutions of $14-58$ pc ($0.01"-0.26"$). We detect disc-like CO distributions on scales from… ▽ More

    Submitted 6 November, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: Main text: 20 pages, 14 Figures; Appendix: 7 pages, 6 Figures. Accepted for publication in MNRAS on 2023 March 28

  41. arXiv:2304.05934  [pdf, other

    cs.CV cs.CL

    ASL Citizen: A Community-Sourced Dataset for Advancing Isolated Sign Language Recognition

    Authors: Aashaka Desai, Lauren Berger, Fyodor O. Minakov, Vanessa Milan, Chinmay Singh, Kriston Pumphrey, Richard E. Ladner, Hal Daumé III, Alex X. Lu, Naomi Caselli, Danielle Bragg

    Abstract: Sign languages are used as a primary language by approximately 70 million D/deaf people world-wide. However, most communication technologies operate in spoken and written languages, creating inequities in access. To help tackle this problem, we release ASL Citizen, the first crowdsourced Isolated Sign Language Recognition (ISLR) dataset, collected with consent and containing 83,399 videos for 2,73… ▽ More

    Submitted 19 June, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

  42. arXiv:2304.03879  [pdf, other

    cs.IR cs.LG

    GPT4Rec: A Generative Framework for Personalized Recommendation and User Interests Interpretation

    Authors: Jinming Li, Wentao Zhang, Tian Wang, Guanglei Xiong, Alan Lu, Gerard Medioni

    Abstract: Recent advancements in Natural Language Processing (NLP) have led to the development of NLP-based recommender systems that have shown superior performance. However, current models commonly treat items as mere IDs and adopt discriminative modeling, resulting in limitations of (1) fully leveraging the content information of items and the language modeling capabilities of NLP models; (2) interpreting… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  43. Device Image-IV Mapping using Variational Autoencoder for Inverse Design and Forward Prediction

    Authors: Thomas Lu, Albert Lu, Hiu Yung Wong

    Abstract: This paper demonstrates the learning of the underlying device physics by mapping device structure images to their corresponding Current-Voltage (IV) characteristics using a novel framework based on variational autoencoders (VAE). Since VAE is used, domain expertise is not required and the framework can be quickly deployed on any new device and measurement. This is expected to be useful in the comp… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 5 pages 6 figures

    Journal ref: 2023 International Conference on Simulation of Semiconductor Processes and Devices (SISPAD), Kobe, Japan, 2023, pp. 161-164

  44. Precoder Design for Massive MIMO Downlink with Matrix Manifold Optimization

    Authors: Rui Sun, Chen Wang, An-An Lu, Xiqi Gao, Xiang-Gen Xia

    Abstract: We investigate the weighted sum-rate (WSR) maximization linear precoder design for massive multiple-input multiple-output (MIMO) downlink. We consider a single-cell system with multiple users and propose a unified matrix manifold optimization framework applicable to total power constraint (TPC), per-user power constraint (PUPC) and per-antenna power constraint (PAPC). We prove that the precoders u… ▽ More

    Submitted 10 April, 2024; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: 16 pages, 11 figures, journal

    Journal ref: IEEE Transactions on Signal Processing, vol. 72, pp. 1065-1080, 2024

  45. Synchronization transitions in Kuramoto networks with higher-mode interaction

    Authors: Rico Berner, Annie Lu, Igor M. Sokolov

    Abstract: Synchronization is an omnipresent collective phenomenon in nature and technology, whose understanding is in particular for real-world systems still elusive. We study the synchronization transition in a phase oscillator system with two nonvanishing Fourier-modes in the interaction function and hence going beyond the Kuromoto paradigm. We show that the transition scenarios crucially depend on the in… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  46. arXiv:2303.02241  [pdf, other

    cs.CV cs.LG

    Domain adaptation using optimal transport for invariant learning using histopathology datasets

    Authors: Kianoush Falahkheirkhah, Alex Lu, David Alvarez-Melis, Grace Huynh

    Abstract: Histopathology is critical for the diagnosis of many diseases, including cancer. These protocols typically require pathologists to manually evaluate slides under a microscope, which is time-consuming and subjective, leading to interest in machine learning to automate analysis. However, computational techniques are limited by batch effects, where technical factors like differences in preparation pr… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  47. arXiv:2302.09185  [pdf, other

    cs.CL cs.AI cs.LG

    Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints

    Authors: Albert Lu, Hongxin Zhang, Yanzhe Zhang, Xuezhi Wang, Diyi Yang

    Abstract: The limits of open-ended generative models are unclear, yet increasingly important. What causes them to succeed and what causes them to fail? In this paper, we take a prompt-centric approach to analyzing and bounding the abilities of open-ended generative models. We present a generic methodology of analysis with two challenging prompt constraint types: structural and stylistic. These constraint ty… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: 27 pages, 13 figures, 11 tables, to be published in EACL 2023 Findings

  48. arXiv:2302.05989  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Van der Waals device integration beyond the limits of van der Waals forces via adhesive matrix transfer

    Authors: Peter F. Satterthwaite, Weikun Zhu, Patricia Jastrzebska-Perfect, Melbourne Tang, Hongze Gao, Hikari Kitadai, Ang-Yu Lu, Qishuo Tan, Shin-Yi Tang, Yu-Lun Chueh, Chia-Nung Kuo, Chin Shan Lue, Jing Kong, Xi Ling, Farnaz Niroui

    Abstract: Pristine van der Waals (vdW) interfaces between two-dimensional (2D) and other materials are core to emerging optical and electronic devices. Their direct fabrication is, however, challenged as the vdW forces are weak and cannot be tuned to accommodate integration of arbitrary layers without solvents, sacrificial-layers or high-temperatures, steps that can introduce damage. To address these limita… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  49. arXiv:2301.03410  [pdf, other

    cs.CV

    In Defense of Structural Symbolic Representation for Video Event-Relation Prediction

    Authors: Andrew Lu, Xudong Lin, Yulei Niu, Shih-Fu Chang

    Abstract: Understanding event relationships in videos requires a model to understand the underlying structures of events (i.e. the event type, the associated argument roles, and corresponding entities) and factual knowledge for reasoning. Structural symbolic representation (SSR) based methods directly take event types and associated argument roles/entities as inputs to perform reasoning. However, the state-… ▽ More

    Submitted 12 April, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: CVPRW 23, Learning with Limited Labelled Data

  50. arXiv:2212.14156  [pdf, other

    eess.SY

    Decentralized Voltage Control with Peer-to-peer Energy Trading in a Distribution Network

    Authors: Chen Feng, Andrew L. Lu, Yihsu Chen

    Abstract: Utilizing distributed renewable and energy storage resources via peer-to-peer (P2P) energy trading has long been touted as a solution to improve energy system's resilience and sustainability. Consumers and prosumers (those who have energy generation resources), however, do not have expertise to engage in repeated P2P trading, and the zero-marginal costs of renewables present challenges in determin… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.