Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 166 results for author: Lai, Z

.
  1. Chromosomal Structural Abnormality Diagnosis by Homologous Similarity

    Authors: Juren Li, Fanzhe Fu, Ran Wei, Yifei Sun, Zeyu Lai, Ning Song, Xin Chen, Yang Yang

    Abstract: Pathogenic chromosome abnormalities are very common among the general population. While numerical chromosome abnormalities can be quickly and precisely detected, structural chromosome abnormalities are far more complex and typically require considerable efforts by human experts for identification. This paper focuses on investigating the modeling of chromosome features and the identification of chr… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.07337  [pdf, other

    cs.NI eess.SP

    In-Orbit Processing or Not? Sunlight-Aware Task Scheduling for Energy-Efficient Space Edge Computing Networks

    Authors: Weisen Liu, Zeqi Lai, Qian Wu, Hewu Li, Qi Zhang, Zonglun Li, Yuanjie Li, Jun Liu

    Abstract: With the rapid evolution of space-borne capabilities, space edge computing (SEC) is becoming a new computation paradigm for future integrated space and terrestrial networks. Satellite edges adopt advanced on-board hardware, which not only enables new opportunities to perform complex intelligent tasks in orbit, but also involves new challenges due to the additional energy consumption in power-const… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted by IEEE INFOCOM 2024

  3. arXiv:2407.06623  [pdf, other

    cs.NI

    SKYCASTLE: Taming LEO Mobility to Facilitate Seamless and Low-latency Satellite Internet Services

    Authors: Jihao Li, Hewu Li, Zeqi Lai, Qian Wu, Weisen Liu, Xiaomo Wang, Yuanjie Li, Jun Liu, Qi Zhang

    Abstract: Emerging integrated space and terrestrial networks (ISTN) built upon low earth orbit (LEO) satellite constellations aim at providing planet-wide Internet services, not only for residential users, but also for mobile users (e.g., in airplane and cruise scenarios). Efficiently managing global mobility and keeping connections active for mobile users is critical for ISTN operators. However, our quanti… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 10 pages, 10 figures, accepted by IEEE INFOCOM 2024

    Journal ref: IEEE International Conference on Computer Communications 2024

  4. arXiv:2407.03799  [pdf, other

    cs.NI

    Your Mega-Constellations Can Be Slim:A Cost-Effective Approach for Constructing Survivable and Performant LEO Satellite Networks

    Authors: Zeqi Lai, Yibo Wang, Hewu Li, Qian Wu, Qi Zhang, Yunan Hou, Jun Liu, Yuanjie Li

    Abstract: In this paper, we investigate an important research problem facing the upcoming satellite Internet: from a network perspective, how many satellites exactly do we need to construct a survivable and performant LSN? To answer this question, we first formulate the survivable and performant LSN design (SPLD) problem, which aims to find the minimum number of needed satellites to construct an LSN that ca… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  5. arXiv:2407.00332  [pdf

    q-bio.OT cs.LG

    Machine Learning Models for Dengue Forecasting in Singapore

    Authors: Zi Iun Lai, Wai Kit Fung, Enquan Chew

    Abstract: With emerging prevalence beyond traditionally endemic regions, the global burden of dengue disease is forecasted to be one of the fastest growing. With limited direct treatment or vaccination currently available, prevention through vector control is widely believed to be the most effective form of managing outbreaks. This study examines traditional state space models (moving average, autoregressiv… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 12 pages, 6 figures

  6. arXiv:2406.19377  [pdf, ps, other

    math.OC math.NA

    Grassmannian optimization is NP-hard

    Authors: Zehua Lai, Lek-Heng Lim, Ke Ye

    Abstract: We show that unconstrained quadratic optimization over a Grassmannian $\operatorname{Gr}(k,n)$ is NP-hard. Our results cover all scenarios: (i) when $k$ and $n$ are both allowed to grow; (ii) when $k$ is arbitrary but fixed; (iii) when $k$ is fixed at its lowest possible value $1$. We then deduce the NP-hardness of unconstrained cubic optimization over the Stiefel manifold $\operatorname{V}(k,n)$… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages

    MSC Class: 03D15; 90C26; 90C23; 65K10; 68Q25; 90C60

  7. arXiv:2406.17601  [pdf, other

    cs.CV

    Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text

    Authors: Xinyang Li, Zhangyu Lai, Linning Xu, Yansong Qu, Liujuan Cao, Shengchuan Zhang, Bo Dai, Rongrong Ji

    Abstract: Recent advancements in 3D generation have leveraged synthetic datasets with ground truth 3D assets and predefined cameras. However, the potential of adopting real-world datasets, which can produce significantly more realistic 3D scenes, remains largely unexplored. In this work, we delve into the key challenge of the complex and scene-specific camera trajectories found in real-world captures. We in… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Code: https://github.com/imlixinyang/director3d

  8. arXiv:2406.17289  [pdf, other

    cs.IR cs.AI

    Hyperbolic Knowledge Transfer in Cross-Domain Recommendation System

    Authors: Xin Yang, Heng Chang, Zhijian Lai, Jinze Yang, Xingrun Li, Yu Lu, Shuaiqiang Wang, Dawei Yin, Erxue Min

    Abstract: Cross-Domain Recommendation (CDR) seeks to utilize knowledge from different domains to alleviate the problem of data sparsity in the target recommendation domain, and it has been gaining more attention in recent years. Although there have been notable advancements in this area, most current methods represent users and items in Euclidean space, which is not ideal for handling long-tail distributed… ▽ More

    Submitted 4 July, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  9. arXiv:2406.11821  [pdf, ps, other

    math.DG math.NA math.OC

    Simple matrix expressions for the curvatures of Grassmannian

    Authors: Zehua Lai, Lek-Heng Lim, Ke Ye

    Abstract: We show that modeling a Grassmannian as symmetric orthogonal matrices $\operatorname{Gr}(k,\mathbb{R}^n) \cong\{Q \in \mathbb{R}^{n \times n} : Q^{\scriptscriptstyle\mathsf{T}} Q = I, \; Q^{\scriptscriptstyle\mathsf{T}} = Q,\; \operatorname{tr}(Q)=2k - n\}$ yields exceedingly simple matrix formulas for various curvatures and curvature-related quantities, both intrinsic and extrinsic. These include… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 25 pages

    MSC Class: 15A75; 14M15

  10. arXiv:2406.08688  [pdf, other

    cs.SE cs.AI

    On Security Weaknesses and Vulnerabilities in Deep Learning Systems

    Authors: Zhongzheng Lai, Huaming Chen, Ruoxi Sun, Yu Zhang, Minhui Xue, Dong Yuan

    Abstract: The security guarantee of AI-enabled software systems (particularly using deep learning techniques as a functional core) is pivotal against the adversarial attacks exploiting software vulnerabilities. However, little attention has been paid to a systematic investigation of vulnerabilities in such systems. A common situation learned from the open source software community is that deep learning engi… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  11. arXiv:2406.08394  [pdf, other

    cs.CV

    VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks

    Authors: Jiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu, Wenhai Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo, Yu Qiao, Jifeng Dai

    Abstract: We present VisionLLM v2, an end-to-end generalist multimodal large model (MLLM) that unifies visual perception, understanding, and generation within a single framework. Unlike traditional MLLMs limited to text output, VisionLLM v2 significantly broadens its application scope. It excels not only in conventional visual question answering (VQA) but also in open-ended, cross-domain vision tasks such a… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 43 pages

  12. AI Cat Narrator: Designing an AI Tool for Exploring the Shared World and Social Connection with a Cat

    Authors: Zhenchi Lai, Janet Yi-Ching Huang, Rung-Huei Liang

    Abstract: As technology continues to advance, the interaction between humans and cats is becoming more diverse. Our research introduces a new tool called the AI Cat Narrator, which offers a unique perspective on the shared lives of humans and cats. We combined the method of ethnography with fictional storytelling, using a defamiliarization strategy to merge real-world data seen through the eyes of cats with… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages

  13. arXiv:2406.06068  [pdf, other

    cs.NI

    Instability of Self-Driving Satellite Mega-Constellation: From Theory to Practical Impacts on Network Lifetime and Capacity

    Authors: Yimei Chen, Yuanjie Li, Hewu Li, Lixin Liu, Li Ouyang, Jiabo Yang, Junyi Li, Jianping Wu, Qian Wu, Jun Liu, Zeqi Lai

    Abstract: Low Earth Orbit (LEO) satellite mega-constellations aim to enable high-speed Internet for numerous users anywhere on Earth. To safeguard their network infrastructure in congested outer space, they perform automatic orbital maneuvers to avoid collisions with external debris and satellites. However, our control-theoretic analysis and empirical validation using Starlink's space situational awareness… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  14. arXiv:2406.04822  [pdf, other

    cs.LG

    M2NO: Multiresolution Operator Learning with Multiwavelet-based Algebraic Multigrid Method

    Authors: Zhihao Li, Zhilu Lai, Xiaobo Wang, Wei Wang

    Abstract: Solving partial differential equations (PDEs) effectively necessitates a multi-scale approach, particularly critical in high-dimensional scenarios characterized by increasing grid points or resolution. Traditional methods often fail to capture the detailed features necessary for accurate modeling, presenting a significant challenge in scientific computing. In response, we introduce the Multiwavele… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  15. arXiv:2406.04649  [pdf, other

    cs.CV

    SMART: Scene-motion-aware human action recognition framework for mental disorder group

    Authors: Zengyuan Lai, Jiarui Yang, Songpengcheng Xia, Qi Wu, Zhen Sun, Wenxian Yu, Ling Pei

    Abstract: Patients with mental disorders often exhibit risky abnormal actions, such as climbing walls or hitting windows, necessitating intelligent video behavior monitoring for smart healthcare with the rising Internet of Things (IoT) technology. However, the development of vision-based Human Action Recognition (HAR) for these actions is hindered by the lack of specialized algorithms and datasets. In this… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  16. arXiv:2405.18376  [pdf, other

    cs.LG cs.CV

    Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning

    Authors: Dongjie Chen, Kartik Patwari, Zhengfeng Lai, Sen-ching Cheung, Chen-Nee Chuah

    Abstract: Source-Free Domain Adaptation (SFDA) aims to adapt a pre-trained source model to a target domain using only unlabeled target data. Current SFDA methods face challenges in effectively leveraging pre-trained knowledge and exploiting target domain data. Multimodal Large Language Models (MLLMs) offer remarkable capabilities in understanding visual and textual information, but their applicability to SF… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  17. arXiv:2405.13923  [pdf, other

    cs.CL

    Why Not Transform Chat Large Language Models to Non-English?

    Authors: Xiang Geng, Ming Zhu, Jiahuan Li, Zhejian Lai, Wei Zou, Shuaijie She, Jiaxin Guo, Xiaofeng Zhao, Yinglu Li, Yuang Li, Chang Su, Yanqing Zhao, Xinglin Lyu, Min Zhang, Jiajun Chen, Hao Yang, Shujian Huang

    Abstract: The scarcity of non-English data limits the development of non-English large language models (LLMs). Transforming English-centric LLMs to non-English has been identified as an effective and resource-efficient method. Previous works start from base LLMs and perform knowledge distillation (KD) with data generated by stronger LLMs, e.g. GPT-4. Compared to base LLMs, chat LLMs are further optimized fo… ▽ More

    Submitted 31 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  18. arXiv:2405.09874  [pdf, other

    cs.CV

    Dual3D: Efficient and Consistent Text-to-3D Generation with Dual-mode Multi-view Latent Diffusion

    Authors: Xinyang Li, Zhangyu Lai, Linning Xu, Jianfei Guo, Liujuan Cao, Shengchuan Zhang, Bo Dai, Rongrong Ji

    Abstract: We present Dual3D, a novel text-to-3D generation framework that generates high-quality 3D assets from texts in only $1$ minute.The key component is a dual-mode multi-view latent diffusion model. Given the noisy multi-view latents, the 2D mode can efficiently denoise them with a single latent denoising network, while the 3D mode can generate a tri-plane neural surface for consistent rendering-based… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Project Page: https://dual3d.github.io

  19. arXiv:2405.08047  [pdf, other

    math.OC cs.LG q-fin.PM

    Autonomous Sparse Mean-CVaR Portfolio Optimization

    Authors: Yizun Lin, Yangyu Zhang, Zhao-Rong Lai, Cheng Li

    Abstract: The $\ell_0$-constrained mean-CVaR model poses a significant challenge due to its NP-hard nature, typically tackled through combinatorial methods characterized by high computational demands. From a markedly different perspective, we propose an innovative autonomous sparse mean-CVaR portfolio model, capable of approximating the original $\ell_0$-constrained mean-CVaR model with arbitrary accuracy.… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  20. arXiv:2405.06965  [pdf, other

    cs.LG

    A De-singularity Subgradient Approach for the Extended Weber Location Problem

    Authors: Zhao-Rong Lai, Xiaotian Wu, Liangda Fang, Ziliang Chen

    Abstract: The extended Weber location problem is a classical optimization problem that has inspired some new works in several machine learning scenarios recently. However, most existing algorithms may get stuck due to the singularity at the data points when the power of the cost function $1\leqslant q<2$, such as the widely-used iterative Weiszfeld approach. In this paper, we establish a de-singularity subg… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024

  21. arXiv:2405.04100  [pdf, other

    cs.CV cs.LG

    ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios

    Authors: Dingrui Wang, Zheyuan Lai, Yuda Li, Yi Wu, Yuexin Ma, Johannes Betz, Ruigang Yang, Wei Li

    Abstract: Emergent-scene safety is the key milestone for fully autonomous driving, and reliable on-time prediction is essential to maintain safety in emergency scenarios. However, these emergency scenarios are long-tailed and hard to collect, which restricts the system from getting reliable predictions. In this paper, we build a new dataset, which aims at the long-term prediction with the inconspicuous stat… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: Accepted by ICRA 2024 as Oral Presentation

  22. arXiv:2405.01389  [pdf, other

    cs.LG

    Invariant Risk Minimization Is A Total Variation Model

    Authors: Zhao-Rong Lai, Weiwen Wang

    Abstract: Invariant risk minimization (IRM) is an arising approach to generalize invariant features to different environments in machine learning. While most related works focus on new IRM settings or new application scenarios, the mathematical essence of IRM remains to be properly explained. We verify that IRM is essentially a total variation based on $L^2$ norm (TV-$\ell_2$) of the learning risk with resp… ▽ More

    Submitted 17 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: ICML 2024

  23. arXiv:2404.19495  [pdf

    stat.AP econ.EM stat.ME stat.OT

    Percentage Coefficient (bp) -- Effect Size Analysis (Theory Paper 1)

    Authors: Xinshu Zhao, Dianshi Moses Li, Ze Zack Lai, Piper Liping Liu, Song Harris Ao, Fei You

    Abstract: Percentage coefficient (bp) has emerged in recent publications as an additional and alternative estimator of effect size for regression analysis. This paper retraces the theory behind the estimator. It's posited that an estimator must first serve the fundamental function of enabling researchers and readers to comprehend an estimand, the target of estimation. It may then serve the instrumental func… ▽ More

    Submitted 6 May, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  24. arXiv:2404.11576  [pdf, other

    cs.CV

    State-space Decomposition Model for Video Prediction Considering Long-term Motion Trend

    Authors: Fei Cui, Jiaojiao Fang, Xiaojiang Wu, Zelong Lai, Mengke Yang, Menghan Jia, Guizhong Liu

    Abstract: Stochastic video prediction enables the consideration of uncertainty in future motion, thereby providing a better reflection of the dynamic nature of the environment. Stochastic video prediction methods based on image auto-regressive recurrent models need to feed their predictions back into the latent space. Conversely, the state-space models, which decouple frame synthesis and temporal prediction… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  25. arXiv:2404.06663  [pdf, other

    cs.CV

    Multi-modal Document Presentation Attack Detection With Forensics Trace Disentanglement

    Authors: Changsheng Chen, Yongyi Deng, Liangwei Lin, Zitong Yu, Zhimao Lai

    Abstract: Document Presentation Attack Detection (DPAD) is an important measure in protecting the authenticity of a document image. However, recent DPAD methods demand additional resources, such as manual effort in collecting additional data or knowing the parameters of acquisition devices. This work proposes a DPAD method based on multi-modal disentangled traces (MMDT) without the above drawbacks. We first… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted to ICME 2024

  26. arXiv:2404.05253  [pdf, other

    cs.CV

    CodeEnhance: A Codebook-Driven Approach for Low-Light Image Enhancement

    Authors: Xu Wu, XianXu Hou, Zhihui Lai, Jie Zhou, Ya-nan Zhang, Witold Pedrycz, Linlin Shen

    Abstract: Low-light image enhancement (LLIE) aims to improve low-illumination images. However, existing methods face two challenges: (1) uncertainty in restoration from diverse brightness degradations; (2) loss of texture and color information caused by noise suppression and light enhancement. In this paper, we propose a novel enhancement approach, CodeEnhance, by leveraging quantized priors and image refin… ▽ More

    Submitted 30 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: 10 pages, 13 figures

  27. arXiv:2404.03611  [pdf, other

    cs.CV cs.AI

    InsectMamba: Insect Pest Classification with State Space Model

    Authors: Qianning Wang, Chenglin Wang, Zhixin Lai, Yucheng Zhou

    Abstract: The classification of insect pests is a critical task in agricultural technology, vital for ensuring food security and environmental sustainability. However, the complexity of pest identification, due to factors like high camouflage and species diversity, poses significant obstacles. Existing methods struggle with the fine-grained feature extraction needed to distinguish between closely related pe… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 13 pages, 5 figures

  28. arXiv:2403.19839  [pdf, other

    cs.LG cs.AI cs.CL

    The New Agronomists: Language Models are Experts in Crop Management

    Authors: Jing Wu, Zhixin Lai, Suiyao Chen, Ran Tao, Pan Zhao, Naira Hovakimyan

    Abstract: Crop management plays a crucial role in determining crop yield, economic profitability, and environmental sustainability. Despite the availability of management guidelines, optimizing these practices remains a complex and multifaceted challenge. In response, previous studies have explored using reinforcement learning with crop simulators, typically employing simple neural-network-based reinforceme… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  29. arXiv:2403.18417  [pdf, other

    cs.CV

    ECNet: Effective Controllable Text-to-Image Diffusion Models

    Authors: Sicheng Li, Keqiang Sun, Zhixin Lai, Xiaoshi Wu, Feng Qiu, Haoran Xie, Kazunori Miyata, Hongsheng Li

    Abstract: The conditional text-to-image diffusion models have garnered significant attention in recent years. However, the precision of these models is often compromised mainly for two reasons, ambiguous condition input and inadequate condition guidance over single denoising loss. To address the challenges, we introduce two innovative solutions. Firstly, we propose a Spatial Guidance Injector (SGI) which en… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  30. arXiv:2403.17343  [pdf, other

    cs.CV cs.CL cs.LG

    Residual-based Language Models are Free Boosters for Biomedical Imaging

    Authors: Zhixin Lai, Jing Wu, Suiyao Chen, Yucheng Zhou, Naira Hovakimyan

    Abstract: In this study, we uncover the unexpected efficacy of residual-based large language models (LLMs) as part of encoders for biomedical imaging tasks, a domain traditionally devoid of language or textual data. The approach diverges from established methodologies by utilizing a frozen transformer block, extracted from pre-trained LLMs, as an innovative encoder layer for the direct processing of visual… ▽ More

    Submitted 28 March, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  31. arXiv:2403.13335  [pdf, other

    cs.LG cs.AI

    Adaptive Ensembles of Fine-Tuned Transformers for LLM-Generated Text Detection

    Authors: Zhixin Lai, Xuesheng Zhang, Suiyao Chen

    Abstract: Large language models (LLMs) have reached human-like proficiency in generating diverse textual content, underscoring the necessity for effective fake text detection to avoid potential risks such as fake news in social media. Previous research has mostly tested single models on in-distribution datasets, limiting our understanding of how these models perform on different types of data for LLM-genera… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  32. arXiv:2403.07359  [pdf, other

    cs.CV

    FSC: Few-point Shape Completion

    Authors: Xianzu Wu, Xianfeng Wu, Tianyu Luan, Yajing Bai, Zhongyuan Lai, Junsong Yuan

    Abstract: While previous studies have demonstrated successful 3D object shape completion with a sufficient number of points, they often fail in scenarios when a few points, e.g. tens of points, are observed. Surprisingly, via entropy analysis, we find that even a few points, e.g. 64 points, could retain substantial information to help recover the 3D shape of the object. To address the challenge of shape com… ▽ More

    Submitted 18 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  33. arXiv:2402.14041  [pdf

    cs.LG cs.AI cs.DB

    E2USD: Efficient-yet-effective Unsupervised State Detection for Multivariate Time Series

    Authors: Zhichen Lai, Huan Li, Dalin Zhang, Yan Zhao, Weizhu Qian, Christian S. Jensen

    Abstract: Cyber-physical system sensors emit multivariate time series (MTS) that monitor physical system processes. Such time series generally capture unknown numbers of states, each with a different duration, that correspond to specific conditions, e.g., "walking" or "running" in human-activity monitoring. Unsupervised identification of such states facilitates storage and processing in subsequent data anal… ▽ More

    Submitted 27 May, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted by The Web Conference 2024 (WWW 2024)

  34. arXiv:2402.12791  [pdf, other

    physics.optics

    Dual-polarization huge photonic spin Hall shift and deep-subwavelength sensing based on topological singularities in one-dimensional photonic crystals

    Authors: Yufu Liu, Xianjun Wang, Yunlin Li, Haoran Zhang, Langlang Xiong, Xingchao Qi, Zhen Lai, Xuezhi Wang, Xunya Jiang

    Abstract: Although several efforts have been taken to enhance the photonic spin Hall shift in deep-subwavelength region, according to effective medium theory, the fundamental confliction between near-zero reflection coefficient and near-zero incident angle still hinders the further application. Here, we reveal a fundamental breakdown of effective medium theory due to the existing of topological singularity… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  35. arXiv:2402.06010  [pdf, other

    cs.LG stat.ML

    NPSVC++: Nonparallel Classifiers Encounter Representation Learning

    Authors: Junhong Zhang, Zhihui Lai, Jie Zhou, Guangfei Liang

    Abstract: This paper focuses on a specific family of classifiers called nonparallel support vector classifiers (NPSVCs). Different from typical classifiers, the training of an NPSVC involves the minimization of multiple objectives, resulting in the potential concerns of feature suboptimality and class dependency. Consequently, no effective learning scheme has been established to improve NPSVCs' performance… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  36. arXiv:2402.03264  [pdf, other

    cs.LG

    MobilityGPT: Enhanced Human Mobility Modeling with a GPT model

    Authors: Ammar Haydari, Dongjie Chen, Zhengfeng Lai, Michael Zhang, Chen-Nee Chuah

    Abstract: Generative models have shown promising results in capturing human mobility characteristics and generating synthetic trajectories. However, it remains challenging to ensure that the generated geospatial mobility data is semantically realistic, including consistent location sequences, and reflects real-world characteristics, such as constraining on geospatial limits. We reformat human mobility model… ▽ More

    Submitted 23 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  37. arXiv:2401.12217  [pdf, other

    cs.CV cs.LG

    Exploring Simple Open-Vocabulary Semantic Segmentation

    Authors: Zihang Lai

    Abstract: Open-vocabulary semantic segmentation models aim to accurately assign a semantic label to each pixel in an image from a set of arbitrary open-vocabulary texts. In order to learn such pixel-level alignment, current approaches typically rely on a combination of (i) image-level VL model (e.g. CLIP), (ii) ground truth masks, and (iii) custom grouping encoders. In this paper, we introduce S-Seg, a nove… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Code is available at: https://github.com/zlai0/S-Seg

  38. arXiv:2401.09695  [pdf

    cs.HC cs.AI

    Should ChatGPT Write Your Breakup Text? Exploring the Role of AI in Relationship Dissolution

    Authors: Yue Fu, Yixin Chen, Zelia Gomes Da Costa Lai, Alexis Hiniker

    Abstract: Relationships are essential to our happiness and wellbeing. The dissolution of a relationship, the final stage of relationship's lifecycle and one of the most stressful events in an individual's life, can have profound and long-lasting impacts on people. With the breakup process increasingly facilitated by computer-mediated communication (CMC), and the likely future influence of AI-mediated commun… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

  39. arXiv:2401.02625  [pdf, other

    physics.ins-det quant-ph

    Compact InGaAs/InP single-photon detector module with ultra-narrowband interference circuits

    Authors: Yan Zhengyu, Shi Tingting, Fan Yuanbin, Zhou Lai, Yuan Zhiliang

    Abstract: Gated InGaAs/InP avalanche photodiodes are the most practical device for detection of telecom single photons arriving at regular intervals.Here, we report the development of a compact single-photon detector (SPD) module measured just 8.8cm * 6cm * 2cm in size and fully integrated with driving signal generation, faint avalanche readout, and discrimination circuits as well as temperature regulation… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: Adv Devices Instrum. 2023;4:0029

  40. arXiv:2401.02047  [pdf

    cs.SI physics.soc-ph

    Covid19 Vaccine Acceptance and Deprivation in US Counties

    Authors: Zi Iun Lai, Jun Yang Ang

    Abstract: This report explores the central question of how socioeconomic status affects Covid19 vaccination rates in the United States, using existing open-source data. In general, a negative correlation exists between Area Deprivation Index (ADI) of a county and first dose, primary series and booster vaccination rates. Higher area deprivation correlated with polled vaccine hesitancy and lower search intere… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: 16 pages, 9 figures

  41. arXiv:2312.09758  [pdf, other

    cs.LG cs.AI stat.ME

    Diagnosing and Rectifying Fake OOD Invariance: A Restructured Causal Approach

    Authors: Ziliang Chen, Yongsen Zheng, Zhao-Rong Lai, Quanlong Guan, Liang Lin

    Abstract: Invariant representation learning (IRL) encourages the prediction from invariant causal features to labels de-confounded from the environments, advancing the technical roadmap of out-of-distribution (OOD) generalization. Despite spotlights around, recent theoretical results verified that some causal features recovered by IRLs merely pretend domain-invariantly in the training environments but fail… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: AAAI-2024

  42. arXiv:2312.00793  [pdf, other

    cs.AI cs.LO

    Variants of Tagged Sentential Decision Diagrams

    Authors: Deyuan Zhong, Mingwei Zhang, Quanlong Guan, Liangda Fang, Zhaorong Lai, Yong Lai

    Abstract: A recently proposed canonical form of Boolean functions, namely tagged sentential decision diagrams (TSDDs), exploits both the standard and zero-suppressed trimming rules. The standard ones minimize the size of sentential decision diagrams (SDDs) while the zero-suppressed trimming rules have the same objective as the standard ones but for zero-suppressed sentential decision diagrams (ZSDDs). The o… ▽ More

    Submitted 16 November, 2023; originally announced December 2023.

  43. arXiv:2310.20425  [pdf, other

    cs.LG

    Discussing the Spectrum of Physics-Enhanced Machine Learning; a Survey on Structural Mechanics Applications

    Authors: Marcus Haywood-Alexander, Wei Liu, Kiran Bacsa, Zhilu Lai, Eleni Chatzi

    Abstract: The intersection of physics and machine learning has given rise to the physics-enhanced machine learning (PEML) paradigm, aiming to improve the capabilities and reduce the individual shortcomings of data- or physics-only methods. In this paper, the spectrum of physics-enhanced machine learning methods, expressed across the defining axes of physics and data, is discussed by engaging in a comprehens… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  44. arXiv:2310.18714  [pdf, ps, other

    cs.AI

    An Investigation of Darwiche and Pearl's Postulates for Iterated Belief Update

    Authors: Quanlong Guan, Tong Zhu, Liangda Fang, Junming Qiu, Zhao-Rong Lai, Weiqi Luo

    Abstract: Belief revision and update, two significant types of belief change, both focus on how an agent modify her beliefs in presence of new information. The most striking difference between them is that the former studies the change of beliefs in a static world while the latter concentrates on a dynamically-changing world. The famous AGM and KM postulates were proposed to capture rational belief revision… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  45. arXiv:2310.17796  [pdf, other

    cs.CV cs.MM

    ControlLLM: Augment Language Models with Tools by Searching on Graphs

    Authors: Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Ziheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang

    Abstract: We present ControlLLM, a novel framework that enables large language models (LLMs) to utilize multi-modal tools for solving complex real-world tasks. Despite the remarkable performance of LLMs, they still struggle with tool invocation due to ambiguous user prompts, inaccurate tool selection and parameterization, and inefficient tool scheduling. To overcome these challenges, our framework comprises… ▽ More

    Submitted 18 December, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 24 pages, 9 figures, 12 tables

  46. 2SFGL: A Simple And Robust Protocol For Graph-Based Fraud Detection

    Authors: Zhirui Pan, Guangzhong Wang, Zhaoning Li, Lifeng Chen, Yang Bian, Zhongyuan Lai

    Abstract: Financial crime detection using graph learning improves financial safety and efficiency. However, criminals may commit financial crimes across different institutions to avoid detection, which increases the difficulty of detection for financial institutions which use local data for graph learning. As most financial institutions are subject to strict regulations in regards to data privacy protection… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: IEEE

  47. arXiv:2310.07699  [pdf, other

    cs.CV cs.AI cs.LG

    VeCLIP: Improving CLIP Training via Visual-enriched Captions

    Authors: Zhengfeng Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Haoping Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao

    Abstract: Large-scale web-crawled datasets are fundamental for the success of pre-training vision-language models, such as CLIP. However, the inherent noise and potential irrelevance of web-crawled AltTexts pose challenges in achieving precise image-text alignment. Existing methods utilizing large language models (LLMs) for caption rewriting have shown promise on small, curated datasets like CC3M and CC12M.… ▽ More

    Submitted 13 March, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: CV/ML

  48. arXiv:2310.07653  [pdf, other

    cs.AI

    Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

    Authors: Zeqiang Lai, Xizhou Zhu, Jifeng Dai, Yu Qiao, Wenhai Wang

    Abstract: The revolution of artificial intelligence content generation has been rapidly accelerated with the booming text-to-image (T2I) diffusion models. Within just two years of development, it was unprecedentedly of high-quality, diversity, and creativity that the state-of-the-art models could generate. However, a prevalent limitation persists in the effective communication with these popular T2I models,… ▽ More

    Submitted 11 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: Technical report. Project page at https://minidalle3.github.io/

  49. arXiv:2309.13230  [pdf, other

    cs.CL

    Unify word-level and span-level tasks: NJUNLP's Participation for the WMT2023 Quality Estimation Shared Task

    Authors: Xiang Geng, Zhejian Lai, Yu Zhang, Shimin Tao, Hao Yang, Jiajun Chen, Shujian Huang

    Abstract: We introduce the submissions of the NJUNLP team to the WMT 2023 Quality Estimation (QE) shared task. Our team submitted predictions for the English-German language pair on all two sub-tasks: (i) sentence- and word-level quality prediction; and (ii) fine-grained error span detection. This year, we further explore pseudo data methods for QE based on NJUQE framework (https://github.com/NJUNLP/njuqe).… ▽ More

    Submitted 11 December, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: WMT2023 System Paper

    Journal ref: https://aclanthology.org/2023.wmt-1.71

  50. arXiv:2308.07934  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training

    Authors: Jianshuo Dong, Han Qiu, Yiming Li, Tianwei Zhang, Yuanjie Li, Zeqi Lai, Chao Zhang, Shu-Tao Xia

    Abstract: Deep neural networks (DNNs) are widely deployed on real-world devices. Concerns regarding their security have gained great attention from researchers. Recently, a new weight modification attack called bit flip attack (BFA) was proposed, which exploits memory fault inject techniques such as row hammer to attack quantized models in the deployment stage. With only a few bit flips, the target model ca… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: This work is accepted by the ICCV 2023. 14 pages