Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 51–100 of 375 results for author: Cui, L

.
  1. arXiv:2312.00307  [pdf, other

    astro-ph.SR astro-ph.HE

    VLBI detection of the AE Aqr twin, LAMOST J024048.51+195226.9

    Authors: Pengfei Jiang, Lang Cui, Xiang Liu, Bo Zhang, Yongfeng Huang, Hongmin Cao, Tao An, Jun Yang, Fengchun Shu, Guiping Tan, Jianping Yuan

    Abstract: LAMOST J024048.51+195226.9 (J0240+1952) was recently identified as the second AE Aquarii (AE Aqr)-type cataclysmic variable, possessing the fastest known rotating white dwarf. We performed a Very Long Baseline Interferometry (VLBI) observation of J0240+1952 utilizing the European VLBI Network at 1.7\,GHz, to obtain the first view of the radio morphology on mas scale. Our high-resolution VLBI image… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  2. arXiv:2311.16465  [pdf, other

    cs.CV

    TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering

    Authors: Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

    Abstract: The diffusion model has been proven a powerful generative model in recent years, yet remains a challenge in generating visual text. Several methods alleviated this issue by incorporating explicit text position and content as guidance on where and what text to render. However, these methods still suffer from several drawbacks, such as limited flexibility and automation, constrained capability of la… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  3. arXiv:2311.09802  [pdf, other

    cs.AI cs.CL

    Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs

    Authors: Sen Yang, Xin Li, Leyang Cui, Lidong Bing, Wai Lam

    Abstract: Though prompting LLMs with various reasoning structures produces reasoning proofs along with answers, these proofs are not ensured to be causal and reliable due to the inherent defects of LLMs. Tracking such deficiencies, we present a neuro-symbolic integration method, in which a neural LLM is used to represent the knowledge of the problem while an LLM-free symbolic solver is adopted to do deliber… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  4. arXiv:2311.07624  [pdf

    q-bio.PE stat.AP

    Disordered hyperuniformity signals functioning and resilience of self-organized vegetation patterns

    Authors: Wensi Hu, Quan-Xing Liu, Bo Wang, Nuo Xu, Lijuan Cui, Chi Xu

    Abstract: In harsh environments, organisms may self-organize into spatially patterned systems in various ways. So far, studies of ecosystem spatial self-organization have primarily focused on apparent orders reflected by regular patterns. However, self-organized ecosystems may also have cryptic orders that can be unveiled only through certain quantitative analyses. Here we show that disordered hyperuniformi… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Comments: 34 pages, 6 figures; Supplementary Materials, 19 pages, 10 figures, 2 tables

  5. arXiv:2311.07324  [pdf, other

    cs.LG

    DAGC: Data-Volume-Aware Adaptive Sparsification Gradient Compression for Distributed Machine Learning in Mobile Computing

    Authors: Rongwei Lu, Yutong Jiang, Yinan Mao, Chen Tang, Bin Chen, Laizhong Cui, Zhi Wang

    Abstract: Distributed machine learning (DML) in mobile environments faces significant communication bottlenecks. Gradient compression has emerged as an effective solution to this issue, offering substantial benefits in environments with limited bandwidth and metered data. Yet, they encounter severe performance drop in non-IID environments due to a one-size-fits-all compression approach, which does not accou… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  6. arXiv:2311.04025  [pdf, other

    gr-qc cond-mat.stat-mech

    General relativistic stochastic thermodynamics

    Authors: Tao Wang, Yifan Cai, Long Cui, Liu Zhao

    Abstract: Based on the recent work [1,2], we formulate the first law and the second law of stochastic thermodynamics in the framework of general relativity. These laws are established for a charged Brownian particle moving in a heat reservoir and subjecting to an external electromagnetic field in generic stationary spacetime background, and in order to maintain general covariance, they are presented respect… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 15 pages, 1 figure

  7. arXiv:2311.03924  [pdf, ps, other

    astro-ph.HE gr-qc

    Follow-up on the Supermassive Black Hole Binary Candidate J1048+7143: Successful Prediction of the Next Gamma-ray Flare and Refined Binary Parameters in the Framework of Jet Precession Model

    Authors: Emma Kun, Ilja Jaroschewski, Julia Becker Tjus, Silke Britzen, Sándor Frey, Krisztina Éva Gabányi, Lang Cui, Xin Wang, Yuling Shen

    Abstract: Analyzing single-dish and VLBI radio, as well as \textit{Fermi}-LAT $γ$-ray observations, we explained the three major flares in the $γ$-ray light curve of FSRQ J1048+7143 with the spin--orbit precession of the dominant mass black hole in a supermassive black hole binary system. Here, we report on the detection of a fourth $γ$-ray flare from J1048+7143, appearing in the time interval which was pre… ▽ More

    Submitted 8 February, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 9 pages, 4 figures, 3 tables. Accepted to ApJL

  8. Multi-band Cross-correlated Radio Variability of the Blazar 3C 279

    Authors: Krishna Mohana A, Alok C. Gupta, Alan P. Marscher, Yulia V. Sotnikova, S. G. Jorstad, Paul J. Wiita, Lang Cui, Margo F. Aller, Hugh D. Aller, Yu. A. Kovalev, Y. Y. Kovalev, Xiang Liu, T. V. Mufakharov, A. V. Popkov, M. G. Mingaliev, A. K. Erkenov, N. A. Nizhelsky, P. G. Tsybulev, Wei Zhao, Z. R. Weaver, D. A. Morozova

    Abstract: We present the results of our study of cross-correlations between long-term multi-band observations of the radio variability of the blazar 3C 279. More than a decade (2008-2022) of radio data were collected at seven different frequencies ranging from 2 GHz to 230 GHz. The multi-band radio light curves show variations in flux, with the prominent flare features appearing first at higher-frequency an… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

    Comments: Submitted revised version to MNRAS journal, 11 pages, 6 figures, 4 tables

    Journal ref: MNRAS 527 (2024) 6970

  9. arXiv:2310.20381  [pdf, other

    cs.CV cs.AI

    A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical Image Analysis

    Authors: Yingshu Li, Yunyi Liu, Zhanyu Wang, Xinyu Liang, Lei Wang, Lingqiao Liu, Leyang Cui, Zhaopeng Tu, Longyue Wang, Luping Zhou

    Abstract: This work conducts an evaluation of GPT-4V's multimodal capability for medical image analysis, with a focus on three representative tasks of radiology report generation, medical visual question answering, and medical visual grounding. For the evaluation, a set of prompts is designed for each task to induce the corresponding capability of GPT-4V to produce sufficiently good outputs. Three evaluatio… ▽ More

    Submitted 30 January, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  10. arXiv:2310.19740  [pdf, other

    cs.CL

    Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation

    Authors: Qintong Li, Leyang Cui, Lingpeng Kong, Wei Bi

    Abstract: Humans are widely involved in the evaluation of open-ended natural language generation tasks (NLG) that demand creativity, as automatic metrics often exhibit weak correlations with human judgments. Large language models (LLMs) recently have emerged as a scalable and cost-effective alternative to human evaluations. However, both humans and LLMs have limitations, i.e., inherent subjectivity and unre… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: We release our resources at \url{https://github.com/qtli/CoEval}

  11. arXiv:2310.14274  [pdf, other

    cs.LG

    Robust Visual Imitation Learning with Inverse Dynamics Representations

    Authors: Siyuan Li, Xun Wang, Rongchang Zuo, Kewu Sun, Lingfei Cui, Jishiyu Ding, Peng Liu, Zhe Ma

    Abstract: Imitation learning (IL) has achieved considerable success in solving complex sequential decision-making problems. However, current IL methods mainly assume that the environment for learning policies is the same as the environment for collecting expert datasets. Therefore, these methods may fail to work when there are slight differences between the learning and expert environments, especially for c… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  12. arXiv:2310.13345  [pdf, other

    cs.CR

    An LLM can Fool Itself: A Prompt-Based Adversarial Attack

    Authors: Xilie Xu, Keyi Kong, Ning Liu, Lizhen Cui, Di Wang, Jingfeng Zhang, Mohan Kankanhalli

    Abstract: The wide-ranging applications of large language models (LLMs), especially in safety-critical domains, necessitate the proper evaluation of the LLM's adversarial robustness. This paper proposes an efficient tool to audit the LLM's adversarial robustness via a prompt-based adversarial attack (PromptAttack). PromptAttack converts adversarial textual attacks into an attack prompt that can cause the vi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  13. arXiv:2310.09015  [pdf, other

    astro-ph.HE astro-ph.GA

    Precessing jet nozzle connecting to a spinning black hole in M87

    Authors: Yuzhu Cui, Kazuhiro Hada, Tomohisa Kawashima, Motoki Kino, Weikang Lin, Yosuke Mizuno, Hyunwook Ro, Mareki Honma, Kunwoo Yi, Jintao Yu, Jongho Park, Wu Jiang, Zhiqiang Shen, Evgeniya Kravchenko, Juan-Carlos Algaba, Xiaopeng Cheng, Ilje Cho, Gabriele Giovannini, Marcello Giroletti, Taehyun Jung, Ru-Sen Lu, Kotaro Niinuma, Junghwan Oh, Ken Ohsuga, Satoko Sawada-Satoh , et al. (54 additional authors not shown)

    Abstract: The nearby radio galaxy M87 offers a unique opportunity to explore the connections between the central supermassive black hole and relativistic jets. Previous studies of the inner region of M87 revealed a wide opening angle for the jet originating near the black hole. The Event Horizon Telescope resolved the central radio source and found an asymmetric ring structure consistent with expectations f… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 41 pages, 7 figures, 7 tables

    Journal ref: 2023, Nature, 621, 711-715

  14. arXiv:2310.07988  [pdf, other

    quant-ph physics.optics

    Recovery of phase constant from two-photon interference pattern by phase retrieval algorithm

    Authors: Yuhang Lei, Wen Zhao, Liang cui, Xiaoyin Li

    Abstract: For a HOM interferometer with two independent incident pulses, the interference pattern can be affected by adding a dispersion medium on one of the incident directions, but there hasn't been a method to reconstruct the phase constant of the medium from the interference pattern. To solve it, we adapted two phase retrieval algorithms and used them to recover the phase difference function between the… ▽ More

    Submitted 14 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 12 pages, 8 figures

  15. arXiv:2310.07821  [pdf, other

    cs.CL

    Non-autoregressive Text Editing with Copy-aware Latent Alignments

    Authors: Yu Zhang, Yue Zhang, Leyang Cui, Guohong Fu

    Abstract: Recent work has witnessed a paradigm shift from Seq2Seq to Seq2Edit in the field of text editing, with the aim of addressing the slow autoregressive inference problem posed by the former. Despite promising results, Seq2Edit approaches still face several challenges such as inflexibility in generation and difficulty in generalizing to other languages. In this work, we propose a novel non-autoregress… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  16. arXiv:2310.07481  [pdf, other

    gr-qc cond-mat.stat-mech

    Iterative solution of relativistic Boltzmann equation in curved spacetime with application to kinetic coefficients

    Authors: Long Cui, Xin Hao, Liu Zhao

    Abstract: Under relaxation time approximation, we obtain an iterative solution to the relativistic Boltzmann equation in generic stationary spacetime. This solution provides a scheme to study non-equilibrium system order by order. As a specific example, we analytically calculated the covariant expressions of the particle flow and the energy momentum tensor up to the first order in relaxation time. Finally a… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 15 pages, 1 figure

  17. arXiv:2310.07299  [pdf, other

    cs.CL cs.AI

    RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation

    Authors: Yue Zhang, Leyang Cui, Enbo Zhao, Wei Bi, Shuming Shi

    Abstract: Grammatical Error Correction (GEC) systems play a vital role in assisting people with their daily writing tasks. However, users may sometimes come across a GEC system that initially performs well but fails to correct errors when the inputs are slightly modified. To ensure an ideal user experience, a reliable GEC system should have the ability to provide consistent and accurate suggestions when enc… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (main conference, long paper)

  18. arXiv:2310.07163  [pdf

    astro-ph.IM hep-ph

    The Qitai Radio Telescope

    Authors: Na Wang, Qian Xu, Jun Ma, Zhiyong Liu, Qi Liu, Hailong Zhang, Xin Pei, Maozheng Chen, Richard N. Manchester, Kejia Lee, Xingwu Zheng, Hans J. Kärcher, Wulin Zhao, Hongwei Li, Dongwei Li, Martin Süss, Matthias Reichert, Zhongyi Zhu, Congsi Wang, Mingshuai Li, Rui Li, Ning Li, Guljaina Kazezkhan, Wenming Yan, Gang Wu , et al. (3 additional authors not shown)

    Abstract: This study presents a general outline of the Qitai radio telescope (QTT) project. Qitai, the site of the telescope, is a county of Xinjiang Uygur Autonomous Region of China, located in the east Tianshan Mountains at an elevation of about 1800 m. The QTT is a fully steerable, Gregorian type telescope with a standard parabolic main reflector of 110 m diameter. The QTT has adopted an um-brella suppor… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: 12 pages, 11 figures, accepted for publication in Science China Physics, Mechanics & Astronomy

    Journal ref: Sci China-Phys Mech Astron, 2023, 66: 289512

  19. arXiv:2310.05341  [pdf, other

    cs.CV cs.AI

    A Critical Look at Classic Test-Time Adaptation Methods in Semantic Segmentation

    Authors: Chang'an Yi, Haotian Chen, Yifan Zhang, Yonghui Xu, Lizhen Cui

    Abstract: Test-time adaptation (TTA) aims to adapt a model, initially trained on training data, to potential distribution shifts in the test data. Most existing TTA studies, however, focus on classification tasks, leaving a notable gap in the exploration of TTA for semantic segmentation. This pronounced emphasis on classification might lead numerous newcomers and engineers to mistakenly assume that classic… ▽ More

    Submitted 11 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  20. arXiv:2310.02930  [pdf, ps, other

    math.OC eess.SY

    Small-Disturbance Input-to-State Stability of Perturbed Gradient Flows: Applications to LQR Problem

    Authors: Leilei Cui, Zhong-Ping Jiang, Eduardo D. Sontag

    Abstract: This paper studies the effect of perturbations on the gradient flow of a general nonlinear programming problem, where the perturbation may arise from inaccurate gradient estimation in the setting of data-driven optimization. Under suitable conditions on the objective function, the perturbed gradient flow is shown to be small-disturbance input-to-state stable (ISS), which implies that, in the prese… ▽ More

    Submitted 16 April, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 20 pages

  21. arXiv:2310.00919  [pdf, other

    eess.IV cs.CV cs.LG

    BAAF: A Benchmark Attention Adaptive Framework for Medical Ultrasound Image Segmentation Tasks

    Authors: Gongping Chen, Lei Zhao, Xiaotao Yin, Liang Cui, Jianxun Zhang, Yu Dai

    Abstract: The AI-based assisted diagnosis programs have been widely investigated on medical ultrasound images. Complex scenario of ultrasound image, in which the coupled interference of internal and external factors is severe, brings a unique challenge for localize the object region automatically and precisely in ultrasound images. In this study, we seek to propose a more general and robust Benchmark Attent… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  22. arXiv:2309.17415  [pdf, other

    cs.CL

    Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts

    Authors: Jiahao Ying, Yixin Cao, Kai Xiong, Yidong He, Long Cui, Yongbin Liu

    Abstract: This study investigates the behaviors of Large Language Models (LLMs) when faced with conflicting prompts versus their internal memory. This will not only help to understand LLMs' decision mechanism but also benefit real-world applications, such as retrieval-augmented generation (RAG). Drawing on cognitive theory, we target the first scenario of decision-making styles where there is no superiority… ▽ More

    Submitted 20 February, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  23. arXiv:2309.12641  [pdf, other

    cs.CV

    Global Context Aggregation Network for Lightweight Saliency Detection of Surface Defects

    Authors: Feng Yan, Xiaoheng Jiang, Yang Lu, Lisha Cui, Shupan Li, Jiale Cao, Mingliang Xu, Dacheng Tao

    Abstract: Surface defect inspection is a very challenging task in which surface defects usually show weak appearances or exist under complex backgrounds. Most high-accuracy defect detection methods require expensive computation and storage overhead, making them less practical in some resource-constrained defect detection applications. Although some lightweight methods have achieved real-time inference speed… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

  24. arXiv:2309.11419  [pdf, other

    cs.CL cs.CV

    Kosmos-2.5: A Multimodal Literate Model

    Authors: Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

    Abstract: We present Kosmos-2.5, a multimodal literate model for machine reading of text-intensive images. Pre-trained on large-scale text-intensive images, Kosmos-2.5 excels in two distinct yet cooperative transcription tasks: (1) generating spatially-aware text blocks, where each block of text is assigned its spatial coordinates within the image, and (2) producing structured text output that captures styl… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

  25. arXiv:2309.01219  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

    Authors: Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi

    Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge… ▽ More

    Submitted 24 September, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: work in progress; 32 pages

  26. arXiv:2308.11459  [pdf, other

    quant-ph

    Phase Dependent Hanbury-Brown and Twiss effect

    Authors: Xuan Tang, Yunxiao Zhang, Xueshi Guo, Liang Cui, Xiaoying Li, Z. Y. Ou

    Abstract: Hanbury-Brown and Twiss (HBT) effect is the foundation for stellar intensity interferometry. However, it is a phase insensitive two-photon interference effect. In this paper, we extend the HBT interferometer by mixing two phase-coherent input fields with coherent auxiliary fields before intensity correlation measurement and achieve phase sensitive two-photon interference so as to measure the compl… ▽ More

    Submitted 30 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: 5 pages, 6 figures

  27. arXiv:2308.01578  [pdf, other

    cs.LG cs.AI

    Unsupervised Representation Learning for Time Series: A Review

    Authors: Qianwen Meng, Hangwei Qian, Yong Liu, Yonghui Xu, Zhiqi Shen, Lizhen Cui

    Abstract: Unsupervised representation learning approaches aim to learn discriminative feature representations from unlabeled data, without the requirement of annotating every sample. Enabling unsupervised representation learning is extremely crucial for time series data, due to its unique annotation bottleneck caused by its complex characteristics and lack of visual cues compared with other data modalities.… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: In submission to IEEE

  28. arXiv:2307.12810  [pdf, other

    cs.IR

    HeteFedRec: Federated Recommender Systems with Model Heterogeneity

    Authors: Wei Yuan, Liang Qu, Lizhen Cui, Yongxin Tong, Xiaofang Zhou, Hongzhi Yin

    Abstract: Owing to the nature of privacy protection, federated recommender systems (FedRecs) have garnered increasing interest in the realm of on-device recommender systems. However, most existing FedRecs only allow participating clients to collaboratively train a recommendation model of the same public parameter size. Training a model of the same size for all clients can lead to suboptimal performance sinc… ▽ More

    Submitted 5 December, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

  29. arXiv:2307.10247  [pdf, other

    cs.CL cs.IR cs.LG

    Automated Action Model Acquisition from Narrative Texts

    Authors: Ruiqi Li, Leyang Cui, Songtuan Lin, Patrik Haslum

    Abstract: Action models, which take the form of precondition/effect axioms, facilitate causal and motivational connections between actions for AI agents. Action model acquisition has been identified as a bottleneck in the application of planning technology, especially within narrative planning. Acquiring action models from narrative texts in an automated way is essential, but challenging because of the inhe… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 10 pages, 3 figures

  30. arXiv:2307.08074  [pdf, other

    cs.CL cs.AI

    Disco-Bench: A Discourse-Aware Evaluation Benchmark for Language Modelling

    Authors: Longyue Wang, Zefeng Du, Donghuai Liu, Deng Cai, Dian Yu, Haiyun Jiang, Yan Wang, Leyang Cui, Shuming Shi, Zhaopeng Tu

    Abstract: Modeling discourse -- the linguistic phenomena that go beyond individual sentences, is a fundamental yet challenging aspect of natural language processing (NLP). However, existing evaluation benchmarks primarily focus on the evaluation of inter-sentence properties and overlook critical discourse phenomena that cross sentences. To bridge the gap, we propose Disco-Bench, a benchmark that can evaluat… ▽ More

    Submitted 21 July, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

    Comments: Zhaopeng Tu is the corresponding author

  31. arXiv:2307.03021  [pdf

    eess.SY cs.HC

    Shadow operator: Effective dynamic load change operation training in air separation processes based on industrial nonlinear MPC and Bloom's taxonomy

    Authors: Guanghui Yang, Zhijiang Shao, Rui Wang, Zuhua Xu, Lidan Cui

    Abstract: A novel human-machine interactive training method for dynamic load change operation in air separation processes (ASPs) is proposed. A shadow operator (SO) is developed in this method to train ASP operators through industrial model predictive control (IMPC) and Bloom's taxonomy. First, a nonlinear two-layer IMPC machine algorithm is developed for dynamic load change operation. The IMPC uses a linea… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 16 pages, 18 figures

  32. arXiv:2306.11485  [pdf, other

    cs.CL

    Explicit Syntactic Guidance for Neural Text Generation

    Authors: Yafu Li, Leyang Cui, Jianhao Yan, Yongjing Yin, Wei Bi, Shuming Shi, Yue Zhang

    Abstract: Most existing text generation models follow the sequence-to-sequence paradigm. Generative Grammar suggests that humans generate natural language texts by learning language grammar. We propose a syntax-guided generation schema, which generates the sequence guided by a constituency parse tree in a top-down direction. The decoding process can be decomposed into two parts: (1) predicting the infilling… ▽ More

    Submitted 25 June, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  33. arXiv:2306.10248  [pdf, other

    astro-ph.HE astro-ph.CO

    Multi-wavelength temporal variability of the blazar PKS 1510-089

    Authors: Q. Yuan, Pankaj Kushwaha, Alok C. Gupta, Ashutosh Tripathi, Paul J. Wiita, M. Zhang, X. Liu, Anne Lahteenmaki, Merja Tornikoski, Joni Tammi, Venkatessh Ramakrishnan, L. Cui, X. Wang, M. F. Gu, Cosimo Bambi, A. E. Volvach

    Abstract: We perform correlation and periodicity search analyses on long-term multi-band light curves of the FSRQ 1510-089 observed by the space-based Fermi--Large Area Telescope in gamma-rays, the SMARTS and Steward Observatory telescopes in optical and near-infrared (NIR) and the 13.7 m radio telescope in Metsahovi Radio Observatory between 2008 and 2018. The z-transform discrete correlation function meth… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in ApJ; 20 pages, 9 figures, 4 tables

  34. arXiv:2306.08871  [pdf, other

    cs.SI cs.CY

    Med-MMHL: A Multi-Modal Dataset for Detecting Human- and LLM-Generated Misinformation in the Medical Domain

    Authors: Yanshen Sun, Jianfeng He, Shuo Lei, Limeng Cui, Chang-Tien Lu

    Abstract: The pervasive influence of misinformation has far-reaching and detrimental effects on both individuals and society. The COVID-19 pandemic has witnessed an alarming surge in the dissemination of medical misinformation. However, existing datasets pertaining to misinformation predominantly focus on textual information, neglecting the inclusion of visual elements, and tend to center solely on COVID-19… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  35. arXiv:2306.08604  [pdf, other

    cs.LG cs.AI cs.CR

    A Unified Framework of Graph Information Bottleneck for Robustness and Membership Privacy

    Authors: Enyan Dai, Limeng Cui, Zhengyang Wang, Xianfeng Tang, Yinghan Wang, Monica Cheng, Bing Yin, Suhang Wang

    Abstract: Graph Neural Networks (GNNs) have achieved great success in modeling graph-structured data. However, recent works show that GNNs are vulnerable to adversarial attacks which can fool the GNN model to make desired predictions of the attacker. In addition, training data of GNNs can be leaked under membership inference attacks. This largely hinders the adoption of GNNs in high-stake domains such as e-… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  36. arXiv:2305.15676  [pdf, other

    cs.CL

    Enhancing Grammatical Error Correction Systems with Explanations

    Authors: Yuejiao Fei, Leyang Cui, Sen Yang, Wai Lam, Zhenzhong Lan, Shuming Shi

    Abstract: Grammatical error correction systems improve written communication by detecting and correcting language mistakes. To help language learners better understand why the GEC system makes a certain correction, the causes of errors (evidence words) and the corresponding error types are two key factors. To enhance GEC systems with explanations, we introduce EXPECT, a large dataset annotated with evidence… ▽ More

    Submitted 10 June, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 9 pages, 7 figures, accepted to the main conference of ACL 2023

  37. arXiv:2305.13614  [pdf, other

    cs.CL

    LLM-empowered Chatbots for Psychiatrist and Patient Simulation: Application and Evaluation

    Authors: Siyuan Chen, Mengyue Wu, Kenny Q. Zhu, Kunyao Lan, Zhiling Zhang, Lyuchun Cui

    Abstract: Empowering chatbots in the field of mental health is receiving increasing amount of attention, while there still lacks exploration in developing and evaluating chatbots in psychiatric outpatient scenarios. In this work, we focus on exploring the potential of ChatGPT in powering chatbots for psychiatrist and patient simulation. We collaborate with psychiatrists to identify objectives and iterativel… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  38. arXiv:2305.13242  [pdf, other

    cs.CL

    MAGE: Machine-generated Text Detection in the Wild

    Authors: Yafu Li, Qintong Li, Leyang Cui, Wei Bi, Zhilin Wang, Longyue Wang, Linyi Yang, Shuming Shi, Yue Zhang

    Abstract: Large language models (LLMs) have achieved human-level text generation, emphasizing the need for effective AI-generated text detection to mitigate risks like the spread of fake news and plagiarism. Existing research has been constrained by evaluating detection methods on specific domains or particular language models. In practical scenarios, however, the detector faces texts from various domains o… ▽ More

    Submitted 21 May, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2024

  39. arXiv:2305.13225  [pdf, other

    cs.CL

    Multi-Task Instruction Tuning of LLaMa for Specific Scenarios: A Preliminary Study on Writing Assistance

    Authors: Yue Zhang, Leyang Cui, Deng Cai, Xinting Huang, Tao Fang, Wei Bi

    Abstract: Proprietary Large Language Models (LLMs), such as ChatGPT, have garnered significant attention due to their exceptional capabilities in handling a diverse range of tasks. Recent studies demonstrate that open-sourced smaller foundational models, such as 7B-size LLaMA, can also display remarkable proficiency in tackling diverse tasks when fine-tuned using instruction-driven data. In this work, we in… ▽ More

    Submitted 9 October, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

  40. arXiv:2305.12147  [pdf, other

    cs.CL cs.AI

    LogiCoT: Logical Chain-of-Thought Instruction-Tuning

    Authors: Hanmeng Liu, Zhiyang Teng, Leyang Cui, Chaoli Zhang, Qiji Zhou, Yue Zhang

    Abstract: Generative Pre-trained Transformer 4 (GPT-4) demonstrates impressive chain-of-thought reasoning ability. Recent work on self-instruction tuning, such as Alpaca, has focused on enhancing the general proficiency of models. These instructions enable the model to achieve performance comparable to GPT-3.5 on general tasks like open-domain text generation and paraphrasing. However, they fall short of he… ▽ More

    Submitted 28 October, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

  41. arXiv:2305.10855  [pdf, other

    cs.CV

    TextDiffuser: Diffusion Models as Text Painters

    Authors: Jingye Chen, Yupan Huang, Tengchao Lv, Lei Cui, Qifeng Chen, Furu Wei

    Abstract: Diffusion models have gained increasing attention for their impressive generation abilities but currently struggle with rendering accurate and coherent text. To address this issue, we introduce TextDiffuser, focusing on generating images with visually appealing text that is coherent with backgrounds. TextDiffuser consists of two stages: first, a Transformer model generates the layout of keywords e… ▽ More

    Submitted 30 October, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  42. arXiv:2305.10013  [pdf, other

    cs.CL cs.AI

    When Gradient Descent Meets Derivative-Free Optimization: A Match Made in Black-Box Scenario

    Authors: Chengcheng Han, Liqing Cui, Renyu Zhu, Jianing Wang, Nuo Chen, Qiushi Sun, Xiang Li, Ming Gao

    Abstract: Large pre-trained language models (PLMs) have garnered significant attention for their versatility and potential for solving a wide spectrum of natural language processing (NLP) tasks. However, the cost of running these PLMs may be prohibitive. Furthermore, PLMs may not be open-sourced due to commercial considerations and potential risks of misuse, such as GPT-3. The parameters and gradients of PL… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  43. Quantum Reliability

    Authors: L. X. Cui, Y-M. Du, C. P. Sun

    Abstract: Quantum technology has led to increasingly sophisticated and complex quantum devices. Assessing their reliability (quantum reliability) is an important issue. Although reliability theory for classical devices has been well developed in industry and technology, a suitable metric on quantum reliability and its loss has not been systematically investigated. Since reliability-loss depends on the proce… ▽ More

    Submitted 22 October, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 5 pages, 3 figures. Comments welcome!

    Journal ref: Phys.Rev.Lett.131,160203 (2023)

  44. arXiv:2305.01221  [pdf, ps, other

    math.AP

    Affine Toda system of $\mathbf{A}$ and $\mathbf{C}^t$ type: compactness and affine Weyl group

    Authors: Leilei Cui, Zhaohu Nie, Wen Yang

    Abstract: The local mass is a fundamental quantized information that characterizes the blow-up solution to the Toda system and has a profound relationship with its underlying algebraic structure. In \cite{Lin-Yang-Zhong-2020}, it was observed that the associated Weyl group can be employed to represent this information for the $\mathbf{A}_n$, $\mathbf{B}_n$, $\mathbf{C}_n$ and $\mathbf{G}_2$ type Toda system… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 40 pages

  45. arXiv:2305.00783  [pdf, other

    cs.IR

    Explicit Knowledge Graph Reasoning for Conversational Recommendation

    Authors: Xuhui Ren, Tong Chen, Quoc Viet Hung Nguyen, Lizhen Cui, Zi Huang, Hongzhi Yin

    Abstract: Traditional recommender systems estimate user preference on items purely based on historical interaction records, thus failing to capture fine-grained yet dynamic user interests and letting users receive recommendation only passively. Recent conversational recommender systems (CRSs) tackle those limitations by enabling recommender systems to interact with the user to obtain her/his current prefere… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  46. arXiv:2304.08492  [pdf, other

    cs.CV

    STRAP: Structured Object Affordance Segmentation with Point Supervision

    Authors: Leiyao Cui, Xiaoxue Chen, Hao Zhao, Guyue Zhou, Yixin Zhu

    Abstract: With significant annotation savings, point supervision has been proven effective for numerous 2D and 3D scene understanding problems. This success is primarily attributed to the structured output space; i.e., samples with high spatial affinity tend to share the same labels. Sharing this spirit, we study affordance segmentation with point supervision, wherein the setting inherits an unexplored dual… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

    Comments: Code: https://github.com/LeiyaoCui/STRAP

  47. arXiv:2304.07194  [pdf, ps, other

    math.AP

    Normalized solutions for a Kirchhoff type equations with potential in $\mathbb{R}^3$

    Authors: Leilei Cui, Qihan He, Zongyan Lv, Xuexiu Zhong

    Abstract: In the present paper, we study the existence of normalized solutions to the following Kirchhoff type equations \begin{equation*} -\left(a+b\int_{\R^3}|\nabla u|^2\right)Δu+V(x)u+λu=g(u)~\hbox{in}~\R^3 \end{equation*} satisfying the normalized constraint $\displaystyle\int_{\R^3}u^2=c$, where $a,b,c>0$ are prescribed constants, and the nonlinearities $g(u)$ are very general and of mass super-critic… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 21 pages

  48. arXiv:2304.03501  [pdf, other

    cs.IR

    Continuous Input Embedding Size Search For Recommender Systems

    Authors: Yunke Qu, Tong Chen, Xiangyu Zhao, Lizhen Cui, Kai Zheng, Hongzhi Yin

    Abstract: Latent factor models are the most popular backbones for today's recommender systems owing to their prominent performance. Latent factor models represent users and items as real-valued embedding vectors for pairwise similarity computation, and all embeddings are traditionally restricted to a uniform size that is relatively large (e.g., 256-dimensional). With the exponentially expanding user base an… ▽ More

    Submitted 7 March, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: To appear in SIGIR'23

  49. arXiv:2304.01612  [pdf, other

    cs.CL

    EDeR: A Dataset for Exploring Dependency Relations Between Events

    Authors: Ruiqi Li, Patrik Haslum, Leyang Cui

    Abstract: Relation extraction is a central task in natural language processing (NLP) and information retrieval (IR) research. We argue that an important type of relation not explored in NLP or IR research to date is that of an event being an argument - required or optional - of another event. We introduce the human-annotated Event Dependency Relation dataset (EDeR) which provides this dependency relation. T… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  50. Application of an ontology for model cards to generate computable artifacts for linking machine learning information from biomedical research

    Authors: Muhammad Amith, Licong Cui, Kirk Roberts, Cui Tao

    Abstract: Model card reports provide a transparent description of machine learning models which includes information about their evaluation, limitations, intended use, etc. Federal health agencies have expressed an interest in model cards report for research studies using machine-learning based AI. Previously, we have developed an ontology model for model card reports to structure and formalize these report… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Journal ref: Companion Proceedings of the ACM Web Conference 2023