Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 201–250 of 1,683 results for author: Cheng, Y

.
  1. arXiv:2402.08930  [pdf

    physics.optics

    Subwavelength Photorefractive Grating in a Thin-Film Lithium Niobate Microcavity

    Authors: Jiankun Hou, Jiefu Zhu, Ruixin Ma, Boyi Xue, Yicheng Zhu, Jintian Lin, Xiaoshun Jiang, Xianfeng Chen, Ya Cheng, Li Ge, Yuanlin Zheng, Wenjie Wan

    Abstract: Subwavelength gratings play a fundamental and pivotal role in numerous science and applications for wave manipulation, exhibiting distinctive features such as filtering, phase manipulation, and anti-reflection. However, conventional fabrication methods for ultrasmall periodic structures are constrained by the fundamental optical diffraction limit, making it challenging to produce subwavelength gra… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  2. arXiv:2402.07792  [pdf, other

    cs.LG cs.DC

    Empowering Federated Learning for Massive Models with NVIDIA FLARE

    Authors: Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng

    Abstract: In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copy… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  3. Enhanced Frequency Conversion in Parity-Time Symmetry Line

    Authors: Jiankun Hou, Jiefu Zhu, Ruixin Ma, Boyi Xue, Yicheng Zhu, Jintian Lin, Xiaoshun Jiang, Yuanlin Zheng, Xianfeng Chen, Ya Cheng, Li Ge, Wenjie Wan

    Abstract: Non-Hermitian degeneracies reveal intriguing and non-trivial behaviors in open physical systems. Examples like Parity-Time (PT) symmetry breaking, topological encircling chirality, and enhanced sensing near an exceptional point (EP) are often associated with the abrupt nature of the phase transition around these degeneracies. Here we experimentally observe a cavity-enhanced second-harmonic frequen… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  4. arXiv:2402.05956  [pdf, other

    cs.LG

    Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting

    Authors: Peng Chen, Yingying Zhang, Yunyao Cheng, Yang Shu, Yihang Wang, Qingsong Wen, Bin Yang, Chenjuan Guo

    Abstract: Transformers for time series forecasting mainly model time series from limited or fixed scales, making it challenging to capture different characteristics spanning various scales. We propose Pathformer, a multi-scale Transformer with adaptive pathways. It integrates both temporal resolution and temporal distance for multi-scale modeling. Multi-scale division divides the time series into different… ▽ More

    Submitted 6 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by the 12th International Conference on Learning Representations (ICLR 2024)

  5. arXiv:2402.05383  [pdf, other

    nucl-ex hep-ex

    First measurement of the yield of $^8$He isotopes produced in liquid scintillator by cosmic-ray muons at Daya Bay

    Authors: Daya Bay Collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: Daya Bay presents the first measurement of cosmogenic $^8$He isotope production in liquid scintillator, using an innovative method for identifying cascade decays of $^8$He and its child isotope, $^8$Li. We also measure the production yield of $^9$Li isotopes using well-established methodology. The results, in units of 10$^{-8}μ^{-1}$g$^{-1}$cm$^{2}$, are 0.307$\pm$0.042, 0.341$\pm$0.040, and 0.546… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  6. arXiv:2402.05347  [pdf, ps, other

    math.NA

    Robust Implicit Adaptive Low Rank Time-Stepping Methods for Matrix Differential Equations

    Authors: Daniel Appelö, Yingda Cheng

    Abstract: In this work, we develop implicit rank-adaptive schemes for time-dependent matrix differential equations. The dynamic low rank approximation (DLRA) is a well-known technique to capture the dynamic low rank structure based on Dirac-Frenkel time-dependent variational principle. In recent years, it has attracted a lot of attention due to its wide applicability. Our schemes are inspired by the three-s… ▽ More

    Submitted 17 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    MSC Class: 65

  7. arXiv:2402.04195  [pdf, other

    cs.CV

    Instance by Instance: An Iterative Framework for Multi-instance 3D Registration

    Authors: Xinyue Cao, Xiyu Zhang, Yuxin Cheng, Zhaoshuai Qi, Yanning Zhang, Jiaqi Yang

    Abstract: Multi-instance registration is a challenging problem in computer vision and robotics, where multiple instances of an object need to be registered in a standard coordinate system. In this work, we propose the first iterative framework called instance-by-instance (IBI) for multi-instance 3D registration (MI-3DReg). It successively registers all instances in a given scenario, starting from the easies… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 14 pages, 12 figures, 10 tables

  8. arXiv:2402.02700  [pdf, ps, other

    cs.LG stat.ML

    Sample Complexity Characterization for Linear Contextual MDPs

    Authors: Junze Deng, Yuan Cheng, Shaofeng Zou, Yingbin Liang

    Abstract: Contextual Markov decision processes (CMDPs) describe a class of reinforcement learning problems in which the transition kernels and reward functions can change over time with different MDPs indexed by a context variable. While CMDPs serve as an important framework to model many real-world applications with time-varying environments, they are largely unexplored from theoretical perspective. In thi… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: accepted to AIstats2024

  9. arXiv:2402.02334  [pdf, other

    cs.LG cs.AI

    Arithmetic Feature Interaction Is Necessary for Deep Tabular Learning

    Authors: Yi Cheng, Renjun Hu, Haochao Ying, Xing Shi, Jian Wu, Wei Lin

    Abstract: Until recently, the question of the effective inductive bias of deep models on tabular data has remained unanswered. This paper investigates the hypothesis that arithmetic feature interaction is necessary for deep tabular learning. To test this point, we create a synthetic tabular dataset with a mild feature interaction assumption and examine a modified transformer architecture enabling arithmetic… ▽ More

    Submitted 19 March, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: 11 pages, 8 figures, to be published to AAAI2024

    ACM Class: I.2.4

  10. arXiv:2402.01220  [pdf, other

    cs.CV cs.CR

    Delving into Decision-based Black-box Attacks on Semantic Segmentation

    Authors: Zhaoyu Chen, Zhengyang Shan, Jingwen Chang, Kaixun Jiang, Dingkang Yang, Yiting Cheng, Wenqiang Zhang

    Abstract: Semantic segmentation is a fundamental visual task that finds extensive deployment in applications with security-sensitive considerations. Nonetheless, recent work illustrates the adversarial vulnerability of semantic segmentation models to white-box attacks. However, its adversarial robustness against black-box attacks has not been fully explored. In this paper, we present the first exploration o… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  11. arXiv:2402.00036  [pdf, other

    cs.CV cs.LG

    Kronecker Product Feature Fusion for Convolutional Neural Network in Remote Sensing Scene Classification

    Authors: Yinzhu Cheng

    Abstract: Remote Sensing Scene Classification is a challenging and valuable research topic, in which Convolutional Neural Network (CNN) has played a crucial role. CNN can extract hierarchical convolutional features from remote sensing imagery, and Feature Fusion of different layers can enhance CNN's performance. Two successful Feature Fusion methods, Add and Concat, are employed in certain state-of-the-art… ▽ More

    Submitted 8 January, 2024; originally announced February 2024.

  12. arXiv:2402.00033  [pdf, other

    cs.CV cs.AI

    LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition

    Authors: Youbing Hu, Yun Cheng, Anqi Lu, Zhiqiang Cao, Dawei Wei, Jie Liu, Zhijun Li

    Abstract: The Vision Transformer (ViT) excels in accuracy when handling high-resolution images, yet it confronts the challenge of significant spatial redundancy, leading to increased computational and memory requirements. To address this, we present the Localization and Focus Vision Transformer (LF-ViT). This model operates by strategically curtailing computational demands without impinging on performance.… ▽ More

    Submitted 7 January, 2024; originally announced February 2024.

  13. arXiv:2401.17992  [pdf, other

    cs.CV cs.LG

    Multilinear Operator Networks

    Authors: Yixin Cheng, Grigorios G. Chrysos, Markos Georgopoulos, Volkan Cevher

    Abstract: Despite the remarkable capabilities of deep neural networks in image recognition, the dependence on activation functions remains a largely unexplored area and has yet to be eliminated. On the other hand, Polynomial Networks is a class of models that does not require activation functions, but have yet to perform on par with modern architectures. In this work, we aim close this gap and propose MONet… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: International Conference on Learning Representations Poster(2024)

  14. SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks

    Authors: Xingning Dong, Qingpei Guo, Tian Gan, Qing Wang, Jianlong Wu, Xiangyuan Ren, Yuan Cheng, Wei Chu

    Abstract: We present a framework for learning cross-modal video representations by directly pre-training on raw data to facilitate various downstream video-text tasks. Our main contributions lie in the pre-training framework and proxy tasks. First, based on the shortcomings of two mainstream pixel-level pre-training architectures (limited applications or less efficient), we propose Shared Network Pre-traini… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted by TCSVT (IEEE Transactions on Circuits and Systems for Video Technology)

  15. arXiv:2401.17475  [pdf, other

    math.CO

    A Dirac-type theorem for arbitrary Hamiltonian $H$-linked digraphs

    Authors: Zhilan Wang, Jin Yan, Yangyang Cheng

    Abstract: Given any digraph $D$, let $\mathcal{P}(D)$ be the family of all directed paths in $D$, and let $H$ be a digraph with the arc set $A(H)=\{a_1, \ldots, a_k\}$. The digraph $D$ is called arbitrary Hamiltonian $H$-linked if for any injective mapping $f: V(H)\rightarrow V(D)$ and any integer set $\mathcal{N}=\{n_1, \ldots, n_k\}$ with $n_i\geq4$ for each $i\in\{1, \ldots, k\}$, there exists a mapping… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    MSC Class: 05C20; 05C70; 05C07

  16. arXiv:2401.16402  [pdf, other

    cs.CV cs.AI

    A Survey on Visual Anomaly Detection: Challenge, Approach, and Prospect

    Authors: Yunkang Cao, Xiaohao Xu, Jiangning Zhang, Yuqi Cheng, Xiaonan Huang, Guansong Pang, Weiming Shen

    Abstract: Visual Anomaly Detection (VAD) endeavors to pinpoint deviations from the concept of normality in visual data, widely applied across diverse domains, e.g., industrial defect inspection, and medical lesion detection. This survey comprehensively examines recent advancements in VAD by identifying three primary challenges: 1) scarcity of training data, 2) diversity of visual modalities, and 3) complexi… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Work in progress. Yunkang Cao, Xiaohao Xu, and Jiangning Zhang contribute equally to this work

  17. arXiv:2401.15287  [pdf, other

    cs.CV cs.DM math.NA

    Applications of Tao General Difference in Discrete Domain

    Authors: Linmi Tao, Ruiyang Liu, Donglai Tao, Wu Xia, Feilong Ma, Yu Cheng, Jingmao Cui

    Abstract: Numerical difference computation is one of the cores and indispensable in the modern digital era. Tao general difference (TGD) is a novel theory and approach to difference computation for discrete sequences and arrays in multidimensional space. Built on the solid theoretical foundation of the general difference in a finite interval, the TGD operators demonstrate exceptional signal processing capab… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: This paper is the application part of the paper "Tao General Differential and Difference: Theory and Application". The theory part of the paper is renamed as "A Theory of General Difference in Continuous and Discrete Domain", which is Arxived in arXiv:2305.08098v2

  18. Eloquent: A More Robust Transmission Scheme for LLM Token Streaming

    Authors: Hanchen Li, Yuhan Liu, Yihua Cheng, Siddhant Ray, Kuntai Du, Junchen Jiang

    Abstract: To render each generated token in real-time for users, the Large Language Model (LLM) server generates tokens one by one and streams each token (or group of a few tokens) through the network to the user right after generation, which we refer to as LLM token streaming. However, under unstable network conditions, the LLM token streaming experience could suffer greatly from stalls since one packet lo… ▽ More

    Submitted 16 June, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: In SIGCOMM Workshop on Networks for AI Computing (NAIC '24)

  19. arXiv:2401.12920  [pdf, other

    cs.AI

    Truck Parking Usage Prediction with Decomposed Graph Neural Networks

    Authors: Rei Tamaru, Yang Cheng, Steven Parker, Ernie Perry, Bin Ran, Soyoung Ahn

    Abstract: Truck parking on freight corridors faces the major challenge of insufficient parking spaces. This is exacerbated by the Hour-of-Service (HOS) regulations, which often result in unauthorized parking practices, causing safety concerns. It has been shown that providing accurate parking usage prediction can be a cost-effective solution to reduce unsafe parking practices. In light of this, existing stu… ▽ More

    Submitted 12 August, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  20. arXiv:2401.12531  [pdf, ps, other

    math.LO

    Some reflections on the relationship between logical incompleteness and concrete incompleteness

    Authors: Yong Cheng

    Abstract: In this paper, we aim to conceptually examine the relationship between logical incompleteness and concrete incompleteness which both study the incompleteness phenomenon. We argue for two main theses. Firstly, the current research on concrete incompleteness reals both similarities and differences between logical incompleteness and concrete incompleteness. Similarities between them are not universal… ▽ More

    Submitted 1 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: 26 pages

    MSC Class: 03A05; 00A30; 03-02

  21. Exploring the Gas-Phase Metallicity Gradients of Star-forming Galaxies at Cosmic Noon

    Authors: Yingjie Cheng, Mauro Giavalisco, Raymond C. Simons, Zhiyuan Ji, Darren Stroupe, Nikko J. Cleri

    Abstract: We explore the relationships between the [O/H] gas-phase metallicity radial gradients and multiple galaxy properties for 238 star-forming galaxies at 0.6<z<2.6 selected from the CANDELS Ly$α$ Emission at Reionization (CLEAR) survey with stellar mass 8.5 < log $M_{*}/M_{\odot}$ < 10.5. The gradients cover the range from -0.11 to 0.22 dex kpc$^{-1}$, with the median value close to zero. We reconstru… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 22 pages, 21 figures, accepted for publication in APJ

    Journal ref: The Astrophysical Journal, 2024, Volume 964, Issue 1, id.94, 17 pp

  22. arXiv:2401.11944  [pdf, other

    cs.CL cs.AI cs.CV

    CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark

    Authors: Ge Zhang, Xinrun Du, Bei Chen, Yiming Liang, Tongxu Luo, Tianyu Zheng, Kang Zhu, Yuyang Cheng, Chunpu Xu, Shuyue Guo, Haoran Zhang, Xingwei Qu, Junjie Wang, Ruibin Yuan, Yizhi Li, Zekun Wang, Yudong Liu, Yu-Hsuan Tsai, Fengji Zhang, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu

    Abstract: As the capabilities of large multimodal models (LMMs) continue to advance, evaluating the performance of LMMs emerges as an increasing need. Additionally, there is an even larger gap in evaluating the advanced knowledge and reasoning abilities of LMMs in non-English contexts such as Chinese. We introduce CMMMU, a new Chinese Massive Multi-discipline Multimodal Understanding benchmark designed to e… ▽ More

    Submitted 18 March, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

  23. arXiv:2401.11749  [pdf, ps, other

    math.LO

    On Rosser theories

    Authors: Yong Cheng

    Abstract: Rosser theories play an important role in the study of the incompleteness phenomenon and mete-mathematics of arithmetic. In this paper, we first define notions of $n$-Rosser theories, exact $n$-Rosser theories, effectively $n$-Rosser theories and effectively exact $n$-Rosser theories (see Definition 1.6). Our definitions are not restricted to arithmetic languages. Then we systematically examine pr… ▽ More

    Submitted 29 July, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

    Comments: 25 pages

    MSC Class: 03F40; 03F30; 03F25

  24. arXiv:2401.09769  [pdf, other

    cs.SI cs.AI cs.LG

    Learning from Graphs with Heterophily: Progress and Future

    Authors: Chenghua Gong, Yao Cheng, Xiang Li, Caihua Shan, Siqiang Luo

    Abstract: Graphs are structured data that models complex relations between real-world entities. Heterophilous graphs, where linked nodes are prone to be with different labels or dissimilar features, have recently attracted significant attention and found many applications. Meanwhile, increasing efforts have been made to advance learning from heterophilous graphs. Although there exist surveys on the relevant… ▽ More

    Submitted 24 July, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  25. arXiv:2401.09745  [pdf, other

    nucl-th nucl-ex

    Impact of Limited Statistics on the Measured Hyper-Order Cumulants of Net-Proton Distributions in Heavy-Ion Collisions

    Authors: Lizhu Chen, Ye-Yin Zhao, Yunshan Cheng, Gang Wang, Zhiming Li, Yuanfang Wu

    Abstract: Hyper-order cumulants $C_5/C_1$ and $C_6/C_2$ of net-baryon distributions are anticipated to offer crucial insights into the phase transition from quark-gluon plasma to hadronic matter in heavy-ion collisions. However, the accuracy of $C_5$ and $C_6$ is highly contingent on the fine shape of the distribution's tail, the detectable range of which could be essentially truncated by low statistics. In… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 6 pages, 7 figures

  26. arXiv:2401.08964  [pdf, other

    cs.HC

    Evidence-centered Assessment for Writing with Generative AI

    Authors: Yixin Cheng, Kayley Lyons, Guanliang Chen, Dragan Gasevic, Zachari Swiecki

    Abstract: We propose a learning analytics-based methodology for assessing the collaborative writing of humans and generative artificial intelligence. Framed by the evidence-centered design, we used elements of knowledge-telling, knowledge transformation, and cognitive presence to identify assessment claims; we used data collected from the CoAuthor writing tool as potential evidence for these claims; and we… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  27. arXiv:2401.08834  [pdf, other

    cond-mat.mtrl-sci

    Structure and lattice excitations of the copper substituted lead oxyapatite Pb$_{9.06(7)}$Cu$_{0.94(6)}$(PO$_{3.92(4)}$)$_{6}$O$_{0.96(3)}$

    Authors: Qiang Zhang, Yingdong Guan, Yongqiang Cheng, Lujin Min, Jong K. Keum, Zhiqiang Mao, Matthew B. Stone

    Abstract: The copper substituted lead oxyapatite, Pb$_{10-x}$Cu$_{x}$(PO$_{3.92(4)}$)$_{6}$O$_{0.96(3)}$ (x=0.94(6)) was studied using neutron and x-ray diffraction and neutron spectroscopy techniques. The crystal structure of the main phase of our sample, which has come to be colloquially known as LK-99, is verified to possess a hexagonal structure with space group $P 6_{3}/m$, alongside the presence of im… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 11 pages, 8 figures. Physical Review Materials, In press

  28. arXiv:2401.07708  [pdf, other

    cond-mat.quant-gas cond-mat.str-el hep-lat hep-th quant-ph

    Emergent Gauge Theory in Rydberg Atom Arrays

    Authors: Yanting Cheng, Hui Zhai

    Abstract: Rydberg atom arrays have emerged as a novel platform exhibiting rich quantum many-body physics and offering promise for universal quantum computation. The Rydberg blockade effect plays an essential role in establishing many-body correlations in this system. In this review, we will highlight that the lattice gauge theory is an efficient description of the Rydberg blockade effect and overview recent… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 12 pages, 5 figures

  29. arXiv:2401.07081  [pdf, other

    cs.NI

    6Rover: Leveraging Reinforcement Learning-based Address Pattern Mining Approach for Discovering Active Targets in IPv6 Unseeded Space

    Authors: Zhichao Zhang, Zhaoxin Zhang, Yanan Cheng, Ning Li

    Abstract: The discovery of active IPv6 addresses represents a pivotal challenge in IPv6 network survey, as it is a prerequisite for downstream tasks such as network topology measurements and security analysis. With the rapid spread of IPv6 networks in recent years, many researchers have focused on improving the hit rate, efficiency, and coverage of IPv6 scanning methods, resulting in considerable advancemen… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  30. arXiv:2401.06541  [pdf, other

    cs.CL cs.AI

    Medical Dialogue Generation via Intuitive-then-Analytical Differential Diagnosis

    Authors: Kaishuai Xu, Wenjun Hou, Yi Cheng, Jian Wang, Wenjie Li

    Abstract: Medical dialogue systems have attracted growing research attention as they have the potential to provide rapid diagnoses, treatment plans, and health consultations. In medical dialogues, a proper diagnosis is crucial as it establishes the foundation for future consultations. Clinicians typically employ both intuitive and analytic reasoning to formulate a differential diagnosis. This reasoning proc… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Work in progress

  31. arXiv:2401.05970  [pdf

    physics.optics physics.app-ph

    On-chip wavelength division multiplexing by angled multimode interferometer fabricated on erbium-doped thin film lithium niobate on insulator

    Authors: Jinli Han, Rui Bao, Rongbo Wu, Zhaoxiang Liu, Zhe Wang, Chao Sun, Zhihao Zhang, Mengqi Li, Zhiwei Fang, Min Wang, Haisu Zhang, Ya Cheng

    Abstract: Photonic integrated circuits based on erbium doped thin film lithium niobate on insulator has attracted broad interests with insofar various waveguide amplifiers and microlasers demonstrated. Wideband operation facilitated by the broadband absorption and emission of erbium ions necessitates the functional integration of wavelength filter and multiplexer on the same chip. Here a low-loss wavelength… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 11 pages, 5 figures

  32. arXiv:2401.05654  [pdf, other

    cs.AI cs.CL cs.LG

    Towards Conversational Diagnostic AI

    Authors: Tao Tu, Anil Palepu, Mike Schaekermann, Khaled Saab, Jan Freyberg, Ryutaro Tanno, Amy Wang, Brenna Li, Mohamed Amin, Nenad Tomasev, Shekoofeh Azizi, Karan Singhal, Yong Cheng, Le Hou, Albert Webson, Kavita Kulkarni, S Sara Mahdavi, Christopher Semturs, Juraj Gottweis, Joelle Barral, Katherine Chou, Greg S Corrado, Yossi Matias, Alan Karthikesalingam, Vivek Natarajan

    Abstract: At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introdu… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 46 pages, 5 figures in main text, 19 figures in appendix

  33. arXiv:2401.05507  [pdf, other

    cs.CL cs.AI

    InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks

    Authors: Xueyu Hu, Ziyu Zhao, Shuang Wei, Ziwei Chai, Qianli Ma, Guoyin Wang, Xuwu Wang, Jing Su, Jingjing Xu, Ming Zhu, Yao Cheng, Jianbo Yuan, Jiwei Li, Kun Kuang, Yang Yang, Hongxia Yang, Fei Wu

    Abstract: In this paper, we introduce InfiAgent-DABench, the first benchmark specifically designed to evaluate LLM-based agents on data analysis tasks. These tasks require agents to end-to-end solving complex tasks by interacting with an execution environment. This benchmark contains DAEval, a dataset consisting of 257 data analysis questions derived from 52 CSV files, and an agent framework which incorpora… ▽ More

    Submitted 11 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: 27 pages, 7 figures, work in progress

  34. arXiv:2401.04354  [pdf, other

    cs.CV

    Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition

    Authors: Xuzheng Yu, Chen Jiang, Wei Zhang, Tian Gan, Linlin Chao, Jianan Zhao, Yuan Cheng, Qingpei Guo, Wei Chu

    Abstract: With the explosive growth of video data in real-world applications, a comprehensive representation of videos becomes increasingly important. In this paper, we address the problem of video scene recognition, whose goal is to learn a high-level video representation to classify scenes in videos. Due to the diversity and complexity of video contents in realistic scenarios, this task remains a challeng… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  35. arXiv:2401.03844  [pdf, other

    cs.CV

    Fully Attentional Networks with Self-emerging Token Labeling

    Authors: Bingyin Zhao, Zhiding Yu, Shiyi Lan, Yutao Cheng, Anima Anandkumar, Yingjie Lao, Jose M. Alvarez

    Abstract: Recent studies indicate that Vision Transformers (ViTs) are robust against out-of-distribution scenarios. In particular, the Fully Attentional Network (FAN) - a family of ViT backbones, has achieved state-of-the-art robustness. In this paper, we revisit the FAN models and improve their pre-training with a self-emerging token labeling (STL) framework. Our method contains a two-stage training framew… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 5585-5595

  36. arXiv:2401.03476  [pdf, other

    cs.MM cs.AI cs.HC cs.SD eess.AS

    Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness

    Authors: Sicheng Yang, Zunnan Xu, Haiwei Xue, Yongkang Cheng, Shaoli Huang, Mingming Gong, Zhiyong Wu

    Abstract: Current talking avatars mostly generate co-speech gestures based on audio and text of the utterance, without considering the non-speaking motion of the speaker. Furthermore, previous works on co-speech gesture generation have designed network structures based on individual gesture datasets, which results in limited data volume, compromised generalizability, and restricted speaker movements. To tac… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 6 pages, 3 figures, ICASSP 2024

  37. arXiv:2401.03428  [pdf, other

    cs.AI cs.MA

    Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects

    Authors: Yuheng Cheng, Ceyao Zhang, Zhengwen Zhang, Xiangrui Meng, Sirui Hong, Wenhao Li, Zihao Wang, Zekai Wang, Feng Yin, Junhua Zhao, Xiuqiang He

    Abstract: Intelligent agents stand out as a potential path toward artificial general intelligence (AGI). Thus, researchers have dedicated significant effort to diverse implementations for them. Benefiting from recent progress in large language models (LLMs), LLM-based agents that use universal natural language as an interface exhibit robust generalization capabilities across various applications -- from ser… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

  38. arXiv:2401.02901  [pdf, other

    hep-ph hep-ex

    Charged-current non-standard neutrino interactions at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 25 pages, 16 figures, 6 tables; 36 pages, format changed, references added

  39. arXiv:2401.00974  [pdf, other

    cs.LG cs.AI

    Downstream Task-Oriented Generative Model Selections on Synthetic Data Training for Fraud Detection Models

    Authors: Yinan Cheng, Chi-Hua Wang, Vamsi K. Potluru, Tucker Balch, Guang Cheng

    Abstract: Devising procedures for downstream task-oriented generative model selections is an unresolved problem of practical importance. Existing studies focused on the utility of a single family of generative models. They provided limited insights on how synthetic data practitioners select the best family generative models for synthetic training tasks given a specific combination of machine learning model… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: The following article has been accepted by ICAIF22, Synthetic Data for AI in Finance; see https://sites.google.com/view/icaif-synthetic-2022/program

  40. arXiv:2401.00701  [pdf, other

    cs.CV

    Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning

    Authors: Kaibin Tian, Yanhua Cheng, Yi Liu, Xinglin Hou, Quan Chen, Han Li

    Abstract: In recent years, text-to-video retrieval methods based on CLIP have experienced rapid development. The primary direction of evolution is to exploit the much wider gamut of visual and textual cues to achieve alignment. Concretely, those methods with impressive performance often design a heavy fusion block for sentence (words)-video (frames) interaction, regardless of the prohibitive computation com… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  41. arXiv:2401.00625  [pdf, ps, other

    cs.LG

    Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models

    Authors: Guangji Bai, Zheng Chai, Chen Ling, Shiyu Wang, Jiaying Lu, Nan Zhang, Tingwei Shi, Ziyang Yu, Mengdan Zhu, Yifei Zhang, Carl Yang, Yue Cheng, Liang Zhao

    Abstract: The burgeoning field of Large Language Models (LLMs), exemplified by sophisticated models like OpenAI's ChatGPT, represents a significant advancement in artificial intelligence. These models, however, bring forth substantial challenges in the high consumption of computational, memory, energy, and financial resources, especially in environments with limited resource capabilities. This survey aims t… ▽ More

    Submitted 3 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    Comments: Preprint. GitHub repo: https://github.com/tiingweii-shii/Awesome-Resource-Efficient-LLM-Papers

  42. arXiv:2401.00395  [pdf, other

    stat.ME

    Energetic Variational Gaussian Process Regression for Computer Experiments

    Authors: Lulu Kang, Yuanxing Cheng, Yiwei Wang, Chun Liu

    Abstract: The Gaussian process (GP) regression model is a widely employed surrogate modeling technique for computer experiments, offering precise predictions and statistical inference for the computer simulators that generate experimental data. Estimation and inference for GP can be performed in both frequentist and Bayesian frameworks. In this chapter, we construct the GP model through variational inferenc… ▽ More

    Submitted 1 April, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 19 pages, 7 figures, 3 tables

  43. arXiv:2401.00204  [pdf, other

    astro-ph.CO hep-ph

    Electromagnetic Radiation from Binary Stars Mediated by Ultralight Scalar

    Authors: Ya-Ze Cheng, Wen-Hao Wu, Yan Cao

    Abstract: We present the electromagnetic (EM) dipole radiation flux from an eccentric Keplerian binary endowed with scalar charges, in the presence of scalar-photon coupling $φA_μA^μ$ or $φF_{μν}F^{μν}$. The scalar radiation is suppressed for orbital frequency below the scalar mass, while the scalar-mediated indirect EM radiation survives. We examine the constraints imposed on the scalar-photon and scalar-c… ▽ More

    Submitted 1 July, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 28 pages, 5 figures, 2 tables; errors corrected; revised discussions on the asymptotic limits of indirect radiation; adding results for the angular momentum flux associated with the dipole radiation of massive scalar/vector fields and the 1PN charged binary conservative dynamics; comments are welcome

  44. arXiv:2401.00151  [pdf, other

    cs.CV cs.CR

    CamPro: Camera-based Anti-Facial Recognition

    Authors: Wenjun Zhu, Yuan Sun, Jiani Liu, Yushi Cheng, Xiaoyu Ji, Wenyuan Xu

    Abstract: The proliferation of images captured from millions of cameras and the advancement of facial recognition (FR) technology have made the abuse of FR a severe privacy threat. Existing works typically rely on obfuscation, synthesis, or adversarial examples to modify faces in images to achieve anti-facial recognition (AFR). However, the unmodified images captured by camera modules that contain sensitive… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Accepted by NDSS Symposium 2024

  45. arXiv:2401.00148  [pdf, other

    cs.CR cs.CV

    TPatch: A Triggered Physical Adversarial Patch

    Authors: Wenjun Zhu, Xiaoyu Ji, Yushi Cheng, Shibo Zhang, Wenyuan Xu

    Abstract: Autonomous vehicles increasingly utilize the vision-based perception module to acquire information about driving environments and detect obstacles. Correct detection and classification are important to ensure safe driving decisions. Existing works have demonstrated the feasibility of fooling the perception models such as object detectors and image classifiers with printed adversarial patches. Howe… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Appeared in 32nd USENIX Security Symposium (USENIX Security 23)

  46. Enhancing Low-Resource Relation Representations through Multi-View Decoupling

    Authors: Chenghao Fan, Wei Wei, Xiaoye Qu, Zhenyi Lu, Wenfeng Xie, Yu Cheng, Dangyang Chen

    Abstract: Recently, prompt-tuning with pre-trained language models (PLMs) has demonstrated the significantly enhancing ability of relation extraction (RE) tasks. However, in low-resource scenarios, where the available training data is scarce, previous prompt-based methods may still perform poorly for prompt-based representation learning due to a superficial understanding of the relation. To this end, we hig… ▽ More

    Submitted 29 May, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  47. arXiv:2312.17115  [pdf, other

    cs.CL cs.CY

    How Far Are LLMs from Believable AI? A Benchmark for Evaluating the Believability of Human Behavior Simulation

    Authors: Yang Xiao, Yi Cheng, Jinlan Fu, Jiashuo Wang, Wenjie Li, Pengfei Liu

    Abstract: In recent years, AI has demonstrated remarkable capabilities in simulating human behaviors, particularly those implemented with large language models (LLMs). However, due to the lack of systematic evaluation of LLMs' simulated behaviors, the believability of LLMs among humans remains ambiguous, i.e., it is unclear which behaviors of LLMs are convincingly human-like and which need further improveme… ▽ More

    Submitted 15 June, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

  48. arXiv:2312.16899  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Anomalous exchange bias effect in ferromagnetic VI3 flakes

    Authors: Xi Zhang, Xiuquan Xia, Qiye Liu, Yonggang He, Le Wang, Junhao Lin, Jia-Wei Mei, Yingchun Cheng, Jun-Feng Dai

    Abstract: The exchange bias (EB) effect, pivotal in magnetic data storage and sensing devices, has been observed not only in interfacial regions but also in intrinsic ferromagnetic materials. Here, we've uncovered a robust and stable exchange bias effect within the layered van der Waals (vdW) ferromagnet VI3 employing magnetic circular dichroism microscopy. At 10 K, we observed a significant exchange field… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  49. arXiv:2312.15746  [pdf, other

    cs.IR cs.AI

    Large Language Models are Not Stable Recommender Systems

    Authors: Tianhui Ma, Yuan Cheng, Hengshu Zhu, Hui Xiong

    Abstract: With the significant successes of large language models (LLMs) in many natural language processing tasks, there is growing interest among researchers in exploring LLMs for novel recommender systems. However, we have observed that directly using LLMs as a recommender system is usually unstable due to its inherent position bias. To this end, we introduce exploratory research and find consistent patt… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  50. Thermal Relic Right-Handed Neutrino Dark Matter

    Authors: Yu Cheng, Jie Sheng, Tsutomu T. Yanagida

    Abstract: It is known that two heavy Majorana right-handed neutrinos are sufficient to generate the baryon asymmetry in the present universe. Thus, it is interesting to identify the third right-handed neutrino $N$ with the dark matter. We impose a new discrete symmetry $Z_2$ on this dark matter neutrino to stabilize it. However, the $U(1)_{B-L}$ gauge boson $A'$ couples to the right-handed neutrino $N$. If… ▽ More

    Submitted 21 May, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

    Comments: 6 pages, 2 figures