Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 6,576 results for author: Yang, J

.
  1. arXiv:2407.21016  [pdf, other

    cs.CV

    Add-SD: Rational Generation without Manual Reference

    Authors: Lingfeng Yang, Xinyu Zhang, Xiang Li, Jinwen Chen, Kun Yao, Gang Zhang, Errui Ding, Lingqiao Liu, Jingdong Wang, Jian Yang

    Abstract: Diffusion models have exhibited remarkable prowess in visual generalization. Building on this success, we introduce an instruction-based object addition pipeline, named Add-SD, which automatically inserts objects into realistic scenes with rational sizes and positions. Different from layout-conditioned methods, Add-SD is solely conditioned on simple text prompts rather than any other human-costly… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  2. arXiv:2407.20816  [pdf, other

    hep-th

    On the definition of Carrollian amplitudes in general dimensions

    Authors: Wen-Bin Liu, Jiang Long, Hong-Yang Xiao, Jing-Long Yang

    Abstract: Carrollian amplitude is the natural object that defines the correlator of the boundary Carrollian field theory. In this work, we will elaborate on its proper definition in general dimensions. We use the vielbein field on the unit sphere to define the fundamental field with non-vanishing helicity in the local Cartesian frame which is the building block of the Carrollian amplitude. In general dimens… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 56 pages

  3. arXiv:2407.20571  [pdf, other

    cs.HC cs.SE

    Considering Visualization Example Galleries

    Authors: Junran Yang, Andrew McNutt, Leilani Battle

    Abstract: Example galleries are often used to teach, document, and advertise visually-focused domain-specific languages and libraries, such as those producing visualizations, diagrams, or webpages. Despite their ubiquity, there is no consensus on the role of "example galleries", let alone what the best practices might be for their creation or curation. To understand gallery meaning and usage, we interviewed… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  4. arXiv:2407.20562  [pdf, ps, other

    math.GN

    Quasisymmetric minimality on packing dimension for homogeneous perfect sets

    Authors: Shishuang Liu, Yanzhe Li, Jiaojiao Yang

    Abstract: In this paper, we study the quasisymmetric packing minimality of homogeneous perfect sets, and obtain that a special class of homogeneous perfect sets with $\operatorname{dim}_{P}E=1$ is quasisymmetrically packing minimal.

    Submitted 30 July, 2024; originally announced July 2024.

  5. arXiv:2407.20551  [pdf, ps, other

    hep-ex

    Observation of $D^0\to b_1(1235)^- e^+ν_e$ and evidence for $D^+\to b_1(1235)^0 e^+ν_e$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (647 additional authors not shown)

    Abstract: By analyzing a data sample of $e^+e^-$ collisions with center-of-mass energy $\sqrt{s}=3.773$ GeV, corresponding to an integrated luminosity of $7.9~\rm {fb}^{-1}$ collected with the BESIII detector operating at the BEPCII collider, we study semileptonic decays of the $D^{0(+)}$ mesons into the axial-vector meson $b_1(1235)$ via the decay $b_1(1235)\to ωπ$. The decay… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 9 pages, 2 figures

  6. arXiv:2407.20141  [pdf, other

    cs.CV

    DDAP: Dual-Domain Anti-Personalization against Text-to-Image Diffusion Models

    Authors: Jing Yang, Runping Xi, Yingxin Lai, Xun Lin, Zitong Yu

    Abstract: Diffusion-based personalized visual content generation technologies have achieved significant breakthroughs, allowing for the creation of specific objects by just learning from a few reference photos. However, when misused to fabricate fake news or unsettling content targeting individuals, these technologies could cause considerable societal harm. To address this problem, current methods generate… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted by IJCB 2024

  7. arXiv:2407.20078  [pdf, other

    cs.CV

    Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset

    Authors: Yimian Dai, Mengxuan Xiao, Yiming Zhu, Huan Wang, Kehua Guo, Jian Yang

    Abstract: Infrared small target detection poses unique challenges due to the scarcity of intrinsic target features and the abundance of similar background distractors. We argue that background semantics play a pivotal role in distinguishing visually similar objects for this task. To address this, we introduce a new task -- clustered infrared small target detection, and present DenseSIRST, a novel benchmark… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  8. arXiv:2407.20009  [pdf, ps, other

    hep-ex

    Measurement of the $\boldsymbol{e^{+}e^{-}\to K^+K^-ψ(2S)}$ Cross Section at Center-of-Mass Energies from 4.699 to 4.951 GeV and Search for $\boldsymbol{Z_{cs}^{\pm}}$ in the $\boldsymbol{Z_{cs}^\pm\to K^\pmψ(2S)}$ Decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (646 additional authors not shown)

    Abstract: We perform the first investigation of the process $e^{+}e^{-}\to K^+K^-ψ(2S)$ and report its Born cross sections over a range of center-of-mass energies from 4.699 to 4.951~GeV. The measurements are carried out using several partial reconstruction techniques using data samples collected by the BESIII detector with a total integrated luminosity of 2.5~fb$^{-1}$. We search for new tetraquark candida… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures

  9. arXiv:2407.19981  [pdf, other

    cs.CV

    Adversarial Robustness in RGB-Skeleton Action Recognition: Leveraging Attention Modality Reweighter

    Authors: Chao Liu, Xin Liu, Zitong Yu, Yonghong Hou, Huanjing Yue, Jingyu Yang

    Abstract: Deep neural networks (DNNs) have been applied in many computer vision tasks and achieved state-of-the-art (SOTA) performance. However, misclassification will occur when DNNs predict adversarial examples which are created by adding human-imperceptible adversarial noise to natural examples. This limits the application of DNN in security-critical fields. In order to enhance the robustness of models,… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: Accepted by IJCB 2024

  10. arXiv:2407.19689  [pdf, other

    math.OC

    PDOT: a Practical Primal-Dual Algorithm and a GPU-Based Solver for Optimal Transport

    Authors: Haihao Lu, Jinwen Yang

    Abstract: In this paper, we propose a practical primal-dual algorithm with theoretical guarantees and develop a GPU-based solver, which we dub PDOT, for solving large-scale optimal transport problems. Compared to Sinkhorn algorithm or classic LP algorithms, PDOT can achieve high-accuracy solution while efficiently taking advantage of modern computing architecture, i.e., GPUs. On the theoretical side, we sho… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  11. arXiv:2407.19628  [pdf, other

    cs.CV

    Text2LiDAR: Text-guided LiDAR Point Cloud Generation via Equirectangular Transformer

    Authors: Yang Wu, Kaihua Zhang, Jianjun Qian, Jin Xie, Jian Yang

    Abstract: The complex traffic environment and various weather conditions make the collection of LiDAR data expensive and challenging. Achieving high-quality and controllable LiDAR data generation is urgently needed, controlling with text is a common practice, but there is little research in this field. To this end, we propose Text2LiDAR, the first efficient, diverse, and text-controllable LiDAR data generat… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  12. arXiv:2407.19078  [pdf, other

    cs.LG stat.ML

    Practical Marketplace Optimization at Uber Using Causally-Informed Machine Learning

    Authors: Bobby Chen, Siyu Chen, Jason Dowlatabadi, Yu Xuan Hong, Vinayak Iyer, Uday Mantripragada, Rishabh Narang, Apoorv Pandey, Zijun Qin, Abrar Sheikh, Hongtao Sun, Jiaqi Sun, Matthew Walker, Kaichen Wei, Chen Xu, Jingnan Yang, Allen T. Zhang, Guoqing Zhang

    Abstract: Budget allocation of marketplace levers, such as incentives for drivers and promotions for riders, has long been a technical and business challenge at Uber; understanding lever budget changes' impact and estimating cost efficiency to achieve predefined budgets is crucial, with the goal of optimal allocations that maximize business value; we introduce an end-to-end machine learning and optimization… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: To be published in the 2nd Workshop on Causal Inference and Machine Learning in Practice, KDD 2024, August 25 to 29, 2024, Barcelona, Spain, 10 pages

    MSC Class: 62J99

  13. arXiv:2407.19053  [pdf, other

    cs.SE

    A Study of Using Multimodal LLMs for Non-Crash Functional Bug Detection in Android Apps

    Authors: Bangyan Ju, Jin Yang, Tingting Yu, Tamerlan Abdullayev, Yuanyuan Wu, Dingbang Wang, Yu Zhao

    Abstract: Numerous approaches employing various strategies have been developed to test the graphical user interfaces (GUIs) of mobile apps. However, traditional GUI testing techniques, such as random and model-based testing, primarily focus on generating test sequences that excel in achieving high code coverage but often fail to act as effective test oracles for non-crash functional (NCF) bug detection. To… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  14. arXiv:2407.18625  [pdf, other

    cs.ET cs.AI cs.NE

    Topology Optimization of Random Memristors for Input-Aware Dynamic SNN

    Authors: Bo Wang, Shaocong Wang, Ning Lin, Yi Li, Yifei Yu, Yue Zhang, Jichang Yang, Xiaoshan Wu, Yangu He, Songqi Wang, Rui Chen, Guoqi Li, Xiaojuan Qi, Zhongrui Wang, Dashan Shang

    Abstract: There is unprecedented development in machine learning, exemplified by recent large language models and world simulators, which are artificial neural networks running on digital computers. However, they still cannot parallel human brains in terms of energy efficiency and the streamlined adaptability to inputs of different difficulties, due to differences in signal representation, optimization, run… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 15 pages, 5 figures

  15. arXiv:2407.18054  [pdf, other

    eess.IV cs.CV

    LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels

    Authors: Ziwei Cui, Jingfeng Yao, Lunbin Zeng, Juan Yang, Wenyu Liu, Xinggang Wang

    Abstract: The segmentation of cell nuclei in tissue images stained with the blood dye hematoxylin and eosin (H$\&$E) is essential for various clinical applications and analyses. Due to the complex characteristics of cellular morphology, a large receptive field is considered crucial for generating high-quality segmentation. However, previous methods face challenges in achieving a balance between the receptiv… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  16. arXiv:2407.17570  [pdf, other

    astro-ph.GA astro-ph.CO

    A SPectroscopic survey of biased halos In the Reionization Era (ASPIRE): Broad-line AGN at $z=4-5$ revealed by JWST/NIRCam WFSS

    Authors: Xiaojing Lin, Feige Wang, Xiaohui Fan, Zheng Cai, Jaclyn B. Champagne, Fengwu Sun, Marta Volonteri, Jinyi Yang, Joseph F. Hennawi, Eduardo Bañados, Aaron Barth, Anna-Christina Eilers, Emanuele Paolo Farina, Weizhe Liu, Xiangyu Jin, Hyunsung D. Jun, Alessandro Lupi, Koki Kakiichi, Chiara Mazzucchelli, Masafusa Onoue, Zhiwei Pan, Elia Pizzati, Sofía Rojas-Ruiz, Jan-Torge Schindler, Benny Trakhtenbrot , et al. (11 additional authors not shown)

    Abstract: Low-luminosity AGNs with low-mass black holes (BHs) in the early universe are fundamental to understanding the BH growth and their co-evolution with the host galaxies. Utilizing JWST NIRCam Wide Field Slitless Spectroscopy (WFSS), we perform a systematic search for broad-line ${\rm Hα}$ emitters (BHAEs) at $z\approx 4-5$ in 25 fields of the ASPIRE (A SPectroscopic survey of biased halos In the Rei… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 19 pages, 13 figures, 4 tables. Accepted by the ApJ

  17. arXiv:2407.17184  [pdf, other

    hep-ex

    Search for $η_{c}(2S)\to K^+ K^- η^{\prime}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times10^{9}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII, we find an evidence of the $η_{c}(2S)\to K^+ K^- η^{\prime}$ decay with a statistical significance of 3.1$σ$. Its decay branching fraction is measured to be $(12.24\pm4.60(\mathrm{stat.})\pm2.37(\mathrm{syst.})\pm4.68(\mathrm{extr.}))\times 10^{-4}$, where the first uncertainty is stati… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  18. arXiv:2407.16910  [pdf

    cond-mat.mtrl-sci

    Operando probing of nanocracking in CuO-derived Cu during CO$_2$ electroreduction

    Authors: Jiawei Wan, Ershuai Liu, Woong Choi, Jiayun Liang, Buyu Zhang, Keon-Han Kim, Xianhu Sun, Meng Zhang, Han Xue, Yi Chen, Qiubo Zhang, Changlian Wen, Ji Yang, Karen C. Bustillo, Peter Ercius, Denis Leshchev, Ji Su, Zakaria Y. Al Balushi, Adam Z. Weber, Mark Asta, Alexis T. Bell, Walter S. Drisdell, Haimei Zheng

    Abstract: Identifying and controlling active sites in electrocatalysis remains a grand challenge due to restructuring of catalysts in the complex chemical environments during operation. Inactive precatalysts can transform into active catalysts under reaction conditions, such as oxide-derived Cu (OD-Cu) for CO$_2$ electroreduction displaying improved production of multicarbon (C$_{2+}$) chemicals. Revealing… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  19. arXiv:2407.16432  [pdf

    quant-ph

    Integrated high-performance error correction for continuous-variable quantum key distribution

    Authors: Chuang Zhou, Yang Li, Li Ma, Jie Yang, Wei Huang, Ao Sun, Heng Wang, Yujie Luo, Yong Li, Ziyang Chen, Francis C. M. Lau, Yichen Zhang, Song Yu, Hong Guo, Bingjie Xu

    Abstract: An integrated error-correction scheme with high throughput, low frame errors rate (FER) and high reconciliation efficiency under low signal to noise ratio (SNR) is one of the major bottlenecks to realize high-performance and low-cost continuous variable quantum key distribution (CV-QKD). To solve this long-standing problem, a novel two-stage error correction method with limited precision that is s… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  20. arXiv:2407.16144  [pdf, other

    math.OC

    Restarted Halpern PDHG for Linear Programming

    Authors: Haihao Lu, Jinwen Yang

    Abstract: In this paper, we propose and analyze a new matrix-free primal-dual algorithm, called restarted Halpern primal-dual hybrid gradient (rHPDHG), for solving linear programming (LP). We show that rHPDHG can achieve optimal accelerated linear convergence on feasible and bounded LP. Furthermore, we present a refined analysis that demonstrates an accelerated two-stage convergence of rHPDHG over the vanil… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  21. arXiv:2407.15851  [pdf, other

    cs.CV cs.AI cs.CY cs.HC cs.LG

    A Survey on Trustworthiness in Foundation Models for Medical Image Analysis

    Authors: Congzhen Shi, Ryan Rezai, Jiaxi Yang, Qi Dou, Xiaoxiao Li

    Abstract: The rapid advancement of foundation models in medical imaging represents a significant leap toward enhancing diagnostic accuracy and personalized treatment. However, the deployment of foundation models in healthcare necessitates a rigorous examination of their trustworthiness, encompassing privacy, robustness, reliability, explainability, and fairness. The current body of survey literature on foun… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  22. arXiv:2407.15435  [pdf, other

    cs.CV

    Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures

    Authors: Ruizhe Wang, Chunliang Hua, Tomakayev Shingys, Mengyuan Niu, Qingxin Yang, Lizhong Gao, Yi Zheng, Junyan Yang, Qiao Wang

    Abstract: The photorealistic reconstruction and rendering of architectural scenes have extensive applications in industries such as film, games, and transportation. It also plays an important role in urban planning, architectural design, and the city's promotion, especially in protecting historical and cultural relics. The 3D Gaussian Splatting, due to better performance over NeRF, has become a mainstream t… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  23. arXiv:2407.15414  [pdf, other

    cs.LG cs.CR

    Weights Shuffling for Improving DPSGD in Transformer-based Models

    Authors: Jungang Yang, Zhe Ji, Liyao Xiang

    Abstract: Differential Privacy (DP) mechanisms, especially in high-dimensional settings, often face the challenge of maintaining privacy without compromising the data utility. This work introduces an innovative shuffling mechanism in Differentially-Private Stochastic Gradient Descent (DPSGD) to enhance the utility of large models at the same privacy guarantee of the unshuffled case. Specifically, we reveal… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  24. arXiv:2407.15369  [pdf, other

    cs.CV

    Sparse Prior Is Not All You Need: When Differential Directionality Meets Saliency Coherence for Infrared Small Target Detection

    Authors: Fei Zhou, Maixia Fu, Yulei Qian, Jian Yang, Yimian Dai

    Abstract: Infrared small target detection is crucial for the efficacy of infrared search and tracking systems. Current tensor decomposition methods emphasize representing small targets with sparsity but struggle to separate targets from complex backgrounds due to insufficient use of intrinsic directional information and reduced target visibility during decomposition. To address these challenges, this study… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE TIM, Minor Revision

  25. arXiv:2407.15156  [pdf, other

    math.NA

    Computational and analytical studies of a new nonlocal phase-field crystal model in two dimensions

    Authors: Qiang Du, Kai Wang, Jiang Yang

    Abstract: A nonlocal phase-field crystal (NPFC) model is presented as a nonlocal counterpart of the local phase-field crystal (LPFC) model and a special case of the structural PFC (XPFC) derived from classical field theory for crystal growth and phase transition. The NPFC incorporates a finite range of spatial nonlocal interactions that can account for both repulsive and attractive effects. The specific for… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  26. arXiv:2407.14895  [pdf, other

    cs.IR

    Strategic Coupon Allocation for Increasing Providers' Sales Experiences in Two-sided Marketplaces

    Authors: Koya Ohashi, Sho Sekine, Deddy Jobson, Jie Yang, Naoki Nishimura, Noriyoshi Sukegawa, Yuichi Takano

    Abstract: In a two-sided marketplace, network effects are crucial for competitiveness, and platforms need to retain users through advanced customer relationship management as much as possible. Maintaining numerous providers' stable and active presence on the platform is highly important to enhance the marketplace's scale and diversity. The strongest motivation for providers to continue using the platform is… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: 8 pages, 10 figures, KDD 2024 Workshop on Two-sided Marketplace Optimization: Search, Pricing, Matching & Growth

  27. arXiv:2407.14875  [pdf, other

    cs.CL

    Seal: Advancing Speech Language Models to be Few-Shot Learners

    Authors: Shuyu Lei, Lingen Liu, Jiaolong Yang, Yasen Jiao, Yuxiang Yang, Yushu Yang, Xiang Guo

    Abstract: Existing auto-regressive language models have demonstrated a remarkable capability to perform a new task with just a few examples in prompt, without requiring any additional training. In order to extend this capability to a multi-modal setting (i.e. speech and language), this paper introduces the Seal model, an abbreviation for speech language model. It incorporates a novel alignment method, in wh… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  28. arXiv:2407.14564  [pdf, ps, other

    eess.IV cs.AI cs.CV cs.LG

    APS-USCT: Ultrasound Computed Tomography on Sparse Data via AI-Physic Synergy

    Authors: Yi Sheng, Hanchen Wang, Yipei Liu, Junhuan Yang, Weiwen Jiang, Youzuo Lin, Lei Yang

    Abstract: Ultrasound computed tomography (USCT) is a promising technique that achieves superior medical imaging reconstruction resolution by fully leveraging waveform information, outperforming conventional ultrasound methods. Despite its advantages, high-quality USCT reconstruction relies on extensive data acquisition by a large number of transducers, leading to increased costs, computational demands, exte… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: MICCAI

  29. arXiv:2407.14198  [pdf

    cs.CV eess.IV

    Double-Shot 3D Shape Measurement with a Dual-Branch Network

    Authors: Mingyang Lei, Jingfan Fan, Long Shao, Hong Song, Deqiang Xiao, Danni Ai, Tianyu Fu, Ying Gu, Jian Yang

    Abstract: The structured light (SL)-based 3D measurement techniques with deep learning have been widely studied, among which speckle projection profilometry (SPP) and fringe projection profilometry (FPP) are two popular methods. However, they generally use a single projection pattern for reconstruction, resulting in fringe order ambiguity or poor reconstruction accuracy. To alleviate these problems, we prop… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  30. arXiv:2407.13932  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Excitation laser energy dependence of the gap-mode TERS spectra of WS$_2$ and MoS$_2$ on silver

    Authors: Andrey Krayev, Eleonora Isotta, Lauren Hoang, Jerry A. Yang, Kathryn Neilson, Minyuan Wang, Noah Haughn, Eric Pop, Andrew Mannix, Oluwaseyi Balogun, Chih-Feng Wang

    Abstract: We present a systematic study of the dependence of gap mode tip-enhanced Raman scattering (TERS) of mono- and bi-layer WS$_2$ and MoS$_2$ as a function of excitation laser energy. We collected consecutive TERS maps of mono-and bi-layer regions with 6 different excitation lasers. To decrease the acquisition time, we used for the first time concurrent excitation and collection with two lasers simult… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 21 pages, 10 figures

  31. arXiv:2407.13896  [pdf, ps, other

    cs.LG cs.AI

    Data-Algorithm-Architecture Co-Optimization for Fair Neural Networks on Skin Lesion Dataset

    Authors: Yi Sheng, Junhuan Yang, Jinyang Li, James Alaina, Xiaowei Xu, Yiyu Shi, Jingtong Hu, Weiwen Jiang, Lei Yang

    Abstract: As Artificial Intelligence (AI) increasingly integrates into our daily lives, fairness has emerged as a critical concern, particularly in medical AI, where datasets often reflect inherent biases due to social factors like the underrepresentation of marginalized communities and socioeconomic barriers to data collection. Traditional approaches to mitigating these biases have focused on data augmenta… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: MICCAI

  32. arXiv:2407.13851  [pdf, other

    cs.CV cs.LG cs.MM

    X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

    Authors: Sirnam Swetha, Jinyu Yang, Tal Neiman, Mamshad Nayeem Rizve, Son Tran, Benjamin Yao, Trishul Chilimbi, Mubarak Shah

    Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) have revolutionized the field of vision-language understanding by integrating visual perception capabilities into Large Language Models (LLMs). The prevailing trend in this field involves the utilization of a vision encoder derived from vision-language contrastive learning (CL), showing expertise in capturing overall representations w… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV2024

  33. arXiv:2407.13499  [pdf, other

    cs.CR

    Three-State Information Hiding: Provably Secure Asymmetric Steganography

    Authors: Minhao Bai, Jinshuai Yang, Kaiyi Pang, Xu Xin, Yongfeng Huang

    Abstract: The rise of language models has provided a fertile ground for the application of steganography. Due to their qualified output, steganographic texts become similar to human and have attracted most of the steganography researchers' attention. However, running a language model requires a strong computation platform. It limits the applicable scenario of steganography, since those electronic devices co… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  34. arXiv:2407.12863  [pdf, other

    cs.CL cs.AI

    Token-Supervised Value Models for Enhancing Mathematical Reasoning Capabilities of Large Language Models

    Authors: Jung Hyun Lee, June Yong Yang, Byeongho Heo, Dongyoon Han, Kang Min Yoo

    Abstract: Large Language Models (LLMs) have demonstrated impressive problem-solving capabilities in mathematics through step-by-step reasoning chains. However, they are susceptible to reasoning errors that impact the quality of subsequent reasoning chains and the final answer due to language models' autoregressive token-by-token generating nature. Recent works have proposed adopting external verifiers to gu… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  35. arXiv:2407.12772  [pdf, other

    cs.CL cs.CV

    LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

    Authors: Kaichen Zhang, Bo Li, Peiyuan Zhang, Fanyi Pu, Joshua Adrian Cahyono, Kairui Hu, Shuai Liu, Yuanhan Zhang, Jingkang Yang, Chunyuan Li, Ziwei Liu

    Abstract: The advances of large foundation models necessitate wide-coverage, low-cost, and zero-contamination benchmarks. Despite continuous exploration of language model evaluations, comprehensive studies on the evaluation of Large Multi-modal Models (LMMs) remain limited. In this work, we introduce LMMS-EVAL, a unified and standardized multimodal benchmark framework with over 50 tasks and more than 10 mod… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Code ad leaderboard are available at https://github.com/EvolvingLMMs-Lab/lmms-eval and https://huggingface.co/spaces/lmms-lab/LiveBench

  36. arXiv:2407.12744  [pdf, other

    cond-mat.supr-con

    Negligible Normal Fluid in Superconducting State of Heavily Overdoped Bi$_2$Sr$_2$CaCu$_2$O$_{8+δ}$ Detected by Ultra-Low Temperature Angle-Resolved Photoemission Spectroscopy

    Authors: Chaohui Yin, Qinghong Wang, Yuyang Xie, Yiwen Chen, Junhao Liu, Jiangang Yang, Junjie Jia, Xing Zhang, Wenkai Lv, Hongtao Yan, Hongtao Rong, Shenjin Zhang, Zhimin Wang, Nan Zong, Lijuan Liu, Rukang Li, Xiaoyang Wang, Fengfeng Zhang, Feng Yang, Qinjun Peng, Zuyan Xu, Guodong Liu, Hanqing Mao, Lin Zhao, Xintong Li , et al. (1 additional authors not shown)

    Abstract: In high temperature cuprate superconductors, it was found that in the overdoped region the superfluid density decreases with the increase of hole doping. One natural question is whether there exists normal fluid in the superconducting state in the overdoped region. In this paper, we have carried out high-resolution ultra-low temperature laser-based angle-resolved photoemission measurements on a he… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures

    Journal ref: Chinese Physics B 33, 077405 (2024)

  37. arXiv:2407.12604  [pdf, ps, other

    cs.IT cs.DS cs.SI

    Exact Graph Matching in Correlated Gaussian-Attributed Erdős-Rényi Model

    Authors: Joonhyuk Yang, Hye Won Chung

    Abstract: Graph matching problem aims to identify node correspondence between two or more correlated graphs. Previous studies have primarily focused on models where only edge information is provided. However, in many social networks, not only the relationships between users, represented by edges, but also their personal information, represented by features, are present. In this paper, we address the challen… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: IEEE International Symposium on Information Theory (ISIT) 2024

  38. arXiv:2407.12529  [pdf, other

    cs.CL

    Crafting the Path: Robust Query Rewriting for Information Retrieval

    Authors: Ingeol Baek, Jimin Lee, Joonho Yang, Hwanhee Lee

    Abstract: Query rewriting aims to generate a new query that can complement the original query to improve the information retrieval system. Recent studies on query rewriting, such as query2doc (Q2D), query2expand (Q2E) and querey2cot (Q2C), rely on the internal knowledge of Large Language Models (LLMs) to generate a relevant passage to add information to the query. Nevertheless, the efficacy of these methodo… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 1 figure, 12 tables

  39. arXiv:2407.12485  [pdf, other

    eess.SY

    Record 202.3 Tb/s Transmission over Field-Deployed Fibre using 15.6 THz S+C+L-Bands

    Authors: Jiaqian Yang, Eric Sillekens, Benjamin J. Puttnam, Ronit Sohanpal, Mindaugas Jarmolovičius, Romulo Aparecido, Henrique Buglia, Ruben S. Luis, Ralf Stolte, Polina Bayvel, Robert I. Killey

    Abstract: Ultra-wideband, field-deployed metropolitan fibre transmission is experimentally demonstrated, measuring a record 202.3 Tb/s GMI and 189.5 Tb/s after decoding with 20.9 dBm launch power and lumped amplification only. An experimentally-optimised 5 dB pre-tilt over the 15.6 THz optical bandwidth was applied to overcome ISRS.

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 3 pages plus 1 page reference, 6 figures, submit to ECOC 2024

  40. arXiv:2407.12476  [pdf, other

    eess.SY

    Experimental validation of the closed-form GN model accounting for distributed Raman amplification in an S+C+L-band hybrid amplified long-haul transmission system

    Authors: Jiaqian Yang, Henrique Buglia, Eric Sillekens, Mingming Tan, Pratim Hazarika, Dini Pratiwi, Ronit Sohanpal, Mindaugas Jarmolovičius, Romulo Aparecido, Ralf Stolte, Wladek Forysiak, Polina Bayvel, Robert I. Killey

    Abstract: The accuracy of a recently-developed closed-form GN nonlinear interference model is evaluated in experimental 1065 km S+C+L band WDM transmission with backward Raman pumping. The model accurately estimates the nonlinear interference and ASE with total SNR error of less than 0.6 dB.

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 3 pages plus 1 page reference, 4 figures, submitted to ECOC 2024

  41. arXiv:2407.12435  [pdf, other

    cs.CV

    F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions

    Authors: Jie Yang, Xuesong Niu, Nan Jiang, Ruimao Zhang, Siyuan Huang

    Abstract: Existing 3D human object interaction (HOI) datasets and models simply align global descriptions with the long HOI sequence, while lacking a detailed understanding of intermediate states and the transitions between states. In this paper, we argue that fine-grained semantic alignment, which utilizes state-level descriptions, offers a promising paradigm for learning semantically rich HOI representati… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: ECCV24

  42. arXiv:2407.12270  [pdf, other

    hep-ex

    Observation of $Λ_c^+ \to Λa_0(980)^+$ and Evidence for $Σ(1380)^+$ in $Λ_c^+ \to Λπ^+ η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Based on $6.1~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at center-of-mass energies from 4.600~GeV to 4.843~GeV with the BESIII detector at the BEPCII collider, a partial wave analysis of $Λ_c^+\toΛπ^+η$ is performed, and branching fractions and decay asymmetry parameters of intermediate processes are determined. The process $Λ_c^+\toΛa_0(980)^+$ is observed for the first time, and… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 16 pages, 8 figures

  43. arXiv:2407.11886  [pdf

    cond-mat.mtrl-sci

    Automated production of batched unclonable micro-patterns anti-counterfeiting labels with strong robustness and rapid recognition speed

    Authors: Yuzheng He, Zunshuai Zhang, Yifei Xing, Zhiyuan Lang, Jinbo Wu, Jiong Yang

    Abstract: Anti-counterfeiting technologies are indeed crucial for information security and protecting product authenticity. Traditional anti-counterfeiting methods have their limitations due to their clonable nature. Exploring new technologies, particularly those based on pixel-level textures is a promising avenue to address the clonable issue due to high encoding capacity. However, research in this field i… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 22 pages, 8 figures

    MSC Class: 68U10; 92E99 ACM Class: J.2; J.7

  44. A cryogenic on-chip microwave pulse generator for large-scale superconducting quantum computing

    Authors: Zenghui Bao, Yan Li, Zhiling Wang, Jiahui Wang, Jize Yang, Haonan Xiong, Yipu Song, Yukai Wu, Hongyi Zhang, Luming Duan

    Abstract: For superconducting quantum processors, microwave signals are delivered to each qubit from room-temperature electronics to the cryogenic environment through coaxial cables. Limited by the heat load of cabling and the massive cost of electronics, such an architecture is not viable for millions of qubits required for fault-tolerant quantum computing. Monolithic integration of the control electronics… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 12 pages, 4 figures

    Journal ref: Nat Commun 15, 5958 (2024)

  45. arXiv:2407.11727  [pdf, ps, other

    hep-ex hep-ph

    Measurement of the branching fraction of $D^+_s\to \ell^+ν_\ell$ via $e^+e^-\to D^{*+}_{s} D^{*-}_{s}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Based on $10.64~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data taken at center-of-mass energies between 4.237 and 4.699 GeV with the BESIII detector, we study the leptonic $D^+_s$ decays using the $e^+e^-\to D^{*+}_{s} D^{*-}_{s}$ process. The branching fractions of $D_s^+\to\ell^+ν_{\ell}\,(\ell=μ,τ)$ are measured to be $\mathcal{B}(D_s^+\toμ^+ν_μ)=(0.547\pm0.026_{\rm stat}\pm0.016_{\rm syst})\%$ a… ▽ More

    Submitted 18 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 27 pages, 13 figures

  46. arXiv:2407.11691  [pdf, other

    cs.CV

    VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models

    Authors: Haodong Duan, Junming Yang, Yuxuan Qiao, Xinyu Fang, Lin Chen, Yuan Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang, Jiaqi Wang, Dahua Lin, Kai Chen

    Abstract: We present VLMEvalKit: an open-source toolkit for evaluating large multi-modality models based on PyTorch. The toolkit aims to provide a user-friendly and comprehensive framework for researchers and developers to evaluate existing multi-modality models and publish reproducible evaluation results. In VLMEvalKit, we implement over 70 different large multi-modality models, including both proprietary… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  47. arXiv:2407.11534  [pdf, other

    cs.LG cs.AI

    LRQ: Optimizing Post-Training Quantization for Large Language Models by Learning Low-Rank Weight-Scaling Matrices

    Authors: Jung Hyun Lee, Jeonghoon Kim, June Yong Yang, Se Jung Kwon, Eunho Yang, Kang Min Yoo, Dongsoo Lee

    Abstract: With the commercialization of large language models (LLMs), weight-activation quantization has emerged to compress and accelerate LLMs, achieving high throughput while reducing inference costs. However, existing post-training quantization (PTQ) techniques for quantizing weights and activations of LLMs still suffer from non-negligible accuracy drops, especially on massive multitask language underst… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Preprint

  48. arXiv:2407.11027  [pdf, other

    cs.LG cs.AI

    A robust three-way classifier with shadowed granular-balls based on justifiable granularity

    Authors: Jie Yang, Lingyun Xiaodiao, Guoyin Wang, Witold Pedrycz, Shuyin Xia, Qinghua Zhang, Di Wu

    Abstract: The granular-ball (GB)-based classifier introduced by Xia, exhibits adaptability in creating coarse-grained information granules for input, thereby enhancing its generality and flexibility. Nevertheless, the current GB-based classifiers rigidly assign a specific class label to each data instance and lacks of the necessary strategies to address uncertain instances. These far-fetched certain classif… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  49. arXiv:2407.11006  [pdf, other

    cs.CL cs.AI cs.LG cs.PF

    How Good Is It? Evaluating the Efficacy of Common versus Domain-Specific Prompts on Foundational Large Language Models

    Authors: Oluyemi Enoch Amujo, Shanchieh Jay Yang

    Abstract: Recently, large language models (LLMs) have expanded into various domains. However, there remains a need to evaluate how these models perform when prompted with commonplace queries compared to domain-specific queries, which may be useful for benchmarking prior to fine-tuning domain-specific downstream tasks. This study evaluates LLMs, specifically Gemma-2B and Gemma-7B, across diverse domains, inc… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures, 2 tables, and algorithms

  50. arXiv:2407.11003  [pdf, other

    cs.CL cs.AI cs.LG

    Using Large Language Models in Public Transit Systems, San Antonio as a case study

    Authors: Ramya Jonnala, Gongbo Liang, Jeong Yang, Izzat Alsmadi

    Abstract: The integration of large language models into public transit systems represents a significant advancement in urban transportation management and passenger experience. This study examines the impact of LLMs within San Antonio's public transit system, leveraging their capabilities in natural language processing, data analysis, and real time communication. By utilizing GTFS and other public transport… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.