Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 791 results for author: Pan, Z

.
  1. arXiv:2408.08228  [pdf, other

    eess.IV cs.CV

    Rethinking Medical Anomaly Detection in Brain MRI: An Image Quality Assessment Perspective

    Authors: Zixuan Pan, Jun Xia, Zheyu Yan, Guoyue Xu, Yawen Wu, Zhenge Jia, Jianxu Chen, Yiyu Shi

    Abstract: Reconstruction-based methods, particularly those leveraging autoencoders, have been widely adopted to perform anomaly detection in brain MRI. While most existing works try to improve detection accuracy by proposing new model structures or algorithms, we tackle the problem through image quality assessment, an underexplored perspective in the field. We propose a fusion quality loss function that com… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2408.03440  [pdf, other

    eess.AS cs.SD

    TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement

    Authors: Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux

    Abstract: Time-frequency (TF) domain dual-path models achieve high-fidelity speech separation. While some previous state-of-the-art (SoTA) models rely on RNNs, this reliance means they lack the parallelizability, scalability, and versatility of Transformer blocks. Given the wide-ranging success of pure Transformer-based architectures in other fields, in this work we focus on removing the RNN from TF-domain… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: Accepted to IWAENC 2024

  3. arXiv:2408.03438  [pdf, other

    eess.AS cs.SD

    Enhanced Reverberation as Supervision for Unsupervised Speech Separation

    Authors: Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux

    Abstract: Reverberation as supervision (RAS) is a framework that allows for training monaural speech separation models from multi-channel mixtures in an unsupervised manner. In RAS, models are trained so that sources predicted from a mixture at an input channel can be mapped to reconstruct a mixture at a target channel. However, stable unsupervised training has so far only been achieved in over-determined s… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: Accepted to Interspeech 2024

  4. arXiv:2408.00402  [pdf, other

    astro-ph.GA

    Changing-look Active Galactic Nuclei from the Dark Energy Spectroscopic Instrument. II. Statistical Properties from the First Data Release

    Authors: Wei-Jian Guo, Hu Zou, Claire L. Greenwell, David M. Alexander, Victoria A. Fawcett, Zhiwei Pan, Malgorzata Siudek, Jessica Nicole Aguilar, Steven Ahlen, David Brooks, Todd Claybaugh, Kyle Dawson, Axel De La Macorra, Peter Doel, Andreu Font-Ribera, Enrique Gaztanaga, Satya Gontcho A Gontcho, Gaston Gutierrez, Robert Kehoe, Theodore Kisner, Martin Landriau, Laurent Le Guillou, Marc Manera, Aaron Meisner, Ramon Mique , et al. (11 additional authors not shown)

    Abstract: We present the identification of changing-look active galactic nuclei (CL-AGNs) from the Dark Energy Spectroscopic Instrument First Data Release and Sloan Digital Sky Survey Data Release 16 at z \leq 0.9. To confirm the CL-AGNs, we utilize spectral flux calibration assessment via an [O\,{\sc iii}]-based calibration, pseudo-photometry examination, and visual inspection. This rigorous selection proc… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Submitted to ApJS, comments welcome

  5. arXiv:2408.00266  [pdf

    physics.chem-ph

    Nanostructured Fe2O3/CuxO Heterojunction for Enhanced Solar Redox Flow Battery Performance

    Authors: Jiaming Ma, Milad Sabzehparvar, Ziyan Pan, Giulia Tagliabue

    Abstract: Solar redox flow batteries (SRFB) have received much attention as an alternative integrated technology for simultaneous conversion and storage of solar energy. Yet, the photocatalytic efficiency of semiconductor-based single photoelectrode, such as hematite, remains low due to the trade-off between fast electron hole recombination and insufficient light utilization, as well as inferior reaction ki… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  6. arXiv:2407.20579  [pdf, other

    astro-ph.IM astro-ph.CO

    Measurement and Modeling of Polarized Atmosphere at the South Pole with SPT-3G

    Authors: A. Coerver, J. A. Zebrowski, S. Takakura, W. L. Holzapfel, P. A. R. Ade, A. J. Anderson, Z. Ahmed, B. Ansarinejad, M. Archipley, L. Balkenhol, D. Barron, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, T. W. Cecil, C. L. Chang, P. Chaubal, P. M. Chichura, A. Chokshi , et al. (80 additional authors not shown)

    Abstract: We present the detection and characterization of fluctuations in linearly polarized emission from the atmosphere above the South Pole. These measurements make use of Austral winter survey data from the SPT-3G receiver on the South Pole Telescope in three frequency bands centered at 95, 150, and 220 GHz. We use the cross-correlation between detectors to produce an unbiased estimate of the power in… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 32 pages, 28 figures

  7. arXiv:2407.20544  [pdf, other

    cs.CR cs.AR

    Automated Physical Design Watermarking Leveraging Graph Neural Networks

    Authors: Ruisi Zhang, Rachel Selina Rajarathnam, David Z. Pan, Farinaz Koushanfar

    Abstract: This paper presents AutoMarks, an automated and transferable watermarking framework that leverages graph neural networks to reduce the watermark search overheads during the placement stage. AutoMarks's novel automated watermark search is accomplished by (i) constructing novel graph and node features with physical, semantic, and design constraint-aware representation; (ii) designing a data-efficien… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: accept to MLCAD24, code: https://github.com/ruisizhang123/PD_WM_GNN

  8. arXiv:2407.19441  [pdf, other

    cs.NE

    Competition-based Adaptive ReLU for Deep Neural Networks

    Authors: Junjia Chen, Zhibin Pan

    Abstract: Activation functions introduce nonlinearity into deep neural networks. Most popular activation functions allow positive values to pass through while blocking or suppressing negative values. From the idea that positive values and negative values are equally important, and they must compete for activation, we proposed a new Competition-based Adaptive ReLU (CAReLU). CAReLU scales the input values bas… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  9. arXiv:2407.17570  [pdf, other

    astro-ph.GA astro-ph.CO

    A SPectroscopic survey of biased halos In the Reionization Era (ASPIRE): Broad-line AGN at $z=4-5$ revealed by JWST/NIRCam WFSS

    Authors: Xiaojing Lin, Feige Wang, Xiaohui Fan, Zheng Cai, Jaclyn B. Champagne, Fengwu Sun, Marta Volonteri, Jinyi Yang, Joseph F. Hennawi, Eduardo Bañados, Aaron Barth, Anna-Christina Eilers, Emanuele Paolo Farina, Weizhe Liu, Xiangyu Jin, Hyunsung D. Jun, Alessandro Lupi, Koki Kakiichi, Chiara Mazzucchelli, Masafusa Onoue, Zhiwei Pan, Elia Pizzati, Sofía Rojas-Ruiz, Jan-Torge Schindler, Benny Trakhtenbrot , et al. (11 additional authors not shown)

    Abstract: Low-luminosity AGNs with low-mass black holes (BHs) in the early universe are fundamental to understanding the BH growth and their co-evolution with the host galaxies. Utilizing JWST NIRCam Wide Field Slitless Spectroscopy (WFSS), we perform a systematic search for broad-line ${\rm Hα}$ emitters (BHAEs) at $z\approx 4-5$ in 25 fields of the ASPIRE (A SPectroscopic survey of biased halos In the Rei… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 19 pages, 13 figures, 4 tables. Accepted by the ApJ

  10. arXiv:2407.16189  [pdf, other

    cs.CV

    EIANet: A Novel Domain Adaptation Approach to Maximize Class Distinction with Neural Collapse Principles

    Authors: Zicheng Pan, Xiaohan Yu, Yongsheng Gao

    Abstract: Source-free domain adaptation (SFDA) aims to transfer knowledge from a labelled source domain to an unlabelled target domain. A major challenge in SFDA is deriving accurate categorical information for the target domain, especially when sample embeddings from different classes appear similar. This issue is particularly pronounced in fine-grained visual categorization tasks, where inter-class differ… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 12 pages, 3 figures. Accepted by BMVC2024

  11. arXiv:2407.15344  [pdf, other

    physics.flu-dyn

    Error propagation of direct pressure gradient integration and a Helmholtz-Hodge decomposition based pressure field reconstruction method for image velocimetry

    Authors: Lanyu Li, Jeffrey McClure, Grady B. Wright, Jared P. Whitehead, Jin Wang, Zhao Pan

    Abstract: Recovering pressure fields from image velocimetry measurements has two general strategies: i) directly integrating the pressure gradients from the momentum equation and ii) solving or enforcing the pressure Poisson equation (divergence of the pressure gradients). In this work, we analyze the error propagation of the former strategy and provide some practical insights. For example, we establish the… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  12. arXiv:2407.14868  [pdf, other

    cs.CV

    Dual High-Order Total Variation Model for Underwater Image Restoration

    Authors: Yuemei Li, Guojia Hou, Peixian Zhuang, Zhenkuan Pan

    Abstract: Underwater images are typically characterized by color cast, haze, blurring, and uneven illumination due to the selective absorption and scattering when light propagates through the water, which limits their practical applications. Underwater image enhancement and restoration (UIER) is one crucial mode to improve the visual quality of underwater images. However, most existing UIER methods concentr… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: 13 pages, 10 figures

  13. arXiv:2407.14360  [pdf

    cond-mat.mtrl-sci

    Unraveling the multistage phase transformations in monolayer Mo-Te compounds

    Authors: Zemin Pan, Tao Jian, Hui Zhang, Xiaoyu Lin, Chao Zhu, Jinghao Deng, Zhengbo Cheng, Chuansheng Liu, Chendong Zhang

    Abstract: Monolayer MoTe2 exhibits a variety of derivative structural phases and associated novel electronic properties that enable a wealth of potential applications in future electronic and optoelectronic devices. However, a comprehensive study focusing on the complexities of the controllable phase evolution in this atomically thin film has yet to be performed. This work aims to address this issue by syst… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 17 pages, 5 figures

  14. arXiv:2407.13698  [pdf, other

    q-fin.ST cs.CE cs.LG

    International Trade Flow Prediction with Bilateral Trade Provisions

    Authors: Zijie Pan, Stepan Gordeev, Jiahui Zhao, Ziyi Meng, Caiwen Ding, Sandro Steinbach, Dongjin Song

    Abstract: This paper presents a novel methodology for predicting international bilateral trade flows, emphasizing the growing importance of Preferential Trade Agreements (PTAs) in the global trade landscape. Acknowledging the limitations of traditional models like the Gravity Model of Trade, this study introduces a two-stage approach combining explainable machine learning and factorization models. The first… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

  15. arXiv:2407.13578  [pdf, other

    cs.CL cs.AI

    Large Language Models as Reliable Knowledge Bases?

    Authors: Danna Zheng, Mirella Lapata, Jeff Z. Pan

    Abstract: The NLP community has recently shown a growing interest in leveraging Large Language Models (LLMs) for knowledge-intensive tasks, viewing LLMs as potential knowledge bases (KBs). However, the reliability and extent to which LLMs can function as KBs remain underexplored. While previous studies suggest LLMs can encode knowledge within their parameters, the amount of parametric knowledge alone is not… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  16. Exposure Completing for Temporally Consistent Neural High Dynamic Range Video Rendering

    Authors: Jiahao Cui, Wei Jiang, Zhan Peng, Zhiyu Pan, Zhiguo Cao

    Abstract: High dynamic range (HDR) video rendering from low dynamic range (LDR) videos where frames are of alternate exposure encounters significant challenges, due to the exposure change and absence at each time stamp. The exposure change and absence make existing methods generate flickering HDR results. In this paper, we propose a novel paradigm to render HDR frames via completing the absent exposure info… ▽ More

    Submitted 4 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 9 pages, 6 figures, accepted by ACM-MM 2024 (poster)

  17. arXiv:2407.12798  [pdf, other

    cs.CV

    Multi-Granularity and Multi-modal Feature Interaction Approach for Text Video Retrieval

    Authors: Wenjun Li, Shudong Wang, Dong Zhao, Shenghui Xu, Zhaoming Pan, Zhimin Zhang

    Abstract: The key of the text-to-video retrieval (TVR) task lies in learning the unique similarity between each pair of text (consisting of words) and video (consisting of audio and image frames) representations. However, some problems exist in the representation alignment of video and text, such as a text, and further each word, are of different importance for video frames. Besides, audio usually carries a… ▽ More

    Submitted 20 June, 2024; originally announced July 2024.

  18. arXiv:2407.12023  [pdf, other

    cs.CL cs.AI

    CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models

    Authors: Zhong-Zhi Li, Ming-Liang Zhang, Fei Yin, Zhi-Long Ji, Jin-Feng Bai, Zhen-Ru Pan, Fan-Hu Zeng, Jian Xu, Jia-Xin Zhang, Cheng-Lin Liu

    Abstract: Due to the rapid advancements in multimodal large language models, evaluating their multimodal mathematical capabilities continues to receive wide attention. Despite the datasets like MathVista proposed benchmarks for assessing mathematical capabilities in multimodal scenarios, there is still a lack of corresponding evaluation tools and datasets for fine-grained assessment in the context of K12 ed… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  19. arXiv:2407.11620  [pdf

    eess.SP

    A Deep Learning-Based Target Radial Length Estimation Method through HRRP Sequence

    Authors: Lingfeng Chen, Panhe Hu, Zhiliang Pan, Xiao Sun, Zehao Wang

    Abstract: This paper introduces an innovative deep learning-based method for end-to-end target radial length estimation from HRRP (High Resolution Range Profile) sequences. Firstly, the HRRP sequences are normalized and transformed into GAF (Gram Angular Field) images to effectively capture and utilize the temporal information. Subsequently, these GAF images serve as the input for a pretrained ResNet-101 mo… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 2 pages, 2 figures. Accepted by APCAP 2024

  20. arXiv:2407.11291  [pdf, ps, other

    math.RA

    Normal forms of elements in the Weyl algebra and Dixmier Conjecture

    Authors: Gang Han, Zhennan Pan, Yulin Chen

    Abstract: A result of A. Joseph says that any nilpotent or semisimple element $z$ in the Weyl algebra $A_1$ over some algebracally closed field $K$ of characterstic 0 has a normal form up to the action of the automorphism group of $A_1$. It is shown in this note that the normal form corresponds to some unique pair of integers $(k,n)$ with $k\ge n\ge 0$, and will be called the Joseph norm form of $z$. Simila… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  21. arXiv:2407.08236  [pdf, other

    eess.SP

    HRRPGraphNet: A Graph Neural Network Based Approach for HRRP Radar Target Recognition

    Authors: Lingfeng Chen, Panhe Hu, Zhiliang Pan, Xiao Sun, Zehao Wang

    Abstract: High Resolution Range Profiles (HRRP) have become a key area of focus in the domain of Radar Automatic Target Recognition (RATR). Despite the success of data-driven neural network-based HRRP recognition, challenges such as insufficient training samples persist in its real-world application. This letter introduces HRRPGraphNet, a novel Graph Neural Network (GNN) model designed specifically for HRRP… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures

  22. arXiv:2407.08199  [pdf, other

    cs.CV

    SRPose: Two-view Relative Pose Estimation with Sparse Keypoints

    Authors: Rui Yin, Yulun Zhang, Zherong Pan, Jianjun Zhu, Cheng Wang, Biao Jia

    Abstract: Two-view pose estimation is essential for map-free visual relocalization and object pose tracking tasks. However, traditional matching methods suffer from time-consuming robust estimators, while deep learning-based pose regressors only cater to camera-to-world pose estimation, lacking generalizability to different image sizes and camera intrinsics. In this paper, we propose SRPose, a sparse keypoi… ▽ More

    Submitted 18 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: 30 pages, 11 figures, to be published in ECCV 2024

  23. arXiv:2407.07835  [pdf, other

    cs.CV cs.AI

    RoBus: A Multimodal Dataset for Controllable Road Networks and Building Layouts Generation

    Authors: Tao Li, Ruihang Li, Huangnan Zheng, Shanding Ye, Shijian Li, Zhijie Pan

    Abstract: Automated 3D city generation, focusing on road networks and building layouts, is in high demand for applications in urban design, multimedia games and autonomous driving simulations. The surge of generative AI facilitates designing city layouts based on deep learning models. However, the lack of high-quality datasets and benchmarks hinders the progress of these data-driven methods in generating ro… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  24. arXiv:2407.07346  [pdf, other

    cs.LG cs.CE

    INSIGHT: Universal Neural Simulator for Analog Circuits Harnessing Autoregressive Transformers

    Authors: Souradip Poddar, Youngmin Oh, Yao Lai, Hanqing Zhu, Bosun Hwang, David Z. Pan

    Abstract: Analog front-end design heavily relies on specialized human expertise and costly trial-and-error simulations, which motivated many prior works on analog design automation. However, efficient and effective exploration of the vast and complex design space remains constrained by the time-consuming nature of SPICE simulations, making effective design automation a challenging endeavor. In this paper, w… ▽ More

    Submitted 6 August, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  25. arXiv:2407.03227  [pdf, other

    cs.CL cs.AI cs.DB

    Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning

    Authors: Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan

    Abstract: We focus on Text-to-SQL semantic parsing from the perspective of Large Language Models. Motivated by challenges related to the size of commercial database schemata and the deployability of business intelligence solutions, we propose an approach that dynamically retrieves input database information and uses abstract syntax trees to select few-shot examples for in-context learning. Furthermore, we… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  26. arXiv:2407.02038  [pdf, other

    cs.CV

    Camera-LiDAR Cross-modality Gait Recognition

    Authors: Wenxuan Guo, Yingping Liang, Zhiyu Pan, Ziheng Xi, Jianjiang Feng, Jie Zhou

    Abstract: Gait recognition is a crucial biometric identification technique. Camera-based gait recognition has been widely applied in both research and industrial fields. LiDAR-based gait recognition has also begun to evolve most recently, due to the provision of 3D structural information. However, in certain applications, cameras fail to recognize persons, such as in low-light environments and long-distance… ▽ More

    Submitted 4 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ECCV 2024

  27. arXiv:2407.01971  [pdf, other

    cs.CV

    Pseudo-Labeling by Multi-Policy Viewfinder Network for Image Cropping

    Authors: Zhiyu Pan, Kewei Wang, Yizheng Wu, Liwen Xiao, Jiahao Cui, Zhicheng Wang, Zhiguo Cao

    Abstract: Automatic image cropping models predict reframing boxes to enhance image aesthetics. Yet, the scarcity of labeled data hinders the progress of this task. To overcome this limitation, we explore the possibility of utilizing both labeled and unlabeled data together to expand the scale of training data for image cropping models. This idea can be implemented in a pseudo-labeling way: producing pseudo… ▽ More

    Submitted 4 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 18 pages, 8figures

  28. arXiv:2407.00909  [pdf, other

    cs.IR cs.CV

    Heterogeneous Graph-based Framework with Disentangled Representations Learning for Multi-target Cross Domain Recommendation

    Authors: Xiaopeng Liu, Juan Zhang, Chongqi Ren, Shenghui Xu, Zhaoming Pan, Zhimin Zhang

    Abstract: CDR (Cross-Domain Recommendation), i.e., leveraging information from multiple domains, is a critical solution to data sparsity problem in recommendation system. The majority of previous research either focused on single-target CDR (STCDR) by utilizing data from the source domains to improve the model's performance on the target domain, or applied dual-target CDR (DTCDR) by integrating data from th… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  29. arXiv:2407.00817  [pdf

    cs.AR

    Multi-Objective Optimization for Common-Centroid Placement of Analog Transistors

    Authors: Supriyo Maji, Hyungjoo Park, Gi moon Hong, Souradip Poddar, David Z. Pan

    Abstract: In analog circuits, process variation can cause unpredictability in circuit performance. Common-centroid (CC) type layouts have been shown to mitigate process-induced variations and are widely used to match circuit elements. Nevertheless, selecting the most suitable CC topology necessitates careful consideration of important layout constraints. Manual handling of these constraints becomes challeng… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  30. arXiv:2406.18588  [pdf, other

    cs.CV cs.LG

    Varying Manifolds in Diffusion: From Time-varying Geometries to Visual Saliency

    Authors: Junhao Chen, Manyi Li, Zherong Pan, Xifeng Gao, Changhe Tu

    Abstract: Deep generative models learn the data distribution, which is concentrated on a low-dimensional manifold. The geometric analysis of distribution transformation provides a better understanding of data structure and enables a variety of applications. In this paper, we study the geometric properties of the diffusion model, whose forward diffusion process and reverse generation process construct a seri… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  31. arXiv:2406.18539  [pdf, other

    cs.CV cs.GR

    TexPainter: Generative Mesh Texturing with Multi-view Consistency

    Authors: Hongkun Zhang, Zherong Pan, Congyi Zhang, Lifeng Zhu, Xifeng Gao

    Abstract: The recent success of pre-trained diffusion models unlocks the possibility of the automatic generation of textures for arbitrary 3D meshes in the wild. However, these models are trained in the screen space, while converting them to a multi-view consistent texture image poses a major obstacle to the output quality. In this paper, we propose a novel method to enforce multi-view consistency. Our meth… ▽ More

    Submitted 17 May, 2024; originally announced June 2024.

    Comments: accepted by Siggraph 2024

  32. arXiv:2406.18169  [pdf, ps, other

    astro-ph.HE hep-ph

    Timing and Scintillation Studies of Pulsars in Globular Cluster M3 (NGC 5272) with FAST

    Authors: Baoda Li, Li-yun Zhang, Jumei Yao, Dejiang Yin, Ralph P. Eatough, Minghui Li, Yifeng Li, Yujie Lian, Yu Pan, Yinfeng Dai, Yaowei Li, Xingnan Zhang, Tianhao Su, Yuxiao Wu, Tong Liu, Kuo Liu, Lin Wang, Lei Qian, Zhichen Pan

    Abstract: We present the phase-connected timing solutions of all the five pulsars in globular cluster (GC) M3 (NGC 5272), namely PSRs M3A to F (PSRs J1342+2822A to F), with the exception of PSR M3C, from FAST archival data. In these timing solutions, those of PSRs M3E, and F are obtained for the first time. We find that PSRs M3E and F have low mass companions, and are in circular orbits with periods of 7.1… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures, accepted for publication in The Astrophysical Journal

  33. Start from Zero: Triple Set Prediction for Automatic Knowledge Graph Completion

    Authors: Wen Zhang, Yajing Xu, Peng Ye, Zhiwei Huang, Zezhong Xu, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graph (KG) completion aims to find out missing triples in a KG. Some tasks, such as link prediction and instance completion, have been proposed for KG completion. They are triple-level tasks with some elements in a missing triple given to predict the missing element of the triple. However, knowing some elements of the missing triple in advance is not always a realistic setting. In this p… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Paper accepted by TKDE in 2024

  34. arXiv:2406.18115  [pdf, other

    cs.RO cs.AI cs.CV

    Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps

    Authors: Dicong Qiu, Wenzong Ma, Zhenfu Pan, Hui Xiong, Junwei Liang

    Abstract: Open-Vocabulary Mobile Manipulation (OVMM) is a crucial capability for autonomous robots, especially when faced with the challenges posed by unknown and dynamic environments. This task requires robots to explore and build a semantic understanding of their surroundings, generate feasible plans to achieve manipulation goals, adapt to environmental changes, and comprehend natural language instruction… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Open-vocabulary, Mobile Manipulation, Dynamic Environments, 3D Semantic Maps, Zero-shot, LLMs, VLMs, 18 pages, 2 figures

  35. arXiv:2406.16776  [pdf, other

    cs.CV

    Instance Consistency Regularization for Semi-Supervised 3D Instance Segmentation

    Authors: Yizheng Wu, Zhiyu Pan, Kewei Wang, Xingyi Li, Jiahao Cui, Liwen Xiao, Guosheng Lin, Zhiguo Cao

    Abstract: Large-scale datasets with point-wise semantic and instance labels are crucial to 3D instance segmentation but also expensive. To leverage unlabeled data, previous semi-supervised 3D instance segmentation approaches have explored self-training frameworks, which rely on high-quality pseudo labels for consistency regularization. They intuitively utilize both instance and semantic pseudo labels in a j… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 14 pages, 10 figures

  36. arXiv:2406.15835  [pdf

    cond-mat.mtrl-sci

    Alternating-Chiral Charge Density Waves and Hybrid Ferrimagnetism in Monolayered NbTe2

    Authors: Yusong Bai, Guohua Cao, Jinghao Deng, Haomin Fei, Xiaoyu Lin, Leiqiang Li, Chao Zhu, Zemin Pan, Tao Jian, Da Huo, Zhengbo Cheng, Chih-Kang Shih, Ping Cui, Chendong Zhang, Zhenyu Zhang

    Abstract: Intertwining of different quantum degrees of freedom manifests exotic quantum phenomena in many-body systems, especially in reduced dimensionality. Here we show that monolayered NbTe2 serves as an ideal platform where lattice, charge, and spin degrees of freedom manifest cooperatively, leading to a new and threading order of chirality. By using spin-polarized scanning tunneling microscopy/spectros… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  37. arXiv:2406.14282  [pdf, other

    cs.CL cs.AI

    Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

    Authors: Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, Yue Shen, Peng Wei, Zhiqiang Zhang, Jinjie Gu, Jun Zhou, Jeff Z. Pan, Wen Zhang, Huajun Chen

    Abstract: Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fin… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Work in progress

  38. arXiv:2406.14052  [pdf, other

    eess.IV cs.CV

    Perspective+ Unet: Enhancing Segmentation with Bi-Path Fusion and Efficient Non-Local Attention for Superior Receptive Fields

    Authors: Jintong Hu, Siyan Chen, Zhiyi Pan, Sen Zeng, Wenming Yang

    Abstract: Precise segmentation of medical images is fundamental for extracting critical clinical information, which plays a pivotal role in enhancing the accuracy of diagnoses, formulating effective treatment plans, and improving patient outcomes. Although Convolutional Neural Networks (CNNs) and non-local attention methods have achieved notable success in medical image segmentation, they either struggle to… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 13 pages, 5 figures

  39. arXiv:2406.11682  [pdf, other

    cs.CL cs.AI cs.CR

    Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack

    Authors: Shangqing Tu, Zhuoran Pan, Wenxuan Wang, Zhexin Zhang, Yuliang Sun, Jifan Yu, Hongning Wang, Lei Hou, Juanzi Li

    Abstract: Large language models (LLMs) have been increasingly applied to various domains, which triggers increasing concerns about LLMs' safety on specialized domains, e.g. medicine. However, testing the domain-specific safety of LLMs is challenging due to the lack of domain knowledge-driven attacks in existing benchmarks. To bridge this gap, we propose a new task, knowledge-to-jailbreak, which aims to gene… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 18 pages, 14 figures, 11 tables

  40. arXiv:2406.10283  [pdf, other

    cs.CL cs.SD eess.AS

    Attentive Merging of Hidden Embeddings from Pre-trained Speech Model for Anti-spoofing Detection

    Authors: Zihan Pan, Tianchi Liu, Hardik B. Sailor, Qiongqiong Wang

    Abstract: Self-supervised learning (SSL) speech representation models, trained on large speech corpora, have demonstrated effectiveness in extracting hierarchical speech embeddings through multiple transformer layers. However, the behavior of these embeddings in specific tasks remains uncertain. This paper investigates the multi-layer behavior of the WavLM model in anti-spoofing and proposes an attentive me… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  41. arXiv:2406.06544  [pdf, other

    cs.AR cs.AI

    TSB: Tiny Shared Block for Efficient DNN Deployment on NVCIM Accelerators

    Authors: Yifan Qin, Zheyu Yan, Zixuan Pan, Wujie Wen, Xiaobo Sharon Hu, Yiyu Shi

    Abstract: Compute-in-memory (CIM) accelerators using non-volatile memory (NVM) devices offer promising solutions for energy-efficient and low-latency Deep Neural Network (DNN) inference execution. However, practical deployment is often hindered by the challenge of dealing with the massive amount of model weight parameters impacted by the inherent device variations within non-volatile computing-in-memory (NV… ▽ More

    Submitted 8 May, 2024; originally announced June 2024.

  42. arXiv:2406.06357  [pdf, other

    cs.CL cs.AI

    MASSW: A New Dataset and Benchmark Tasks for AI-Assisted Scientific Workflows

    Authors: Xingjian Zhang, Yutong Xie, Jin Huang, Jinge Ma, Zhaoying Pan, Qijia Liu, Ziyang Xiong, Tolga Ergen, Dongsub Shim, Honglak Lee, Qiaozhu Mei

    Abstract: Scientific innovation relies on detailed workflows, which include critical steps such as analyzing literature, generating ideas, validating these ideas, interpreting results, and inspiring follow-up research. However, scientific publications that document these workflows are extensive and unstructured. This makes it difficult for both human researchers and AI systems to effectively navigate and ex… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:1706.03762 by other authors

  43. arXiv:2406.05720  [pdf, other

    cs.AI cs.MA

    VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft

    Authors: Yubo Dong, Xukun Zhu, Zhengzhe Pan, Linchao Zhu, Yi Yang

    Abstract: In this paper, we aim to evaluate multi-agent systems against complex dependencies, including spatial, causal, and temporal constraints. First, we construct a new benchmark, named VillagerBench, within the Minecraft environment.VillagerBench comprises diverse tasks crafted to test various aspects of multi-agent collaboration, from workload distribution to dynamic adaptation and synchronized task e… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  44. arXiv:2406.05641  [pdf, other

    cs.CV

    PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction

    Authors: Shangyu Chen, Zizheng Pan, Jianfei Cai, Dinh Phung

    Abstract: Personalizing a large-scale pretrained Text-to-Image (T2I) diffusion model is challenging as it typically struggles to make an appropriate trade-off between its training data distribution and the target distribution, i.e., learning a novel concept with only a few target images to achieve personalization (aligning with the personalized target) while preserving text editability (aligning with divers… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  45. arXiv:2406.05250  [pdf, other

    cs.AI cs.AR cs.LG

    LLM-Enhanced Bayesian Optimization for Efficient Analog Layout Constraint Generation

    Authors: Guojin Chen, Keren Zhu, Seunggeun Kim, Hanqing Zhu, Yao Lai, Bei Yu, David Z. Pan

    Abstract: Analog layout synthesis faces significant challenges due to its dependence on manual processes, considerable time requirements, and performance instability. Current Bayesian Optimization (BO)-based techniques for analog layout synthesis, despite their potential for automation, suffer from slow convergence and extensive data needs, limiting their practical application. This paper presents the \text… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  46. arXiv:2406.05130  [pdf, other

    cs.CL

    An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models

    Authors: Xiongtao Zhou, Jie He, Yuhua Ke, Guangyao Zhu, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Multimodal large language models (MLLMs) fine-tuned with multimodal instruction datasets have demonstrated remarkable capabilities in multimodal tasks. However, fine-tuning all parameters of MLLMs has become challenging as they usually contain billions of parameters. To address this issue, we study parameter-efficient fine-tuning (PEFT) methods for MLLMs. We aim to identify effective methods for e… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACL finding 2024

  47. arXiv:2406.03777  [pdf, other

    cs.LG cs.AI

    Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices

    Authors: Ruiyang Qin, Dancheng Liu, Zheyu Yan, Zhaoxuan Tan, Zixuan Pan, Zhenge Jia, Meng Jiang, Ahmed Abbasi, Jinjun Xiong, Yiyu Shi

    Abstract: The scaling laws have become the de facto guidelines for designing large language models (LLMs), but they were studied under the assumption of unlimited computing resources for both training and inference. As LLMs are increasingly used as personalized intelligent assistants, their customization (i.e., learning through fine-tuning) and deployment onto resource-constrained edge devices will become m… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Benckmarking paper

  48. arXiv:2406.03283  [pdf, other

    cs.SE cs.AI

    Enhancing Repository-Level Code Generation with Integrated Contextual Information

    Authors: Zhiyuan Pan, Xing Hu, Xin Xia, Xiaohu Yang

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in code generation tasks. However, repository-level code generation presents unique challenges, particularly due to the need to utilize information spread across multiple files within a repository. Existing retrieval-based approaches sometimes fall short as they are limited in obtaining a broader and deeper repository context.… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  49. arXiv:2406.02880  [pdf, other

    cs.CV cs.AI

    Controllable Talking Face Generation by Implicit Facial Keypoints Editing

    Authors: Dong Zhao, Jiaying Shi, Wenjun Li, Shudong Wang, Shenghui Xu, Zhaoming Pan

    Abstract: Audio-driven talking face generation has garnered significant interest within the domain of digital human research. Existing methods are encumbered by intricate model architectures that are intricately dependent on each other, complicating the process of re-editing image or video inputs. In this work, we present ControlTalk, a talking face generation method to control face expression deformation b… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  50. arXiv:2406.01763  [pdf, other

    math.OC cs.RO

    Provably Feasible and Stable White-Box Trajectory Optimization

    Authors: Zherong Pan, Yifan Zhu

    Abstract: We study the problem of Trajectory Optimization (TO) for a general class of stiff and constrained dynamic systems. We establish a set of mild assumptions, under which we show that TO converges numerically stably to a locally optimal and feasible solution up to arbitrary user-specified error tolerance. Our key observation is that all prior works use SQP as a black-box solver, where a TO problem is… ▽ More

    Submitted 23 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.