Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 473 results for author: Zhuang, J

.
  1. arXiv:2407.20181  [pdf, other

    cs.CR cs.AI cs.DC cs.LG

    Blockchain for Large Language Model Security and Safety: A Holistic Survey

    Authors: Caleb Geren, Amanda Board, Gaby G. Dagher, Tim Andersen, Jun Zhuang

    Abstract: With the advent of accessible interfaces for interacting with large language models, there has been an associated explosion in both their commercial and academic interest. Consequently, there has also been an sudden burst of novel attacks associated with large language models, jeopardizing user data on a massive scale. Situated at a comparable crossroads in its development, and equally prolific to… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: Submitted to SIGKDD Explorations

  2. arXiv:2407.19375  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Topological Phase Transition in Quasi-One-Dimensional Bismuth Iodide Bi4I4

    Authors: W. X. Zhao, M. Yang, X. Du, Y. D. Li, K. Y. Zhai, Y. Q. Hu, J. F. Han, Y. Huang, Z. K. Liu, Y. G. Yao, J. C. Zhuang, Y. Du, J. J. Zhou, Y. L. Chen, L. X. Yang

    Abstract: The exploration of topological quantum materials and topological phase transitions is at the forefront of modern condensed matter physics. Quasi-one-dimensional (quasi-1D) bismuth iodide Bi4I4 exhibits versatile topological phases of matter including weak topological insulator (WTI) and higher-order topological insulator (HOTI) phases with high tunability in response to external parameters. In thi… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  3. arXiv:2407.17706  [pdf, other

    quant-ph cs.LG

    Investigating and Mitigating Barren Plateaus in Variational Quantum Circuits: A Survey

    Authors: Jack Cunningham, Jun Zhuang

    Abstract: In recent years, variational quantum circuits (VQCs) have been widely explored to advance quantum circuits against classic models on various domains, such as quantum chemistry and quantum machine learning. Similar to classic machine-learning models, VQCs can be optimized through gradient-based approaches. However, the gradient variance of VQCs may dramatically vanish as the number of qubits or lay… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: preprint, under review. Please feel free to reach out if your work fits within our scope

  4. arXiv:2407.15613  [pdf, other

    cs.CV

    Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning

    Authors: Xiangyan Qu, Jing Yu, Keke Gai, Jiamin Zhuang, Yuanmin Tang, Gang Xiong, Gaopeng Gou, Qi Wu

    Abstract: Recent work shows that documents from encyclopedias serve as helpful auxiliary information for zero-shot learning. Existing methods align the entire semantics of a document with corresponding images to transfer knowledge. However, they disregard that semantic information is not equivalent between them, resulting in a suboptimal alignment. In this work, we propose a novel network to extract multi-v… ▽ More

    Submitted 23 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Accepted to ACM International Conference on Multimedia (MM) 2024

  5. arXiv:2407.13139  [pdf, other

    cs.CV

    Image Inpainting Models are Effective Tools for Instruction-guided Image Editing

    Authors: Xuan Ju, Junhao Zhuang, Zhaoyang Zhang, Yuxuan Bian, Qiang Xu, Ying Shan

    Abstract: This is the technique report for the winning solution of the CVPR2024 GenAI Media Generation Challenge Workshop's Instruction-guided Image Editing track. Instruction-guided image editing has been largely studied in recent years. The most advanced methods, such as SmartEdit and MGIE, usually combine large language models with diffusion models through joint training, where the former provides text u… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  6. arXiv:2407.12940  [pdf, other

    cs.RO cs.CV

    KiGRAS: Kinematic-Driven Generative Model for Realistic Agent Simulation

    Authors: Jianbo Zhao, Jiaheng Zhuang, Qibin Zhou, Taiyu Ban, Ziyao Xu, Hangning Zhou, Junhe Wang, Guoan Wang, Zhiheng Li, Bin Li

    Abstract: Trajectory generation is a pivotal task in autonomous driving. Recent studies have introduced the autoregressive paradigm, leveraging the state transition model to approximate future trajectory distributions. This paradigm closely mirrors the real-world trajectory generation process and has achieved notable success. However, its potential is limited by the ineffective representation of realistic t… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  7. arXiv:2407.08839  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    A Survey on the Application of Generative Adversarial Networks in Cybersecurity: Prospective, Direction and Open Research Scopes

    Authors: Md Mashrur Arifin, Md Shoaib Ahmed, Tanmai Kumar Ghosh, Jun Zhuang, Jyh-haw Yeh

    Abstract: With the proliferation of Artificial Intelligence, there has been a massive increase in the amount of data required to be accumulated and disseminated digitally. As the data are available online in digital landscapes with complex and sophisticated infrastructures, it is crucial to implement various defense mechanisms based on cybersecurity. Generative Adversarial Networks (GANs), which are deep le… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  8. arXiv:2407.05578  [pdf, other

    cs.CV

    FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance

    Authors: Jiedong Zhuang, Jiaqi Hu, Lianrui Mu, Rui Hu, Xiaoyu Liang, Jiangnan Ye, Haoji Hu

    Abstract: CLIP has achieved impressive zero-shot performance after pre-training on a large-scale dataset consisting of paired image-text data. Previous works have utilized CLIP by incorporating manually designed visual prompts like colored circles and blur masks into the images to guide the model's attention, showing enhanced zero-shot performance in downstream tasks. Although these methods have achieved pr… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: accepted by ECCV2024

  9. arXiv:2407.04064  [pdf, other

    cs.RO

    Collision Avoidance for Multiple UAVs in Unknown Scenarios with Causal Representation Disentanglement

    Authors: Jiafan Zhuang, Zihao Xia, Gaofei Han, Boxi Wang, Wenji Li, Dongliang Wang, Zhifeng Hao, Ruichu Cai, Zhun Fan

    Abstract: Deep reinforcement learning (DRL) has achieved remarkable progress in online path planning tasks for multi-UAV systems. However, existing DRL-based methods often suffer from performance degradation when tackling unseen scenarios, since the non-causal factors in visual representations adversely affect policy learning. To address this issue, we propose a novel representation learning approach, \ie,… ▽ More

    Submitted 15 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  10. arXiv:2407.04056  [pdf, other

    cs.RO

    Robust Policy Learning for Multi-UAV Collision Avoidance with Causal Feature Selection

    Authors: Jiafan Zhuang, Gaofei Han, Zihao Xia, Boxi Wang, Wenji Li, Dongliang Wang, Zhifeng Hao, Ruichu Cai, Zhun Fan

    Abstract: In unseen and complex outdoor environments, collision avoidance navigation for unmanned aerial vehicle (UAV) swarms presents a challenging problem. It requires UAVs to navigate through various obstacles and complex backgrounds. Existing collision avoidance navigation methods based on deep reinforcement learning show promising performance but suffer from poor generalization abilities, resulting in… ▽ More

    Submitted 15 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  11. arXiv:2407.03116  [pdf, other

    quant-ph

    Hardware-efficient variational quantum algorithm in trapped-ion quantum computer

    Authors: J. -Z. Zhuang, Y. -K. Wu, L. -M. Duan

    Abstract: We study a hardware-efficient variational quantum algorithm ansatz tailored for the trapped-ion quantum simulator, HEA-TI. We leverage programmable single-qubit rotations and global spin-spin interactions among all ions, reducing the dependence on resource-intensive two-qubit gates in conventional gate-based methods. We apply HEA-TI to state engineering of cluster states and analyze the scaling of… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  12. arXiv:2407.01599  [pdf, other

    cs.CL cs.CR cs.CV cs.LG

    JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models

    Authors: Haibo Jin, Leyang Hu, Xinuo Li, Peiyan Zhang, Chonghan Chen, Jun Zhuang, Haohan Wang

    Abstract: The rapid evolution of artificial intelligence (AI) through developments in Large Language Models (LLMs) and Vision-Language Models (VLMs) has brought significant advancements across various technological domains. While these models enhance capabilities in natural language processing and visual interactive tasks, their growing adoption raises critical concerns regarding security and ethical alignm… ▽ More

    Submitted 24 July, 2024; v1 submitted 25 June, 2024; originally announced July 2024.

    Comments: 45 pages

  13. arXiv:2406.19844  [pdf, other

    cs.CV cs.RO

    StreamMOTP: Streaming and Unified Framework for Joint 3D Multi-Object Tracking and Trajectory Prediction

    Authors: Jiaheng Zhuang, Guoan Wang, Siyu Zhang, Xiyang Wang, Hangning Zhou, Ziyao Xu, Chi Zhang, Zhiheng Li

    Abstract: 3D multi-object tracking and trajectory prediction are two crucial modules in autonomous driving systems. Generally, the two tasks are handled separately in traditional paradigms and a few methods have started to explore modeling these two tasks in a joint manner recently. However, these approaches suffer from the limitations of single-frame training and inconsistent coordinate representations bet… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  14. arXiv:2406.15054  [pdf

    physics.chem-ph physics.bio-ph

    Dynamic Response of Ionic Current in Conical Nanopores

    Authors: Zhe Liu, Long Ma, Hongwen Zhang, Jiakun Zhuang, Jia Man, Zuzanna S. Siwy, Yinghua Qiu

    Abstract: Ionic current rectification (ICR) of charged conical nanopores has various applications in fields including nanofluidics, bio-sensing, and energy conversion, whose function is closely related to the dynamic response of nanopores. The occurrence of ICR originates from the ion enrichment and depletion in conical pores, whose formation is found to be affected by the scanning rate of voltages. Here, t… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 30 pages, 5 figures

    Journal ref: ACS Appl. Mater. Interfaces 2024, 16 (23), 30496-30505

  15. arXiv:2406.09053  [pdf, ps, other

    eess.SP

    Joint Channel Estimation and Prediction for Massive MIMO with Frequency Hopping Sounding

    Authors: Yiming Zhu, Jiawei Zhuang, Gangle Sun, Hongwei Hou, Li You, Wenjin Wang

    Abstract: In massive multiple-input multiple-output (MIMO) systems, the downlink transmission performance heavily relies on accurate channel state information (CSI). Constrained by the transmitted power, user equipment always transmits sounding reference signals (SRSs) to the base station through frequency hopping, which will be leveraged to estimate uplink CSI and subsequently predict downlink CSI. This pa… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  16. arXiv:2406.08012  [pdf, other

    astro-ph.HE

    Interaction of an outflow with surrounding gaseous clouds as the origin of the late-time radio flares in TDEs

    Authors: Jialun Zhuang, Rong-Feng Shen, Guobin Mou, Wenbin Lu

    Abstract: Close encounter between a star and a supermassive black hole (SMBH) results in the tidal disruption of the star, known as a tidal disruption event (TDE). Recently, a few TDEs, e.g., ASASSN-15oi and AT2018hyz, have shown late-time (hundreds of days after their UV/optical peaks) radio flares with radio luminosities of $10^{38\sim39}$ erg/s. The super-Eddington fallback or accretion in a TDE may gene… ▽ More

    Submitted 26 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 13 pages, 13 figures. Submitted to ApJ. A new version with some modifications. Comments are welcome

  17. arXiv:2406.06959  [pdf, other

    cs.LG cs.AI

    Unleashing the Denoising Capability of Diffusion Prior for Solving Inverse Problems

    Authors: Jiawei Zhang, Jiaxin Zhuang, Cheng Jin, Gen Li, Yuantao Gu

    Abstract: The recent emergence of diffusion models has significantly advanced the precision of learnable priors, presenting innovative avenues for addressing inverse problems. Since inverse problems inherently entail maximum a posteriori estimation, previous works have endeavored to integrate diffusion priors into the optimization frameworks. However, prevailing optimization-based inverse algorithms primari… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  18. arXiv:2406.05392  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas

    Authors: Chengyuan Deng, Yiqun Duan, Xin Jin, Heng Chang, Yijun Tian, Han Liu, Henry Peng Zou, Yiqiao Jin, Yijia Xiao, Yichen Wang, Shenghao Wu, Zongxing Xie, Kuofeng Gao, Sihong He, Jun Zhuang, Lu Cheng, Haohan Wang

    Abstract: Large Language Models (LLMs) have achieved unparalleled success across diverse language modeling tasks in recent years. However, this progress has also intensified ethical concerns, impacting the deployment of LLMs in everyday contexts. This paper provides a comprehensive survey of ethical challenges associated with LLMs, from longstanding issues such as copyright infringement, systematic bias, an… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  19. arXiv:2406.03368  [pdf, other

    cs.CL cs.AI

    IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

    Authors: David Ifeoluwa Adelani, Jessica Ojo, Israel Abebe Azime, Jian Yun Zhuang, Jesujoba O. Alabi, Xuanli He, Millicent Ochieng, Sara Hooker, Andiswa Bukula, En-Shiun Annie Lee, Chiamaka Chukwuneke, Happy Buzaaba, Blessing Sibanda, Godson Kalipe, Jonathan Mukiibi, Salomon Kabongo, Foutse Yuehgoh, Mmasibidi Setaka, Lolwethu Ndolela, Nkiruka Odu, Rooweither Mabuya, Shamsuddeen Hassan Muhammad, Salomey Osei, Sokhar Samb, Tadesse Kebede Guge , et al. (1 additional authors not shown)

    Abstract: Despite the widespread adoption of Large language models (LLMs), their remarkable capabilities remain limited to a few high-resource languages. Additionally, many low-resource languages (e.g. African languages) are often evaluated only on basic text classification tasks due to the lack of appropriate or comprehensive benchmarks outside of high-resource languages. In this paper, we introduce IrokoB… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Under review

  20. arXiv:2406.03097  [pdf, other

    cs.LG cs.AI

    Enhancing the Resilience of Graph Neural Networks to Topological Perturbations in Sparse Graphs

    Authors: Shuqi He, Jun Zhuang, Ding Wang, Luyao Peng, Jun Song

    Abstract: Graph neural networks (GNNs) have been extensively employed in node classification. Nevertheless, recent studies indicate that GNNs are vulnerable to topological perturbations, such as adversarial attacks and edge disruptions. Considerable efforts have been devoted to mitigating these challenges. For example, pioneering Bayesian methodologies, including GraphSS and LlnDT, incorporate Bayesian labe… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  21. arXiv:2406.01264  [pdf, other

    cs.CV

    FreeTumor: Advance Tumor Segmentation via Large-Scale Tumor Synthesis

    Authors: Linshan Wu, Jiaxin Zhuang, Xuefeng Ni, Hao Chen

    Abstract: AI-driven tumor analysis has garnered increasing attention in healthcare. However, its progress is significantly hindered by the lack of annotated tumor cases, which requires radiologists to invest a lot of effort in collecting and annotation. In this paper, we introduce a highly practical solution for robust tumor synthesis and segmentation, termed FreeTumor, which refers to annotation-free synth… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Preprint

  22. arXiv:2405.19590  [pdf, other

    cs.LG

    Weights Augmentation: it has never ever ever ever let her model down

    Authors: Junbin Zhuang, Guiguang Din, Yunyi Yan

    Abstract: Weight play an essential role in deep learning network models. Unlike network structure design, this article proposes the concept of weight augmentation, focusing on weight exploration. The core of Weight Augmentation Strategy (WAS) is to adopt random transformed weight coefficients training and transformed coefficients, named Shadow Weight(SW), for networks that can be used to calculate loss func… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  23. arXiv:2405.11830  [pdf, other

    cond-mat.mtrl-sci

    Fe2+ partitioning in Al-free pyrolite: consequences for seismic velocities and heterogeneities

    Authors: Jingyi Zhuang, Renata Wentzcovitch

    Abstract: Iron partitioning among the main lower mantle phases, bridgmanite (Bm) and ferropericlase (Fp), has non-monotonic behavior owing to the high-spin to low-spin crossover in ferrous iron (Fe2+) in Fp. Results of previous studies of the iron partitioning coefficient between these phases, $K_D$, still have considerable uncertainty. Here, we investigate the Fe2+ partitioning behavior using well-document… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 18 pages, 5 figures

  24. arXiv:2405.01606  [pdf, other

    quant-ph cs.LG

    Improving Trainability of Variational Quantum Circuits via Regularization Strategies

    Authors: Jun Zhuang, Jack Cunningham, Chaowen Guan

    Abstract: In the era of noisy intermediate-scale quantum (NISQ), variational quantum circuits (VQCs) have been widely applied in various domains, advancing the superiority of quantum circuits against classic models. Similar to classic models, regular VQCs can be optimized by various gradient-based methods. However, the optimization may be initially trapped in barren plateaus or eventually entangled in saddl… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: preprint, under review. TL;DR: we propose a regularization strategy to improve the trainability of VQCs

  25. arXiv:2404.16374  [pdf

    physics.geo-ph

    Revisiting Seismicity Criticality: A New Framework for Bias Correction of Statistical Seismology Model Calibrations

    Authors: Jiawei Li, Didier Sornette, Zhongliang Wu, Jiancang Zhuang, Changsheng Jiang

    Abstract: The Epidemic-Type Aftershock Sequences (ETAS) model and its variants effectively capture the space-time clustering of seismicity, setting the standard for earthquake forecasting. Accurate unbiased ETAS calibration is thus crucial. But we identify three sources of bias, (i) boundary effects, (ii) finite-size effects, and (iii) censorship, which are often overlooked or misinterpreted, causing errors… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 36 pages, 7 figures, 5 tables

  26. arXiv:2404.15760  [pdf, other

    cs.LG cs.AI stat.ML

    Debiasing Machine Unlearning with Counterfactual Examples

    Authors: Ziheng Chen, Jia Wang, Jun Zhuang, Abbavaram Gowtham Reddy, Fabrizio Silvestri, Jin Huang, Kaushiki Nag, Kun Kuang, Xin Ning, Gabriele Tolomei

    Abstract: The right to be forgotten (RTBF) seeks to safeguard individuals from the enduring effects of their historical actions by implementing machine-learning techniques. These techniques facilitate the deletion of previously acquired knowledge without requiring extensive model retraining. However, they often overlook a critical issue: unlearning processes bias. This bias emerges from two main sources: (1… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  27. arXiv:2404.15580  [pdf, other

    cs.CV

    MiM: Mask in Mask Self-Supervised Pre-Training for 3D Medical Image Analysis

    Authors: Jiaxin Zhuang, Linshan Wu, Qiong Wang, Varut Vardhanabhuti, Lin Luo, Hao Chen

    Abstract: The Vision Transformer (ViT) has demonstrated remarkable performance in Self-Supervised Learning (SSL) for 3D medical image analysis. Mask AutoEncoder (MAE) for feature pre-training can further unleash the potential of ViT on various medical vision tasks. However, due to large spatial sizes with much higher dimensions of 3D medical images, the lack of hierarchical design for MAE may hinder the per… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: submitted to journal

  28. arXiv:2404.08852  [pdf, other

    math.NA math.CV

    Complex variable solution on over-/under-break shallow tunnelling in gravitational geomaterial with reasonable far-field displacement

    Authors: Luo-bin Lin, Fu-quan Chen, Jin-ping Zhuang

    Abstract: Over-/under-break excavation is a common phenomenon in shallow tunnelling, which is nonetheless not generally considered in existing complex variable solutions. In this paper, a new equilibrium mechanical model on over-/under-break shallow tunnelling in gravitational geomaterial is established by fixing far-field ground surface to form a corresponding mixed boundary problem. With integration of a… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

  29. arXiv:2404.06039  [pdf, other

    cs.HC

    Breathing New Life into Existing Visualizations: A Natural Language-Driven Manipulation Framework

    Authors: Can Liu, Jiacheng Yu, Yuhan Guo, Jiayi Zhuang, Yuchu Luo, Xiaoru Yuan

    Abstract: We propose an approach to manipulate existing interactive visualizations to answer users' natural language queries. We analyze the natural language tasks and propose a design space of a hierarchical task structure, which allows for a systematic decomposition of complex queries. We introduce a four-level visualization manipulation space to facilitate in-situ manipulations for visualizations, enabli… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 21 pages

  30. Multi-Level Label Correction by Distilling Proximate Patterns for Semi-supervised Semantic Segmentation

    Authors: Hui Xiao, Yuting Hong, Li Dong, Diqun Yan, Jiayan Zhuang, Junjie Xiong, Dongtai Liang, Chengbin Peng

    Abstract: Semi-supervised semantic segmentation relieves the reliance on large-scale labeled data by leveraging unlabeled data. Recent semi-supervised semantic segmentation approaches mainly resort to pseudo-labeling methods to exploit unlabeled data. However, unreliable pseudo-labeling can undermine the semi-supervision processes. In this paper, we propose an algorithm called Multi-Level Label Correction (… ▽ More

    Submitted 9 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 8 figures. IEEE Transactions on Multimedia, 2024

  31. arXiv:2403.01976  [pdf, other

    cs.CL

    SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis

    Authors: Hengxing Cai, Xiaochen Cai, Junhan Chang, Sihang Li, Lin Yao, Changxin Wang, Zhifeng Gao, Hongshuai Wang, Yongge Li, Mujie Lin, Shuwen Yang, Jiankun Wang, Mingjun Xu, Jin Huang, Fang Xi, Jiaxi Zhuang, Yuqi Yin, Yaqi Li, Changhong Chen, Zheng Cheng, Zifeng Zhao, Linfeng Zhang, Guolin Ke

    Abstract: Recent breakthroughs in Large Language Models (LLMs) have revolutionized natural language understanding and generation, sparking significant interest in applying them to scientific literature analysis. However, existing benchmarks fail to adequately evaluate the proficiency of LLMs in this domain, particularly in scenarios requiring higher-level abilities beyond mere memorization and the handling… ▽ More

    Submitted 18 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  32. arXiv:2402.17300  [pdf, other

    eess.IV

    VoCo: A Simple-yet-Effective Volume Contrastive Learning Framework for 3D Medical Image Analysis

    Authors: Linshan Wu, Jiaxin Zhuang, Hao Chen

    Abstract: Self-Supervised Learning (SSL) has demonstrated promising results in 3D medical image analysis. However, the lack of high-level semantics in pre-training still heavily hinders the performance of downstream tasks. We observe that 3D medical images contain relatively consistent contextual position information, i.e., consistent geometric relations between different organs, which leads to a potential… ▽ More

    Submitted 17 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by CVPR 2024. The camera-ready version will soon be available

  33. arXiv:2402.10409  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning

    Authors: Jun Zhuang, Casey Kennington

    Abstract: As new research on Large Language Models (LLMs) continues, it is difficult to keep up with new research and models. To help researchers synthesize the new research many have written survey papers, but even those have become numerous. In this paper, we develop a method to automatically assign survey papers to a taxonomy. We collect the metadata of 144 LLM survey papers and explore three paradigms t… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: TL;DR: We collected metadata about LLM surveys and developed a method for categorizing them into a taxonomy, indicating the superiority of graph representation learning over language models and revealing the efficacy of fine-tuning using weak labels

  34. Probing the interaction energy of two $^{85}$Rb atoms in an optical tweezer via spin-motion coupling

    Authors: Jun Zhuang, Kun-Peng Wang, Peng-Xiang Wang, Ming-Rui Wei, Bahtiyar Mamat, Cheng Sheng, Peng Xu, Min Liu, Jin Wang, Xiao-Dong He, Ming-Sheng Zhan

    Abstract: The inherent polarization gradients in tight optical tweezers can be used to couple the atomic spins to the two-body motion under the action of a microwave spin-flip transition, so that such a spin-motion coupling offers an important control knob on the motional states of optically trapped two colliding atoms. Here, after preparing two elastically scattering $^{85}$Rb atoms in the three-dimensiona… ▽ More

    Submitted 2 July, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: 8 pages, 5 figures

    Journal ref: Phys. Rev. A 109, 043320 (2024)

  35. Influences of Divalent Ions in Natural Seawater/River Water on Nanofluidic Osmotic Energy Generation

    Authors: Fenhong Song, Xuan An, Long Ma, Jiakun Zhuang, Yinghua Qiu

    Abstract: Besides the dominant NaCl, natural seawater/river water contains trace multivalent ions, which can provide effective screening to surface charges. Here, in both negatively and positively charged nanopores, influences from divalent ions as counterions and coions have been investigated on the performance of osmotic energy conversion (OEC) under natural salt gradients. As counterions, trace Ca2+ ions… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 24 pages, 5 figures

    Journal ref: Langmuir 2022, 38 (42), 12935-12943

  36. arXiv:2402.04920  [pdf

    physics.chem-ph

    Characterization of the Surface Charge Property and Porosity of Track-etched Polymer Membranes

    Authors: Jiakun Zhuang, Long Ma, Yinghua Qiu

    Abstract: As an important property of porous membranes, the surface charge property determines many ionic behaviors of nanopores, such as ionic conductance and selectivity. Based on the dependence of electric double layers on bulk concentrations, ionic conductance through nanopores at high and low concentrations is governed by the bulk conductance and surface charge density, respectively. Here, through the… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 25 pages, 4 figures

    Journal ref: Electrophoresis 2022, 43 (23-24), 2428-2435

  37. arXiv:2401.14828  [pdf, other

    cs.CV

    TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts

    Authors: Jingyu Zhuang, Di Kang, Yan-Pei Cao, Guanbin Li, Liang Lin, Ying Shan

    Abstract: Text-driven 3D scene editing has gained significant attention owing to its convenience and user-friendliness. However, existing methods still lack accurate control of the specified appearance and location of the editing result due to the inherent limitations of the text description. To this end, we propose a 3D scene editing framework, TIPEditor, that accepts both text and image prompts and a 3D b… ▽ More

    Submitted 25 April, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Accpeted by Siggraph 2024 & ACM Transactions on Graphics

  38. arXiv:2401.11261  [pdf, other

    cs.LG cs.CV

    Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient

    Authors: Weiguo Lu, Xuan Wu, Deng Ding, Jinqiao Duan, Jirong Zhuang, Gangnan Yuan

    Abstract: Diffusion models (DMs) are a type of generative model that has a huge impact on image synthesis and beyond. They achieve state-of-the-art generation results in various generative tasks. A great diversity of conditioning inputs, such as text or bounding boxes, are accessible to control the generation. In this work, we propose a conditioning mechanism utilizing Gaussian mixture models (GMMs) as feat… ▽ More

    Submitted 1 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  39. SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration

    Authors: Jinming Zhuang, Zhuoping Yang, Shixin Ji, Heng Huang, Alex K. Jones, Jingtong Hu, Yiyu Shi, Peipei Zhou

    Abstract: With the increase in the computation intensity of the chip, the mismatch between computation layer shapes and the available computation resource significantly limits the utilization of the chip. Driven by this observation, prior works discuss spatial accelerators or dataflow architecture to maximize the throughput. However, using spatial accelerators could potentially increase the execution latenc… ▽ More

    Submitted 18 February, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Journal ref: 2024 ACM/SIGDA International Symposium on Field Programmable Gate Arrays (FPGA '24)

  40. arXiv:2401.00695  [pdf, other

    cs.CV

    Credible Teacher for Semi-Supervised Object Detection in Open Scene

    Authors: Jingyu Zhuang, Kuo Wang, Liang Lin, Guanbin Li

    Abstract: Semi-Supervised Object Detection (SSOD) has achieved resounding success by leveraging unlabeled data to improve detection performance. However, in Open Scene Semi-Supervised Object Detection (O-SSOD), unlabeled data may contains unknown objects not observed in the labeled data, which will increase uncertainty in the model's predictions for known objects. It is detrimental to the current methods th… ▽ More

    Submitted 2 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: Accpet by ICASSP 2024

  41. arXiv:2312.10903  [pdf, other

    cs.LG cs.AI

    Robust Node Representation Learning via Graph Variational Diffusion Networks

    Authors: Jun Zhuang, Mohammad Al Hasan

    Abstract: Node representation learning by using Graph Neural Networks (GNNs) has been widely explored. However, in recent years, compelling evidence has revealed that GNN-based node representation learning can be substantially deteriorated by delicately-crafted perturbations in a graph structure. To learn robust node representation in the presence of perturbations, various works have been proposed to safegu… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: preprint, under review

  42. arXiv:2312.03594  [pdf, other

    cs.CV

    A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting

    Authors: Junhao Zhuang, Yanhong Zeng, Wenran Liu, Chun Yuan, Kai Chen

    Abstract: Advancing image inpainting is challenging as it requires filling user-specified regions for various intents, such as background filling and object synthesis. Existing approaches focus on either context-aware filling or object synthesis using text descriptions. However, achieving both tasks simultaneously is challenging due to differing training strategies. To overcome this challenge, we introduce… ▽ More

    Submitted 23 July, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Project page with code: https://powerpaint.github.io/

  43. arXiv:2312.02991  [pdf, other

    cs.AR

    REFRESH FPGAs: Sustainable FPGA Chiplet Architectures

    Authors: Peipei Zhou, Jinming Zhuang, Stephen Cahoon, Yue Tang, Zhuoping Yang, Xingzhen Chen, Yiyu Shi, Jingtong Hu, Alex K. Jones

    Abstract: There is a growing call for greater amounts of increasingly agile computational power for edge and cloud infrastructure to serve the computationally complex needs of ubiquitous computing devices. Thus, an important challenge is addressing the holistic environmental impacts of these next-generation computing systems. To accomplish this, a life-cycle view of sustainability for computing advancements… ▽ More

    Submitted 27 November, 2023; originally announced December 2023.

  44. arXiv:2312.02597  [pdf, other

    physics.atom-ph

    Mitigating noise of residual electric fields for single Rydberg atoms with electron photodesorption

    Authors: Bahtiyar Mamat, Cheng Sheng, Xiaodong He, Jiayi Hou, Peng Xu, Kunpeng Wang, Jun Zhuang, Mingrui Wei, Min Liu, Jin Wang, Mingsheng Zhan

    Abstract: Rydberg atoms as versatile tools for quantum applications are extremely sensitive to electric fields. When utilizing these atoms, it becomes imperative to comprehensively characterize and mitigate any residual electric fields present in the environment. Particularly for single Rydberg atoms trapped in optical tweezers in a compact quartz vacuum cell, we have identified that a significant source of… ▽ More

    Submitted 26 February, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

  45. arXiv:2311.18420  [pdf, other

    cs.CV

    TeG-DG: Textually Guided Domain Generalization for Face Anti-Spoofing

    Authors: Lianrui Mu, Jianhong Bai, Xiaoxuan He, Jiangnan Ye, Xiaoyu Liang, Yuchen Yang, Jiedong Zhuang, Haoji Hu

    Abstract: Enhancing the domain generalization performance of Face Anti-Spoofing (FAS) techniques has emerged as a research focus. Existing methods are dedicated to extracting domain-invariant features from various training domains. Despite the promising performance, the extracted features inevitably contain residual style feature bias (e.g., illumination, capture device), resulting in inferior generalizatio… ▽ More

    Submitted 30 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  46. arXiv:2311.16417  [pdf, other

    cs.AR

    Challenges and Opportunities to Enable Large-Scale Computing via Heterogeneous Chiplets

    Authors: Zhuoping Yang, Shixin Ji, Xingzhen Chen, Jinming Zhuang, Weifeng Zhang, Dharmesh Jani, Peipei Zhou

    Abstract: Fast-evolving artificial intelligence (AI) algorithms such as large language models have been driving the ever-increasing computing demands in today's data centers. Heterogeneous computing with domain-specific architectures (DSAs) brings many opportunities when scaling up and scaling out the computing system. In particular, heterogeneous chiplet architecture is favored to keep scaling up and scali… ▽ More

    Submitted 4 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

  47. arXiv:2311.07211  [pdf, other

    q-fin.CP

    A Gaussian Process Based Method with Deep Kernel Learning for Pricing High-dimensional American Options

    Authors: Jirong Zhuang, Deng Ding, Weiguo Lu, Xuan Wu, Gangnan Yuan

    Abstract: In this work, we present a novel machine learning approach for pricing high-dimensional American options based on the modified Gaussian process regression (GPR). We incorporate deep kernel learning and sparse variational Gaussian processes to address the challenges traditionally associated with GPR. These challenges include its diminished reliability in high-dimensional scenarios and the excessive… ▽ More

    Submitted 18 April, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 21pages,8 figures

  48. arXiv:2311.02893  [pdf

    cond-mat.mtrl-sci

    Topological electronic structure and spin texture of quasi-one-dimensional higher-order topological insulator Bi4Br4

    Authors: W. X. Zhao, M. Yang, R. Z. Xu, X. Du, Y. D. Li, K. Y. Zhai, C. Peng, D. Pei, H. Gao, Y. W. Li, L. X. Xu, J. F. Han, Y. Huang, Z. K. Liu, Y. G. Yao, J. C. Zhuang, Y. Du, J. J. Zhou, Y. L. Chen, L. X. Yang

    Abstract: The notion of topological insulators (TIs), characterized by an insulating bulk and conducting topological surface states, can be extended to higher-order topological insulators (HOTIs) hosting gapless modes localized at the boundaries of two or more dimensions lower than the insulating bulk1-5. In this work, by performing high-resolution angle-resolved photoemission spectroscopy (ARPES) measureme… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  49. arXiv:2310.01159  [pdf, other

    eess.IV cs.CV cs.LG

    Iterative Semi-Supervised Learning for Abdominal Organs and Tumor Segmentation

    Authors: Jiaxin Zhuang, Luyang Luo, Zhixuan Chen, Linshan Wu

    Abstract: Deep-learning (DL) based methods are playing an important role in the task of abdominal organs and tumors segmentation in CT scans. However, the large requirements of annotated datasets heavily limit its development. The FLARE23 challenge provides a large-scale dataset with both partially and fully annotated data, which also focuses on both segmentation accuracy and computational efficiency. In th… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: text overlap with arXiv:2309.05405

  50. arXiv:2309.16137  [pdf, other

    cs.CV

    Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval

    Authors: Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gang Xiong, Yue Hu, Qi Wu

    Abstract: Different from Composed Image Retrieval task that requires expensive labels for training task-specific models, Zero-Shot Composed Image Retrieval (ZS-CIR) involves diverse tasks with a broad range of visual content manipulation intent that could be related to domain, scene, object, and attribute. The key challenge for ZS-CIR tasks is to learn a more accurate image representation that has adaptive… ▽ More

    Submitted 15 December, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

    Journal ref: AAAI 2024