Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 51–100 of 4,184 results for author: Huang, J

.
  1. arXiv:2406.04337  [pdf, other

    cs.CV cs.AI

    Coherent Zero-Shot Visual Instruction Generation

    Authors: Quynh Phung, Songwei Ge, Jia-Bin Huang

    Abstract: Despite the advances in text-to-image synthesis, particularly with diffusion models, generating visual instructions that require consistent representation and smooth state transitions of objects across sequential steps remains a formidable challenge. This paper introduces a simple, training-free framework to tackle the issues, capitalizing on the advancements in diffusion models and large language… ▽ More

    Submitted 8 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: https://instruct-vis-zero.github.io/

  2. arXiv:2406.04160  [pdf, other

    astro-ph.EP astro-ph.SR

    Disk Evolution Study Through Imaging of Nearby Young Stars (DESTINYS): PDS 111, an old T Tauri star with a young-looking disk

    Authors: Annelotte Derkink, Christian Ginski, Paola Pinilla, Nicolas Kurtovic, Lex Kaper, Alex de Koter, Per-Gunnar Valegård, Eric Mamajek, Frank Backs, Myriam Benisty, Til Birnstiel, Gabriele Columba, Carsten Dominik, Antonio Garufi, Michiel Hogerheijde, Rob van Holstein, Jane Huang, François Ménard, Christian Rab, María Claudia Ramírez-Tannus, Álvaro Ribas, Jonathan P. Williams, Alice Zurlo

    Abstract: The interplay between T Tauri stars and their circumstellar disks, and how this impacts the onset of planet formation has yet to be established. We studied a seemingly old T Tauri star, PDS 111, and its disk. We analyzed optical, infrared, and sub-millimeter observations obtained with VLT/X-shooter, Mercator/HERMES, TESS, VLT/SPHERE, and ALMA, providing a new view on PDS 111 and its protoplanetary… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 23 pages, 23 figures, accepted by A&A (abstract shortened)

  3. arXiv:2406.03711  [pdf, other

    physics.flu-dyn cs.AI

    Pi-fusion: Physics-informed diffusion model for learning fluid dynamics

    Authors: Jing Qiu, Jiancheng Huang, Xiangdong Zhang, Zeng Lin, Minglei Pan, Zengding Liu, Fen Miao

    Abstract: Physics-informed deep learning has been developed as a novel paradigm for learning physical dynamics recently. While general physics-informed deep learning methods have shown early promise in learning fluid dynamics, they are difficult to generalize in arbitrary time instants in real-world scenario, where the fluid motion can be considered as a time-variant trajectory involved large-scale particle… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  4. arXiv:2406.03683  [pdf, other

    cs.LG stat.ML

    Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models

    Authors: Ding Huang, Ting Li, Jian Huang

    Abstract: We propose a Bayesian framework for fine-tuning large diffusion models with a novel network structure called Bayesian Power Steering (BPS). We clarify the meaning behind adaptation from a \textit{large probability space} to a \textit{small probability space} and explore the task of fine-tuning pre-trained models using learnable modules from a Bayesian perspective. BPS extracts task-specific knowle… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 25 pages, 26 figures, and 4 tables

    MSC Class: 62G05; 68T07

  5. arXiv:2406.02987  [pdf, other

    cs.CV

    Enhancing Multimodal Large Language Models with Multi-instance Visual Prompt Generator for Visual Representation Enrichment

    Authors: Wenliang Zhong, Wenyi Wu, Qi Li, Rob Barton, Boxin Du, Shioulin Sam, Karim Bouyarmane, Ismail Tutar, Junzhou Huang

    Abstract: Multimodal Large Language Models (MLLMs) have achieved SOTA performance in various visual language tasks by fusing the visual representations with LLMs leveraging some visual adapters. In this paper, we first establish that adapters using query-based Transformers such as Q-former is a simplified Multi-instance Learning method without considering instance heterogeneity/correlation. We then propose… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  6. arXiv:2406.02885  [pdf, other

    cs.RO

    Homotopic Path Set Planning for Robot Manipulation and Navigation

    Authors: Jing Huang, Yunxi Tang, Kwok Wai Samuel Au

    Abstract: This paper addresses path set planning that yields important applications in robot manipulation and navigation such as path generation for deformable object keypoints and swarms. A path set refers to the collection of finite agent paths to represent the overall spatial path of a group of keypoints or a swarm, whose collective properties meet spatial and topological constraints. As opposed to plann… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 16 pages, 19 figures, 2 tables, conference

  7. arXiv:2406.02546  [pdf, other

    hep-ph astro-ph.CO

    Dark photon limits from patchy dark screening of the cosmic microwave background

    Authors: Fiona McCarthy, Dalila Pirvu, J. Colin Hill, Junwu Huang, Matthew C. Johnson, Keir K. Rogers

    Abstract: Dark photons that kinetically mix with the Standard Model photon give rise to new spectral anisotropies (patchy dark screening) in the cosmic microwave background (CMB) due to conversion of photons to dark photons within large-scale structure. We utilize predictions for this patchy dark screening signal to provide the tightest constraints to date on the dark photon kinetic mixing parameter (… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 7+12 pages, 3+13 figures. Data products available at https://users.flatironinstitute.org/~fmccarthy/dark_photon_screening_maps/ V2 only has minor changes to these comments

  8. arXiv:2406.02252  [pdf, other

    cs.DC

    Exploring the Efficiency of Renewable Energy-based Modular Data Centers at Scale

    Authors: Jinghan Sun, Zibo Gong, Anup Agarwal, Shadi Noghabi, Ranveer Chandra, Marc Snir, Jian Huang

    Abstract: Modular data centers (MDCs) that can be placed right at the energy farms and powered mostly by renewable energy, are proven to be a flexible and effective approach to lowering the carbon footprint of data centers. However, the main challenge of using renewable energy is the high variability of power produced, which implies large volatility in powering computing resources at MDCs, and degraded appl… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  9. arXiv:2406.01894  [pdf, other

    cs.CV

    SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks

    Authors: Yi Pan, Jun-Jie Huang, Zihan Chen, Wentao Zhao, Ziyue Wang

    Abstract: Robust and imperceptible adversarial video attack is challenging due to the spatial and temporal characteristics of videos. The existing video adversarial attack methods mainly take a gradient-based approach and generate adversarial videos with noticeable perturbations. In this paper, we propose a novel Sparse Adversarial Video Attack via Spatio-Temporal Invertible Neural Networks (SVASTIN) to gen… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  10. arXiv:2406.01791  [pdf, other

    cs.CV

    Hybrid-Learning Video Moment Retrieval across Multi-Domain Labels

    Authors: Weitong Cai, Jiabo Huang, Shaogang Gong

    Abstract: Video moment retrieval (VMR) is to search for a visual temporal moment in an untrimmed raw video by a given text query description (sentence). Existing studies either start from collecting exhaustive frame-wise annotations on the temporal boundary of target moments (fully-supervised), or learn with only the video-level video-text pairing labels (weakly-supervised). The former is poor in generalisa… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by BMVC2022

  11. arXiv:2406.01602  [pdf, other

    physics.data-an hep-ex nucl-ex

    Effectiveness of denoising diffusion probabilistic models for fast and high-fidelity whole-event simulation in high-energy heavy-ion experiments

    Authors: Yeonju Go, Dmitrii Torbunov, Timothy Rinn, Yi Huang, Haiwang Yu, Brett Viren, Meifeng Lin, Yihui Ren, Jin Huang

    Abstract: Artificial intelligence (AI) generative models, such as generative adversarial networks (GANs), variational auto-encoders, and normalizing flows, have been widely used and studied as efficient alternatives for traditional scientific simulations. However, they have several drawbacks, including training instability and inability to cover the entire data distribution, especially for regions where dat… ▽ More

    Submitted 23 May, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures

  12. arXiv:2406.01502  [pdf

    math.NA physics.soc-ph

    Spatiotemporal evolution of PM2.5 diffusion in Cheng-Yu urban agglomeration in response to COVID-19 lockdown using complex network

    Authors: Jiaxian Huang, Yi Huang, Yong Zhang, Jiao Zhang

    Abstract: As the decrease in human activities resulting from the COVID-19 control measures had a significant impact on air quality, the epidemic provided an opportunity to investigate the extent to which air pollution is influenced by human activities and review existing measures. However, the corresponding diffusion pattern on a city scale is seldom mentioned at present stage, therefore, we chose the Cheng… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  13. arXiv:2406.01159  [pdf, other

    cs.CV

    Dimba: Transformer-Mamba Diffusion Models

    Authors: Zhengcong Fei, Mingyuan Fan, Changqian Yu, Debang Li, Youqiang Zhang, Junshi Huang

    Abstract: This paper unveils Dimba, a new text-to-image diffusion model that employs a distinctive hybrid architecture combining Transformer and Mamba elements. Specifically, Dimba sequentially stacked blocks alternate between Transformer and Mamba layers, and integrate conditional information through the cross-attention layer, thus capitalizing on the advantages of both architectural paradigms. We investig… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  15. arXiv:2406.00993  [pdf

    eess.SP cs.HC q-bio.OT

    Detection of Acetone as a Gas Biomarker for Diabetes Based on Gas Sensor Technology

    Authors: Jiaming Wei, Tong Liu, Jipeng Huang, Xiaowei Li, Yurui Qi, Gangyin Luo

    Abstract: With the continuous development and improvement of medical services, there is a growing demand for improving diabetes diagnosis. Exhaled breath analysis, characterized by its speed, convenience, and non-invasive nature, is leading the trend in diagnostic development. Studies have shown that the acetone levels in the breath of diabetes patients are higher than normal, making acetone a basis for dia… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 9 pages, 14 figures

  16. arXiv:2406.00857  [pdf, other

    astro-ph.IM

    Modeling the refractive index profile n(z) of polar ice for ultra-high energy neutrino experiments

    Authors: S. Ali, P. Allison, S. Archambault, J. J. Beatty, D. Z. Besson, A. Bishop, P. Chen, Y. C. Chen, B. A. Clark, W. Clay, A. Connolly, K. Couberly, L. Cremonesi, A. Cummings, P. Dasgupta, R. Debolt, S. de Kockere, K. D. de Vries, C. Deaconu, M. A. DuVernois, J. Flaherty, E. Friedman, R. Gaior, P. Giri, J. Hanson , et al. (45 additional authors not shown)

    Abstract: We develop an in-situ index of refraction profile using the transit time of radio signals broadcast from an englacial transmitter to 2-5 km distant radio-frequency receivers, deployed at depths up to 200 m. Maxwell's equations generally admit two ray propagation solutions from a given transmitter, corresponding to a direct path (D) and a refracted path (R); the measured D vs. R (dt(D,R)) timing di… ▽ More

    Submitted 11 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  17. arXiv:2406.00320  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching

    Authors: Yongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang, Zehan Wang, Fuming You, Ruiqi Li, Zhou Zhao

    Abstract: Video-to-audio (V2A) generation aims to synthesize content-matching audio from silent video, and it remains challenging to build V2A models with high generation quality, efficiency, and visual-audio temporal synchrony. We propose Frieren, a V2A model based on rectified flow matching. Frieren regresses the conditional transport vector field from noise to spectrogram latent with straight paths and c… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  18. arXiv:2405.21050  [pdf, other

    cs.CV cs.LG

    Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models

    Authors: Xinxi Zhang, Song Wen, Ligong Han, Felix Juefei-Xu, Akash Srivastava, Junzhou Huang, Hao Wang, Molei Tao, Dimitris N. Metaxas

    Abstract: Adapting large-scale pre-trained generative models in a parameter-efficient manner is gaining traction. Traditional methods like low rank adaptation achieve parameter efficiency by imposing constraints but may not be optimal for tasks requiring high representation capacity. We propose a novel spectrum-aware adaptation framework for generative models. Our method adjusts both singular values and the… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  19. arXiv:2405.20871  [pdf, other

    hep-ph hep-ex

    A Nearest-neighbor Expansion of Lepton Flavor Mixing in Powers of the $μ$-$τ$ Permutation Symmetry Breaking Effect

    Authors: Jihong Huang

    Abstract: We point out that the observed pattern of lepton flavor mixing can be well described by a proper nearest-neighbor expansion of a constant $3\times 3$ unitary matrix in powers of a small parameter characterizing the fine effect of $μ$-$τ$ permutation symmetry breaking. We take an example of this kind for illustration, and provide complete discussions on the usefulness in the study of leptonic CP vi… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 16 pages, 3 figures, 2 tables

  20. arXiv:2405.20618  [pdf, other

    math.NA cs.CG

    CPAFT: A Consistent Parallel Advancing Front Technique for Unstructured Triangular/Tetrahedral Mesh Generation

    Authors: Chengdi Ma, Jizu Huang, Hao Luo, Chao Yang

    Abstract: Compared with the remarkable progress made in parallel numerical solvers of partial differential equations,the development of algorithms for generating unstructured triangular/tetrahedral meshes has been relatively sluggish. In this paper, we propose a novel, consistent parallel advancing front technique (CPAFT) by combining the advancing front technique, the domain decomposition method based on s… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    MSC Class: 65M50; 65M55; 68W10

  21. arXiv:2405.20588  [pdf, other

    cs.CL

    DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models

    Authors: Taolin Zhang, Qizhou Chen, Dongyang Li, Chengyu Wang, Xiaofeng He, Longtao Huang, Hui Xue, Jun Huang

    Abstract: Recently, while large language models (LLMs) have demonstrated impressive results, they still suffer from hallucination, i.e., the generation of false information. Model editing is the task of fixing factual mistakes in LLMs; yet, most previous works treat it as a one-time task, paying little attention to ever-emerging mistakes generated by LLMs. We address the task of sequential model editing (SM… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: ACL2024 findings

  22. arXiv:2405.20380  [pdf, other

    cs.AI cs.CR cs.CV

    Gradient Inversion of Federated Diffusion Models

    Authors: Jiyue Huang, Chi Hong, Lydia Y. Chen, Stefanie Roos

    Abstract: Diffusion models are becoming defector generative models, which generate exceptionally high-resolution image data. Training effective diffusion models require massive real data, which is privately owned by distributed parties. Each data party can collaboratively train diffusion models in a federated learning manner by sharing gradients instead of the raw data. In this paper, we study the privacy l… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  23. arXiv:2405.20334  [pdf, other

    cs.CV cs.GR

    VividDream: Generating 3D Scene with Ambient Dynamics

    Authors: Yao-Chih Lee, Yi-Ting Chen, Andrew Wang, Ting-Hsuan Liao, Brandon Y. Feng, Jia-Bin Huang

    Abstract: We introduce VividDream, a method for generating explorable 4D scenes with ambient dynamics from a single input image or text prompt. VividDream first expands an input image into a static 3D point cloud through iterative inpainting and geometry merging. An ensemble of animated videos is then generated using video diffusion models with quality refinement techniques and conditioned on renderings of… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Project page: https://vivid-dream-4d.github.io

  24. arXiv:2405.19879  [pdf, other

    physics.ins-det hep-ex

    Refractive index in the JUNO liquid scintillator

    Authors: H. S. Zhang, M. Beretta, S. Cialdi, C. X. Yang, J. H. Huang, F. Ferraro, G. F. Cao, G. Reina, Z. Y. Deng, E. Suerra, S. Altilia, V. Antonelli, D. Basilico, A. Brigatti, B. Caccianiga, M. G. Giammarchi, C. Landini, P. Lombardi, L. Miramonti, E. Percalli, G. Ranucci, A. C. Re, P. Saggese, M. D. C. Torri, S. Aiello , et al. (51 additional authors not shown)

    Abstract: In the field of rare event physics, it is common to have huge masses of organic liquid scintillator as detection medium. In particular, they are widely used to study neutrino properties or astrophysical neutrinos. Thanks to its safety properties (such as low toxicity and high flash point) and easy scalability, linear alkyl benzene is the most common solvent used to produce liquid scintillators for… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 6 pages, 9 figures

  25. arXiv:2405.19665  [pdf

    eess.SY cs.AI cs.LG

    A novel fault localization with data refinement for hydroelectric units

    Authors: Jialong Huang, Junlin Song, Penglong Lian, Mengjie Gan, Zhiheng Su, Benhao Wang, Wenji Zhu, Xiaomin Pu, Jianxiao Zou, Shicai Fan

    Abstract: Due to the scarcity of fault samples and the complexity of non-linear and non-smooth characteristics data in hydroelectric units, most of the traditional hydroelectric unit fault localization methods are difficult to carry out accurate localization. To address these problems, a sparse autoencoder (SAE)-generative adversarial network (GAN)-wavelet noise reduction (WNR)- manifold-boosted deep learni… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6pages,4 figures,Conference on Decision and Control(CDC) conference

  26. arXiv:2405.19642  [pdf

    cs.AI

    Few-shot fault diagnosis based on multi-scale graph convolution filtering for industry

    Authors: Mengjie Gan, Penglong Lian, Zhiheng Su, Jiyang Zhang, Jialong Huang, Benhao Wang, Jianxiao Zou, Shicai Fan

    Abstract: Industrial equipment fault diagnosis often encounter challenges such as the scarcity of fault data, complex operating conditions, and varied types of failures. Signal analysis, data statistical learning, and conventional deep learning techniques face constraints under these conditions due to their substantial data requirements and the necessity for transfer learning to accommodate new failure mode… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6 pages, 2 figures, 2 tables, 63rd IEEE Conference on Decision and Control

  27. arXiv:2405.19574  [pdf, other

    astro-ph.EP astro-ph.SR

    On Kinematic Measurements of Self-Gravity in Protoplanetary Disks

    Authors: Sean M. Andrews, Richard Teague, Christopher P. Wirth, Jane Huang, Zhaohuan Zhu

    Abstract: Using controlled injection and recovery experiments, we devised an analysis prescription to assess the quality of dynamical measurements of protoplanetary disk gas masses based on resolved (CO) spectral line data, given observational limitations (resolution, sampling, noise), measurement bias, and ambiguities in the geometry and physical conditions. With sufficient data quality, this approach perf… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ApJ, in press; 32 pages, 29 figures

  28. arXiv:2405.19465  [pdf, other

    cs.CV

    RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter

    Authors: Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li

    Abstract: Text-Video Retrieval (TVR) aims to align relevant video content with natural language queries. To date, most state-of-the-art TVR methods learn image-to-video transfer learning based on large-scale pre-trained visionlanguage models (e.g., CLIP). However, fully fine-tuning these pre-trained models for TVR incurs prohibitively expensive computation costs. To this end, we propose to conduct efficient… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024 Findings

  29. arXiv:2405.18991  [pdf, other

    cs.CV cs.CL cs.MM

    EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture

    Authors: Jiaqi Xu, Xinyi Zou, Kunzhe Huang, Yunkuo Chen, Bo Liu, MengLi Cheng, Xing Shi, Jun Huang

    Abstract: This paper presents EasyAnimate, an advanced method for video generation that leverages the power of transformer architecture for high-performance outcomes. We have expanded the DiT framework originally designed for 2D image synthesis to accommodate the complexities of 3D video generation by incorporating a motion module block. It is used to capture temporal dynamics, thereby ensuring the producti… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 6 pages, 5 figures

  30. arXiv:2405.18440  [pdf, other

    physics.app-ph

    Spatiotemporal Diffusion Metamaterials: Theories and Applications

    Authors: Jinrong Liu, Liujun Xu, Jiping Huang

    Abstract: Diffusion metamaterials with artificial spatial structures have significant potential in controlling energy and mass transfer. Those static structures may lead to functionality and tunability constraints, impeding the application scope of diffusion metamaterials. Dynamic structures, adding the temporal dimension, have recently provided a new possibility for electric charge and heat diffusion regul… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures. A perspective accepted by APL in press

  31. arXiv:2405.18284  [pdf, other

    stat.ML cs.LG

    Adaptive debiased SGD in high-dimensional GLMs with streaming data

    Authors: Ruijian Han, Lan Luo, Yuanhang Luo, Yuanyuan Lin, Jian Huang

    Abstract: Online statistical inference facilitates real-time analysis of sequentially collected data, making it different from traditional methods that rely on static datasets. This paper introduces a novel approach to online inference in high-dimensional generalized linear models, where we update regression coefficient estimates and their standard errors upon each new data arrival. In contrast to existing… ▽ More

    Submitted 1 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 37 pages, 4 figures

  32. arXiv:2405.18258  [pdf, other

    cs.CV cs.AI cs.CL

    Text-only Synthesis for Image Captioning

    Authors: Qing Zhou, Junlin Huang, Qiang Li, Junyu Gao, Qi Wang

    Abstract: From paired image-text training to text-only training for image captioning, the pursuit of relaxing the requirements for high-cost and large-scale annotation of good quality data remains consistent. In this paper, we propose Text-only Synthesis for Image Captioning (ToCa), which further advances this relaxation with fewer human labor and less computing time. Specifically, we deconstruct caption te… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  33. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  34. arXiv:2405.17659  [pdf, other

    eess.IV cs.CV

    Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: Deep learning has been extensively applied in medical image reconstruction, where Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) represent the predominant paradigms, each possessing distinct advantages and inherent limitations: CNNs exhibit linear complexity with local sensitivity, whereas ViTs demonstrate quadratic complexity with global sensitivity. The emerging Mamba has sh… ▽ More

    Submitted 25 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  35. arXiv:2405.17532  [pdf, other

    cs.CV

    ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance

    Authors: Jiannan Huang, Jun Hao Liew, Hanshu Yan, Yuyang Yin, Yao Zhao, Yunchao Wei

    Abstract: Recent text-to-image customization works have been proven successful in generating images of given concepts by fine-tuning the diffusion models on a few examples. However, these methods tend to overfit the concepts, resulting in failure to create the concept under multiple conditions (e.g. headphone is missing when generating a <sks> dog wearing a headphone'). Interestingly, we notice that the bas… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  36. arXiv:2405.17509  [pdf, other

    cs.LG

    Reference Neural Operators: Learning the Smooth Dependence of Solutions of PDEs on Geometric Deformations

    Authors: Ze Cheng, Zhongkai Hao, Xiaoqiang Wang, Jianing Huang, Youjia Wu, Xudan Liu, Yiru Zhao, Songming Liu, Hang Su

    Abstract: For partial differential equations on domains of arbitrary shapes, existing works of neural operators attempt to learn a mapping from geometries to solutions. It often requires a large dataset of geometry-solution pairs in order to obtain a sufficiently accurate neural operator. However, for many industrial applications, e.g., engineering design optimization, it can be prohibitive to satisfy the r… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  37. arXiv:2405.16464  [pdf, other

    cs.RO cs.CV

    Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge

    Authors: Tianchen Deng, Yi Zhou, Wenhua Wu, Mingrui Li, Jingwei Huang, Shuhong Liu, Yanzeng Song, Hao Zuo, Yanbo Wang, Yutao Yue, Hesheng Wang, Weidong Chen

    Abstract: This technical report presents the 1st winning model for UG2+, a task in CVPR 2024 UAV Tracking and Pose-Estimation Challenge. This challenge faces difficulties in drone detection, UAV-type classification and 2D/3D trajectory estimation in extreme weather conditions with multi-modal sensor information, including stereo vision, various Lidars, Radars, and audio arrays. Leveraging this information… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024 workshop. The 1st winning model in CVPR 2024 UG2+ challenge. The code and configuration of our method are available at https://github.com/dtc111111/Multi-Modal-UAV

  38. arXiv:2405.15967  [pdf, other

    astro-ph.SR physics.plasm-ph physics.space-ph

    3-Minute Oscillations in the Upper Corona: Evidence from Parker Solar Probe

    Authors: Zesen Huang, Marco Velli, Chen Shi, Yingjie Zhu, B. D. G. Chandran, Victor Réville, Trevor Bowen, Nikos Sioulas, Marc Pulupa, Jia Huang, Sheng Huang

    Abstract: Recent observations of Parker Solar Probe (PSP) from around the Alfvén surface have shown that the trace magnetic power spectrum density (PSD) is often characterized by a shallow-inertial double power law, where in the low frequency energy injection range, the power spectrum is shallow (flatter than $1/f$), and in the inertial range the spectrum is steep, with a scaling index of [1.5, 1.67]. Conse… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  39. arXiv:2405.15954  [pdf, other

    nucl-ex hep-ex

    Searches for new physics below twice the electron mass with GERDA

    Authors: GERDA Collaboration, M. Agostini, A. Alexander, G. R. Araujo, A. M. Bakalyarov, M. Balata, I. Barabanov, L. Baudis, C. Bauer, S. Belogurov, A. Bettini, L. Bezrukov, V. Biancacci, E. Bossio, V. Bothe, R. Brugnera, A. Caldwell, S. Calgaro, C. Cattadori, A. Chernogorov, P. -J. Chiu, T. Comellato, V. D'Andrea, E. V. Demidova, N. Di Marco , et al. (86 additional authors not shown)

    Abstract: A search for full energy depositions from bosonic keV-scale dark matter candidates of masses between 65 keV and 1021 keV has been performed with data collected during Phase II of the GERmanium Detector Array (GERDA) experiment. Our analysis includes direct dark matter absorption as well as dark Compton scattering. With a total exposure of 105.5 kg yr, no evidence for a signal above the background… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 20 pages, 12 figures, 7 tables

  40. arXiv:2405.15895  [pdf, other

    cs.LG

    Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective

    Authors: Pranshu Malviya, Jerry Huang, Quentin Fournier, Sarath Chandar

    Abstract: The optimal model for a given task is often challenging to determine, requiring training multiple models from scratch which becomes prohibitive as dataset and model sizes grow. A more efficient alternative is to reuse smaller pre-trained models by expanding them, however, this is not widely adopted as how this impacts training dynamics remains poorly understood. While prior works have introduced s… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  41. arXiv:2405.15844  [pdf, other

    astro-ph.SR physics.space-ph

    Near subsonic solar wind outflow from an active region

    Authors: Tamar Ervin, Stuart D. Bale, Samuel T. Badman, Trevor A. Bowen, Pete Riley, Kristoff Paulson, Yeimy J. Rivera, Orlando Romeo, Nikos Sioulas, Davin E. Larson, Jaye L. Verniero, Ryan M. Dewey, Jia Huang

    Abstract: During Parker Solar Probe (Parker) Encounter 15 (E15), we observe an 18-hour period of near subsonic ($\mathrm{M_S \sim}$ 1) and sub-Alfvénic (SA), $\mathrm{M_A}$ <<< 1, slow speed solar wind from 22 to 15.6 R$_\odot$. As the most extreme SA interval measured to date and skirting the solar wind sonic point, it is the deepest Parker has probed into the formation and acceleration region of the solar… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 14 figures

  42. arXiv:2405.15491  [pdf, other

    cs.CV

    GSDeformer: Direct Cage-based Deformation for 3D Gaussian Splatting

    Authors: Jiajun Huang, Hongchuan Yu

    Abstract: We present GSDeformer, a method that achieves free-form deformation on 3D Gaussian Splatting(3DGS) without requiring any architectural changes. Our method extends cage-based deformation, a traditional mesh deformation method, to 3DGS. This is done by converting 3DGS into a novel proxy point cloud representation, where its deformation can be used to infer the transformations to apply on the 3D gaus… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: For project page, see https://jhuangbu.github.io/gsdeformer

  43. arXiv:2405.15484  [pdf, other

    astro-ph.HE

    Spin, inclination, and magnetic field evolution of magnetar population in vacuum and plasma-filled magnetospheres

    Authors: Jun-Xiang Huang, Hou-Jun Lü, Jared Rice, En-Wei Liang

    Abstract: Magnetars are potential energy sources or central engines for numerous transient phenomena in the Universe. How newborn magnetars evolve in different environments remains an open question. Based on both observed and candidate magnetars, it is found that the periods of all magnetars or candidates appear as a bimodal distribution, and are defined as the ``long-P'' and ``short-P'' magnetar subclasses… ▽ More

    Submitted 18 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 18 pages, 2 Tables, and 9 Figures. PRD in press, and matched with the published verison

  44. arXiv:2405.15347  [pdf, ps, other

    math.AP

    Normalized ground states for the mass supercritical Schrödinger-Bopp-Podolsky system: existence, limit behavior, strong instability

    Authors: Juan Huang, Sheng Wang

    Abstract: This paper concerns the normalized ground states for the nonlinear Schrödinger equation in the Bopp-Podolsky electrodynamics. This equation has a nonlocal nonlinearity and a mass supercritical power nonlinearity, both of which have deep impact on the geometry of the corresponding functional, and thus on the existence, limit behavior and stability of the normalized ground states. In the present stu… ▽ More

    Submitted 30 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  45. arXiv:2405.14961  [pdf, other

    cs.CV cs.LG

    SFDDM: Single-fold Distillation for Diffusion models

    Authors: Chi Hong, Jiyue Huang, Robert Birke, Dick Epema, Stefanie Roos, Lydia Y. Chen

    Abstract: While diffusion models effectively generate remarkable synthetic images, a key limitation is the inference inefficiency, requiring numerous sampling steps. To accelerate inference and maintain high-quality synthesis, teacher-student distillation is applied to compress the diffusion models in a progressive and binary manner by retraining, e.g., reducing the 1024-step model to a 128-step model in 3… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  46. arXiv:2405.14572  [pdf, other

    math.NA

    Multicontinuum Homogenization for Coupled Flow and Transport Equations

    Authors: Dmitry Ammosov, W. T. Leung, Buzheng Shan, Jian Huang

    Abstract: In this paper, we present the derivation of a multicontinuum model for the coupled flow and transport equations by applying multicontinuum homogenization. We perform the multicontinuum expansion for both flow and transport solutions and formulate novel coupled constraint cell problems to capture the multiscale property, where oversampled regions are utilized to avoid boundary effects. Assuming the… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  47. arXiv:2405.14303  [pdf, other

    cs.LG

    Similarity-Navigated Conformal Prediction for Graph Neural Networks

    Authors: Jianqing Song, Jianguo Huang, Wenyu Jiang, Baoming Zhang, Shuangjie Li, Chongjun Wang

    Abstract: Graph Neural Networks have achieved remarkable accuracy in semi-supervised node classification tasks. However, these results lack reliable uncertainty estimates. Conformal prediction methods provide a theoretical guarantee for node classification tasks, ensuring that the conformal prediction set contains the ground-truth label with a desired probability (e.g., 95%). In this paper, we empirically s… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  48. arXiv:2405.14135  [pdf, other

    cs.LG cs.AI

    Learning Geospatial Region Embedding with Heterogeneous Graph

    Authors: Xingchen Zou, Jiani Huang, Xixuan Hao, Yuhao Yang, Haomin Wen, Yibo Yan, Chao Huang, Yuxuan Liang

    Abstract: Learning effective geospatial embeddings is crucial for a series of geospatial applications such as city analytics and earth monitoring. However, learning comprehensive region representations presents two significant challenges: first, the deficiency of effective intra-region feature representation; and second, the difficulty of learning from intricate inter-region dependencies. In this paper, we… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  49. arXiv:2405.13448  [pdf, other

    cs.CL

    Distilling Instruction-following Abilities of Large Language Models with Task-aware Curriculum Planning

    Authors: Yuanhao Yue, Chengyu Wang, Jun Huang, Peng Wang

    Abstract: The process of instruction tuning aligns pre-trained large language models (LLMs) with open-domain instructions and human-preferred responses. While several studies have explored autonomous approaches to distilling and annotating instructions from more powerful proprietary LLMs, such as ChatGPT, they often neglect the impact of task distributions and the varying difficulty of instructions of the t… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  50. arXiv:2405.13372  [pdf, other

    cs.LG

    Ada-HGNN: Adaptive Sampling for Scalable Hypergraph Neural Networks

    Authors: Shuai Wang, David W. Zhang, Jia-Hong Huang, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

    Abstract: Hypergraphs serve as an effective model for depicting complex connections in various real-world scenarios, from social to biological networks. The development of Hypergraph Neural Networks (HGNNs) has emerged as a valuable method to manage the intricate associations in data, though scalability is a notable challenge due to memory limitations. In this study, we introduce a new adaptive sampling str… ▽ More

    Submitted 14 June, 2024; v1 submitted 22 May, 2024; originally announced May 2024.