Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 652 results for author: Zhao, K

.
  1. arXiv:2409.01555  [pdf, other

    cs.CV cs.AI

    EA-RAS: Towards Efficient and Accurate End-to-End Reconstruction of Anatomical Skeleton

    Authors: Zhiheng Peng, Kai Zhao, Xiaoran Chen, Li Ma, Siyu Xia, Changjie Fan, Weijian Shang, Wei Jing

    Abstract: Efficient, accurate and low-cost estimation of human skeletal information is crucial for a range of applications such as biology education and human-computer interaction. However, current simple skeleton models, which are typically based on 2D-3D joint points, fall short in terms of anatomical fidelity, restricting their utility in fields. On the other hand, more complex models while anatomically… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 13 pages,15 figures

  2. arXiv:2409.00997  [pdf, other

    cs.CL

    DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning

    Authors: Keer Lu, Zheng Liang, Xiaonan Nie, Da Pan, Shusen Zhang, Keshi Zhao, Weipeng Chen, Zenan Zhou, Guosheng Dong, Wentao Zhang, Bin Cui

    Abstract: The effectiveness of long-context modeling is important for Large Language Models (LLMs) in various applications. Despite their potential, LLMs' efficacy in processing long context does not consistently meet expectations, posing significant challenges for efficient management of prolonged sequences in training. This difficulty is compounded by the scarcity of comprehensive and diverse training dat… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  3. arXiv:2408.14267  [pdf, other

    cs.LG cs.CV

    1-Bit FQT: Pushing the Limit of Fully Quantized Training to 1-bit

    Authors: Chang Gao, Jianfei Chen, Kang Zhao, Jiaqi Wang, Liping Jing

    Abstract: Fully quantized training (FQT) accelerates the training of deep neural networks by quantizing the activations, weights, and gradients into lower precision. To explore the ultimate limit of FQT (the lowest achievable precision), we make a first attempt to 1-bit FQT. We provide a theoretical analysis of FQT based on Adam and SGD, revealing that the gradient variance influences the convergence of FQT… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  4. arXiv:2408.12133  [pdf, other

    cs.AI cs.LG

    Self-supervised Learning for Geospatial AI: A Survey

    Authors: Yile Chen, Weiming Huang, Kaiqi Zhao, Yue Jiang, Gao Cong

    Abstract: The proliferation of geospatial data in urban and territorial environments has significantly facilitated the development of geospatial artificial intelligence (GeoAI) across various urban applications. Given the vast yet inherently sparse labeled nature of geospatial data, there is a critical need for techniques that can effectively leverage such data without heavy reliance on labeled datasets. Th… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  5. arXiv:2408.11971  [pdf, other

    cs.DC

    HoSZp: An Efficient Homomorphic Error-bounded Lossy Compressor for Scientific Data

    Authors: Tripti Agarwal, Sheng Di, Jiajun Huang, Yafan Huang, Ganesh Gopalakrishnan, Robert Underwood, Kai Zhao, Xin Liang, Guanpeng Li, Franck Cappello

    Abstract: Error-bounded lossy compression has been a critical technique to significantly reduce the sheer amounts of simulation datasets for high-performance computing (HPC) scientific applications while effectively controlling the data distortion based on user-specified error bound. In many real-world use cases, users must perform computational operations on the compressed data (a.k.a. homomorphic compress… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 12 pages, 7 figures, 9 tables

  6. arXiv:2408.09347  [pdf, other

    cs.CV

    S^3D-NeRF: Single-Shot Speech-Driven Neural Radiance Field for High Fidelity Talking Head Synthesis

    Authors: Dongze Li, Kang Zhao, Wei Wang, Yifeng Ma, Bo Peng, Yingya Zhang, Jing Dong

    Abstract: Talking head synthesis is a practical technique with wide applications. Current Neural Radiance Field (NeRF) based approaches have shown their superiority on driving one-shot talking heads with videos or signals regressed from audio. However, most of them failed to take the audio as driven information directly, unable to enjoy the flexibility and availability of speech. Since mapping audio signals… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: ECCV 2024

  7. arXiv:2408.06840  [pdf, other

    cs.CV

    Dynamic and Compressive Adaptation of Transformers From Images to Videos

    Authors: Guozhen Zhang, Jingyu Liu, Shengming Cao, Xiaotong Zhao, Kevin Zhao, Kai Ma, Limin Wang

    Abstract: Recently, the remarkable success of pre-trained Vision Transformers (ViTs) from image-text matching has sparked an interest in image-to-video adaptation. However, most current approaches retain the full forward pass for each frame, leading to a high computation overhead for processing entire videos. In this paper, we present InTI, a novel approach for compressive image-to-video adaptation using dy… ▽ More

    Submitted 13 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  8. arXiv:2408.00323  [pdf, other

    eess.SY

    A Novel Edge Laplacian-based Approach for Adaptive Formation Control of Uncertain Multi-agent Systems with Unified Relative Error Performance

    Authors: Kun Li, Kai Zhao, Yongduan Song, Lihua Xie

    Abstract: For most existing prescribed performance formation control methods, performance requirements are not directly imposed on the relative states between agents but on the consensus error, which lacks a clear physical interpretation of their solution. In this paper, we propose a novel adaptive prescribed performance formation control strategy, capable of guaranteeing prescribed performance on the relat… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 9 pages, 3 figures, submitted to IEEE

  9. arXiv:2407.19349  [pdf

    q-bio.QM cs.AI

    Predicting T-Cell Receptor Specificity

    Authors: Tengyao Tu, Wei Zeng, Kun Zhao, Zhenyu Zhang

    Abstract: Researching the specificity of TCR contributes to the development of immunotherapy and provides new opportunities and strategies for personalized cancer immunotherapy. Therefore, we established a TCR generative specificity detection framework consisting of an antigen selector and a TCR classifier based on the Random Forest algorithm, aiming to efficiently screen out TCRs and target antigens and ac… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

  10. arXiv:2407.19219  [pdf, other

    astro-ph.SR astro-ph.GA

    Primeval very low-mass stars and brown dwarfs -- VIII. The first age benchmark L subdwarf, a wide companion to a halo white dwarf

    Authors: Z. H. Zhang, R. Raddi, A. J. Burgasser, S. L. Casewell, R. L. Smart, M. C. Galvez-Ortiz, H. R. A. Jones, S. Baig, N. Lodieu, B. Gauza, Ya. V. Pavlenko, Y. F. Jiao, Z. K. Zhao, S. Y. Zhou, D. J. Pinfield

    Abstract: We report the discovery of five white dwarf + ultracool dwarf systems identified as common proper motion wide binaries in the Gaia Catalogue of Nearby Stars. The discoveries include a white dwarf + L subdwarf binary, VVV 1256-62AB, a gravitationally bound system located 75.6(+1.9/-1.8) pc away with a projected separation of 1375(+35/-33) au. The primary is a cool DC white dwarf with a hydrogen dom… ▽ More

    Submitted 17 August, 2024; v1 submitted 27 July, 2024; originally announced July 2024.

    Comments: 15 pages, 12 figures

  11. arXiv:2407.17285  [pdf, ps, other

    math.OC

    Second-Order Necessary Conditions, Constraint Qualifications and Exact Penalty for Mathematical Programs with Switching Constraints

    Authors: Jiawei Chen, Luyu Liu, Yibing Lv, Kequan Zhao

    Abstract: In this paper, we investigate second-order necessary conditions and exact penalty of mathematical programs with switching constraints (MPSC). Some new second-order constraint qualifications and second-order quasi-normality are introduced for (MPSC), which are crucial to establish the second-order necessary conditions and the error bound of (MPSC). We explore the relations among these constraint qu… ▽ More

    Submitted 26 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: 25 pages, 1 figure

    MSC Class: 90C30; 90C33; 90C46

  12. arXiv:2407.14326  [pdf, ps, other

    cs.CV cs.AI

    Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model

    Authors: Kun Zhao, Jakub Prokop, Javier Montalt Tordera, Sadegh Mohammadi

    Abstract: Mammography is crucial for breast cancer surveillance and early diagnosis. However, analyzing mammography images is a demanding task for radiologists, who often review hundreds of mammograms daily, leading to overdiagnosis and overtreatment. Computer-Aided Diagnosis (CAD) systems have been developed to assist in this process, but their capabilities, particularly in lesion segmentation, remained li… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures. Submitted to Deep Generative Models workshop @ MICCAI 2024

  13. arXiv:2407.12490  [pdf

    cond-mat.mtrl-sci

    An extended Rice model for intergranular fracture

    Authors: Kai Zhao, Yu Ding, Haiyang Yu, Jianying He, Zhiliang Zhang

    Abstract: The plastic events occurring during the process of intergranular fracture in metals is still not well understood due to the complexity of grain boundary (GB) structures and their interactions with crack-tip dislocation plasticity. By considering the local GB structural transformation after dislocation emission from a GB in the Peierls-type Rice-Beltz model, herein we established a semi-analytical… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 55 pages, 21 figures

  14. arXiv:2407.09250  [pdf

    cs.NI cs.LG

    FedsLLM: Federated Split Learning for Large Language Models over Communication Networks

    Authors: Kai Zhao, Zhaohui Yang, Chongwen Huang, Xiaoming Chen, Zhaoyang Zhang

    Abstract: Addressing the challenges of deploying large language models in wireless communication networks, this paper combines low-rank adaptation technology (LoRA) with the splitfed learning framework to propose the federated split learning for large language models (FedsLLM) framework. The method introduced in this paper utilizes LoRA technology to reduce processing loads by dividing the network into clie… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  15. arXiv:2407.04381  [pdf, other

    cs.CV cs.AI

    Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection

    Authors: Zhiqiang Yang, Qiu Guan, Keer Zhao, Jianmin Yang, Xinli Xu, Haixia Long, Ying Tang

    Abstract: Due to the effective performance of multi-scale feature fusion, Path Aggregation FPN (PAFPN) is widely employed in YOLO detectors. However, it cannot efficiently and adaptively integrate high-level semantic information with low-level spatial information simultaneously. We propose a new model named MAF-YOLO in this paper, which is a novel object detection framework with a versatile neck named Multi… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  16. arXiv:2407.04267  [pdf, other

    cs.DC

    A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization

    Authors: Daoce Wang, Pascal Grosset, Jesus Pulido, Tushar M. Athawale, Jiannan Tian, Kai Zhao, Zarija Lukić, Axel Huebl, Zhe Wang, James Ahrens, Dingwen Tao

    Abstract: Multi-resolution methods such as Adaptive Mesh Refinement (AMR) can enhance storage efficiency for HPC applications generating vast volumes of data. However, their applicability is limited and cannot be universally deployed across all applications. Furthermore, integrating lossy compression with multi-resolution techniques to further boost storage efficiency encounters significant barriers. To thi… ▽ More

    Submitted 25 August, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: camera-ready version for SC '24

  17. arXiv:2407.01146  [pdf, other

    eess.IV cs.CV

    Cross-Slice Attention and Evidential Critical Loss for Uncertainty-Aware Prostate Cancer Detection

    Authors: Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Kaifeng Pang, Demetri Terzopoulos, Kyunghyun Sung

    Abstract: Current deep learning-based models typically analyze medical images in either 2D or 3D albeit disregarding volumetric information or suffering sub-optimal performance due to the anisotropic resolution of MR data. Furthermore, providing an accurate uncertainty estimation is beneficial to clinicians, as it indicates how confident a model is about its prediction. We propose a novel 2.5D cross-slice a… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  18. arXiv:2406.20038  [pdf, other

    cs.CL

    BioMNER: A Dataset for Biomedical Method Entity Recognition

    Authors: Chen Tang, Bohao Yang, Kun Zhao, Bo Lv, Chenghao Xiao, Frank Guerin, Chenghua Lin

    Abstract: Named entity recognition (NER) stands as a fundamental and pivotal task within the realm of Natural Language Processing. Particularly within the domain of Biomedical Method NER, this task presents notable challenges, stemming from the continual influx of domain-specific terminologies in scholarly literature. Current research in Biomedical Method (BioMethod) NER suffers from a scarcity of resources… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  19. arXiv:2406.17962  [pdf, other

    cs.CL

    Crafting Customisable Characters with LLMs: Introducing SimsChat, a Persona-Driven Role-Playing Agent Framework

    Authors: Bohao Yang, Dong Liu, Chen Tang, Chenghao Xiao, Kun Zhao, Chao Li, Lin Yuan, Guang Yang, Lanxiao Huang, Chenghua Lin

    Abstract: Large Language Models (LLMs) demonstrate a remarkable ability to comprehend human instructions and generate high-quality text. This capability allows LLMs to function as agents that can emulate human beings at a more sophisticated level, beyond the mere replication of basic human behaviours. However, there is a lack of exploring into leveraging LLMs to craft characters from diverse aspects. In thi… ▽ More

    Submitted 16 August, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  20. arXiv:2406.17911  [pdf, other

    cs.CL

    X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

    Authors: Kun Zhao, Chenghao Xiao, Chen Tang, Bohao Yang, Kai Ye, Noura Al Moubayed, Liang Zhan, Chenghua Lin

    Abstract: Radiology Report Generation (RRG) has achieved significant progress with the advancements of multimodal generative models. However, the evaluation in the domain suffers from a lack of fair and robust metrics. We reveal that, high performance on RRG with existing lexical-based metrics (e.g. BLEU) might be more of a mirage - a model can get a high BLEU only by learning the template of reports. This… ▽ More

    Submitted 30 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  21. arXiv:2406.17873  [pdf, other

    cs.CL cs.AI

    Improving Arithmetic Reasoning Ability of Large Language Models through Relation Tuples, Verification and Dynamic Feedback

    Authors: Zhongtao Miao, Kaiyan Zhao, Yoshimasa Tsuruoka

    Abstract: Current representations used in reasoning steps of large language models can mostly be categorized into two main types: (1) natural language, which is difficult to verify; and (2) non-natural language, usually programming code, which is difficult for people who are unfamiliar with coding to read. In this paper, we propose to use a semi-structured form to represent reasoning steps of large language… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Under review, 25 figures, 8 tables, 29 pages

  22. arXiv:2406.15758  [pdf, other

    cs.LG cs.DC

    EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive Layer Tuning and Voting

    Authors: Zhongzhi Yu, Zheng Wang, Yuhan Li, Haoran You, Ruijie Gao, Xiaoya Zhou, Sreenidhi Reedy Bommu, Yang Katie Zhao, Yingyan Celine Lin

    Abstract: Efficient adaption of large language models (LLMs) on edge devices is essential for applications requiring continuous and privacy-preserving adaptation and inference. However, existing tuning techniques fall short because of the high computation and memory overheads. To this end, we introduce a computation- and memory-efficient LLM tuning framework, called Edge-LLM, to facilitate affordable and ef… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  23. arXiv:2406.10311  [pdf, other

    cs.CL cs.AI

    CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language Models

    Authors: Wenjing Zhang, Xuejiao Lei, Zhaoxiang Liu, Meijuan An, Bikun Yang, KaiKai Zhao, Kai Wang, Shiguo Lian

    Abstract: With the profound development of large language models(LLMs), their safety concerns have garnered increasing attention. However, there is a scarcity of Chinese safety benchmarks for LLMs, and the existing safety taxonomies are inadequate, lacking comprehensive safety detection capabilities in authentic Chinese scenarios. In this work, we introduce CHiSafetyBench, a dedicated safety benchmark for e… ▽ More

    Submitted 1 September, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 16 pages, 5 figures

  24. arXiv:2406.10307  [pdf, other

    cs.CL cs.AI

    What is the best model? Application-driven Evaluation for Large Language Models

    Authors: Shiguo Lian, Kaikai Zhao, Xinhui Liu, Xuejiao Lei, Bikun Yang, Wenjing Zhang, Kai Wang, Zhaoxiang Liu

    Abstract: General large language models enhanced with supervised fine-tuning and reinforcement learning from human feedback are increasingly popular in academia and industry as they generalize foundation models to various practical tasks in a prompt manner. To assist users in selecting the best model in practical application scenarios, i.e., choosing the model that meets the application requirements while m… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  25. arXiv:2406.09073  [pdf, other

    cs.LG

    Are we making progress in unlearning? Findings from the first NeurIPS unlearning competition

    Authors: Eleni Triantafillou, Peter Kairouz, Fabian Pedregosa, Jamie Hayes, Meghdad Kurmanji, Kairan Zhao, Vincent Dumoulin, Julio Jacques Junior, Ioannis Mitliagkas, Jun Wan, Lisheng Sun Hosoya, Sergio Escalera, Gintare Karolina Dziugaite, Peter Triantafillou, Isabelle Guyon

    Abstract: We present the findings of the first NeurIPS competition on unlearning, which sought to stimulate the development of novel algorithms and initiate discussions on formal and robust evaluation methodologies. The competition was highly successful: nearly 1,200 teams from across the world participated, and a wealth of novel, imaginative solutions with different characteristics were contributed. In thi… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  26. arXiv:2406.06600  [pdf, other

    cs.LG cs.AI cs.CL

    HORAE: A Domain-Agnostic Modeling Language for Automating Multimodal Service Regulation

    Authors: Yutao Sun, Mingshuai Chen, Tiancheng Zhao, Kangjia Zhao, He Li, Jintao Chen, Liqiang Lu, Xinkui Zhao, Shuiguang Deng, Jianwei Yin

    Abstract: Artificial intelligence is rapidly encroaching on the field of service regulation. This work presents the design principles behind HORAE, a unified specification language to model multimodal regulation rules across a diverse set of domains. We show how HORAE facilitates an intelligent service regulation pipeline by further exploiting a fine-tuned large language model named HORAE that automates the… ▽ More

    Submitted 18 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  27. arXiv:2406.06571  [pdf, other

    cs.CL cs.AI

    SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM

    Authors: Quandong Wang, Yuxuan Yuan, Xiaoyu Yang, Ruike Zhang, Kang Zhao, Wei Liu, Jian Luan, Daniel Povey, Bin Wang

    Abstract: While Large Language Models (LLMs) have achieved remarkable success in various fields, the efficiency of training and inference remains a major challenge. To address this issue, we propose SUBLLM, short for Subsampling-Upsampling-Bypass Large Language Model, an innovative architecture that extends the core decoder-only framework by incorporating subsampling, upsampling, and bypass modules. The sub… ▽ More

    Submitted 23 August, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures, accepted by ECAI 2024

    ACM Class: I.2.7

  28. arXiv:2406.06388  [pdf, ps, other

    math.RT

    Simple smooth modules over the Ramond algebra and applications to vertex operator superalgebras

    Authors: Yulu Chen, Ran Shen, Yufeng Yao, Kaiming Zhao

    Abstract: Simple smooth modules over the Virasoro algebra and one of the super-Virasoro algebra named the Neveu-Schwarz algebra were classified. This problem remained unsolved for the other super-Virasoro algebra called the Ramond algebra. In this paper, all simple smooth modules over the Ramond algebra are classified. More precisely, a simple smooth module over the Ramond algebra is either a simple highest… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages

  29. arXiv:2406.06376  [pdf, ps, other

    math.RA

    Biderivations of Lie algebras

    Authors: Qiufan Chen, Yufeng Yao, Kaiming Zhao

    Abstract: In this paper, we first introduce the concept of symmetric biderivation radicals and characteristic subalgebras of Lie algebras, and study their properties. Based on these results, we precisely determine biderivations of some Lie algebras including finite-dimensional simple Lie algebras over arbitrary fields of characteristic not $2$ or $3$, and the Witt algebras $\mathcal{W}^+_n$ over fields of c… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 14 pages

  30. arXiv:2406.03725  [pdf, other

    cs.CL

    LLMEmbed: Rethinking Lightweight LLM's Genuine Function in Text Classification

    Authors: Chun Liu, Hongguang Zhang, Kainan Zhao, Xinghai Ju, Lin Yang

    Abstract: With the booming of Large Language Models (LLMs), prompt-learning has become a promising method mainly researched in various research areas. Recently, many attempts based on prompt-learning have been made to improve the performance of text classification. However, most of these methods are based on heuristic Chain-of-Thought (CoT), and tend to be more complex but less efficient. In this paper, we… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: ACL 2024 main conference

  31. arXiv:2406.02249  [pdf, other

    physics.ins-det nucl-ex

    A novel measurement method for SiPM external crosstalk probability at low temperature

    Authors: Guanda Li, Lei Wang, Xilei Sun, Fang Liu, Cong Guo, Kangkang Zhao, Lei Tian, Zeyuan Yu, Zhilong Hou, Chi Li, Yu Lei, Bin Wang, Rongbin Zhou

    Abstract: Silicon photomultipliers (SiPMs) are being considered as potential replacements for conventional photomultiplier tubes (PMTs). However, a significant disadvantage of SiPMs is crosstalk (CT), wherein photons propagate through other pixels, resulting in secondary avalanches. CT can be categorized into internal crosstalk and external crosstalk based on whether the secondary avalanche occurs within th… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  32. arXiv:2406.01257  [pdf, other

    cs.LG

    What makes unlearning hard and what to do about it

    Authors: Kairan Zhao, Meghdad Kurmanji, George-Octavian Bărbulescu, Eleni Triantafillou, Peter Triantafillou

    Abstract: Machine unlearning is the problem of removing the effect of a subset of training data (the ''forget set'') from a trained model without damaging the model's utility e.g. to comply with users' requests to delete their data, or remove mislabeled, poisoned or otherwise problematic data. With unlearning research still being at its infancy, many fundamental open questions exist: Are there interpretable… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  33. arXiv:2406.01223  [pdf, other

    cs.GR

    Report on Methods and Applications for Crafting 3D Humans

    Authors: Lei Liu, Ke Zhao

    Abstract: This paper presents an in-depth exploration of 3D human model and avatar generation technology, propelled by the rapid advancements in large-scale models and artificial intelligence. The paper reviews the comprehensive process of 3D human model generation, from scanning to rendering, and highlights the pivotal role these models play in entertainment, VR, AR, healthcare, and education. We underscor… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.15335

  34. arXiv:2405.19853  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Correlated Electronic Structure and Density-Wave Gap in Trilayer Nickelate La4Ni3O10

    Authors: X. Du, Y. D. Li, Y. T. Cao, C. Y. Pei, M. X. Zhang, W. X. Zhao, K. Y. Zhai, R. Z. Xu, Z. K. Liu, Z. W. Li, J. K. Zhao, G. Li, Y. L. Chen, Y. P. Qi, H. J. Guo, L. X. Yang

    Abstract: The discovery of pressurized superconductivity at 80 K in La3Ni2O7 officially brings nickelates into the family of high-temperature superconductors, which gives rise to not only new insights but also mysteries in the strongly correlated superconductivity. More recently, the sibling compound La4Ni3O10 was also shown to be superconducting below about 25 K under pressure, further boosting the popular… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  35. arXiv:2405.17478  [pdf, other

    cs.LG stat.ML

    ROSE: Register Assisted General Time Series Forecasting with Decomposed Frequency Learning

    Authors: Yihang Wang, Yuying Qiu, Peng Chen, Kai Zhao, Yang Shu, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo

    Abstract: With the increasing collection of time series data from various domains, there arises a strong demand for general time series forecasting models pre-trained on a large number of time-series datasets to support a variety of downstream prediction tasks. Enabling general time series forecasting faces two challenges: how to obtain unified representations from multi-domian time series data, and how to… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  36. arXiv:2405.16456  [pdf, other

    cs.LG cs.AI

    Dominant Shuffle: A Simple Yet Powerful Data Augmentation for Time-series Prediction

    Authors: Kai Zhao, Zuojie He, Alex Hung, Dan Zeng

    Abstract: Recent studies have suggested frequency-domain Data augmentation (DA) is effec tive for time series prediction. Existing frequency-domain augmentations disturb the original data with various full-spectrum noises, leading to excess domain gap between augmented and original data. Although impressive performance has been achieved in certain cases, frequency-domain DA has yet to be generalized to time… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: https://kaizhao.net/time-series

  37. arXiv:2405.15924  [pdf, other

    cs.CL

    SLIDE: A Framework Integrating Small and Large Language Models for Open-Domain Dialogues Evaluation

    Authors: Kun Zhao, Bohao Yang, Chen Tang, Chenghua Lin, Liang Zhan

    Abstract: The long-standing one-to-many problem of gold standard responses in open-domain dialogue systems presents challenges for automatic evaluation metrics. Though prior works have demonstrated some success by applying powerful Large Language Models (LLMs), existing approaches still struggle with the one-to-many problem, and exhibit subpar performance in domain-specific scenarios. We assume the commonse… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL2024 Findings

  38. arXiv:2405.15335  [pdf, other

    cs.GR

    Challenges and Opportunities in 3D Content Generation

    Authors: Ke Zhao, Andreas Larsen

    Abstract: This paper explores the burgeoning field of 3D content generation within the landscape of Artificial Intelligence Generated Content (AIGC) and large-scale models. It investigates innovative methods like Text-to-3D and Image-to-3D, which translate text or images into 3D objects, reshaping our understanding of virtual and real-world simulations. Despite significant advancements in text and image gen… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Brief report

  39. arXiv:2405.15273  [pdf, other

    cs.LG

    Towards a General Time Series Anomaly Detector with Adaptive Bottlenecks and Dual Adversarial Decoders

    Authors: Qichao Shentu, Beibu Li, Kai Zhao, Yang Shu, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo

    Abstract: Time series anomaly detection plays a vital role in a wide range of applications. Existing methods require training one specific model for each dataset, which exhibits limited generalization capability across different target datasets, hindering anomaly detection performance in various scenarios with scarce training data. Aiming at this problem, we propose constructing a general time series anomal… ▽ More

    Submitted 2 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  40. arXiv:2405.13954  [pdf, other

    cs.LG cs.AI cs.CL

    What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions

    Authors: Sang Keun Choe, Hwijeen Ahn, Juhan Bae, Kewen Zhao, Minsoo Kang, Youngseog Chung, Adithya Pratapa, Willie Neiswanger, Emma Strubell, Teruko Mitamura, Jeff Schneider, Eduard Hovy, Roger Grosse, Eric Xing

    Abstract: Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited. In response to this issue, data valuation (or data attribution), which quantifies the contribution or value of each data to the model output, has been discussed as a potential solution. Nevertheless, applying existing data valuation methods to recent LLMs and their vast trai… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  41. arXiv:2405.13190  [pdf, other

    cs.LG cs.AI

    Interpretable Spatio-Temporal Embedding for Brain Structural-Effective Network with Ordinary Differential Equation

    Authors: Haoteng Tang, Guodong Liu, Siyuan Dai, Kai Ye, Kun Zhao, Wenlu Wang, Carl Yang, Lifang He, Alex Leow, Paul Thompson, Heng Huang, Liang Zhan

    Abstract: The MRI-derived brain network serves as a pivotal instrument in elucidating both the structural and functional aspects of the brain, encompassing the ramifications of diseases and developmental processes. However, prevailing methodologies, often focusing on synchronous BOLD signals from functional MRI (fMRI), may not capture directional influences among brain regions and rarely tackle temporal fun… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  42. arXiv:2405.10664  [pdf, other

    math.DG

    Uniqueness of tangent flows at infinity for finite-entropy shortening curves

    Authors: Kyeongsu Choi, Dong-Hwi Seo, Wei-Bo Su, Kai-Wei Zhao

    Abstract: In this paper, we prove that an ancient smooth curve shortening flow with finite-entropy embedded in $\mathbb{R}^2$ has a unique tangent flow at infinity. To this end, we show that its rescaled flows backwardly converge to a line with multiplity $m\geq 3$ exponentially fast in any compact region, unless the flow is a shrinking circle, a static line, a paper clip, or a translating grim reaper. In a… ▽ More

    Submitted 8 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  43. arXiv:2405.07478  [pdf, other

    eess.SY

    Coded Event-triggered Control for Nonlinear Systems

    Authors: Ruihang Ji, Shuzhi Sam Ge, Kai Zhao

    Abstract: This paper studies a Coded Event-triggered Control (CEC) for a class of nonlinear systems under any initial condition. To reduce communication burden, the CEC is designed from the encoding-decoding viewpoint by which only $m$-length string is transmitted for each communication between CEC and actuator. If a more general Entry Capture Problem is encountered, such control design will be rather compl… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  44. arXiv:2405.07303  [pdf, other

    hep-ex hep-ph physics.ins-det

    Search for solar axions by Primakoff effect with the full dataset of the CDEX-1B Experiment

    Authors: L. T. Yang, S. K. Liu, Q. Yue, K. J. Kang, Y. J. Li, H. P. An, Greeshma C., J. P. Chang, Y. H. Chen, J. P. Cheng, W. H. Dai, Z. Deng, C. H. Fang, X. P. Geng, H. Gong, Q. J. Guo, T. Guo, X. Y. Guo, L. He, J. R. He, J. W. Hu, H. X. Huang, T. C. Huang, L. Jiang, S. Karmakar , et al. (61 additional authors not shown)

    Abstract: We present the first limit on $g_{Aγ}$ coupling constant using the Bragg-Primakoff conversion based on an exposure of 1107.5 kg days of data from the CDEX-1B experiment at the China Jinping Underground Laboratory. The data are consistent with the null signal hypothesis, and no excess signals are observed. Limits of the coupling $g_{Aγ}<2.08\times10^{-9}$ GeV$^{-1}$ (95\% C.L.) are derived for axio… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures

  45. arXiv:2405.00509  [pdf, other

    astro-ph.HE

    Polarization Perspectives on Hercules X-1: Further Constraining the Geometry

    Authors: Qingchang Zhao, Hancheng Li, Lian Tao, Hua Feng, Shuangnan Zhang, Roland Walter, Mingyu Ge, Hao Tong, Long Ji, Liang Zhang, Jinlu Qu, Yue Huang, Xiang Ma, Shu Zhang, Qianqing Yin, Hongxing Yin, Ruican Ma, Shujie Zhao, Panping Li, Zixu Yang, Hexin Liu, Wei Yu, Yiming Huang, Zexi Li, Yajun Li , et al. (2 additional authors not shown)

    Abstract: We conduct a comprehensive analysis of the accreting X-ray pulsar, Hercules X-1, utilizing data from IXPE and NuSTAR. IXPE performed five observations of Her X-1, consisting of three in the Main-on state and two in the Short-on state. Our time-resolved analysis uncovers the linear correlations between the flux and polarization degree as well as the pulse fraction and polarization degree. Geometry… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted for MNRAS

  46. arXiv:2404.17825  [pdf, other

    cs.CV

    ODCR: Orthogonal Decoupling Contrastive Regularization for Unpaired Image Dehazing

    Authors: Zhongze Wang, Haitao Zhao, Jingchao Peng, Lujian Yao, Kaijie Zhao

    Abstract: Unpaired image dehazing (UID) holds significant research importance due to the challenges in acquiring haze/clear image pairs with identical backgrounds. This paper proposes a novel method for UID named Orthogonal Decoupling Contrastive Regularization (ODCR). Our method is grounded in the assumption that an image consists of both haze-related features, which influence the degree of haze, and haze-… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  47. arXiv:2404.17099  [pdf, other

    cs.LG cs.NE

    Unleashing the Potential of Fractional Calculus in Graph Neural Networks with FROND

    Authors: Qiyu Kang, Kai Zhao, Qinxu Ding, Feng Ji, Xuhao Li, Wenfei Liang, Yang Song, Wee Peng Tay

    Abstract: We introduce the FRactional-Order graph Neural Dynamical network (FROND), a new continuous graph neural network (GNN) framework. Unlike traditional continuous GNNs that rely on integer-order differential equations, FROND employs the Caputo fractional derivative to leverage the non-local properties of fractional calculus. This approach enables the capture of long-term dependencies in feature update… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: The Twelfth International Conference on Learning Representations

  48. arXiv:2404.14372  [pdf, other

    cs.CL cs.AI

    Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph

    Authors: Xiaochen Kev Gao, Feng Yao, Kewen Zhao, Beilei He, Animesh Kumar, Vish Krishnan, Jingbo Shang

    Abstract: Model scaling is becoming the default choice for many language tasks due to the success of large language models (LLMs). However, it can fall short in specific scenarios where simple customized methods excel. In this paper, we delve into the patent approval pre-diction task and unveil that simple domain-specific graph methods outperform enlarging the model, using the intrinsic dependencies within… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 17 Pages, Under Review

  49. PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer

    Authors: Rui She, Qiyu Kang, Sijie Wang, Wee Peng Tay, Kai Zhao, Yang Song, Tianyu Geng, Yi Xu, Diego Navarro Navarro, Andreas Hartmannsgruber

    Abstract: Point cloud registration is a fundamental technique in 3-D computer vision with applications in graphics, autonomous driving, and robotics. However, registration tasks under challenging conditions, under which noise or perturbations are prevalent, can be difficult. We propose a robust point cloud registration approach that leverages graph neural partial differential equations (PDEs) and heat kerne… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Transactions on Geoscience and Remote Sensing

  50. arXiv:2404.11934  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall cond-mat.str-el

    Quantum simulation of honeycomb lattice model by high-order moiré pattern

    Authors: Qiang Wan, Chunlong Wu, Xun-Jiang Luo, Shenghao Dai, Cao Peng, Renzhe Li, Shangkun Mo, Keming Zhao, Wen-Xuan Qiu, Hao Zhong, Yiwei Li, Chendong Zhang, Fengcheng Wu, Nan Xu

    Abstract: Moiré superlattices have become an emergent solid-state platform for simulating quantum lattice models. However, in single moiré device, Hamiltonians parameters like lattice constant, hopping and interaction terms can hardly be manipulated, limiting the controllability and accessibility of moire quantum simulator. Here, by combining angle-resolved photoemission spectroscopy and theoretical analysi… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 19 pages, 5 figure

    Journal ref: Phy. Rev. B 109, L161102 (2024)