Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 186 results for author: Ye, K

.
  1. arXiv:2409.14961  [pdf, other

    cs.DC

    UELLM: A Unified and Efficient Approach for LLM Inference Serving

    Authors: Yiyuan He, Minxian Xu, Jingfeng Wu, Wanyi Zheng, Kejiang Ye, Chengzhong Xu

    Abstract: In the context of Machine Learning as a Service (MLaaS) clouds, the extensive use of Large Language Models (LLMs) often requires efficient management of significant query loads. When providing real-time inference services, several challenges arise. Firstly, increasing the number of GPUs may lead to a decrease in inference speed due to heightened communication overhead, while an inadequate number o… ▽ More

    Submitted 23 September, 2024; v1 submitted 23 September, 2024; originally announced September 2024.

    Comments: 15 pages, 5 figures, ICSOC 2024

  2. arXiv:2409.14953  [pdf, other

    cs.DC

    MSARS: A Meta-Learning and Reinforcement Learning Framework for SLO Resource Allocation and Adaptive Scaling for Microservices

    Authors: Kan Hu, Linfeng Wen, Minxian Xu, Kejiang Ye

    Abstract: Service Level Objectives (SLOs) aim to set threshold for service time in cloud services to ensure acceptable quality of service (QoS) and user satisfaction. Currently, many studies consider SLOs as a system resource to be allocated, ensuring QoS meets the SLOs. Existing microservice auto-scaling frameworks that rely on SLO resources often utilize complex and computationally intensive models, requi… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 10 pages, 6 figures, IEEE ISPA 2024

  3. arXiv:2409.14434  [pdf, ps, other

    math.DG math.OC

    The sparseness of g-convex functions

    Authors: Yu Wang, Ke Ye

    Abstract: The g-convexity of functions on manifolds is a generalization of the convexity of functions on Rn. It plays an essential role in both differential geometry and non-convex optimization theory. This paper is concerned with g-convex smooth functions on manifolds. We establish criteria for the existence of a Riemannian metric (or connection) with respect to which a given function is g-convex. Using th… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

  4. arXiv:2409.14121  [pdf, other

    cs.SE cs.LG

    CONGRA: Benchmarking Automatic Conflict Resolution

    Authors: Qingyu Zhang, Liangcai Su, Kai Ye, Chenxiong Qian

    Abstract: Resolving conflicts from merging different software versions is a challenging task. To reduce the overhead of manual merging, researchers develop various program analysis-based tools which only solve specific types of conflicts and have a limited scope of application. With the development of language models, researchers treat conflict code as text, which theoretically allows for addressing almost… ▽ More

    Submitted 21 September, 2024; originally announced September 2024.

    ACM Class: D.2; D.3

  5. arXiv:2409.10227  [pdf, other

    physics.optics physics.app-ph

    Programmable multifunctional integrated microwave photonic circuit on thin-film lithium niobate

    Authors: Chuangchuang Wei, Hanke Feng, Kaixuan Ye, Maarten Eijkel, Yvan Klaver, Zhaoxi Chen, Akshay Keloth, Cheng Wang, David Marpaung

    Abstract: Microwave photonics, with its advanced high-frequency signal processing capabilities, is expected to play a crucial role in next-generation wireless communications and radar systems. The realization of highly integrated, high-performance, and multifunctional microwave photonic links will pave the way for its widespread deployment in practical applications, which is a significant challenge. Here, l… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 18 pages, 8 figures, 1 table

  6. arXiv:2409.05093  [pdf, other

    cs.DC

    CloudNativeSim: a toolkit for modeling and simulation of cloud-native applications

    Authors: Jingfeng Wu, Minxian Xu, Yiyuan He, Kejiang Ye, Chengzhong Xu

    Abstract: Cloud-native applications are increasingly becoming popular in modern software design. Employing a microservice-based architecture into these applications is a prevalent strategy that enhances system availability and flexibility. However, cloud-native applications also introduce new challenges, such as frequent inter-service communication and the complexity of managing heterogeneous codebases and… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 24 pages

  7. arXiv:2409.04034  [pdf, ps, other

    math.CO math.AG

    Stability of ranks under field extensions

    Authors: Qiyuan Chen, Ke Ye

    Abstract: This paper studies the stability of tensor ranks under field extensions. Our main contributions are fourfold: (1) We prove that the analytic rank is stable under field extensions. (2) We establish the equivalence between the partition rank vs. analytic rank conjecture and the stability conjecture for partition rank. We also prove that they are equivalent to other two important conjectures. (3) We… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: 18 pages

  8. arXiv:2408.14180  [pdf, other

    cs.CV cs.AI

    I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing

    Authors: Yiwei Ma, Jiayi Ji, Ke Ye, Weihuang Lin, Zhibin Wang, Yonghan Zheng, Qiang Zhou, Xiaoshuai Sun, Rongrong Ji

    Abstract: Significant progress has been made in the field of Instruction-based Image Editing (IIE). However, evaluating these models poses a significant challenge. A crucial requirement in this field is the establishment of a comprehensive evaluation benchmark for accurately assessing editing results and providing valuable insights for its further development. In response to this need, we propose I2EBench,… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Tech report, 39 pages, 41 figures

  9. arXiv:2408.07595  [pdf, other

    cs.CV

    Progressive Radiance Distillation for Inverse Rendering with Gaussian Splatting

    Authors: Keyang Ye, Qiming Hou, Kun Zhou

    Abstract: We propose progressive radiance distillation, an inverse rendering method that combines physically-based rendering with Gaussian-based radiance field rendering using a distillation progress map. Taking multi-view images as input, our method starts from a pre-trained radiance field guidance, and distills physically-based light and material parameters from the radiance field using an image-fitting p… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  10. arXiv:2408.04453  [pdf, other

    math.AG cs.SC math.GR

    Rational Curves on Real Classical Groups

    Authors: Zijia Li, Ke Ye

    Abstract: This paper is concerned with rational curves on real classical groups. Our contributions are three-fold: (i) We determine the structure of quadratic rational curves on real classical groups. As a consequence, we completely classify quadratic rational curves on $\mathrm{U}_n$, $\mathrm{O}_n(\mathbb{R})$, $\mathrm{O}_{n-1,1}(\mathbb{R})$ and $\mathrm{O}_{n-2,2}(\mathbb{R})$. (ii) We prove a decompos… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 50 pages

    MSC Class: 14H45; 20G20; 26C15; 14L35; 14L30; 70B05

  11. arXiv:2408.04102  [pdf, other

    cs.CV cs.AI

    ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling

    Authors: William Yicheng Zhu, Keren Ye, Junjie Ke, Jiahui Yu, Leonidas Guibas, Peyman Milanfar, Feng Yang

    Abstract: Recognizing and disentangling visual attributes from objects is a foundation to many computer vision applications. While large vision language representations like CLIP had largely resolved the task of zero-shot object recognition, zero-shot visual attribute recognition remains a challenge because CLIP's contrastively-learned vision-language representation cannot effectively capture object-attribu… ▽ More

    Submitted 24 September, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted at ECCV 2024

  12. arXiv:2407.21269  [pdf, other

    cond-mat.mtrl-sci

    Atomic Structure of Self-Buffered BaZr(S,Se)$_3$ Epitaxial Thin Film Interfaces

    Authors: Michael Xu, Kevin Ye, Ida Sadeghi, Rafael Jaramillo, James M. LeBeau

    Abstract: Understanding and controlling the growth of chalcogenide perovskite thin films through interface design is important for tailoring film properties. Here, the film and interface structure of BaZr(S,Se)$_3$ thin films grown on LaAlO$_3$ by molecular beam epitaxy and post-growth anion exchange is resolved using aberration-corrected scanning transmission electron microscopy. Epitaxial films are achiev… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  13. arXiv:2407.21075  [pdf, other

    cs.AI cs.CL cs.LG

    Apple Intelligence Foundation Language Models

    Authors: Tom Gunter, Zirui Wang, Chong Wang, Ruoming Pang, Andy Narayanan, Aonan Zhang, Bowen Zhang, Chen Chen, Chung-Cheng Chiu, David Qiu, Deepak Gopinath, Dian Ang Yap, Dong Yin, Feng Nan, Floris Weers, Guoli Yin, Haoshuo Huang, Jianyu Wang, Jiarui Lu, John Peebles, Ke Ye, Mark Lee, Nan Du, Qibin Chen, Quentin Keunebroek , et al. (130 additional authors not shown)

    Abstract: We present foundation language models developed to power Apple Intelligence features, including a ~3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This report describes the model architecture, the data used… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  14. arXiv:2407.19415  [pdf, other

    cs.MM cs.AI

    Start from Video-Music Retrieval: An Inter-Intra Modal Loss for Cross Modal Retrieval

    Authors: Zeyu Chen, Pengfei Zhang, Kai Ye, Wei Dong, Xin Feng, Yana Zhang

    Abstract: The burgeoning short video industry has accelerated the advancement of video-music retrieval technology, assisting content creators in selecting appropriate music for their videos. In self-supervised training for video-to-music retrieval, the video and music samples in the dataset are separated from the same video work, so they are all one-to-one matches. This does not match the real situation. In… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 10 pages, 7 figures

    ACM Class: I.2; I.4

  15. arXiv:2407.13482  [pdf, ps, other

    math.NA math.DG

    Simple matrix models for the flag, Grassmann, and Stiefel manifolds

    Authors: Lek-Heng Lim, Ke Ye

    Abstract: We derive three families of orthogonally-equivariant matrix submanifold models for the Grassmann, flag, and Stiefel manifolds respectively. These families are exhaustive -- every orthogonally-equivariant submanifold model of the lowest dimension for any of these manifolds is necessarily a member of the respective family, with a small number of exceptions. They have several computationally desirabl… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 17 pages

    MSC Class: 14M15; 65J05; 90C48; 53Z30; 57S25; 22E70

  16. arXiv:2407.12546  [pdf, ps, other

    math.RT math.DG

    Minimal equivariant embeddings of the Grassmannian and flag manifold

    Authors: Lek-Heng Lim, Ke Ye

    Abstract: We show that the flag manifold $\operatorname{Flag}(k_1,\dots, k_p, \mathbb{R}^n)$, with Grassmannian the special case $p=1$, has an $\operatorname{SO}_n(\mathbb{R})$-equivariant embedding in an Euclidean space of dimension $(n-1)(n+2)/2$, two orders of magnitude below the current best known result. We will show that the value $(n-1)(n+2)/2$ is the smallest possible and that any… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 11 pages

    MSC Class: 14M15; 57R40; 57S25; 14R20; 22E46; 22E70

  17. arXiv:2407.10173  [pdf, other

    cs.DC

    StatuScale: Status-aware and Elastic Scaling Strategy for Microservice Applications

    Authors: Linfeng Wen, Minxian Xu, Sukhpal Singh Gill, Muhammad Hafizhuddin Hilman, Satish Narayana Srirama, Kejiang Ye, Chengzhong Xu

    Abstract: Microservice architecture has transformed traditional monolithic applications into lightweight components. Scaling these lightweight microservices is more efficient than scaling servers. However, scaling microservices still faces the challenges resulted from the unexpected spikes or bursts of requests, which are difficult to detect and can degrade performance instantaneously. To address this chall… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 26 pages

    Journal ref: ACM Transactions on Autonomous and Adaptive Systems, 2024

  18. arXiv:2407.10169  [pdf, other

    cs.DC

    DRPC: Distributed Reinforcement Learning Approach for Scalable Resource Provisioning in Container-based Clusters

    Authors: Haoyu Bai, Minxian Xu, Kejiang Ye, Rajkumar Buyya, Chengzhong Xu

    Abstract: Microservices have transformed monolithic applications into lightweight, self-contained, and isolated application components, establishing themselves as a dominant paradigm for application development and deployment in public clouds such as Google and Alibaba. Autoscaling emerges as an efficient strategy for managing resources allocated to microservices' replicas. However, the dynamic and intricat… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 12 pages

    Journal ref: IEEE Transactions on Service Computing, 2024

  19. arXiv:2407.04053  [pdf, other

    cs.DC

    Edge AI: A Taxonomy, Systematic Review and Future Directions

    Authors: Sukhpal Singh Gill, Muhammed Golec, Jianmin Hu, Minxian Xu, Junhui Du, Huaming Wu, Guneet Kaur Walia, Subramaniam Subramanian Murugesan, Babar Ali, Mohit Kumar, Kejiang Ye, Prabal Verma, Surendra Kumar, Felix Cuadrado, Steve Uhlig

    Abstract: Edge Artificial Intelligence (AI) incorporates a network of interconnected systems and devices that receive, cache, process, and analyse data in close communication with the location where the data is captured with AI technology. Recent advancements in AI efficiency, the widespread use of Internet of Things (IoT) devices, and the emergence of edge computing have unlocked the enormous scope of Edge… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Preprint Version, 18 Figures

  20. arXiv:2407.03765  [pdf, ps, other

    cs.RO

    Design and Central Pattern Generator Control of a New Transformable Wheel-Legged Robot

    Authors: Tyler Bishop, Keran Ye, Konstantinos Karydis

    Abstract: This paper introduces a new wheel-legged robot and develops motion controllers based on central pattern generators (CPGs) for the robot to navigate over a range of terrains. A transformable leg-wheel design is considered and characterized in terms of key locomotion characteristics as a function of the design. Kinematic analysis is conducted based on a generalized four-bar mechanism driven by a coa… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: ICRA 2024 in print

  21. arXiv:2406.19377  [pdf, ps, other

    math.OC math.NA

    Grassmannian optimization is NP-hard

    Authors: Zehua Lai, Lek-Heng Lim, Ke Ye

    Abstract: We show that unconstrained quadratic optimization over a Grassmannian $\operatorname{Gr}(k,n)$ is NP-hard. Our results cover all scenarios: (i) when $k$ and $n$ are both allowed to grow; (ii) when $k$ is arbitrary but fixed; (iii) when $k$ is fixed at its lowest possible value $1$. We then deduce the NP-hardness of unconstrained cubic optimization over the Stiefel manifold $\operatorname{V}(k,n)$… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 19 pages

    MSC Class: 03D15; 90C26; 90C23; 65K10; 68Q25; 90C60

  22. arXiv:2406.17911  [pdf, other

    cs.CL

    X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

    Authors: Kun Zhao, Chenghao Xiao, Chen Tang, Bohao Yang, Kai Ye, Noura Al Moubayed, Liang Zhan, Chenghua Lin

    Abstract: Radiology Report Generation (RRG) has achieved significant progress with the advancements of multimodal generative models. However, the evaluation in the domain suffers from a lack of fair and robust metrics. We reveal that, high performance on RRG with existing lexical-based metrics (e.g. BLEU) might be more of a mirage - a model can get a high BLEU only by learning the template of reports. This… ▽ More

    Submitted 30 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  23. arXiv:2406.11821  [pdf, ps, other

    math.DG math.NA math.OC

    Simple matrix expressions for the curvatures of Grassmannian

    Authors: Zehua Lai, Lek-Heng Lim, Ke Ye

    Abstract: We show that modeling a Grassmannian as symmetric orthogonal matrices $\operatorname{Gr}(k,\mathbb{R}^n) \cong\{Q \in \mathbb{R}^{n \times n} : Q^{\scriptscriptstyle\mathsf{T}} Q = I, \; Q^{\scriptscriptstyle\mathsf{T}} = Q,\; \operatorname{tr}(Q)=2k - n\}$ yields exceedingly simple matrix formulas for various curvatures and curvature-related quantities, both intrinsic and extrinsic. These include… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 25 pages

    MSC Class: 15A75; 14M15

  24. arXiv:2406.02479  [pdf

    cs.LG eess.SP eess.SY

    Applying Fine-Tuned LLMs for Reducing Data Needs in Load Profile Analysis

    Authors: Yi Hu, Hyeonjin Kim, Kai Ye, Ning Lu

    Abstract: This paper presents a novel method for utilizing fine-tuned Large Language Models (LLMs) to minimize data requirements in load profile analysis, demonstrated through the restoration of missing data in power system load profiles. A two-stage fine-tuning strategy is proposed to adapt a pre-trained LLMs, i.e., GPT-3.5, for missing data restoration tasks. Through empirical evaluation, we demonstrate t… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  25. arXiv:2405.20560  [pdf, other

    cs.DC

    Collaborative Resource Management and Workloads Scheduling in Cloud-Assisted Mobile Edge Computing across Timescales

    Authors: Lujie Tang, Minxian Xu, Chengzhong Xu, Kejiang Ye

    Abstract: Due to the limited resource capacity of edge servers and the high purchase costs of edge resources, service providers are facing the new challenge of how to take full advantage of the constrained edge resources for Internet of Things (IoT) service hosting and task scheduling to maximize system performance. In this paper, we study the joint optimization problem on service placement, resource provis… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 11 pages, 10 figures

    Journal ref: IEEE ICWS 2024

  26. arXiv:2405.17241  [pdf, other

    cs.CV eess.IV

    NeurTV: Total Variation on the Neural Domain

    Authors: Yisi Luo, Xile Zhao, Kai Ye, Deyu Meng

    Abstract: Recently, we have witnessed the success of total variation (TV) for many imaging applications. However, traditional TV is defined on the original pixel domain, which limits its potential. In this work, we suggest a new TV regularization defined on the neural domain. Concretely, the discrete data is continuously and implicitly represented by a deep neural network (DNN), and we use the derivatives o… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    MSC Class: 94A08; 68U10; 68T45

  27. arXiv:2405.14206  [pdf, other

    cs.CV

    LG-VQ: Language-Guided Codebook Learning

    Authors: Guotao Liang, Baoquan Zhang, Yaowei Wang, Xutao Li, Yunming Ye, Huaibin Wang, Chuyao Luo, Kola Ye, linfeng Luo

    Abstract: Vector quantization (VQ) is a key technique in high-resolution and high-fidelity image synthesis, which aims to learn a codebook to encode an image with a sequence of discrete codes and then generate an image in an auto-regression manner. Although existing methods have shown superior performance, most methods prefer to learn a single-modal codebook (\emph{e.g.}, image), resulting in suboptimal per… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: None

  28. arXiv:2405.13190  [pdf, other

    cs.LG cs.AI

    Interpretable Spatio-Temporal Embedding for Brain Structural-Effective Network with Ordinary Differential Equation

    Authors: Haoteng Tang, Guodong Liu, Siyuan Dai, Kai Ye, Kun Zhao, Wenlu Wang, Carl Yang, Lifang He, Alex Leow, Paul Thompson, Heng Huang, Liang Zhan

    Abstract: The MRI-derived brain network serves as a pivotal instrument in elucidating both the structural and functional aspects of the brain, encompassing the ramifications of diseases and developmental processes. However, prevailing methodologies, often focusing on synchronous BOLD signals from functional MRI (fMRI), may not capture directional influences among brain regions and rarely tackle temporal fun… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  29. arXiv:2405.12635  [pdf, other

    cs.DC

    TempoScale: A Cloud Workloads Prediction Approach Integrating Short-Term and Long-Term Information

    Authors: Linfeng Wen, Minxian Xu, Adel N. Toosi, Kejiang Ye

    Abstract: Cloud native solutions are widely applied in various fields, placing higher demands on the efficient management and utilization of resource platforms. To achieve the efficiency, load forecasting and elastic scaling have become crucial technologies for dynamically adjusting cloud resources to meet user demands and minimizing resource waste. However, existing prediction-based methods lack comprehens… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 11pages, 11 figures, 4 tables

    Journal ref: In proceedings of IEEE CLOUD 2024

  30. arXiv:2405.09554  [pdf, ps, other

    eess.SP cs.IT

    Underdetermined DOA Estimation of Off-Grid Sources Based on the Generalized Double Pareto Prior

    Authors: Yongfeng Huang, Zhendong Chen, Kun Ye, Lang Zhou, Haixin Sun

    Abstract: In this letter, we investigate a new generalized double Pareto based on off-grid sparse Bayesian learning (GDPOGSBL) approach to improve the performance of direction of arrival (DOA) estimation in underdetermined scenarios. The method aims to enhance the sparsity of source signal by utilizing the generalized double Pareto (GDP) prior. Firstly, we employ a first-order linear Taylor expansion to mod… ▽ More

    Submitted 17 May, 2024; v1 submitted 18 April, 2024; originally announced May 2024.

  31. arXiv:2405.09470  [pdf, other

    cs.SD cs.CR cs.LG eess.AS

    Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer

    Authors: Weifei Jin, Yuxin Cao, Junjie Su, Qi Shen, Kai Ye, Derui Wang, Jie Hao, Ziyao Liu

    Abstract: In light of the widespread application of Automatic Speech Recognition (ASR) systems, their security concerns have received much more attention than ever before, primarily due to the susceptibility of Deep Neural Networks. Previous studies have illustrated that surreptitiously crafting adversarial perturbations enables the manipulation of speech recognition systems, resulting in the production of… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted to SecTL (AsiaCCS Workshop) 2024

  32. arXiv:2405.05128  [pdf, ps, other

    math.AG

    Degree of the Grassmannian as an affine variety

    Authors: Lek-Heng Lim, Ke Ye

    Abstract: The degree of the Grassmannian with respect to the Plücker embedding is well-known. However, the Plücker embedding, while ubiquitous in pure mathematics, is almost never used in applied mathematics. In applied mathematics, the Grassmannian is usually embedded as projection matrices… ▽ More

    Submitted 19 July, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 16 pages

    MSC Class: 14E25; 14F45

  33. 3D Gaussian Splatting with Deferred Reflection

    Authors: Keyang Ye, Qiming Hou, Kun Zhou

    Abstract: The advent of neural and Gaussian-based radiance field methods have achieved great success in the field of novel view synthesis. However, specular reflection remains non-trivial, as the high frequency radiance field is notoriously difficult to fit stably and accurately. We present a deferred shading method to effectively render specular reflection with Gaussian splatting. The key challenge comes f… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  34. arXiv:2404.10541  [pdf, other

    cs.RO

    MPCOM: Robotic Data Gathering with Radio Mapping and Model Predictive Communication

    Authors: Zhiyou Ji, Guoliang Li, Ruihua Han, Shuai Wang, Bing Bai, Wei Xu, Kejiang Ye, Chengzhong Xu

    Abstract: Robotic data gathering (RDG) is an emerging paradigm that navigates a robot to harvest data from remote sensors. However, motion planning in this paradigm needs to maximize the RDG efficiency instead of the navigation efficiency, for which the existing motion planning methods become inefficient, as they plan robot trajectories merely according to motion factors. This paper proposes radio map guide… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: submit to IROS

  35. arXiv:2404.08175  [pdf, ps, other

    eess.SY

    A Novel Vision Transformer based Load Profile Analysis using Load Images as Inputs

    Authors: Hyeonjin Kim, Yi Hu, Kai Ye, Ning Lu

    Abstract: This paper introduces ViT4LPA, an innovative Vision Transformer (ViT) based approach for Load Profile Analysis (LPA). We transform time-series load profiles into load images. This allows us to leverage the ViT architecture, originally designed for image processing, as a pre-trained image encoder to uncover latent patterns within load data. ViT is pre-trained using an extensive load image dataset,… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  36. arXiv:2403.20031  [pdf, other

    cs.CV

    A Unified Framework for Human-centric Point Cloud Video Understanding

    Authors: Yiteng Xu, Kecheng Ye, Xiao Han, Yiming Ren, Xinge Zhu, Yuexin Ma

    Abstract: Human-centric Point Cloud Video Understanding (PVU) is an emerging field focused on extracting and interpreting human-related features from sequences of human point clouds, further advancing downstream human-centric tasks and applications. Previous works usually focus on tackling one specific task and rely on huge labeled data, which has poor generalization capability. Considering that human has s… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  37. arXiv:2403.09016  [pdf

    cond-mat.mtrl-sci

    A Processing Route to Chalcogenide Perovskites Alloys with Tunable Band Gap via Anion Exchange

    Authors: Kevin Ye, Ida Sadeghi, Michael Xu, Jack Van Sambeek, Tao Cai, Jessica Dong, Rishabh Kothari, James M. LeBeau, R. Jaramillo

    Abstract: We demonstrate synthesis of BaZr(S,Se)3 chalcogenide perovskite alloys by selenization of BaZrS3 thin films. The anion-exchange process produces films with tunable composition and band gap without changing the orthorhombic perovskite crystal structure or the film microstructure. The direct band gap is tunable between 1.5 and 1.9 eV. The alloy films made in this way feature 100x stronger photocondu… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  38. arXiv:2403.08136  [pdf, other

    cs.LO cs.AI

    RoboCertProb: Property Specification for Probabilistic RoboChart Models

    Authors: Kangfeng Ye, Jim Woodcock

    Abstract: RoboChart is a core notation in the RoboStar framework which brings modern modelling and formal verification technologies into software engineering for robotics. It is a timed and probabilistic domain-specific language for robotics and provides a UML-like architectural and state machine modelling. This work presents RoboCertProb for specifying quantitative properties of probabilistic robotic syste… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 24 pages, 10 figures, 4 tables, submitted to the International Journal on Software and Systems Modeling (SoSyM)

  39. arXiv:2403.00169  [pdf, other

    cs.LO cs.FL cs.SE

    Quantitative Assurance and Synthesis of Controllers from Activity Diagrams

    Authors: Kangfeng Ye, Fang Yan, Simos Gerasimou

    Abstract: Probabilistic model checking is a widely used formal verification technique to automatically verify qualitative and quantitative properties for probabilistic models. However, capturing such systems, writing corresponding properties, and verifying them require domain knowledge. This makes it not accessible for researchers and engineers who may not have the required knowledge. Previous studies have… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 43 pages, 29 figures, 5 tables, submitted to Journal of Systems and Software (JSS)

    ACM Class: D.2.4; F.3.1; F.3.2; F.4.3

  40. arXiv:2402.18957  [pdf, other

    cond-mat.mtrl-sci

    Vibrational properties differ between halide and chalcogenide perovskite semiconductors, and it matters for optoelectronic performance

    Authors: K. Ye, M. Menahem, T. Salzillo, F. Knoop, B. Zhao, S. Niu, O. Hellman, J. Ravichandran, R. Jaramillo, O. Yaffe

    Abstract: We report a comparative study of temperature-dependent photoluminescence and structural dynamics of two perovskite semiconductors, the chalcogenide BaZrS$_3$ (BZS) and the halide CsPbBr$_3$ (CPB). These materials have similar crystal structures and direct band gaps, but we find that they have quite distinct optoelectronic and vibrational properties. Both materials exhibit thermally-activated non-r… ▽ More

    Submitted 14 April, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Main text - 12 pages with 5 figures and 1 table. Supplemental text - 16 pages with 6 figures and 5 tables

  41. arXiv:2402.14255  [pdf

    physics.optics

    Observation of temporal topological boundary states of light in a momentum bandgap

    Authors: Yudong Ren, Kangpeng Ye, Qiaolu Chen, Fujia Chen, Li Zhang, Yuang Pan, Wenhao Li, Xinrui Li, Lu Zhang, Hongsheng Chen, Yihao Yang

    Abstract: Topological phases have prevailed across diverse disciplines, spanning electronics, photonics, and acoustics. Hitherto, the understanding of these phases has centred on energy (frequency) bandstructures, showcasing topological boundary states at spatial interfaces. Recent strides have uncovered a unique category of bandstructures characterized by gaps in momentum, referred to as momentum bandgaps… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  42. arXiv:2402.08917  [pdf, other

    cs.DC

    An Interference-aware Approach for Co-located Container Orchestration with Novel Metric

    Authors: Xiang Li, Linfeng Wen, Minxian Xu, Kejiang Ye

    Abstract: Container orchestration technologies are widely employed in cloud computing, facilitating the co-location of online and offline services on the same infrastructure. Online services demand rapid responsiveness and high availability, whereas offline services require extensive computational resources. However, this mixed deployment can lead to resource contention, adversely affecting the performance… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 8 pages

    Journal ref: In the Proceedings of IEEE SmartData 2023

  43. arXiv:2402.04134  [pdf, other

    cs.CC cs.SC math.RA

    A quasi-optimal lower bound for skew polynomial multiplication

    Authors: Qiyuan Chen, Ke Ye

    Abstract: We establish a lower bound for the complexity of multiplying two skew polynomials. The lower bound coincides with the upper bound conjectured by Caruso and Borgne in 2017, up to a log factor. We present algorithms for three special cases, indicating that the aforementioned lower bound is quasi-optimal. In fact, our lower bound is quasi-optimal in the sense of bilinear complexity. In addition, we d… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  44. arXiv:2402.03456  [pdf, other

    cs.CV

    Constrained Multiview Representation for Self-supervised Contrastive Learning

    Authors: Siyuan Dai, Kai Ye, Kun Zhao, Ge Cui, Haoteng Tang, Liang Zhan

    Abstract: Representation learning constitutes a pivotal cornerstone in contemporary deep learning paradigms, offering a conduit to elucidate distinctive features within the latent space and interpret the deep models. Nevertheless, the inherent complexity of anatomical patterns and the random nature of lesion distribution in medical image segmentation pose significant challenges to the disentanglement of rep… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 11 pages, 9 figures, 2 algorithms

  45. arXiv:2401.13160  [pdf, other

    cs.LG cs.CL

    SpacTor-T5: Pre-training T5 Models with Span Corruption and Replaced Token Detection

    Authors: Ke Ye, Heinrich Jiang, Afshin Rostamizadeh, Ayan Chakrabarti, Giulia DeSalvo, Jean-François Kagy, Lazaros Karydas, Gui Citovsky, Sanjiv Kumar

    Abstract: Pre-training large language models is known to be extremely resource intensive and often times inefficient, under-utilizing the information encapsulated in the training text sequences. In this paper, we present SpacTor, a new training procedure consisting of (1) a hybrid objective combining span corruption (SC) and token replacement detection (RTD), and (2) a two-stage curriculum that optimizes th… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 9+13 pages, 5 figures

  46. arXiv:2401.12651  [pdf, other

    physics.optics physics.app-ph

    Brillouin nonlinearity characterizations of a high refractive index silicon oxynitride platform

    Authors: Kaixuan Ye, Akshay Keloth, Yvan Klaver, Alessio Baldazzi, Gioele Piccoli, Matteo Sanna, Lorenzo Pavesi, Mher Ghulinyan, David Marpaung

    Abstract: Silicon oxynitride (SiON) is a low-loss and versatile material for linear and nonlinear photonics applications. Controlling the oxygen-to-nitrogen (O/N) ratio in SiON provides an effective way to engineer its optical and mechanical properties, making it a great platform for the investigation of on-chip optomechanical interactions, especially the stimulated Brillouin scattering (SBS). Here we repor… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  47. arXiv:2401.01484  [pdf, other

    cs.LG cs.AI

    Uncertainty Regularized Evidential Regression

    Authors: Kai Ye, Tiejin Chen, Hua Wei, Liang Zhan

    Abstract: The Evidential Regression Network (ERN) represents a novel approach that integrates deep learning with Dempster-Shafer's theory to predict a target and quantify the associated uncertainty. Guided by the underlying theory, specific activation functions must be employed to enforce non-negative values, which is a constraint that compromises model performance by limiting its ability to learn from all… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI 2024 main track

  48. arXiv:2312.13721  [pdf, ps, other

    math.NA

    Bundle-based similarity measurement for positive semidefinite matrices

    Authors: Peng Liu, Ke Ye

    Abstract: Positive semidefinite (PSD) matrices are indispensable in many fields of science. A similarity measurement for such matrices is usually an essential ingredient in the mathematical modelling of a scientific problem. This paper proposes a unified framework to construct similarity measurements for PSD matrices. The framework is obtained by exploring the fiber bundle structure of the cone of PSD matri… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  49. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  50. arXiv:2312.11595  [pdf, other

    cs.CV

    SPIRE: Semantic Prompt-Driven Image Restoration

    Authors: Chenyang Qi, Zhengzhong Tu, Keren Ye, Mauricio Delbracio, Peyman Milanfar, Qifeng Chen, Hossein Talebi

    Abstract: Text-driven diffusion models have become increasingly popular for various image editing tasks, including inpainting, stylization, and object replacement. However, it still remains an open research problem to adopt this language-vision paradigm for more fine-level image processing tasks, such as denoising, super-resolution, deblurring, and compression artifact removal. In this paper, we develop SPI… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Accepted by ECCV 2024; Webpage: https://chenyangqiqi.github.io/tip