Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–8 of 8 results for author: Pu, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13357  [pdf, other

    cs.CL cs.SD eess.AS

    Transferable speech-to-text large language model alignment module

    Authors: Boyong Wu, Chao Yan, Haoran Pu

    Abstract: By leveraging the power of Large Language Models(LLMs) and speech foundation models, state of the art speech-text bimodal works can achieve challenging tasks like spoken translation(ST) and question answering(SQA) altogether with much simpler architectures. In this paper, we utilize the capability of Whisper encoder and pre-trained Yi-6B. Empirical results reveal that modal alignment can be achiev… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted by InterSpeech 2024; 5 pages, 2 figures

  2. arXiv:2403.06397  [pdf, other

    cs.LG cs.AI eess.SY

    DeepSafeMPC: Deep Learning-Based Model Predictive Control for Safe Multi-Agent Reinforcement Learning

    Authors: Xuefeng Wang, Henglin Pu, Hyung Jun Kim, Husheng Li

    Abstract: Safe Multi-agent reinforcement learning (safe MARL) has increasingly gained attention in recent years, emphasizing the need for agents to not only optimize the global return but also adhere to safety requirements through behavioral constraints. Some recent work has integrated control theory with multi-agent reinforcement learning to address the challenge of ensuring safety. However, there have bee… ▽ More

    Submitted 11 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 8 pages, 5 figures

  3. arXiv:2401.12272  [pdf, other

    stat.ML cs.LG

    Transfer Learning for Nonparametric Regression: Non-asymptotic Minimax Analysis and Adaptive Procedure

    Authors: T. Tony Cai, Hongming Pu

    Abstract: Transfer learning for nonparametric regression is considered. We first study the non-asymptotic minimax risk for this problem and develop a novel estimator called the confidence thresholding estimator, which is shown to achieve the minimax optimal risk up to a logarithmic factor. Our results demonstrate two unique phenomena in transfer learning: auto-smoothing and super-acceleration, which differe… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  4. arXiv:2312.03775  [pdf, other

    cs.CV

    FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability

    Authors: Linze Li, Sunqi Fan, Hengjun Pu, Zhaodong Bing, Yao Tang, Tianzhu Ye, Tong Yang, Liangyu Chen, Jiajun Liang

    Abstract: Over recent years, diffusion models have facilitated significant advancements in video generation. Yet, the creation of face-related videos still confronts issues such as low facial fidelity, lack of frame consistency, limited editability and uncontrollable human poses. To address these challenges, we introduce a facial animation generation method that enhances both face identity fidelity and edit… ▽ More

    Submitted 20 December, 2023; v1 submitted 5 December, 2023; originally announced December 2023.

  5. arXiv:2311.17307  [pdf, other

    cs.CL cs.AI

    RoKEPG: RoBERTa and Knowledge Enhancement for Prescription Generation of Traditional Chinese Medicine

    Authors: Hua Pu, Jiacong Mi, Shan Lu, Jieyue He

    Abstract: Traditional Chinese medicine (TCM) prescription is the most critical form of TCM treatment, and uncovering the complex nonlinear relationship between symptoms and TCM is of great significance for clinical practice and assisting physicians in diagnosis and treatment. Although there have been some studies on TCM prescription generation, these studies consider a single factor and directly model the s… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 8 pages

  6. arXiv:2310.07944   

    cs.AI

    AutoRepo: A general framework for multi-modal LLM-based automated construction reporting

    Authors: Hongxu Pu, Xincong Yang, Jing Li, Runhao Guo, Heng Li

    Abstract: Ensuring the safety, quality, and timely completion of construction projects is paramount, with construction inspections serving as a vital instrument towards these goals. Nevertheless, the predominantly manual approach of present-day inspections frequently results in inefficiencies and inadequate information management. Such methods often fall short of providing holistic, exhaustive assessments,… ▽ More

    Submitted 4 December, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: We believe that keeping this version of the paper publicly available may lead to confusion or misinterpretation regarding our current research direction and findings

  7. arXiv:2212.10341  [pdf, other

    cs.CL

    CoCo: Coherence-Enhanced Machine-Generated Text Detection Under Data Limitation With Contrastive Learning

    Authors: Xiaoming Liu, Zhaohan Zhang, Yichen Wang, Hang Pu, Yu Lan, Chao Shen

    Abstract: Machine-Generated Text (MGT) detection, a task that discriminates MGT from Human-Written Text (HWT), plays a crucial role in preventing misuse of text generative models, which excel in mimicking human writing style recently. Latest proposed detectors usually take coarse text sequences as input and fine-tune pretrained models with standard cross-entropy loss. However, these methods fail to consider… ▽ More

    Submitted 20 October, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted by EMNLP 2023 main cofference

  8. arXiv:1709.02540  [pdf, other

    cs.LG

    The Expressive Power of Neural Networks: A View from the Width

    Authors: Zhou Lu, Hongming Pu, Feicheng Wang, Zhiqiang Hu, Liwei Wang

    Abstract: The expressive power of neural networks is important for understanding deep learning. Most existing works consider this problem from the view of the depth of a network. In this paper, we study how width affects the expressiveness of neural networks. Classical results state that depth-bounded (e.g. depth-$2$) networks with suitable activation functions are universal approximators. We show a univers… ▽ More

    Submitted 1 November, 2017; v1 submitted 8 September, 2017; originally announced September 2017.

    Comments: accepted by NIPS 2017 ( with some typos fixed)