Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Quan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16579  [pdf, other

    cs.CL

    Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

    Authors: Shanghaoran Quan

    Abstract: Constructing high-quality query-response pairs from custom corpus is crucial for supervised fine-tuning (SFT) large language models (LLMs) in many applications, like creating domain-specific AI assistants or roleplaying agents. However, sourcing this data through human annotation is costly, and existing automated methods often fail to capture the diverse range of contextual granularity and tend to… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  2. arXiv:2403.01197  [pdf, other

    cs.CL

    DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

    Authors: Shanghaoran Quan

    Abstract: The performance of the reward model (RM) is a critical factor in improving the effectiveness of the large language model (LLM) during alignment fine-tuning. There remain two challenges in RM training: 1) training the same RM using various categories of data may cause its generalization performance to suffer from multi-task disturbance, and 2) the human annotation consistency rate is generally only… ▽ More

    Submitted 27 April, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

    Comments: 23 pages, 8 figures

  3. Ads Recommendation in a Collapsed and Entangled World

    Authors: Junwei Pan, Wei Xue, Ximei Wang, Haibin Yu, Xun Liu, Shijie Quan, Xueming Qiu, Dapeng Liu, Lei Xiao, Jie Jiang

    Abstract: We present Tencent's ads recommendation system and examine the challenges and practices of learning appropriate recommendation representations. Our study begins by showcasing our approaches to preserving prior knowledge when encoding features of diverse types into embedding representations. We specifically address sequence features, numeric features, and pre-trained embedding features. Subsequentl… ▽ More

    Submitted 5 July, 2024; v1 submitted 22 February, 2024; originally announced March 2024.

    Journal ref: SIGKDD 2024

  4. arXiv:2308.13537  [pdf, other

    cs.IR cs.LG

    STEM: Unleashing the Power of Embeddings for Multi-task Recommendation

    Authors: Liangcai Su, Junwei Pan, Ximei Wang, Xi Xiao, Shijie Quan, Xihua Chen, Jie Jiang

    Abstract: Multi-task learning (MTL) has gained significant popularity in recommender systems as it enables simultaneous optimization of multiple objectives. A key challenge in MTL is negative transfer, but existing studies explored negative transfer on all samples, overlooking the inherent complexities within them. We split the samples according to the relative amount of positive feedback among tasks. Surpr… ▽ More

    Submitted 6 January, 2024; v1 submitted 16 August, 2023; originally announced August 2023.

  5. arXiv:2308.11131  [pdf, other

    cs.IR cs.AI

    ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation

    Authors: Jianghao Lin, Rong Shan, Chenxu Zhu, Kounianhua Du, Bo Chen, Shigang Quan, Ruiming Tang, Yong Yu, Weinan Zhang

    Abstract: With large language models (LLMs) achieving remarkable breakthroughs in natural language processing (NLP) domains, LLM-enhanced recommender systems have received much attention and have been actively explored currently. In this paper, we focus on adapting and empowering a pure large language model for zero-shot and few-shot recommendation tasks. First and foremost, we identify and formulate the li… ▽ More

    Submitted 26 June, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted by WWW 2024. Full and More Readable Version

  6. arXiv:2308.05872  [pdf, other

    cs.CV

    Vision Backbone Enhancement via Multi-Stage Cross-Scale Attention

    Authors: Liang Shang, Yanli Liu, Zhengyang Lou, Shuxue Quan, Nagesh Adluru, Bochen Guan, William A. Sethares

    Abstract: Convolutional neural networks (CNNs) and vision transformers (ViTs) have achieved remarkable success in various vision tasks. However, many architectures do not consider interactions between feature maps from different stages and scales, which may limit their performance. In this work, we propose a simple add-on attention module to overcome these limitations via multi-stage and cross-scale interac… ▽ More

    Submitted 14 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

  7. A Transformer-based Diffusion Probabilistic Model for Heart Rate and Blood Pressure Forecasting in Intensive Care Unit

    Authors: Ping Chang, Huayu Li, Stuart F. Quan, Shuyang Lu, Shu-Fen Wung, Janet Roveda, Ao Li

    Abstract: Background and Objective: Vital sign monitoring in the Intensive Care Unit (ICU) is crucial for enabling prompt interventions for patients. This underscores the need for an accurate predictive system. Therefore, this study proposes a novel deep learning approach for forecasting Heart Rate (HR), Systolic Blood Pressure (SBP), and Diastolic Blood Pressure (DBP) in the ICU. Methods: We extracted… ▽ More

    Submitted 3 April, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

  8. arXiv:2110.10842  [pdf, other

    cs.CV cs.AI eess.IV

    SMOF: Squeezing More Out of Filters Yields Hardware-Friendly CNN Pruning

    Authors: Yanli Liu, Bochen Guan, Qinwen Xu, Weiyi Li, Shuxue Quan

    Abstract: For many years, the family of convolutional neural networks (CNNs) has been a workhorse in deep learning. Recently, many novel CNN structures have been designed to address increasingly challenging tasks. To make them work efficiently on edge devices, researchers have proposed various structured network pruning strategies to reduce their memory and computational cost. However, most of them only foc… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: 11 pages, 4 figures

  9. arXiv:2104.14837  [pdf, other

    cs.CV

    RobustFusion: Robust Volumetric Performance Reconstruction under Human-object Interactions from Monocular RGBD Stream

    Authors: Zhuo Su, Lan Xu, Dawei Zhong, Zhong Li, Fan Deng, Shuxue Quan, Lu Fang

    Abstract: High-quality 4D reconstruction of human performance with complex interactions to various objects is essential in real-world scenarios, which enables numerous immersive VR/AR applications. However, recent advances still fail to provide reliable performance reconstruction, suffering from challenging interaction patterns and severe occlusions, especially for the monocular setting. To fill this gap, i… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: 16 pages, 18 figures. Under review by IEEE TPAMI

  10. arXiv:2012.06734  [pdf, other

    cs.CV

    PoP-Net: Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image

    Authors: Yuliang Guo, Zhong Li, Zekun Li, Xiangyu Du, Shuxue Quan, Yi Xu

    Abstract: In this paper, a real-time method called PoP-Net is proposed to predict multi-person 3D poses from a depth image. PoP-Net learns to predict bottom-up part representations and top-down global poses in a single shot. Specifically, a new part-level representation, called Truncated Part Displacement Field (TPDF), is introduced which enables an explicit fusion process to unify the advantages of bottom-… ▽ More

    Submitted 24 November, 2021; v1 submitted 12 December, 2020; originally announced December 2020.

  11. arXiv:2011.04862  [pdf, other

    cs.CV cs.RO

    On Efficient and Robust Metrics for RANSAC Hypotheses and 3D Rigid Registration

    Authors: Jiaqi Yang, Zhiqiang Huang, Siwen Quan, Qian Zhang, Yanning Zhang, Zhiguo Cao

    Abstract: This paper focuses on developing efficient and robust evaluation metrics for RANSAC hypotheses to achieve accurate 3D rigid registration. Estimating six-degree-of-freedom (6-DoF) pose from feature correspondences remains a popular approach to 3D rigid registration, where random sample consensus (RANSAC) is a de-facto choice to this problem. However, existing metrics for RANSAC hypotheses are eithe… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  12. arXiv:2008.06655  [pdf, other

    cs.CV

    Object Detection in the Context of Mobile Augmented Reality

    Authors: Xiang Li, Yuan Tian, Fuyao Zhang, Shuxue Quan, Yi Xu

    Abstract: In the past few years, numerous Deep Neural Network (DNN) models and frameworks have been developed to tackle the problem of real-time object detection from RGB images. Ordinary object detection approaches process information from the images only, and they are oblivious to the camera pose with regard to the environment and the scale of the environment. On the other hand, mobile Augmented Reality (… ▽ More

    Submitted 15 August, 2020; originally announced August 2020.

    Comments: accepted to IEEE International Symposium on Mixed and Augmented Reality (ISMAR) 2020

  13. arXiv:2007.10570  [pdf, other

    cs.CV

    3D Correspondence Grouping with Compatibility Features

    Authors: Jiaqi Yang, Jiahao Chen, Zhiqiang Huang, Siwen Quan, Yanning Zhang, Zhiguo Cao

    Abstract: We present a simple yet effective method for 3D correspondence grouping. The objective is to accurately classify initial correspondences obtained by matching local geometric descriptors into inliers and outliers. Although the spatial distribution of correspondences is irregular, inliers are expected to be geometrically compatible with each other. Based on such observation, we propose a novel repre… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

  14. Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching

    Authors: Jiaqi Yang, Siwen Quan, Peng Wang, Yanning Zhang

    Abstract: Local geometric descriptors remain an essential component for 3D rigid data matching and fusion. The devise of a rotational invariant local geometric descriptor usually consists of two steps: local reference frame (LRF) construction and feature representation. Existing evaluation efforts have mainly been paid on the LRF or the overall descriptor, yet the quantitative comparison of feature represen… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.