Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–5 of 5 results for author: Guan, H

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.00564  [pdf, other

    cs.LG stat.ML

    Automatically Marginalized MCMC in Probabilistic Programming

    Authors: Jinlin Lai, Javier Burroni, Hui Guan, Daniel Sheldon

    Abstract: Hamiltonian Monte Carlo (HMC) is a powerful algorithm to sample latent variables from Bayesian models. The advent of probabilistic programming languages (PPLs) frees users from writing inference algorithms and lets users focus on modeling. However, many models are difficult for HMC to solve directly, and often require tricks like model reparameterization. We are motivated by the fact that many of… ▽ More

    Submitted 1 June, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted to the 40th International Conference on Machine Learning (ICML 2023)

  2. arXiv:2008.02014  [pdf, other

    cs.LG cs.IR stat.ML

    Optimizing AD Pruning of Sponsored Search with Reinforcement Learning

    Authors: Yijiang Lian, Zhijie Chen, Xin Pei, Shuang Li, Yifei Wang, Yuefeng Qiu, Zhiheng Zhang, Zhipeng Tao, Liang Yuan, Hanju Guan, Kefeng Zhang, Zhigang Li, Xiaochun Liu

    Abstract: Industrial sponsored search system (SSS) can be logically divided into three modules: keywords matching, ad retrieving, and ranking. During ad retrieving, the ad candidates grow exponentially. A query with high commercial value might retrieve a great deal of ad candidates such that the ranking module could not afford. Due to limited latency and computing resources, the candidates have to be pruned… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

  3. arXiv:1911.02079  [pdf, other

    cs.LG cs.IR stat.ML

    Post-Training 4-bit Quantization on Embedding Tables

    Authors: Hui Guan, Andrey Malevich, Jiyan Yang, Jongsoo Park, Hector Yuen

    Abstract: Continuous representations have been widely adopted in recommender systems where a large number of entities are represented using embedding vectors. As the cardinality of the entities increases, the embedding components can easily contain millions of parameters and become the bottleneck in both storage and inference due to large memory consumption. This work focuses on post-training 4-bit quantiza… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: Accepted in MLSys@NeurIPS'19 (http://learningsys.org/neurips19/)

  4. arXiv:1910.14479  [pdf, other

    cs.LG stat.ML

    In-Place Zero-Space Memory Protection for CNN

    Authors: Hui Guan, Lin Ning, Zhen Lin, Xipeng Shen, Huiyang Zhou, Seung-Hwan Lim

    Abstract: Convolutional Neural Networks (CNN) are being actively explored for safety-critical applications such as autonomous vehicles and aerospace, where it is essential to ensure the reliability of inference results in the presence of possible memory faults. Traditional methods such as error correction codes (ECC) and Triple Modular Redundancy (TMR) are CNN-oblivious and incur substantial memory overhead… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

    Comments: Accepted in NeurIPS'19

  5. arXiv:1905.07297  [pdf, other

    cs.LG stat.ML

    MOBA: A multi-objective bounded-abstention model for two-class cost-sensitive problems

    Authors: Hongjiao Guan

    Abstract: Abstaining classifiers have been widely used in cost-sensitive applications to avoid ambiguous classification and reduce the cost of misclassification. Previous abstaining classification models rely on cost information, such as a cost matrix or cost ratio. However, it is difficult to obtain or estimate costs in practical applications. Furthermore, these abstention models are typically restricted t… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.