Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 110 results for author: Shi, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02681  [pdf, other

    cs.LG eess.IV math.OC stat.ML

    Uniform Transformation: Refining Latent Representation in Variational Autoencoders

    Authors: Ye Shi, C. S. George Lee

    Abstract: Irregular distribution in latent space causes posterior collapse, misalignment between posterior and prior, and ill-sampling problem in Variational Autoencoders (VAEs). In this paper, we introduce a novel adaptable three-stage Uniform Transformation (UT) module -- Gaussian Kernel Density Estimation (G-KDE) clustering, non-parametric Gaussian Mixture (GM) Modeling, and Probability Integral Transfor… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted by 2024 IEEE 20th International Conference on Automation Science and Engineering

  2. arXiv:2406.12171  [pdf, other

    stat.ME stat.AP

    Model Selection for Causal Modeling in Missing Exposure Problems

    Authors: Yuliang Shi, Yeying Zhu, Joel A. Dubin

    Abstract: In causal inference, properly selecting the propensity score (PS) model is a popular topic and has been widely investigated in observational studies. In addition, there is a large literature concerning the missing data problem. However, there are very few studies investigating the model selection issue for causal inference when the exposure is missing at random (MAR). In this paper, we discuss how… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2406.08668  [pdf, other

    stat.ME

    Causal Inference on Missing Exposure via Robust Estimation

    Authors: Yuliang Shi, Yeying Zhu, Joel A. Dubin

    Abstract: How to deal with missing data in observational studies is a common concern for causal inference. When the covariates are missing at random (MAR), multiple approaches have been provided to help solve the issue. However, if the exposure is MAR, few approaches are available and careful adjustments on both missingness and confounding issues are required to ensure a consistent estimate of the true caus… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  4. arXiv:2404.02986  [pdf, other

    cs.LG stat.ML

    Universal Functional Regression with Neural Operator Flows

    Authors: Yaozhong Shi, Angela F. Gao, Zachary E. Ross, Kamyar Azizzadenesheli

    Abstract: Regression on function spaces is typically limited to models with Gaussian process priors. We introduce the notion of universal functional regression, in which we aim to learn a prior distribution over non-Gaussian function spaces that remains mathematically tractable for functional regression. To do this, we develop Neural Operator Flows (OpFlow), an infinite-dimensional extension of normalizing… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  5. arXiv:2310.09583  [pdf, other

    cs.LG stat.ML

    Two Sides of The Same Coin: Bridging Deep Equilibrium Models and Neural ODEs via Homotopy Continuation

    Authors: Shutong Ding, Tianyu Cui, Jingya Wang, Ye Shi

    Abstract: Deep Equilibrium Models (DEQs) and Neural Ordinary Differential Equations (Neural ODEs) are two branches of implicit models that have achieved remarkable success owing to their superior performance and low memory consumption. While both are implicit models, DEQs and Neural ODEs are derived from different mathematical formulations. Inspired by homotopy continuation, we establish a connection betwee… ▽ More

    Submitted 21 December, 2023; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS2023

  6. arXiv:2310.04919  [pdf, other

    stat.ME cs.LG stat.ML

    The Conditional Prediction Function: A Novel Technique to Control False Discovery Rate for Complex Models

    Authors: Yushu Shi, Michael Martens

    Abstract: In modern scientific research, the objective is often to identify which variables are associated with an outcome among a large class of potential predictors. This goal can be achieved by selecting variables in a manner that controls the the false discovery rate (FDR), the proportion of irrelevant predictors among the selections. Knockoff filtering is a cutting-edge approach to variable selection t… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  7. arXiv:2309.09831  [pdf, other

    math.ST stat.ML

    Pivotal Estimation of Linear Discriminant Analysis in High Dimensions

    Authors: Ethan X. Fang, Yajun Mei, Yuyang Shi, Qunzhi Xu, Tuo Zhao

    Abstract: We consider the linear discriminant analysis problem in the high-dimensional settings. In this work, we propose PANDA(PivotAl liNear Discriminant Analysis), a tuning-insensitive method in the sense that it requires very little effort to tune the parameters. Moreover, we prove that PANDA achieves the optimal convergence rate in terms of both the estimation error and misclassification rate. Our theo… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  8. arXiv:2309.08109  [pdf, other

    stat.ME

    CAT: a conditional association test for microbiome data using a leave-out approach

    Authors: Yushu Shi, Liangliang Zhang, Kim-Anh Do, Robert R. Jenq, Christine B. Peterson

    Abstract: In microbiome analysis, researchers often seek to identify taxonomic features associated with an outcome of interest. However, microbiome features are intercorrelated and linked by phylogenetic relationships, making it challenging to assess the association between an individual feature and an outcome. Researchers have developed global tests for the association of microbiome profiles with outcomes… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  9. arXiv:2308.13737  [pdf, other

    stat.AP

    survivalContour: Visualizing predicted survival via colored contour plots

    Authors: Yushu Shi, Liangliang Zhang, Kim-Anh Do, Robert R. Jenq, Christine B. Peterson

    Abstract: Advances in survival analysis have facilitated unprecedented flexibility in data modeling, yet there remains a lack of tools for graphically illustrating the influence of continuous covariates on predicted survival outcomes. We propose the utilization of a colored contour plot to depict the predicted survival probabilities over time, and provide a Shiny app and R package as implementations of this… ▽ More

    Submitted 12 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

  10. arXiv:2308.13298  [pdf, other

    cs.LG eess.SP stat.ML

    Federated Linear Bandit Learning via Over-the-Air Computation

    Authors: Jiali Wang, Yuning Jiang, Xin Liu, Ting Wang, Yuanming Shi

    Abstract: In this paper, we investigate federated contextual linear bandit learning within a wireless system that comprises a server and multiple devices. Each device interacts with the environment, selects an action based on the received reward, and sends model updates to the server. The primary objective is to minimize cumulative regret across all devices within a finite time horizon. To reduce the commun… ▽ More

    Submitted 28 August, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  11. arXiv:2308.12016  [pdf, ps, other

    stat.ML cs.LG

    MKL-$L_{0/1}$-SVM

    Authors: Bin Zhu, Yijie Shi

    Abstract: This paper presents a Multiple Kernel Learning (abbreviated as MKL) framework for the Support Vector Machine (SVM) with the $(0, 1)$ loss function. Some KKT-like first-order optimality conditions are provided and then exploited to develop a fast ADMM algorithm to solve the nonsmooth nonconvex optimization problem. Numerical experiments on real data sets show that the performance of our MKL-… ▽ More

    Submitted 3 September, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 26 pages in the JMLR template, 3 figures, and 2 tables, submitted to the Journal of Machine Learning Research, with minor text overlap with arXiv: 2303.04445 (conference version). arXiv admin note: text overlap with arXiv:2303.04445

  12. arXiv:2307.16360  [pdf, other

    cs.LG stat.ML

    Probabilistically robust conformal prediction

    Authors: Subhankar Ghosh, Yuanjie Shi, Taha Belkhouja, Yan Yan, Jana Doppa, Brian Jones

    Abstract: Conformal prediction (CP) is a framework to quantify uncertainty of machine learning classifiers including deep neural networks. Given a testing example and a trained classifier, CP produces a prediction set of candidate labels with a user-specified coverage (i.e., true class label is contained with high probability). Almost all the existing work on CP assumes clean testing data and there is not m… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, 2023

    Journal ref: Uncertainty in Artificial Intelligence. PMLR 216:681-690, 2023

  13. arXiv:2303.16852  [pdf, other

    stat.ML cs.LG

    Diffusion Schrödinger Bridge Matching

    Authors: Yuyang Shi, Valentin De Bortoli, Andrew Campbell, Arnaud Doucet

    Abstract: Solving transport problems, i.e. finding a map transporting one given distribution to another, has numerous applications in machine learning. Novel mass transport methods motivated by generative modeling have recently been proposed, e.g. Denoising Diffusion Models (DDMs) and Flow Matching Models (FMMs) implement such a transport through a Stochastic Differential Equation (SDE) or an Ordinary Diffe… ▽ More

    Submitted 11 December, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

  14. arXiv:2303.04445  [pdf, ps, other

    stat.ML cs.LG

    An ADMM Solver for the MKL-$L_{0/1}$-SVM

    Authors: Yijie Shi, Bin Zhu

    Abstract: We formulate the Multiple Kernel Learning (abbreviated as MKL) problem for the support vector machine with the infamous $(0,1)$-loss function. Some first-order optimality conditions are given and then exploited to develop a fast ADMM solver for the nonconvex and nonsmooth optimization problem. A simple numerical experiment on synthetic planar data shows that our MKL-$L_{0/1}$-SVM framework could b… ▽ More

    Submitted 30 March, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures, 2 tables. Submitted to the 62nd IEEE Conference on Decision and Control as a Regular paper, with a shortened version (arXiv version 1) submitted to the 3rd Chinese Conference on Predictive Control and Intelligent Decision (CPCID) as an Extended Abstract

  15. Bayesian Methods in Tensor Analysis

    Authors: Yiyao Shi, Weining Shen

    Abstract: Tensors, also known as multidimensional arrays, are useful data structures in machine learning and statistics. In recent years, Bayesian methods have emerged as a popular direction for analyzing tensor-valued data since they provide a convenient way to introduce sparsity into the model and conduct uncertainty quantification. In this article, we provide an overview of frequentist and Bayesian metho… ▽ More

    Submitted 5 June, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: 32 pages, 8 figures, 2 tables

    Journal ref: Statistics and Its Interface, Vol. 17, No. 2 (2024), pp. 249-274

  16. arXiv:2211.04725  [pdf, other

    stat.ME

    Single Parameter Inference of Non-sparse Logistic Regression Models

    Authors: Yanmei Shi, QiZhang

    Abstract: This paper infers a single parameter in non-sparse logistic regression models. By transforming the null hypothesis into a moment condition, we construct the test statistic and obtain the asymptotic null distribution. Numerical experiments show that our method performs well.

    Submitted 9 November, 2022; originally announced November 2022.

  17. arXiv:2211.03595  [pdf, other

    stat.ML cs.LG

    From Denoising Diffusions to Denoising Markov Models

    Authors: Joe Benton, Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusions are state-of-the-art generative models exhibiting remarkable empirical performance. They work by diffusing the data distribution into a Gaussian distribution and then learning to reverse this noising process to obtain synthetic datapoints. The denoising diffusion relies on approximations of the logarithmic derivatives of the noised data densities using score matching. Such mod… ▽ More

    Submitted 18 February, 2024; v1 submitted 7 November, 2022; originally announced November 2022.

  18. arXiv:2210.06226  [pdf, other

    stat.ML cs.LG

    Alpha-divergence Variational Inference Meets Importance Weighted Auto-Encoders: Methodology and Asymptotics

    Authors: Kamélia Daudel, Joe Benton, Yuyang Shi, Arnaud Doucet

    Abstract: Several algorithms involving the Variational Rényi (VR) bound have been proposed to minimize an alpha-divergence between a target posterior distribution and a variational distribution. Despite promising empirical results, those algorithms resort to biased stochastic gradient descent procedures and thus lack theoretical guarantees. In this paper, we formalize and study the VR-IWAE bound, a generali… ▽ More

    Submitted 19 July, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  19. arXiv:2210.04268  [pdf, other

    stat.ME math.ST

    A Locally Adaptive Shrinkage Approach to False Selection Rate Control in High-Dimensional Classification

    Authors: Bowen Gang, Yuantao Shi, Wenguang Sun

    Abstract: The uncertainty quantification and error control of classifiers are crucial in many high-consequence decision-making scenarios. We propose a selective classification framework that provides an indecision option for any observations that cannot be classified with confidence. The false selection rate (FSR), defined as the expected fraction of erroneous classifications among all definitive classifica… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  20. arXiv:2206.08994  [pdf, other

    stat.ML cs.CV cs.LG math.NA

    Robust Group Synchronization via Quadratic Programming

    Authors: Yunpeng Shi, Cole Wyeth, Gilad Lerman

    Abstract: We propose a novel quadratic programming formulation for estimating the corruption levels in group synchronization, and use these estimates to solve this problem. Our objective function exploits the cycle consistency of the group and we thus refer to our method as detection and estimation of structural consistency (DESC). This general framework can be extended to other algebraic and geometric stru… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to ICML 2022

    MSC Class: 90C26; 90C17; 68Q87; 65C20; 90-08; 60-08 ACM Class: G.1.6; I.4.0

  21. arXiv:2206.08871  [pdf, other

    cs.LG stat.ML

    How Robust is Unsupervised Representation Learning to Distribution Shift?

    Authors: Yuge Shi, Imant Daunhawer, Julia E. Vogt, Philip H. S. Torr, Amartya Sanyal

    Abstract: The robustness of machine learning algorithms to distributions shift is primarily discussed in the context of supervised learning (SL). As such, there is a lack of insight on the robustness of the representations learned from unsupervised methods, such as self-supervised learning (SSL) and auto-encoder based algorithms (AE), to distribution shift. We posit that the input-driven objectives of unsup… ▽ More

    Submitted 16 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  22. arXiv:2206.01704  [pdf, ps, other

    cs.LG eess.SY math.OC stat.ML

    KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed Stability in Nonlinear Dynamical Systems

    Authors: Sahin Lale, Yuanyuan Shi, Guannan Qu, Kamyar Azizzadenesheli, Adam Wierman, Anima Anandkumar

    Abstract: Learning a dynamical system requires stabilizing the unknown dynamics to avoid state blow-ups. However, current reinforcement learning (RL) methods lack stabilization guarantees, which limits their applicability for the control of safety-critical systems. We propose a model-based RL framework with formal stability guarantees, Krasovskii Constrained RL (KCRL), that adopts Krasovskii's family of Lya… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  23. arXiv:2203.16505  [pdf, other

    cs.CV math.NA stat.ML

    Fast, Accurate and Memory-Efficient Partial Permutation Synchronization

    Authors: Shaohan Li, Yunpeng Shi, Gilad Lerman

    Abstract: Previous partial permutation synchronization (PPS) algorithms, which are commonly used for multi-object matching, often involve computation-intensive and memory-demanding matrix operations. These operations become intractable for large scale structure-from-motion datasets. For pure permutation synchronization, the recent Cycle-Edge Message Passing (CEMP) framework suggests a memory-efficient and f… ▽ More

    Submitted 31 March, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

    MSC Class: 90C26; 90C10; 90C17; 68Q87; 65C20

  24. arXiv:2202.13460  [pdf, other

    stat.ML cs.LG

    Conditional Simulation Using Diffusion Schrödinger Bridges

    Authors: Yuyang Shi, Valentin De Bortoli, George Deligiannidis, Arnaud Doucet

    Abstract: Denoising diffusion models have recently emerged as a powerful class of generative models. They provide state-of-the-art results, not only for unconditional simulation, but also when used to solve conditional simulation problems arising in a wide range of inverse problems. A limitation of these models is that they are computationally intensive at generation time as they require simulating a diffus… ▽ More

    Submitted 26 June, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: 29 pages, 15 figures. UAI 2022 camera-ready version

  25. arXiv:2202.11455  [pdf, other

    cs.LG cs.CV math.ST stat.ML

    On PAC-Bayesian reconstruction guarantees for VAEs

    Authors: Badr-Eddine Chérief-Abdellatif, Yuyang Shi, Arnaud Doucet, Benjamin Guedj

    Abstract: Despite its wide use and empirical successes, the theoretical understanding and study of the behaviour and performance of the variational autoencoder (VAE) have only emerged in the past few years. We contribute to this recent line of work by analysing the VAE's reconstruction ability for unseen test data, leveraging arguments from the PAC-Bayes theory. We provide generalisation bounds on the theor… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 14 pages

    Journal ref: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics (AISTATS) 2022, Valencia, Spain. PMLR: Volume 151

  26. arXiv:2202.06383  [pdf, other

    cs.LG stat.AP

    Surgical Scheduling via Optimization and Machine Learning with Long-Tailed Data

    Authors: Yuan Shi, Saied Mahdian, Jose Blanchet, Peter Glynn, Andrew Y. Shin, David Scheinker

    Abstract: Using data from cardiovascular surgery patients with long and highly variable post-surgical lengths of stay (LOS), we develop a modeling framework to reduce recovery unit congestion. We estimate the LOS and its probability distribution using machine learning models, schedule procedures on a rolling basis using a variety of optimization models, and estimate performance with simulation. The machine… ▽ More

    Submitted 28 November, 2022; v1 submitted 13 February, 2022; originally announced February 2022.

  27. arXiv:2112.07746  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning

    Authors: Kevin Huang, Sahin Lale, Ugo Rosolia, Yuanyuan Shi, Anima Anandkumar

    Abstract: Current state-of-the-art model-based reinforcement learning algorithms use trajectory sampling methods, such as the Cross-Entropy Method (CEM), for planning in continuous control settings. These zeroth-order optimizers require sampling a large number of trajectory rollouts to select an optimal action, which scales poorly for large prediction horizons or high dimensional action spaces. First-order… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  28. arXiv:2111.06985  [pdf, other

    stat.ME

    Bayesian Knockoff Generators for Robust Inference Under Complex Data Structure

    Authors: Michael J. Martens, Anjishnu Banerjee, Xinran Qi, Yushu Shi

    Abstract: The recent proliferation of medical data, such as genetics and electronic health records (EHR), offers new opportunities to find novel predictors of health outcomes. Presented with a large set of candidate features, interest often lies in selecting the ones most likely to be predictive of an outcome for further study such that the goal is to control the false discovery rate (FDR) at a specified le… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

  29. arXiv:2111.01395  [pdf, other

    cs.LG cs.CR stat.ML

    Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds

    Authors: Yujia Huang, Huan Zhang, Yuanyuan Shi, J Zico Kolter, Anima Anandkumar

    Abstract: Certified robustness is a desirable property for deep neural networks in safety-critical applications, and popular training algorithms can certify robustness of a neural network by computing a global bound on its Lipschitz constant. However, such a bound is often loose: it tends to over-regularize the neural network and degrade its natural accuracy. A tighter Lipschitz bound may provide a better t… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

  30. arXiv:2110.13549  [pdf, other

    stat.ML cs.LG stat.CO

    Online Variational Filtering and Parameter Learning

    Authors: Andrew Campbell, Yuyang Shi, Tom Rainforth, Arnaud Doucet

    Abstract: We present a variational method for online state estimation and parameter learning in state-space models (SSMs), a ubiquitous class of latent variable models for sequential data. As per standard batch variational techniques, we use stochastic gradients to simultaneously optimize a lower bound on the log evidence with respect to both model parameters and a variational approximation of the states' p… ▽ More

    Submitted 14 June, 2022; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: 27 pages, 6 figures. NeurIPS 2021 (Oral); updated references

  31. arXiv:2110.07818  [pdf, other

    q-bio.QM stat.ME

    A novel framework to quantify uncertainty in peptide-tandem mass spectrum matches with application to nanobody peptide identification

    Authors: Chris McKennan, Zhe Sang, Yi Shi

    Abstract: Nanobodies are small antibody fragments derived from camelids that selectively bind to antigens. These proteins have marked physicochemical properties that support advanced therapeutics, including treatments for SARS-CoV-2. To realize their potential, bottom-up proteomics via liquid chromatography-tandem mass spectrometry (LC-MS/MS) has been proposed to identify antigen-specific nanobodies at the… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

    Comments: 19 pages, 7 figures in the main text; 59 pages, 15 figures including supplement

  32. arXiv:2109.08139  [pdf, ps, other

    eess.SP cs.LG cs.NI stat.ML

    Adversarial Attacks against Deep Learning Based Power Control in Wireless Communications

    Authors: Brian Kim, Yi Shi, Yalin E. Sagduyu, Tugba Erpek, Sennur Ulukus

    Abstract: We consider adversarial machine learning based attacks on power allocation where the base station (BS) allocates its transmit power to multiple orthogonal subcarriers by using a deep neural network (DNN) to serve multiple user equipments (UEs). The DNN that corresponds to a regression model is trained with channel gains as the input and returns transmit powers as the output. While the BS allocates… ▽ More

    Submitted 12 October, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  33. arXiv:2104.12953  [pdf, other

    cs.LG cs.AI stat.ML

    Exploring Uncertainty in Deep Learning for Construction of Prediction Intervals

    Authors: Yuandu Lai, Yucheng Shi, Yahong Han, Yunfeng Shao, Meiyu Qi, Bingshuai Li

    Abstract: Deep learning has achieved impressive performance on many tasks in recent years. However, it has been found that it is still not enough for deep neural networks to provide only point estimates. For high-risk tasks, we need to assess the reliability of the model predictions. This requires us to quantify the uncertainty of model prediction and construct prediction intervals. In this paper, We explor… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

  34. arXiv:2104.09937  [pdf, other

    cs.LG stat.ML

    Gradient Matching for Domain Generalization

    Authors: Yuge Shi, Jeffrey Seely, Philip H. S. Torr, N. Siddharth, Awni Hannun, Nicolas Usunier, Gabriel Synnaeve

    Abstract: Machine learning systems typically assume that the distributions of training and test sets match closely. However, a critical requirement of such systems in the real world is their ability to generalize to unseen domains. Here, we propose an inter-domain gradient matching objective that targets domain generalization by maximizing the inner product between gradients from different domains. Since di… ▽ More

    Submitted 13 July, 2021; v1 submitted 20 April, 2021; originally announced April 2021.

  35. arXiv:2103.13221  [pdf, other

    stat.ME

    Mixed Effects Envelope Models

    Authors: Yuyang Shi, Linquan Ma, Lan Liu

    Abstract: When multiple measures are collected repeatedly over time, redundancy typically exists among responses. The envelope method was recently proposed to reduce the dimension of responses without loss of information in regression with multivariate responses. It can gain substantial efficiency over the standard least squares estimator. In this paper, we generalize the envelope method to mixed effects mo… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  36. arXiv:2011.04868  [pdf, other

    cs.LG math.OC stat.ML

    Neural Network Compression Via Sparse Optimization

    Authors: Tianyi Chen, Bo Ji, Yixin Shi, Tianyu Ding, Biyi Fang, Sheng Yi, Xiao Tu

    Abstract: The compression of deep neural networks (DNNs) to reduce inference cost becomes increasingly important to meet realistic deployment requirements of various applications. There have been a significant amount of work regarding network compression, while most of them are heuristic rule-based or typically not friendly to be incorporated into varying scenarios. On the other hand, sparse optimization yi… ▽ More

    Submitted 11 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

  37. arXiv:2009.03509  [pdf, other

    cs.LG stat.ML

    Masked Label Prediction: Unified Message Passing Model for Semi-Supervised Classification

    Authors: Yunsheng Shi, Zhengjie Huang, Shikun Feng, Hui Zhong, Wenjin Wang, Yu Sun

    Abstract: Graph neural network (GNN) and label propagation algorithm (LPA) are both message passing algorithms, which have achieved superior performance in semi-supervised classification. GNN performs feature propagation by a neural network to make predictions, while LPA uses label propagation across graph adjacency matrix to get results. However, there is still no effective way to directly combine these tw… ▽ More

    Submitted 9 May, 2021; v1 submitted 8 September, 2020; originally announced September 2020.

    Comments: 7 pages, 3 figures and 8 tables; Accepted by IJCAI 2021

  38. arXiv:2008.09990  [pdf, other

    cs.LG stat.ML

    Unsupervised Multi-view Clustering by Squeezing Hybrid Knowledge from Cross View and Each View

    Authors: Junpeng Tan, Yukai Shi, Zhijing Yang, Caizhen Wen, Liang Lin

    Abstract: Multi-view clustering methods have been a focus in recent years because of their superiority in clustering performance. However, typical traditional multi-view clustering algorithms still have shortcomings in some aspects, such as removal of redundant information, utilization of various views and fusion of multi-view features. In view of these problems, this paper proposes a new multi-view cluster… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

  39. arXiv:2008.08060  [pdf

    cs.LG eess.SP stat.ML

    Personalized Deep Learning for Ventricular Arrhythmias Detection on Medical IoT Systems

    Authors: Zhenge Jia, Zhepeng Wang, Feng Hong, Lichuan Ping, Yiyu Shi, Jingtong Hu

    Abstract: Life-threatening ventricular arrhythmias (VA) are the leading cause of sudden cardiac death (SCD), which is the most significant cause of natural death in the US. The implantable cardioverter defibrillator (ICD) is a small device implanted to patients under high risk of SCD as a preventive treatment. The ICD continuously monitors the intracardiac rhythm and delivers shock when detecting the life-t… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  40. arXiv:2007.16103  [pdf, other

    cs.LG cs.CV stat.ML

    Learning-based Computer-aided Prescription Model for Parkinson's Disease: A Data-driven Perspective

    Authors: Yinghuan Shi, Wanqi Yang, Kim-Han Thung, Hao Wang, Yang Gao, Yang Pan, Li Zhang, Dinggang Shen

    Abstract: In this paper, we study a novel problem: "automatic prescription recommendation for PD patients." To realize this goal, we first build a dataset by collecting 1) symptoms of PD patients, and 2) their prescription drug provided by neurologists. Then, we build a novel computer-aided prescription model by learning the relation between observed symptoms and prescription drug. Finally, for the new comi… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: IEEE JBHI 2020

  41. arXiv:2007.15812  [pdf, other

    stat.AP

    Sparse tree-based clustering of microbiome data to characterize microbiome heterogeneity in pancreatic cancer

    Authors: Yushu Shi, Liangliang Zhang, Kim-Anh Do, Robert Jenq, Christine Peterson

    Abstract: There is a keen interest in characterizing variation in the microbiome across cancer patients, given increasing evidence of its important role in determining treatment outcomes. Here our goal is to discover subgroups of patients with similar microbiome profiles. We propose a novel unsupervised clustering approach in the Bayesian framework that innovates over existing model-based clustering approac… ▽ More

    Submitted 2 December, 2022; v1 submitted 30 July, 2020; originally announced July 2020.

  42. arXiv:2007.13638  [pdf, other

    cs.CV cs.IT stat.ML

    Message Passing Least Squares Framework and its Application to Rotation Synchronization

    Authors: Yunpeng Shi, Gilad Lerman

    Abstract: We propose an efficient algorithm for solving group synchronization under high levels of corruption and noise, while we focus on rotation synchronization. We first describe our recent theoretically guaranteed message passing algorithm that estimates the corruption levels of the measured group ratios. We then propose a novel reweighted least squares method to estimate the group elements, where the… ▽ More

    Submitted 14 August, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: To Appear in ICML 2020 Proceedings

    MSC Class: 90C26; 90C17; 68Q87; 65C20; 90-08; 60-08 ACM Class: G.1.6; I.4.0

    Journal ref: International Conference on Machine Learning, 8796-8806 (2020)

  43. arXiv:2007.09087  [pdf, ps, other

    cs.LG cs.NE eess.SP stat.ML

    Standing on the Shoulders of Giants: Hardware and Neural Architecture Co-Search with Hot Start

    Authors: Weiwen Jiang, Lei Yang, Sakyasingha Dasgupta, Jingtong Hu, Yiyu Shi

    Abstract: Hardware and neural architecture co-search that automatically generates Artificial Intelligence (AI) solutions from a given dataset is promising to promote AI democratization; however, the amount of time that is required by current co-search frameworks is in the order of hundreds of GPU hours for one target hardware. This inhibits the use of such frameworks on commodity hardware. The root cause of… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

    Comments: 13 pages

  44. arXiv:2007.03213  [pdf, other

    cs.LG stat.ML

    Enabling On-Device CNN Training by Self-Supervised Instance Filtering and Error Map Pruning

    Authors: Yawen Wu, Zhepeng Wang, Yiyu Shi, Jingtong Hu

    Abstract: This work aims to enable on-device training of convolutional neural networks (CNNs) by reducing the computation cost at training time. CNN models are usually trained on high-performance computers and only the trained models are deployed to edge devices. But the statically trained model cannot adapt dynamically in a real environment and may result in low accuracy for new inputs. On-device training… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

  45. arXiv:2007.01179  [pdf, other

    cs.LG stat.ML

    Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models

    Authors: Yuge Shi, Brooks Paige, Philip H. S. Torr, N. Siddharth

    Abstract: Multimodal learning for generative models often refers to the learning of abstract concepts from the commonality of information in multiple modalities, such as vision and language. While it has proven effective for learning generalisable representations, the training of such models often requires a large amount of "related" multimodal data that shares commonality, which can be expensive to come by… ▽ More

    Submitted 21 April, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  46. arXiv:2006.10027  [pdf, other

    eess.IV cs.LG stat.ML

    Deep Learning Meets SAR

    Authors: Xiao Xiang Zhu, Sina Montazeri, Mohsin Ali, Yuansheng Hua, Yuanyuan Wang, Lichao Mou, Yilei Shi, Feng Xu, Richard Bamler

    Abstract: Deep learning in remote sensing has become an international hype, but it is mostly limited to the evaluation of optical data. Although deep learning has been introduced in Synthetic Aperture Radar (SAR) data processing, despite successful first attempts, its huge potential remains locked. In this paper, we provide an introduction to the most relevant deep learning models and concepts, point out po… ▽ More

    Submitted 5 January, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: article accepted by IEEE Geoscience and Remote Sensing Magazine. Copyright may be transferred without notice, after which this version may no longer be accessible

  47. arXiv:2006.06658  [pdf, other

    cs.CV math.NA math.PR stat.ML

    Robust Multi-object Matching via Iterative Reweighting of the Graph Connection Laplacian

    Authors: Yunpeng Shi, Shaohan Li, Gilad Lerman

    Abstract: We propose an efficient and robust iterative solution to the multi-object matching problem. We first clarify serious limitations of current methods as well as the inappropriateness of the standard iteratively reweighted least squares procedure. In view of these limitations, we suggest a novel and more reliable iterative reweighting strategy that incorporates information from higher-order neighborh… ▽ More

    Submitted 24 October, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    MSC Class: 90C26; 90C10; 90C17; 68Q87; 65C20 ACM Class: G.1.6; I.4.0

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS) 33, 15243--15253 (2020)

  48. arXiv:2006.04259  [pdf, other

    cs.LG stat.ML

    Deep Goal-Oriented Clustering

    Authors: Yifeng Shi, Christopher M. Bender, Junier B. Oliva, Marc Niethammer

    Abstract: Clustering and prediction are two primary tasks in the fields of unsupervised and supervised learning, respectively. Although much of the recent advances in machine learning have been centered around those two tasks, the interdependent, mutually beneficial relationship between them is rarely explored. One could reasonably expect appropriately clustering the data would aid the downstream prediction… ▽ More

    Submitted 15 June, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: 15 pages

  49. arXiv:2005.11716  [pdf, other

    cs.LG cs.CV stat.ML

    Multi-view Alignment and Generation in CCA via Consistent Latent Encoding

    Authors: Yaxin Shi, Yuangang Pan, Donna Xu, Ivor W. Tsang

    Abstract: Multi-view alignment, achieving one-to-one correspondence of multi-view inputs, is critical in many real-world multi-view applications, especially for cross-view data analysis problems. Recently, an increasing number of works study this alignment problem with Canonical Correlation Analysis (CCA). However, existing CCA models are prone to misalign the multiple views due to either the neglect of unc… ▽ More

    Submitted 24 May, 2020; originally announced May 2020.

    Comments: 37 pages, 22 figures

  50. arXiv:2005.02153  [pdf, other

    cs.CV cs.LG stat.ML

    Improving Target-driven Visual Navigation with Attention on 3D Spatial Relationships

    Authors: Yunlian Lv, Ning Xie, Yimin Shi, Zijiao Wang, Heng Tao Shen

    Abstract: Embodied artificial intelligence (AI) tasks shift from tasks focusing on internet images to active settings involving embodied agents that perceive and act within 3D environments. In this paper, we investigate the target-driven visual navigation using deep reinforcement learning (DRL) in 3D indoor scenes, whose navigation task aims to train an agent that can intelligently make a series of decision… ▽ More

    Submitted 29 April, 2020; originally announced May 2020.

    Comments: 12 pages, 9 figures