Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 56 results for author: Hu, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.03734  [pdf, other

    cs.HC cs.AI stat.AP

    FOKE: A Personalized and Explainable Education Framework Integrating Foundation Models, Knowledge Graphs, and Prompt Engineering

    Authors: Silan Hu, Xiaoning Wang

    Abstract: Integrating large language models (LLMs) and knowledge graphs (KGs) holds great promise for revolutionizing intelligent education, but challenges remain in achieving personalization, interactivity, and explainability. We propose FOKE, a Forest Of Knowledge and Education framework that synergizes foundation models, knowledge graphs, and prompt engineering to address these challenges. FOKE introduce… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  2. arXiv:2404.19292  [pdf, other

    cs.IT cs.LG cs.MA stat.ML

    Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

    Authors: Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li

    Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling (IDS). These algorithms draw inspiration from foundational concepts in information theory, and are proven to be sample efficient in MARL settings such as two-player zero-sum Markov games (MGs) and multi-player general-sum MGs. For episodic t… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  3. arXiv:2404.09729  [pdf

    eess.SP cs.IT cs.LG stat.ME

    Amplitude-Phase Fusion for Enhanced Electrocardiogram Morphological Analysis

    Authors: Shuaicong Hu, Yanan Wang, Jian Liu, Jingyu Lin, Shengmei Qin, Zhenning Nie, Zhifeng Yao, Wenjie Cai, Cuiwei Yang

    Abstract: Considering the variability of amplitude and phase patterns in electrocardiogram (ECG) signals due to cardiac activity and individual differences, existing entropy-based studies have not fully utilized these two patterns and lack integration. To address this gap, this paper proposes a novel fusion entropy metric, morphological ECG entropy (MEE) for the first time, specifically designed for ECG mor… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 16 pages, 12 figures

    ACM Class: I.5.2

  4. arXiv:2404.03701  [pdf, other

    cs.LG stat.ML

    Predictive Analytics of Varieties of Potatoes

    Authors: Fabiana Ferracina, Bala Krishnamoorthy, Mahantesh Halappanavar, Shengwei Hu, Vidyasagar Sathuvalli

    Abstract: We explore the application of machine learning algorithms to predict the suitability of Russet potato clones for advancement in breeding trials. Leveraging data from manually collected trials in the state of Oregon, we investigate the potential of a wide variety of state-of-the-art binary classification models. We conduct a comprehensive analysis of the dataset that includes preprocessing, feature… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: 19 pages, 3 figures, submitted to Artificial Intelligence in Agriculture

  5. arXiv:2403.05425  [pdf, ps, other

    stat.ML stat.ME

    An Adaptive Dimension Reduction Estimation Method for High-dimensional Bayesian Optimization

    Authors: Shouri Hu, Jiawei Li, Zhibo Cai

    Abstract: Bayesian optimization (BO) has shown impressive results in a variety of applications within low-to-moderate dimensional Euclidean spaces. However, extending BO to high-dimensional settings remains a significant challenge. We address this challenge by proposing a two-step optimization framework. Initially, we identify the effective dimension reduction (EDR) subspace for the objective function using… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: First draft

  6. arXiv:2402.00728  [pdf, other

    cs.LG stat.ML

    Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation

    Authors: Hsiang Hsu, Guihong Li, Shaohan Hu, Chun-Fu, Chen

    Abstract: Predictive multiplicity refers to the phenomenon in which classification tasks may admit multiple competing models that achieve almost-equally-optimal performance, yet generate conflicting outputs for individual samples. This presents significant concerns, as it can potentially result in systemic exclusion, inexplicable discrimination, and unfairness in practical applications. Measuring and mitiga… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: ICLR 2024

  7. arXiv:2309.13557  [pdf, other

    stat.CO math.NA

    Bayesian Parameter Inference for Partially Observed Diffusions using Multilevel Stochastic Runge-Kutta Methods

    Authors: Pierre Del Moral, Shulan Hu, Ajay Jasra, Hamza Ruzayqat, Xinyu Wang

    Abstract: We consider the problem of Bayesian estimation of static parameters associated to a partially and discretely observed diffusion process. We assume that the exact transition dynamics of the diffusion process are unavailable, even up-to an unbiased estimator and that one must time-discretize the diffusion process. In such scenarios it has been shown how one can introduce the multilevel Monte Carlo m… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  8. arXiv:2309.05145  [pdf, other

    cs.LG cs.AI stat.ML

    Outlier Robust Adversarial Training

    Authors: Shu Hu, Zhenhuan Yang, Xin Wang, Yiming Ying, Siwei Lyu

    Abstract: Supervised learning models are challenged by the intrinsic complexities of training data such as outliers and minority subpopulations and intentional attacks at inference time with adversarial samples. While traditional robust learning methods and the recent adversarial training approaches are designed to handle each of the two challenges, to date, no work has been done to develop models that are… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: Accepted by The 15th Asian Conference on Machine Learning (ACML 2023)

  9. arXiv:2308.04158  [pdf, other

    stat.ME

    A Dual Cox Model Theory And Its Applications In Oncology

    Authors: Powei Chen, Siying Hu, Dr. Haojin Zhou

    Abstract: Given the prominence of targeted therapy and immunotherapy in cancer treatment, it becomes imperative to consider heterogeneity in patients' responses to treatments, which contributes greatly to the widely used proportional hazard assumption invalidated as in several clinical trials. To address the challenge, we develop a Dual Cox model theory including a Dual Cox model and a fitting algorithm.… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  10. arXiv:2211.10508  [pdf, other

    stat.ML cs.LG

    Distributionally Robust Survival Analysis: A Novel Fairness Loss Without Demographics

    Authors: Shu Hu, George H. Chen

    Abstract: We propose a general approach for training survival analysis models that minimizes a worst-case error across all subpopulations that are large enough (occurring with at least a user-specified minimum probability). This approach uses a training loss function that does not know any demographic information to treat as sensitive. Despite this, we demonstrate that our proposed approach often scores bet… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

    Comments: Machine Learning for Health (ML4H 2022)

  11. arXiv:2209.00383  [pdf, other

    cs.CV stat.ML

    TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut

    Authors: Yangtao Wang, Xi Shen, Yuan Yuan, Yuming Du, Maomao Li, Shell Xu Hu, James L Crowley, Dominique Vaufreydaz

    Abstract: In this paper, we describe a graph-based algorithm that uses the features obtained by a self-supervised transformer to detect and segment salient objects in images and videos. With this approach, the image patches that compose an image or video are organised into a fully connected graph, where the edge between each pair of patches is labeled with a similarity score between patches using features l… ▽ More

    Submitted 5 December, 2023; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2202.11539

  12. arXiv:2208.02627  [pdf, ps, other

    stat.ME math.ST

    Modelling multivariate extreme value distributions via Markov trees

    Authors: Shuang Hu, Zuoxiang Peng, Johan Segers

    Abstract: Multivariate extreme value distributions are a common choice for modelling multivariate extremes. In high dimensions, however, the construction of flexible and parsimonious models is challenging. We propose to combine bivariate extreme value distributions into a Markov random field with respect to a tree. Although in general not an extreme value distribution itself, this Markov tree is attracted b… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

    Comments: 37 pages, 10 figures, 7 tables

    MSC Class: 62G32; 62H22

  13. arXiv:2207.07624  [pdf, other

    cs.LG stat.ML

    Feed-Forward Latent Domain Adaptation

    Authors: Ondrej Bohdal, Da Li, Shell Xu Hu, Timothy Hospedales

    Abstract: We study a new highly-practical problem setting that enables resource-constrained edge devices to adapt a pre-trained model to their local data distributions. Recognizing that device's data are likely to come from multiple latent domains that include a mixture of unlabelled domain-relevant and domain-irrelevant examples, we focus on the comparatively under-studied problem of latent domain adaptati… ▽ More

    Submitted 31 January, 2024; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted at WACV 2024. Project page: https://ondrejbohdal.github.io/cxda

  14. arXiv:2206.13140  [pdf, other

    cs.LG stat.ML

    Compressing Features for Learning with Noisy Labels

    Authors: Yingyi Chen, Shell Xu Hu, Xi Shen, Chunrong Ai, Johan A. K. Suykens

    Abstract: Supervised learning can be viewed as distilling relevant information from input data into feature representations. This process becomes difficult when supervision is noisy as the distilled information might not be relevant. In fact, recent research shows that networks can easily overfit all labels including those that are corrupted, and hence can hardly generalize to clean datasets. In this paper,… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted to TNNLS 2022. Project page: https://yingyichen-cyy.github.io/CompressFeatNoisyLabels/

  15. arXiv:2206.08531  [pdf, ps, other

    stat.ML cs.LG

    Reframed GES with a Neural Conditional Dependence Measure

    Authors: Xinwei Shen, Shengyu Zhu, Jiji Zhang, Shoubo Hu, Zhitang Chen

    Abstract: In a nonparametric setting, the causal structure is often identifiable only up to Markov equivalence, and for the purpose of causal inference, it is useful to learn a graphical representation of the Markov equivalence class (MEC). In this paper, we revisit the Greedy Equivalence Search (GES) algorithm, which is widely cited as a score-based algorithm for learning the MEC of the underlying causal s… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted to UAI 2022

  16. arXiv:2206.07902  [pdf, other

    cs.LG cs.CR stat.ML

    On Privacy and Personalization in Cross-Silo Federated Learning

    Authors: Ziyu Liu, Shengyuan Hu, Zhiwei Steven Wu, Virginia Smith

    Abstract: While the application of differential privacy (DP) has been well-studied in cross-device federated learning (FL), there is a lack of work considering DP and its implications for cross-silo FL, a setting characterized by a limited number of clients each containing many data subjects. In cross-silo FL, usual notions of client-level DP are less suitable as real-world privacy regulations typically con… ▽ More

    Submitted 17 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022, 37 pages

  17. arXiv:2203.11691  [pdf, other

    stat.ML cs.LG econ.EM

    GAM(L)A: An econometric model for interpretable Machine Learning

    Authors: Emmanuel Flachaire, Gilles Hacheme, Sullivan Hué, Sébastien Laurent

    Abstract: Despite their high predictive performance, random forest and gradient boosting are often considered as black boxes or uninterpretable models which has raised concerns from practitioners and regulators. As an alternative, we propose in this paper to use partial linear models that are inherently interpretable. Specifically, this article introduces GAM-lasso (GAMLA) and GAM-autometrics (GAMA), denote… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 47 pages, 12 tables and 7 figures

  18. arXiv:2202.11539  [pdf, other

    cs.CV stat.ML

    Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut

    Authors: Yangtao Wang, Xi Shen, Shell Hu, Yuan Yuan, James Crowley, Dominique Vaufreydaz

    Abstract: Transformers trained with self-supervised learning using self-distillation loss (DINO) have been shown to produce attention maps that highlight salient foreground objects. In this paper, we demonstrate a graph-based approach that uses the self-supervised transformer features to discover an object from an image. Visual tokens are viewed as nodes in a weighted graph with edges representing a connect… ▽ More

    Submitted 24 March, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Journal ref: CVPR 2022 - Conference on Computer Vision and Pattern Recognition, Jun 2022, New Orleans, United States

  19. arXiv:2106.03300  [pdf, other

    cs.LG stat.ML

    Sum of Ranked Range Loss for Supervised Learning

    Authors: Shu Hu, Yiming Ying, Xin Wang, Siwei Lyu

    Abstract: In forming learning objectives, one oftentimes needs to aggregate a set of individual values to a single output. Such cases occur in the aggregate loss, which combines individual losses of a learning model over each training sample, and in the individual loss for multi-label learning, which combines prediction scores over all class labels. In this work, we introduce the sum of ranked range (SoRR)… ▽ More

    Submitted 3 April, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted by Journal of Machine Learning Research (JMLR). arXiv admin note: text overlap with arXiv:2010.01741

  20. arXiv:2106.00925  [pdf, other

    cs.LG stat.ML

    Contrastive ACE: Domain Generalization Through Alignment of Causal Mechanisms

    Authors: Yunqi Wang, Furui Liu, Zhitang Chen, Qing Lian, Shoubo Hu, Jianye Hao, Yik-Chung Wu

    Abstract: Domain generalization aims to learn knowledge invariant across different distributions while semantically meaningful for downstream tasks from multiple source domains, to improve the model's generalization ability on unseen target domains. The fundamental objective is to understand the underlying "invariance" behind these observational distributions and such invariance has been shown to have a clo… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

  21. arXiv:2103.10912  [pdf, other

    stat.AP

    Copula Averaging for Tail Dependence in Insurance Claims Data

    Authors: Sen Hu, Adrian O'Hagan

    Abstract: Analysing dependent risks is an important task for insurance companies. A dependency is reflected in the fact that information about one random variable provides information about the likely distribution of values of another random variable. Insurance companies in particular must investigate such dependencies between different lines of business and the effects that an extreme loss event, such as a… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

  22. arXiv:2012.04221  [pdf, other

    cs.LG stat.ML

    Ditto: Fair and Robust Federated Learning Through Personalization

    Authors: Tian Li, Shengyuan Hu, Ahmad Beirami, Virginia Smith

    Abstract: Fairness and robustness are two important concerns for federated learning systems. In this work, we identify that robustness to data and model poisoning attacks and fairness, measured as the uniformity of performance across devices, are competing constraints in statistically heterogeneous networks. To address these constraints, we propose employing a simple, general framework for personalized fede… ▽ More

    Submitted 15 June, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted by ICML 2021

  23. arXiv:2010.01741  [pdf, other

    cs.LG stat.ML

    Learning by Minimizing the Sum of Ranked Range

    Authors: Shu Hu, Yiming Ying, Xin Wang, Siwei Lyu

    Abstract: In forming learning objectives, one oftentimes needs to aggregate a set of individual values to a single output. Such cases occur in the aggregate loss, which combines individual losses of a learning model over each training sample, and in the individual loss for multi-label learning, which combines prediction scores over all class labels. In this work, we introduce the sum of ranked range (SoRR)… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

    Comments: Accepted by Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020)

  24. arXiv:2009.04197   

    cs.LG cs.MA stat.ML

    QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning

    Authors: Jian Hu, Seth Austin Harding, Haibin Wu, Siyue Hu, Shih-wei Liao

    Abstract: In Cooperative Multi-Agent Reinforcement Learning (MARL) and under the setting of Centralized Training with Decentralized Execution (CTDE), agents observe and interact with their environment locally and independently. With local observation and random sampling, the randomness in rewards and observations leads to randomness in long-term returns. Existing methods such as Value Decomposition Network… ▽ More

    Submitted 23 February, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

    Comments: There are some experimental errors and experimental unfairness in this paper that will seriously affect the later studies

  25. arXiv:2009.01272  [pdf, other

    cs.LG stat.ML

    Understanding the wiring evolution in differentiable neural architecture search

    Authors: Sirui Xie, Shoukang Hu, Xinjiang Wang, Chunxiao Liu, Jianping Shi, Xunying Liu, Dahua Lin

    Abstract: Controversy exists on whether differentiable neural architecture search methods discover wiring topology effectively. To understand how wiring topology evolves, we study the underlying mechanism of several existing differentiable NAS frameworks. Our investigation is motivated by three observed searching patterns of differentiable NAS: 1) they search by growing instead of pruning; 2) wider networks… ▽ More

    Submitted 25 February, 2021; v1 submitted 2 September, 2020; originally announced September 2020.

    Comments: AISTATS 2021

  26. arXiv:2006.13681  [pdf, other

    cs.CV cs.LG stat.ML

    Multi-view Drone-based Geo-localization via Style and Spatial Alignment

    Authors: Siyi Hu, Xiaojun Chang

    Abstract: In this paper, we focus on the task of multi-view multi-source geo-localization, which serves as an important auxiliary method of GPS positioning by matching drone-view image and satellite-view image with pre-annotated GPS tag. To solve this problem, most existing methods adopt metric loss with an weighted classification block to force the generation of common feature space shared by different vie… ▽ More

    Submitted 8 July, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: 9 pages 9 figures. arXiv admin note: text overlap with arXiv:2002.12186 by other authors

    ACM Class: I.4.7; I.2.10

  27. arXiv:2006.13463  [pdf, other

    cs.LG cs.AI stat.ML

    Graph Policy Network for Transferable Active Learning on Graphs

    Authors: Shengding Hu, Zheng Xiong, Meng Qu, Xingdi Yuan, Marc-Alexandre Côté, Zhiyuan Liu, Jian Tang

    Abstract: Graph neural networks (GNNs) have been attracting increasing popularity due to their simplicity and effectiveness in a variety of fields. However, a large number of labeled data is generally required to train these networks, which could be very expensive to obtain in some domains. In this paper, we study active learning for GNNs, i.e., how to efficiently label the nodes on a graph to reduce the an… ▽ More

    Submitted 23 October, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

    ACM Class: I.2

  28. arXiv:2006.07856  [pdf, other

    cs.LG stat.ML

    The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems

    Authors: Sixu Hu, Yuan Li, Xu Liu, Qinbin Li, Zhaomin Wu, Bingsheng He

    Abstract: This paper presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available data sets as different data si… ▽ More

    Submitted 2 March, 2022; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: ACM Transactions on Intelligent Systems and Technology, Vol. 13, No. 4, Article 63

  29. arXiv:2006.04877  [pdf, other

    stat.ME cs.LG stat.CO

    A Causal Direction Test for Heterogeneous Populations

    Authors: Vahid Partovi Nia, Xinlin Li, Masoud Asgharian, Shoubo Hu, Zhitang Chen, Yanhui Geng

    Abstract: A probabilistic expert system emulates the decision-making ability of a human expert through a directional graphical model. The first step in building such systems is to understand data generation mechanism. To this end, one may try to decompose a multivariate distribution into product of several conditionals, and evolving a blackbox machine learning predictive models towards transparent cause-and… ▽ More

    Submitted 27 September, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    MSC Class: 62D20; 62H30

  30. arXiv:2005.00667  [pdf

    stat.AP

    Data-Driven Modeling Reveals the Impact of Stay-at-Home Orders on Human Mobility during the COVID-19 Pandemic in the U.S

    Authors: Chenfeng Xiong, Songhua Hu, Mofeng Yang, Hannah N Younes, Weiyu Luo, Sepehr Ghader, Lei Zhang

    Abstract: One approach to delay the spread of the novel coronavirus (COVID-19) is to reduce human travel by imposing travel restriction policies. It is yet unclear how effective those policies are on suppressing the mobility trend due to the lack of ground truth and large-scale dataset describing human mobility during the pandemic. This study uses real-world location-based service data collected from anonym… ▽ More

    Submitted 4 May, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

  31. arXiv:2004.12696  [pdf, other

    cs.LG stat.ML

    Empirical Bayes Transductive Meta-Learning with Synthetic Gradients

    Authors: Shell Xu Hu, Pablo G. Moreno, Yang Xiao, Xi Shen, Guillaume Obozinski, Neil D. Lawrence, Andreas Damianou

    Abstract: We propose a meta-learning approach that learns from multiple tasks in a transductive setting, by leveraging the unlabeled query set in addition to the support set to generate a more powerful model for each task. To develop our framework, we revisit the empirical Bayes formulation for multi-task learning. The evidence lower bound of the marginal log-likelihood of empirical Bayes decomposes as a su… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: ICLR 2020

  32. arXiv:2002.09128  [pdf, other

    cs.LG stat.ML

    DSNAS: Direct Neural Architecture Search without Parameter Retraining

    Authors: Shoukang Hu, Sirui Xie, Hehui Zheng, Chunxiao Liu, Jianping Shi, Xunying Liu, Dahua Lin

    Abstract: If NAS methods are solutions, what is the problem? Most existing NAS methods require two-stage parameter optimization. However, performance of the same architecture in the two stages correlates poorly. In this work, we propose a new problem definition for NAS, task-specific end-to-end, based on this observation. We argue that given a computer vision task for which a NAS method is expected, this de… ▽ More

    Submitted 31 March, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: To appear in CVPR 2020

  33. arXiv:2002.05582  [pdf, other

    cs.LG stat.ML

    Learning to Predict Error for MRI Reconstruction

    Authors: Shi Hu, Nicola Pezzotti, Max Welling

    Abstract: In healthcare applications, predictive uncertainty has been used to assess predictive accuracy. In this paper, we demonstrate that predictive uncertainty estimated by the current methods does not highly correlate with prediction error by decomposing the latter into random and systematic errors, and showing that the former is equivalent to the variance of the random error. In addition, we observe t… ▽ More

    Submitted 6 July, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: Accepted to MICCAI 2021

  34. arXiv:1910.07629  [pdf, other

    cs.LG cs.CR stat.ML

    A New Defense Against Adversarial Images: Turning a Weakness into a Strength

    Authors: Tao Yu, Shengyuan Hu, Chuan Guo, Wei-Lun Chao, Kilian Q. Weinberger

    Abstract: Natural images are virtually surrounded by low-density misclassified regions that can be efficiently discovered by gradient-guided search --- enabling the generation of adversarial images. While many techniques for detecting these attacks have been proposed, they are easily bypassed when the adversary has full knowledge of the detection mechanism and adapts the attack strategy accordingly. In this… ▽ More

    Submitted 3 December, 2019; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019, 14 pages

  35. arXiv:1907.11216  [pdf, other

    stat.ML cs.LG

    Domain Generalization via Multidomain Discriminant Analysis

    Authors: Shoubo Hu, Kun Zhang, Zhitang Chen, Laiwan Chan

    Abstract: Domain generalization (DG) aims to incorporate knowledge from multiple source domains into a single model that could generalize well on unseen target domains. This problem is ubiquitous in practice since the distributions of the target data may rarely be identical to those of the source data. In this paper, we propose Multidomain Discriminant Analysis (MDA) to address DG of classification tasks in… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: UAI 2019

  36. arXiv:1907.09693  [pdf, other

    cs.LG cs.CR cs.DB stat.ML

    A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

    Authors: Qinbin Li, Zeyi Wen, Zhaomin Wu, Sixu Hu, Naibo Wang, Yuan Li, Xu Liu, Bingsheng He

    Abstract: Federated learning has been a hot research topic in enabling the collaborative training of machine learning models among different organizations under the privacy restrictions. As researchers try to support more machine learning models with different privacy-preserving approaches, there is a requirement in developing systems and infrastructures to ease the development of various federated learning… ▽ More

    Submitted 4 December, 2021; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: Accepted to IEEE Transactions on Knowledge and Data Engineering (TKDE)

  37. arXiv:1907.01949  [pdf, other

    cs.LG cs.CV stat.ML

    Supervised Uncertainty Quantification for Segmentation with Multiple Annotations

    Authors: Shi Hu, Daniel Worrall, Stefan Knegt, Bas Veeling, Henkjan Huisman, Max Welling

    Abstract: The accurate estimation of predictive uncertainty carries importance in medical scenarios such as lung node segmentation. Unfortunately, most existing works on predictive uncertainty do not return calibrated uncertainty estimates, which could be used in practice. In this work we exploit multi-grader annotation variability as a source of 'groundtruth' aleatoric uncertainty, which can be treated as… ▽ More

    Submitted 27 May, 2022; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: MICCAI 2019. Fixed a few typos

  38. Topological Techniques in Model Selection

    Authors: Shaoxiong Hu, Hugo Maruri-Aguliar, Zixiang Ma

    Abstract: The LASSO is an attractive regularisation method for linear regression that combines variable selection with an efficient computation procedure. This paper is concerned with enhancing the performance of LASSO for square-free hierarchical polynomial models when combining validation error with a measure of model complexity. The measure of the complexity is the sum of Betti numbers of the model which… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Journal ref: Alg. Stat. 13 (2022) 41-56

  39. arXiv:1904.04699  [pdf, other

    stat.AP

    Bivariate Gamma Mixture of Experts Models for Joint Insurance Claims Modeling

    Authors: Sen Hu, T Brendan Murphy, Adrian O'Hagan

    Abstract: In general insurance, risks from different categories are often modeled independently and their sum is regarded as the total risk the insurer takes on in exchange for a premium. The dependence from multiple risks is generally neglected even when correlation could exist, for example a single car accident may result in claims from multiple risk categories. It is desirable to take the covariance of d… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  40. STFNets: Learning Sensing Signals from the Time-Frequency Perspective with Short-Time Fourier Neural Networks

    Authors: Shuochao Yao, Ailing Piao, Wenjun Jiang, Yiran Zhao, Huajie Shao, Shengzhong Liu, Dongxin Liu, Jinyang Li, Tianshi Wang, Shaohan Hu, Lu Su, Jiawei Han, Tarek Abdelzaher

    Abstract: Recent advances in deep learning motivate the use of deep neural networks in Internet-of-Things (IoT) applications. These networks are modelled after signal processing in the human brain, thereby leading to significant advantages at perceptual tasks such as vision and speech recognition. IoT applications, however, often measure physical phenomena, where the underlying physics (such as inertia, wir… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

  41. arXiv:1812.11027  [pdf, other

    cs.LG stat.ML

    Exploring Weight Symmetry in Deep Neural Networks

    Authors: Xu Shell Hu, Sergey Zagoruyko, Nikos Komodakis

    Abstract: We propose to impose symmetry in neural network parameters to improve parameter usage and make use of dedicated convolution and matrix multiplication routines. Due to significant reduction in the number of parameters as a result of the symmetry constraints, one would expect a dramatic drop in accuracy. Surprisingly, we show that this is not the case, and, depending on network size, symmetry can ha… ▽ More

    Submitted 10 January, 2019; v1 submitted 28 December, 2018; originally announced December 2018.

  42. arXiv:1812.08434  [pdf

    cs.LG cs.AI stat.ML

    Graph Neural Networks: A Review of Methods and Applications

    Authors: Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, Lifeng Wang, Changcheng Li, Maosong Sun

    Abstract: Lots of learning tasks require dealing with graph data which contains rich relation information among elements. Modeling physics systems, learning molecular fingerprints, predicting protein interface, and classifying diseases demand a model to learn from graph inputs. In other domains such as learning from non-structural data like texts and images, reasoning on extracted structures (like the depen… ▽ More

    Submitted 6 October, 2021; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: Published at AI Open 2021

  43. arXiv:1809.08568  [pdf, other

    stat.ML cs.AI cs.LG

    Causal Inference and Mechanism Clustering of A Mixture of Additive Noise Models

    Authors: Shoubo Hu, Zhitang Chen, Vahid Partovi Nia, Laiwan Chan, Yanhui Geng

    Abstract: The inference of the causal relationship between a pair of observed variables is a fundamental problem in science, and most existing approaches are based on one single causal model. In practice, however, observations are often collected from multiple sources with heterogeneous causal models due to certain uncontrollable factors, which renders causal analysis results obtained by a single model skep… ▽ More

    Submitted 11 November, 2018; v1 submitted 23 September, 2018; originally announced September 2018.

    Comments: Published at NIPS 2018

  44. A Kernel Embedding-based Approach for Nonstationary Causal Model Inference

    Authors: Shoubo Hu, Zhitang Chen, Laiwan Chan

    Abstract: Although nonstationary data are more common in the real world, most existing causal discovery methods do not take nonstationarity into consideration. In this letter, we propose a kernel embedding-based approach, ENCI, for nonstationary causal model inference where data are collected from multiple domains with varying distributions. In ENCI, we transform the complicated relation of a cause-effect p… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

    Comments: Published at Neural Computation

    Journal ref: Neural computation, 30(5), 1394-1425, 2018

  45. arXiv:1809.01471  [pdf, other

    cs.GR cs.LG stat.ML

    Chest X-ray Inpainting with Deep Generative Models

    Authors: Ecem Sogancioglu, Shi Hu, Davide Belli, Bram van Ginneken

    Abstract: Generative adversarial networks have been successfully applied to inpainting in natural images. However, the current state-of-the-art models have not yet been widely adopted in the medical imaging domain. In this paper, we investigate the performance of three recently published deep learning based inpainting models: context encoders, semantic image inpainting, and the contextual attention model, a… ▽ More

    Submitted 29 August, 2018; originally announced September 2018.

    Comments: 9 pages

  46. arXiv:1808.05766  [pdf

    q-bio.OT stat.AP

    The Function Transformation Omics - Funomics

    Authors: Yongshuai Jiang, Jing Xu, Simeng Hu, Di Liu, Linna Zhao, Xu Zhou

    Abstract: There are no two identical leaves in the world, so how to find effective markers or features to distinguish them is an important issue. Function transformation, such as f(x,y) and f(x,y,z), can transform two, three, or multiple input/observation variables (in biology, it generally refers to the observed/measured value of biomarkers, biological characteristics, or other indicators) into a new outpu… ▽ More

    Submitted 17 August, 2018; originally announced August 2018.

  47. arXiv:1805.11793  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Infinite Arms Bandit: Optimality via Confidence Bounds

    Authors: Hock Peng Chan, Shouri Hu

    Abstract: Berry et al. (1997) initiated the development of the infinite arms bandit problem. They derived a regret lower bound of all allocation strategies for Bernoulli rewards with uniform priors, and proposed strategies based on success runs. Bonald and Proutière (2013) proposed a two-target algorithm that achieves the regret lower bound, and extended optimality to Bernoulli rewards with general priors.… ▽ More

    Submitted 21 June, 2020; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: Fourth version

  48. arXiv:1804.04206  [pdf, other

    cs.CV cs.LG stat.ML

    Multi-scale Neural Networks for Retinal Blood Vessels Segmentation

    Authors: Boheng Zhang, Shenglei Huang, Shaohan Hu

    Abstract: Existing supervised approaches didn't make use of the low-level features which are actually effective to this task. And another deficiency is that they didn't consider the relation between pixels, which means effective features are not extracted. In this paper, we proposed a novel convolutional neural network which make sufficient use of low-level features together with high-level features and inv… ▽ More

    Submitted 11 April, 2018; originally announced April 2018.

  49. arXiv:1710.03704  [pdf, other

    stat.AP

    Motor Insurance Accidental Damage Claims Modeling with Factor Collapsing and Bayesian Model Averaging

    Authors: Sen Hu, Adrian O'Hagan, Thomas Brendan Murphy

    Abstract: Accidental damage is a typical component of motor insurance claim. Modeling of this nature generally involves analysis of past claim history and different characteristics of the insured objects and the policyholders. Generalized linear models (GLMs) have become the industry's standard approach for pricing and modeling risks of this nature. However, the GLM approach utilizes a single "best" model o… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

  50. arXiv:1704.04235  [pdf, other

    cs.LG cs.CV stat.ML

    Close Yet Distinctive Domain Adaptation

    Authors: Lingkun Luo, Xiaofang Wang, Shiqiang Hu, Chao Wang, Yuxing Tang, Liming Chen

    Abstract: Domain adaptation is transfer learning which aims to generalize a learning model across training and testing data with different distributions. Most previous research tackle this problem in seeking a shared feature representation between source and target domains while reducing the mismatch of their data distributions. In this paper, we propose a close yet discriminative domain adaptation method,… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    Comments: 11pages, 3 figures, ICCV2017