Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–44 of 44 results for author: Clifton, D A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16264  [pdf, other

    cs.CV

    Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring

    Authors: Shreyank N Gowda, David A. Clifton

    Abstract: Contemporary medical contrastive learning faces challenges from inconsistent semantics and sample pair morphology, leading to dispersed and converging semantic shifts. The variability in text reports, due to multiple authors, complicates semantic consistency. To tackle these issues, we propose a two-step approach. Initially, text reports are converted into a standardized triplet format, laying the… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted in MICCAI-24

  2. arXiv:2407.10086  [pdf, other

    cs.CL cs.AI

    Rapid Biomedical Research Classification: The Pandemic PACT Advanced Categorisation Engine

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Olena Seminog, Rodrigo Furst, Thomas Mendy, Shanthi Levanita, Zaharat Kadri-Alabi, Nusrat Jabin, Daniela Toale, Georgina Humphreys, Emilia Antonio, Adrian Bucher, Alice Norton, David A. Clifton

    Abstract: This paper introduces the Pandemic PACT Advanced Categorisation Engine (PPACE) along with its associated dataset. PPACE is a fine-tuned model developed to automatically classify research abstracts from funded biomedical projects according to WHO-aligned research priorities. This task is crucial for monitoring research trends and identifying gaps in global health preparedness and response. Our appr… ▽ More

    Submitted 19 July, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  3. arXiv:2407.04752  [pdf, other

    cs.LG cs.CL cs.NE

    SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

    Authors: Xingrun Xing, Boyan Gao, Zheng Zhang, David A. Clifton, Shitao Xiao, Li Du, Guoqi Li, Jiajun Zhang

    Abstract: The recent advancements in large language models (LLMs) with billions of parameters have significantly boosted their performance across various real-world applications. However, the inference processes for these models require substantial energy and computational resources, presenting considerable deployment challenges. In contrast, human brains, which contain approximately 86 billion biological n… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2406.14377  [pdf, other

    cs.LG cs.AI

    Computation-Efficient Semi-Supervised Learning for ECG-based Cardiovascular Diseases Detection

    Authors: Rushuang Zhou, Zijun Liu, Lei Clifton, David A. Clifton, Kannie W. Y. Chan, Yuan-Ting Zhang, Yining Dong

    Abstract: Label scarcity problem is the main challenge that hinders the wide application of deep learning systems in automatic cardiovascular diseases (CVDs) detection using electrocardiography (ECG). Tuning pre-trained models alleviates this problem by transferring knowledge learned from large datasets to downstream small datasets. However, bottlenecks in computational efficiency and CVDs detection perform… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2405.07841  [pdf, other

    cs.LG

    Sample Selection Bias in Machine Learning for Healthcare

    Authors: Vinod Kumar Chauhan, Lei Clifton, Achille Salaün, Huiqi Yvonne Lu, Kim Branson, Patrick Schwab, Gaurav Nigam, David A. Clifton

    Abstract: While machine learning algorithms hold promise for personalised medicine, their clinical adoption remains limited. One critical factor contributing to this restraint is sample selection bias (SSB) which refers to the study population being less representative of the target population, leading to biased and potentially harmful decisions. Despite being well-known in the literature, SSB remains scarc… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 20 pages and 11 figures (under review)

  6. arXiv:2405.00716  [pdf, other

    cs.CL cs.AI

    Large Language Models in the Clinic: A Comprehensive Benchmark

    Authors: Andrew Liu, Hongjian Zhou, Yining Hua, Omid Rohanian, Anshul Thakur, Lei Clifton, David A. Clifton

    Abstract: The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. However, many clinical decisions involve answering open-ended questions without pre-set options. To better understand LLMs in the clinic, we construct a benchmark ClinicBench. We first coll… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  7. arXiv:2401.00579  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, David A. Clifton

    Abstract: Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evo… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  8. arXiv:2311.05112  [pdf

    cs.CL cs.AI

    A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

    Authors: Hongjian Zhou, Fenglin Liu, Boyang Gu, Xinyu Zou, Jinfa Huang, Jinge Wu, Yiru Li, Sam S. Chen, Peilin Zhou, Junling Liu, Yining Hua, Chengfeng Mao, Chenyu You, Xian Wu, Yefeng Zheng, Lei Clifton, Zheng Li, Jiebo Luo, David A. Clifton

    Abstract: Large language models (LLMs), such as ChatGPT, have received substantial attention due to their capabilities for understanding and generating human language. While there has been a burgeoning trend in research focusing on the employment of LLMs in supporting different medical tasks (e.g., enhancing clinical diagnostics and providing medical education), a review of these efforts, particularly their… ▽ More

    Submitted 22 July, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Preprint. Version 6. Update Figures 1-5; Tables 2-3; 31 pages

  9. arXiv:2309.00810  [pdf, other

    cs.CV cs.AI

    RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model

    Authors: Fengxiang Bie, Yibo Yang, Zhongzhu Zhou, Adam Ghanem, Minjia Zhang, Zhewei Yao, Xiaoxia Wu, Connor Holmes, Pareesa Golnari, David A. Clifton, Yuxiong He, Dacheng Tao, Shuaiwen Leon Song

    Abstract: Text-to-image generation (TTI) refers to the usage of models that could process text input and generate high fidelity images based on text descriptions. Text-to-image generation using neural networks could be traced back to the emergence of Generative Adversial Network (GAN), followed by the autoregressive Transformer. Diffusion models are one prominent type of generative model used for the genera… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  10. arXiv:2306.10494  [pdf, other

    eess.SP cs.AI

    Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study

    Authors: Rushuang Zhou, Lei Lu, Zijun Liu, Ting Xiang, Zhen Liang, David A. Clifton, Yining Dong, Yuan-Ting Zhang

    Abstract: Electrocardiography (ECG) is a non-invasive tool for predicting cardiovascular diseases (CVDs). Current ECG-based diagnosis systems show promising performance owing to the rapid development of deep learning techniques. However, the label scarcity problem, the co-occurrence of multiple CVDs and the poor performance on unseen datasets greatly hinder the widespread application of deep learning-based… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  11. arXiv:2306.06955  [pdf, other

    cs.LG

    A Brief Review of Hypernetworks in Deep Learning

    Authors: Vinod Kumar Chauhan, Jiandong Zhou, Ping Lu, Soheila Molaei, David A. Clifton

    Abstract: Hypernetworks, or hypernets for short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility, adaptability, dynamism, faster training, information sharing, and model compression. Hypernets have shown promising results in a variety of deep learning problems, inclu… ▽ More

    Submitted 13 July, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: 2 figures and 2 tables -- Accepted to Artificial Intelligence Review

  12. arXiv:2305.15984  [pdf, other

    cs.LG stat.ME

    Dynamic Inter-treatment Information Sharing for Individualized Treatment Effects Estimation

    Authors: Vinod Kumar Chauhan, Jiandong Zhou, Ghadeer Ghosheh, Soheila Molaei, David A. Clifton

    Abstract: Estimation of individualized treatment effects (ITE) from observational studies is a fundamental problem in causal inference and holds significant importance across domains, including healthcare. However, limited observational datasets pose challenges in reliable ITE estimation as data have to be split among treatment groups to train an ITE learner. While information sharing among treatment groups… ▽ More

    Submitted 12 February, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: accepted to The 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  13. arXiv:2305.03711  [pdf, other

    cs.LG cs.CY

    Medical records condensation: a roadmap towards healthcare data democratisation

    Authors: Yujiang Wang, Anshul Thakur, Mingzhi Dong, Pingchuan Ma, Stavros Petridis, Li Shang, Tingting Zhu, David A. Clifton

    Abstract: The prevalence of artificial intelligence (AI) has envisioned an era of healthcare democratisation that promises every stakeholder a new and better way of life. However, the advancement of clinical AI research is significantly hurdled by the dearth of data democratisation in healthcare. To truly democratise data for AI studies, challenges are two-fold: 1. the sensitive information in clinical data… ▽ More

    Submitted 8 January, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  14. arXiv:2305.03710  [pdf, other

    cs.LG cs.CR

    Data Encoding For Healthcare Data Democratisation and Information Leakage Prevention

    Authors: Anshul Thakur, Tingting Zhu, Vinayak Abrol, Jacob Armstrong, Yujiang Wang, David A. Clifton

    Abstract: The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  15. arXiv:2303.06458  [pdf, other

    cs.CL cs.AI cs.CV

    ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation

    Authors: Bang Yang, Fenglin Liu, Yuexian Zou, Xian Wu, Yaowei Wang, David A. Clifton

    Abstract: Natural Language Generation (NLG) accepts input data in the form of images, videos, or text and generates corresponding natural language text as output. Existing NLG methods mainly adopt a supervised approach and rely heavily on coupled data-to-text pairs. However, for many targeted scenarios and for non-English languages, sufficient quantities of labeled data are often not available. To relax the… ▽ More

    Submitted 3 June, 2024; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: Accepted by TPAMI (Our code and data are available at https://github.com/yangbang18/ZeroNLG)

  16. arXiv:2302.14679  [pdf, other

    cs.LG cs.CL

    Synthesizing Mixed-type Electronic Health Records using Diffusion Models

    Authors: Taha Ceritli, Ghadeer O. Ghosheh, Vinod Kumar Chauhan, Tingting Zhu, Andrew P. Creagh, David A. Clifton

    Abstract: Electronic Health Records (EHRs) contain sensitive patient information, which presents privacy concerns when sharing such data. Synthetic data generation is a promising solution to mitigate these risks, often relying on deep generative models such as Generative Adversarial Networks (GANs). However, recent studies have shown that diffusion models offer several advantages over GANs, such as generati… ▽ More

    Submitted 10 August, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Page 2, Figure 1 is updated

  17. arXiv:2302.04725  [pdf, other

    cs.CL cs.AI cs.LG

    Lightweight Transformers for Clinical Natural Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson, David A. Clifton

    Abstract: Specialised pre-trained language models are becoming more frequent in NLP since they can potentially outperform models trained on generic texts. BioBERT and BioClinicalBERT are two examples of such models that have shown promise in medical NLP tasks. Many of these models are overparametrised and resource-intensive, but thanks to techniques like Knowledge Distillation (KD), it is possible to create… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  18. arXiv:2302.01735  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective

    Authors: Chenyu You, Weicheng Dai, Yifei Min, Fenglin Liu, David A. Clifton, S Kevin Zhou, Lawrence Hamilton Staib, James S Duncan

    Abstract: For medical image segmentation, contrastive learning is the dominant practice to improve the quality of visual representations by contrasting semantically similar and dissimilar pairs of samples. This is enabled by the observation that without accessing ground truth labels, negative examples with truly dissimilar anatomical features, if sampled, can significantly improve the performance. In realit… ▽ More

    Submitted 23 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted by Advances in Neural Information Processing Systems (NeurIPS 2023)

  19. arXiv:2211.11427  [pdf, other

    cs.CV

    Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

    Authors: Peng Jin, Jinfa Huang, Fenglin Liu, Xian Wu, Shen Ge, Guoli Song, David A. Clifton, Jie Chen

    Abstract: Most video-and-language representation learning approaches employ contrastive learning, e.g., CLIP, to project the video and text features into a common latent space according to the semantic similarities of text-video pairs. However, such learned shared latent spaces are not often optimal, and the modality gap between visual and textual representation can not be fully eliminated. In this paper, w… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2022

  20. arXiv:2210.12777  [pdf, other

    cs.CL cs.LG

    Retrieval-Augmented and Knowledge-Grounded Language Models for Faithful Clinical Medicine

    Authors: Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton

    Abstract: Language models (LMs), including large language models (such as ChatGPT), have the potential to assist clinicians in generating various clinical notes. However, LMs are prone to produce ``hallucinations'', i.e., generated content that is not aligned with facts and knowledge. In this paper, we propose the Re$^3$Writer method with retrieval-augmented generation and knowledge-grounded reasoning to en… ▽ More

    Submitted 21 July, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

  21. arXiv:2210.10530  [pdf, other

    cs.LG cs.AI stat.ME

    Adversarial De-confounding in Individualised Treatment Effects Estimation

    Authors: Vinod Kumar Chauhan, Soheila Molaei, Marzia Hoque Tania, Anshul Thakur, Tingting Zhu, David A. Clifton

    Abstract: Observational studies have recently received significant attention from the machine learning community due to the increasingly available non-experimental observational data and the limitations of the experimental studies, such as considerable cost, impracticality, small and less representative sample sizes, etc. In observational studies, de-confounding is a fundamental problem of individualised tr… ▽ More

    Submitted 24 January, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: accepted to AISTATS 2023

  22. arXiv:2210.06425  [pdf, other

    cs.CL cs.LG

    MiniALBERT: Model Distillation via Parameter-Efficient Recursive Transformers

    Authors: Mohammadmahdi Nouriborji, Omid Rohanian, Samaneh Kouchaki, David A. Clifton

    Abstract: Pre-trained Language Models (LMs) have become an integral part of Natural Language Processing (NLP) in recent years, due to their superior performance in downstream applications. In spite of this resounding success, the usability of LMs is constrained by computational and time complexity, along with their increasing size; an issue that has been referred to as `overparameterisation'. Different stra… ▽ More

    Submitted 30 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    MSC Class: 68T50 ACM Class: I.2.7

  23. arXiv:2209.13476  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Mine yOur owN Anatomy: Revisiting Medical Image Segmentation with Extremely Limited Labels

    Authors: Chenyu You, Weicheng Dai, Fenglin Liu, Yifei Min, Haoran Su, Xiaoran Zhang, Xiaoxiao Li, David A. Clifton, Lawrence Staib, James S. Duncan

    Abstract: Recent studies on contrastive learning have achieved remarkable performance solely by leveraging few labels in the context of medical image segmentation. Existing methods mainly focus on instance discrimination and invariant mapping. However, they face three common pitfalls: (1) tailness: medical image data usually follows an implicit long-tail class distribution. Blindly leveraging all pixels in… ▽ More

    Submitted 16 March, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: In this version: Add theoretical analysis and correct some typos

  24. arXiv:2209.03182  [pdf, ps, other

    cs.CL cs.LG

    On the Effectiveness of Compact Biomedical Transformers

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Samaneh Kouchaki, David A. Clifton

    Abstract: Language models pre-trained on biomedical corpora, such as BioBERT, have recently shown promising results on downstream biomedical tasks. Many existing pre-trained models, on the other hand, are resource-intensive and computationally heavy owing to factors such as embedding size, hidden dimension, and number of layers. The natural language processing (NLP) community has developed numerous strategi… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    MSC Class: 68T50

  25. COPER: Continuous Patient State Perceiver

    Authors: Vinod Kumar Chauhan, Anshul Thakur, Odhran O'Donoghue, David A. Clifton

    Abstract: In electronic health records (EHRs), irregular time-series (ITS) occur naturally due to patient health dynamics, reflected by irregular hospital visits, diseases/conditions and the necessity to measure different vitals signs at each visit etc. ITS present challenges in training machine learning algorithms which mostly are built on assumption of coherent fixed dimensional feature space. In this pap… ▽ More

    Submitted 24 November, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: 2 figures; presented in IEEE International Conference on Biomedical and Health Informatics (IEEE BHI-2022)

  26. arXiv:2207.11846  [pdf, other

    cs.LG cs.AI

    Mixture of Input-Output Hidden Markov Models for Heterogeneous Disease Progression Modeling

    Authors: Taha Ceritli, Andrew P. Creagh, David A. Clifton

    Abstract: A particular challenge for disease progression modeling is the heterogeneity of a disease and its manifestations in the patients. Existing approaches often assume the presence of a single disease progression characteristics which is unlikely for neurodegenerative disorders such as Parkinson's disease. In this paper, we propose a hierarchical time-series model that can discover multiple disease pro… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  27. arXiv:2207.00118  [pdf, other

    cs.LG cs.AI cs.CV

    ProSelfLC: Progressive Self Label Correction Towards A Low-Temperature Entropy State

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, Sankha Subhra Mukherjee, David A. Clifton, Neil M. Robertson

    Abstract: There is a family of label modification approaches including self and non-self label correction (LC), and output regularisation. They are widely used for training robust deep neural networks (DNNs), but have not been mathematically and thoroughly analysed together. We study them and discover three key issues: (1) We are more interested in adopting Self LC as it leverages its own knowledge and requ… ▽ More

    Submitted 6 September, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: To ease the reading, a summary of changes is put in the beginning. Our source code is available at https://github.com/XinshaoAmosWang/ProSelfLC-AT

  28. arXiv:2206.06488  [pdf, other

    cs.CV cs.LG

    Multimodal Learning with Transformers: A Survey

    Authors: Peng Xu, Xiatian Zhu, David A. Clifton

    Abstract: Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks. Thanks to the recent prevalence of multimodal applications and big data, Transformer-based multimodal learning has become a hot topic in AI research. This paper presents a comprehensive survey of Transformer techniques oriented at multimodal data. The main contents of this survey in… ▽ More

    Submitted 9 May, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: This paper is accepted by IEEE TPAMI

  29. arXiv:2206.02909  [pdf, other

    eess.SP cs.AI cs.LG

    Self-supervised Learning for Human Activity Recognition Using 700,000 Person-days of Wearable Data

    Authors: Hang Yuan, Shing Chan, Andrew P. Creagh, Catherine Tong, Aidan Acquah, David A. Clifton, Aiden Doherty

    Abstract: Advances in deep learning for human activity recognition have been relatively limited due to the lack of large labelled datasets. In this study, we leverage self-supervised learning techniques on the UK-Biobank activity tracker dataset--the largest of its kind to date--containing more than 700,000 person-days of unlabelled wearable sensor data. Our resulting activity recognition model consistently… ▽ More

    Submitted 20 June, 2024; v1 submitted 6 June, 2022; originally announced June 2022.

    Journal ref: npj Digit. Med. 7, 91 (2024)

  30. arXiv:2205.12070  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning for Multi-class Imbalanced Training

    Authors: Jenny Yang, Rasheed El-Bouri, Odhran O'Donoghue, Alexander S. Lachapelle, Andrew A. S. Soltan, David A. Clifton

    Abstract: With the rapid growth of memory and computing power, datasets are becoming increasingly complex and imbalanced. This is especially severe in the context of clinical data, where there may be one rare event for many cases in the majority class. We introduce an imbalanced classification framework, based on reinforcement learning, for training extremely imbalanced data sets, and extend it for use in m… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  31. arXiv:2202.03670  [pdf, other

    cs.CV cs.LG

    How to Understand Masked Autoencoders

    Authors: Shuhao Cao, Peng Xu, David A. Clifton

    Abstract: "Masked Autoencoders (MAE) Are Scalable Vision Learners" revolutionizes the self-supervised learning method in that it not only achieves the state-of-the-art for image pre-training, but is also a milestone that bridges the gap between visual and linguistic masked autoencoding (BERT-style) pre-trainings. However, to our knowledge, to date there are no theoretical perspectives to explain the powerfu… ▽ More

    Submitted 9 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  32. arXiv:2107.01707  [pdf, other

    cs.LG cs.CR cs.DC

    Towards Scheduling Federated Deep Learning using Meta-Gradients for Inter-Hospital Learning

    Authors: Rasheed el-Bouri, Tingting Zhu, David A. Clifton

    Abstract: Given the abundance and ease of access of personal data today, individual privacy has become of paramount importance, particularly in the healthcare domain. In this work, we aim to utilise patient data extracted from multiple hospital data centres to train a machine learning model without sacrificing patient privacy. We develop a scheduling algorithm in conjunction with a student-teacher algorithm… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: 11 pages, 8 figures

  33. arXiv:2106.01489  [pdf, other

    cs.LG cs.AI cs.CV

    Not All Knowledge Is Created Equal: Mutual Distillation of Confident Knowledge

    Authors: Ziyun Li, Xinshao Wang, Di Hu, Neil M. Robertson, David A. Clifton, Christoph Meinel, Haojin Yang

    Abstract: Mutual knowledge distillation (MKD) improves a model by distilling knowledge from another model. However, \textit{not all knowledge is certain and correct}, especially under adverse conditions. For example, label noise usually leads to less reliable models due to undesired memorization \cite{zhang2017understanding,arpit2017closer}. Wrong knowledge misleads the learning rather than helps. This prob… ▽ More

    Submitted 16 November, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2022 Workshop(Trustworthy and Socially Responsible Machine Learning) paper

  34. arXiv:2011.14230  [pdf, other

    eess.SP cs.LG

    CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The process of manually searching for relevant instances in, and extracting information from, clinical databases underpin a multitude of clinical tasks. Such tasks include disease diagnosis, clinical trial recruitment, and continuing medical education. This manual search-and-extract process, however, has been hampered by the growth of large-scale clinical databases and the increased prevalence of… ▽ More

    Submitted 3 October, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: Accepted at Advances in Neural Information Processing Systems (NeurIPS) 2021

  35. arXiv:2011.14227  [pdf, other

    eess.SP cs.LG

    PCPs: Patient Cardiac Prototypes

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Many clinical deep learning algorithms are population-based and difficult to interpret. Such properties limit their clinical utility as population-based findings may not generalize to individual patients and physicians are reluctant to incorporate opaque models into their clinical workflow. To overcome these obstacles, we propose to learn patient-specific embeddings, entitled patient cardiac proto… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

  36. arXiv:2005.13249  [pdf, other

    cs.LG eess.SP stat.ML

    CLOCS: Contrastive Learning of Cardiac Signals Across Space, Time, and Patients

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The healthcare industry generates troves of unlabelled physiological data. This data can be exploited via contrastive learning, a self-supervised pre-training method that encourages representations of instances to be similar to one another. We propose a family of contrastive learning methods, CLOCS, that encourages representations across space, time, \textit{and} patients to be similar to one anot… ▽ More

    Submitted 16 May, 2021; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: Accepted to ICML 2021

  37. arXiv:2005.03788  [pdf, other

    cs.LG cs.CV stat.ML

    ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson

    Abstract: To train robust deep neural networks (DNNs), we systematically study several target modification approaches, which include output regularisation, self and non-self label correction (LC). Two key issues are discovered: (1) Self LC is the most appealing as it exploits its own knowledge and requires no extra models. However, how to automatically decide the trust degree of a learner as training goes i… ▽ More

    Submitted 2 June, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: ProSelfLC is the first method to trust self knowledge progressively and adaptively. ProSelfLC redirects and promotes entropy minimisation, which is in marked contrast to recent practices of confidence penalty [42, 33, 6]

    Journal ref: CVPR 2021

  38. arXiv:2004.10468  [pdf, other

    cs.LG stat.ML

    SoQal: Selective Oracle Questioning in Active Learning

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Large sets of unlabelled data within the healthcare domain remain underutilized. Active learning offers a way to exploit these datasets by iteratively requesting an oracle (e.g. medical professional) to label instances. This process, which can be costly and time-consuming is overly-dependent upon an oracle. To alleviate this burden, we propose SoQal, a questioning strategy that dynamically determi… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  39. arXiv:2004.09578  [pdf, other

    cs.LG stat.ML

    CLOPS: Continual Learning of Physiological Signals

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Deep learning algorithms are known to experience destructive interference when instances violate the assumption of being independent and identically distributed (i.i.d). This violation, however, is ubiquitous in clinical settings where data are streamed temporally and from a multitude of physiological sensors. To overcome this obstacle, we propose CLOPS, a replay-based continual learning strategy.… ▽ More

    Submitted 28 November, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  40. arXiv:2004.09557  [pdf, other

    cs.LG stat.ML

    SoQal: Selective Oracle Questioning for Consistency Based Active Learning of Cardiac Signals

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Clinical settings are often characterized by abundant unlabelled data and limited labelled data. This is typically driven by the high burden placed on oracles (e.g., physicians) to provide annotations. One way to mitigate this burden is via active learning (AL) which involves the (a) acquisition and (b) annotation of informative unlabelled instances. Whereas previous work addresses either one of t… ▽ More

    Submitted 18 May, 2022; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: ICML 2022

  41. arXiv:1912.05345  [pdf, other

    eess.SP cs.CV cs.LG

    Severity Detection Tool for Patients with Infectious Disease

    Authors: Girmaw Abebe Tadesse, Tingting Zhu, Nhan Le Nguyen Thanh, Nguyen Thanh Hung, Ha Thi Hai Duong, Truong Huu Khanh, Pham Van Quang, Duc Duong Tran, LamMinh Yen, H Rogier Van Doorn, Nguyen Van Hao, John Prince, Hamza Javed, DaniKiyasseh, Le Van Tan, Louise Thwaites, David A. Clifton

    Abstract: Hand, foot and mouth disease (HFMD) and tetanus are serious infectious diseases in low and middle income countries. Tetanus in particular has a high mortality rate and its treatment is resource-demanding. Furthermore, HFMD often affects a large number of infants and young children. As a result, its treatment consumes enormous healthcare resources, especially when outbreaks occur. Autonomic nervous… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

  42. arXiv:1912.00354  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Preserving Patient Privacy while Training a Predictive Model of In-hospital Mortality

    Authors: Pulkit Sharma, Farah E Shamout, David A Clifton

    Abstract: Machine learning models can be used for pattern recognition in medical data in order to improve patient outcomes, such as the prediction of in-hospital mortality. Deep learning models, in particular, require large amounts of data for model training. However, the data is often collected at different hospitals and sharing is restricted due to patient privacy concerns. In this paper, we aimed to demo… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: AI for Social Good Workshop, Neurips 2019, Vancouver, Canada

  43. arXiv:1903.12141  [pdf, other

    cs.LG cs.CV stat.ML

    IMAE for Noise-Robust Learning: Mean Absolute Error Does Not Treat Examples Equally and Gradient Magnitude's Variance Matters

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson

    Abstract: In this work, we study robust deep learning against abnormal training data from the perspective of example weighting built in empirical loss functions, i.e., gradient magnitude with respect to logits, an angle that is not thoroughly studied so far. Consequently, we have two key findings: (1) Mean Absolute Error (MAE) Does Not Treat Examples Equally. We present new observations and insightful analy… ▽ More

    Submitted 1 May, 2023; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: ICLR 2023, RTML Workshop paper. For the source code, based on the requests for academic research and kindness to cite our work, we will release and maintain it in https://github.com/XinshaoAmosWang/DeepCriticalLearning

  44. Fusing Continuous-valued Medical Labels using a Bayesian Model

    Authors: Tingting Zhu, Nic Dunkley, Joachim Behar, David A. Clifton, Gari D. Clifford

    Abstract: With the rapid increase in volume of time series medical data available through wearable devices, there is a need to employ automated algorithms to label data. Examples of labels include interventions, changes in activity (e.g. sleep) and changes in physiology (e.g. arrhythmias). However, automated algorithms tend to be unreliable resulting in lower quality care. Expert annotations are scarce, exp… ▽ More

    Submitted 13 June, 2015; v1 submitted 23 March, 2015; originally announced March 2015.