Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–44 of 44 results for author: Batmanghelich, K

.
  1. arXiv:2405.12255  [pdf, other

    eess.IV cs.CV

    Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography

    Authors: Shantanu Ghosh, Clare B. Poynton, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: The lack of large and diverse training data on Computer-Aided Diagnosis (CAD) in breast cancer detection has been one of the concerns that impedes the adoption of the system. Recently, pre-training with large-scale image text datasets via Vision-Language models (VLM) (\eg CLIP) partially addresses the issue of robustness and data efficiency in computer vision (CV). This paper proposes Mammo-CLIP,… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: MICCAI 2024, early accept, top 11%

  2. arXiv:2310.03559  [pdf, other

    eess.IV cs.CV

    MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images

    Authors: Yanwu Xu, Li Sun, Wei Peng, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: This paper introduces an innovative methodology for producing high-quality 3D lung CT images guided by textual information. While diffusion-based generative models are increasingly used in medical imaging, current state-of-the-art approaches are limited to low-resolution outputs and underutilize radiology reports' abundant information. The radiology reports can enhance the generation process by pr… ▽ More

    Submitted 18 June, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  3. arXiv:2309.16139  [pdf, other

    cs.CV cs.LG

    Two-Step Active Learning for Instance Segmentation with Uncertainty and Diversity Sampling

    Authors: Ke Yu, Stephen Albro, Giulia DeSalvo, Suraj Kothawade, Abdullah Rashwan, Sasan Tavakkol, Kayhan Batmanghelich, Xiaoqi Yin

    Abstract: Training high-quality instance segmentation models requires an abundance of labeled images with instance masks and classifications, which is often expensive to procure. Active learning addresses this challenge by striving for optimum performance with minimal labeling cost by selecting the most informative and representative images for labeling. Despite its potential, active learning has been less… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: UNCV ICCV 2023

  4. arXiv:2307.13698  [pdf, other

    cs.CV cs.LG

    Exploring the Lottery Ticket Hypothesis with Explainability Methods: Insights into Sparse Network Performance

    Authors: Shantanu Ghosh, Kayhan Batmanghelich

    Abstract: Discovering a high-performing sparse network within a massive neural network is advantageous for deploying them on devices with limited storage, such as mobile phones. Additionally, model explainability is essential to fostering trust in AI. The Lottery Ticket Hypothesis (LTH) finds a network within a deep network with comparable or superior performance to the original model. However, limited stud… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  5. arXiv:2307.05350  [pdf, other

    cs.LG cs.CV cs.CY

    Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat

    Authors: Shantanu Ghosh, Ke Yu, Forough Arabshahi, Kayhan Batmanghelich

    Abstract: ML model design either starts with an interpretable model or a Blackbox and explains it post hoc. Blackbox models are flexible but difficult to explain, while interpretable models are inherently explainable. Yet, interpretable models require extensive ML knowledge and tend to be less flexible and underperforming than their Blackbox variants. This paper aims to blur the distinction between a post h… ▽ More

    Submitted 12 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: appeared as v5 of arXiv:2302.10289 which was replaced in error, which drifted into a different work, accepted in ICML 2023

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:11360-11397, 2023

  6. arXiv:2306.12511  [pdf, other

    cs.LG cs.CV

    Semi-Implicit Denoising Diffusion Models (SIDDMs)

    Authors: Yanwu Xu, Mingming Gong, Shaoan Xie, Wei Wei, Matthias Grundmann, Kayhan Batmanghelich, Tingbo Hou

    Abstract: Despite the proliferation of generative models, achieving fast sampling during inference without compromising sample diversity and quality remains challenging. Existing models such as Denoising Diffusion Probabilistic Models (DDPM) deliver high-quality, diverse samples but are slowed by an inherently high number of iterative steps. The Denoising Diffusion Generative Adversarial Networks (DDGAN) at… ▽ More

    Submitted 10 October, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

  7. arXiv:2305.17303  [pdf, other

    cs.CV cs.LG

    Distilling BlackBox to Interpretable models for Efficient Transfer Learning

    Authors: Shantanu Ghosh, Ke Yu, Kayhan Batmanghelich

    Abstract: Building generalizable AI models is one of the primary challenges in the healthcare domain. While radiologists rely on generalizable descriptive rules of abnormality, Neural Network (NN) models suffer even with a slight shift in input distribution (e.g., scanner type). Fine-tuning a model to transfer knowledge from one domain to another requires a significant amount of labeled data in the target d… ▽ More

    Submitted 7 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: 26th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2023, Early accept

  8. arXiv:2305.14571  [pdf, other

    cs.CL

    From Characters to Words: Hierarchical Pre-trained Language Model for Open-vocabulary Language Understanding

    Authors: Li Sun, Florian Luisier, Kayhan Batmanghelich, Dinei Florencio, Cha Zhang

    Abstract: Current state-of-the-art models for natural language understanding require a preprocessing step to convert raw text into discrete tokens. This process known as tokenization relies on a pre-built vocabulary of words or sub-word morphemes. This fixed vocabulary limits the model's robustness to spelling errors and its capacity to adapt to new domains. In this work, we introduce a novel open-vocabular… ▽ More

    Submitted 29 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023 Main Conference

  9. arXiv:2302.10390  [pdf, other

    cs.CV cs.AI cs.LG

    DrasCLR: A Self-supervised Framework of Learning Disease-related and Anatomy-specific Representation for 3D Medical Images

    Authors: Ke Yu, Li Sun, Junxiang Chen, Max Reynolds, Tigmanshu Chaudhary, Kayhan Batmanghelich

    Abstract: Large-scale volumetric medical images with annotation are rare, costly, and time prohibitive to acquire. Self-supervised learning (SSL) offers a promising pre-training and feature extraction solution for many downstream tasks, as it only uses unlabeled data. Recently, SSL methods based on instance discrimination have gained popularity in the medical imaging domain. However, SSL pre-trained encoder… ▽ More

    Submitted 15 March, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Added some recent references

  10. arXiv:2302.10289  [pdf, other

    cs.LG cs.CV

    Tackling Shortcut Learning in Deep Neural Networks: An Iterative Approach with Interpretable Models

    Authors: Shantanu Ghosh, Ke Yu, Forough Arabshahi, Kayhan Batmanghelich

    Abstract: We use concept-based interpretable models to mitigate shortcut learning. Existing methods lack interpretability. Beginning with a Blackbox, we iteratively carve out a mixture of interpretable experts (MoIE) and a residual network. Each expert explains a subset of data using First Order Logic (FOL). While explaining a sample, the FOL from biased BB-derived MoIE detects the shortcut effectively. Fin… ▽ More

    Submitted 7 July, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 2nd Workshop on Spurious Correlations, Invariance, and Stability, ICML 2023

  11. arXiv:2302.09344  [pdf, other

    cs.LG cs.AI cs.CV

    Beyond Distribution Shift: Spurious Features Through the Lens of Training Dynamics

    Authors: Nihal Murali, Aahlad Puli, Ke Yu, Rajesh Ranganath, Kayhan Batmanghelich

    Abstract: Deep Neural Networks (DNNs) are prone to learning spurious features that correlate with the label during training but are irrelevant to the learning problem. This hurts model generalization and poses problems when deploying them in safety-critical applications. This paper aims to better understand the effects of spurious features through the lens of the learning dynamics of the internal neurons du… ▽ More

    Submitted 14 October, 2023; v1 submitted 18 February, 2023; originally announced February 2023.

    Comments: Main paper: 12 pages, 2 tables, and 10 figures. Supplementary: 10 pages and 9 figures. Accepted in TMLR23 (https://openreview.net/pdf?id=Tkvmt9nDmB)

  12. arXiv:2210.12196  [pdf, other

    cs.LG cs.CV

    Augmentation by Counterfactual Explanation -- Fixing an Overconfident Classifier

    Authors: Sumedha Singla, Nihal Murali, Forough Arabshahi, Sofia Triantafyllou, Kayhan Batmanghelich

    Abstract: A highly accurate but overconfident model is ill-suited for deployment in critical applications such as healthcare and autonomous driving. The classification outcome should reflect a high uncertainty on ambiguous in-distribution samples that lie close to the decision boundary. The model should also refrain from making overconfident decisions on samples that lie far outside its training distributio… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted in WACV 2023

  13. arXiv:2208.06361  [pdf, other

    q-bio.BM cs.LG

    Hyperbolic Molecular Representation Learning for Drug Repositioning

    Authors: Ke Yu, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: Learning accurate drug representations is essential for task such as computational drug repositioning. A drug hierarchy is a valuable source that encodes knowledge of relations among drugs in a tree-like structure where drugs that act on the same organs, treat the same disease, or bind to the same biological target are grouped together. However, its utility in learning drug representations has not… ▽ More

    Submitted 6 July, 2022; originally announced August 2022.

    Comments: Accepted by NeurIPS workshop 2020. arXiv admin note: substantial text overlap with arXiv:2006.00986

  14. arXiv:2207.02957  [pdf, other

    eess.IV cs.CV cs.LG

    Context-aware Self-supervised Learning for Medical Images Using Graph Neural Network

    Authors: Li Sun, Ke Yu, Kayhan Batmanghelich

    Abstract: Although self-supervised learning enables us to bootstrap the training by exploiting unlabeled data, the generic self-supervised methods for natural images do not sufficiently incorporate the context. For medical images, a desirable method should be sensitive enough to detect deviation from normal-appearing tissue of each anatomical region; here, anatomy is the context. We introduce a novel approa… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted by NeurIPS workshop 2020. arXiv admin note: substantial text overlap with arXiv:2012.06457

  15. arXiv:2206.13737  [pdf, other

    cs.CV

    Adversarial Consistency for Single Domain Generalization in Medical Image Segmentation

    Authors: Yanwu Xu, Shaoan Xie, Maxwell Reynolds, Matthew Ragoza, Mingming Gong, Kayhan Batmanghelich

    Abstract: An organ segmentation method that can generalize to unseen contrasts and scanner settings can significantly reduce the need for retraining of deep learning models. Domain Generalization (DG) aims to achieve this goal. However, most DG methods for segmentation require training data from multiple domains during training. We propose a novel adversarial domain generalization method for organ segmentat… ▽ More

    Submitted 29 June, 2022; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: MICCAI2022 accpted

  16. arXiv:2206.12704  [pdf, other

    cs.CV cs.LG

    Anatomy-Guided Weakly-Supervised Abnormality Localization in Chest X-rays

    Authors: Ke Yu, Shantanu Ghosh, Zhexiong Liu, Christopher Deible, Kayhan Batmanghelich

    Abstract: Creating a large-scale dataset of abnormality annotation on medical images is a labor-intensive and costly task. Leveraging weak supervision from readily available data such as radiology reports can compensate lack of large-scale data for anomaly detection methods. However, most of the current methods only use image-level pathological observations, failing to utilize the relevant anatomy mentions… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

    Comments: Accepted by MICCAI 20222

  17. arXiv:2203.12707  [pdf, other

    cs.CV eess.IV

    Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation

    Authors: Yanwu Xu, Shaoan Xie, Wenhao Wu, Kun Zhang, Mingming Gong, Kayhan Batmanghelich

    Abstract: Unpaired image-to-image translation (I2I) is an ill-posed problem, as an infinite number of translation functions can map the source domain distribution to the target distribution. Therefore, much effort has been put into designing suitable constraints, e.g., cycle consistency (CycleGAN), geometry consistency (GCGAN), and contrastive learning-based constraints (CUTGAN), that help better pose the p… ▽ More

    Submitted 29 March, 2022; v1 submitted 23 March, 2022; originally announced March 2022.

    Comments: CVPR 2022 accepted paper

  18. CrossMoDA 2021 challenge: Benchmark of Cross-Modality Domain Adaptation techniques for Vestibular Schwannoma and Cochlea Segmentation

    Authors: Reuben Dorent, Aaron Kujawa, Marina Ivory, Spyridon Bakas, Nicola Rieke, Samuel Joutard, Ben Glocker, Jorge Cardoso, Marc Modat, Kayhan Batmanghelich, Arseniy Belkov, Maria Baldeon Calisto, Jae Won Choi, Benoit M. Dawant, Hexin Dong, Sergio Escalera, Yubo Fan, Lasse Hansen, Mattias P. Heinrich, Smriti Joshi, Victoriya Kashtanova, Hyeon Gyu Kim, Satoshi Kondo, Christian N. Kruse, Susana K. Lai-Yuen , et al. (15 additional authors not shown)

    Abstract: Domain Adaptation (DA) has recently raised strong interests in the medical imaging community. While a large variety of DA techniques has been proposed for image segmentation, most of these techniques have been validated either on private datasets or on small publicly available datasets. Moreover, these datasets mostly addressed single-class problems. To tackle these limitations, the Cross-Modality… ▽ More

    Submitted 14 December, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Comments: In Medical Image Analysis

  19. arXiv:2108.08432  [pdf, other

    cs.CV

    Box-Adapt: Domain-Adaptive Medical Image Segmentation using Bounding BoxSupervision

    Authors: Yanwu Xu, Mingming Gong, Shaoan Xie, Kayhan Batmanghelich

    Abstract: Deep learning has achieved remarkable success in medicalimage segmentation, but it usually requires a large numberof images labeled with fine-grained segmentation masks, andthe annotation of these masks can be very expensive andtime-consuming. Therefore, recent methods try to use un-supervised domain adaptation (UDA) methods to borrow in-formation from labeled data from other datasets (source do-m… ▽ More

    Submitted 27 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Journal ref: IJCAI Workshop on Weakly Supervised Representation Learning, 2021

  20. arXiv:2107.06098  [pdf, other

    cs.LG cs.CV

    Using Causal Analysis for Conceptual Deep Learning Explanation

    Authors: Sumedha Singla, Stephen Wallace, Sofia Triantafillou, Kayhan Batmanghelich

    Abstract: Model explainability is essential for the creation of trustworthy Machine Learning models in healthcare. An ideal explanation resembles the decision-making process of a domain expert and is expressed using concepts or terminology that is meaningful to the clinicians. To provide such an explanation, we first associate the hidden units of the classifier to clinically relevant concepts. We take advan… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: 10 pages, 6 figures

  21. arXiv:2106.11230  [pdf, other

    cs.LG

    Can contrastive learning avoid shortcut solutions?

    Authors: Joshua Robinson, Li Sun, Ke Yu, Kayhan Batmanghelich, Stefanie Jegelka, Suvrit Sra

    Abstract: The generalization of representations learned via contrastive learning depends crucially on what features of the data are extracted. However, we observe that the contrastive loss does not always sufficiently guide which features are extracted, a behavior that can negatively impact the performance on downstream tasks via "shortcuts", i.e., by inadvertently suppressing important predictive features.… ▽ More

    Submitted 19 December, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  22. arXiv:2101.05145  [pdf, other

    eess.IV cs.CV cs.LG

    Self-Supervised Vessel Enhancement Using Flow-Based Consistencies

    Authors: Rohit Jena, Sumedha Singla, Kayhan Batmanghelich

    Abstract: Vessel segmentation is an essential task in many clinical applications. Although supervised methods have achieved state-of-art performance, acquiring expert annotation is laborious and mostly limited for two-dimensional datasets with a small sample size. On the contrary, unsupervised methods rely on handcrafted features to detect tube-like structures such as vessels. However, those methods require… ▽ More

    Submitted 22 July, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: Early accept at MICCAI 2021

  23. arXiv:2101.04230  [pdf, other

    cs.CV eess.IV

    Explaining the Black-box Smoothly- A Counterfactual Approach

    Authors: Sumedha Singla, Motahhare Eslami, Brian Pollack, Stephen Wallace, Kayhan Batmanghelich

    Abstract: We propose a BlackBox Counterfactual Explainer, designed to explain image classification models for medical applications. Classical approaches (e.g., saliency maps) that assess feature importance do not explain "how" imaging features in important anatomical regions are relevant to the classification decision. Our framework explains the decision for a target class by gradually "exaggerating" the se… ▽ More

    Submitted 18 November, 2022; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: Preprint Accepted in Medical image Analysis journal

  24. arXiv:2012.06457  [pdf, other

    eess.IV cs.CV cs.LG

    Context Matters: Graph-based Self-supervised Representation Learning for Medical Images

    Authors: Li Sun, Ke Yu, Kayhan Batmanghelich

    Abstract: Supervised learning method requires a large volume of annotated datasets. Collecting such datasets is time-consuming and expensive. Until now, very few annotated COVID-19 imaging datasets are available. Although self-supervised learning enables us to bootstrap the training by exploiting unlabeled data, the generic self-supervised methods for natural images do not sufficiently incorporate the conte… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: Accepted to AAAI 2021

  25. Hierarchical Amortized Training for Memory-efficient High Resolution 3D GAN

    Authors: Li Sun, Junxiang Chen, Yanwu Xu, Mingming Gong, Ke Yu, Kayhan Batmanghelich

    Abstract: Generative Adversarial Networks (GAN) have many potential medical imaging applications, including data augmentation, domain adaptation, and model explanation. Due to the limited memory of Graphical Processing Units (GPUs), most current 3D GAN models are trained on low-resolution medical images, these models either cannot scale to high-resolution or are prone to patchy artifacts. In this work, we p… ▽ More

    Submitted 12 September, 2022; v1 submitted 4 August, 2020; originally announced August 2020.

    Comments: Paper accepted to IEEE Journal of Biomedical and Health Informatics, code available at https://github.com/batmanlab/HA-GAN

    Journal ref: in IEEE Journal of Biomedical and Health Informatics, vol. 26, no. 8, pp. 3966-3975, Aug. 2022

  26. arXiv:2006.00986  [pdf, other

    cs.LG q-bio.MN q-bio.QM stat.ML

    Semi-Supervised Hierarchical Drug Embedding in Hyperbolic Space

    Authors: Ke Yu, Shyam Visweswaran, Kayhan Batmanghelich

    Abstract: Learning accurate drug representation is essential for tasks such as computational drug repositioning and prediction of drug side-effects. A drug hierarchy is a valuable source that encodes human knowledge of drug relations in a tree-like structure where drugs that act on the same organs, treat the same disease, or bind to the same biological target are grouped together. However, its utility in le… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

  27. arXiv:1911.00483  [pdf, other

    cs.LG cs.AI cs.CV

    Explanation by Progressive Exaggeration

    Authors: Sumedha Singla, Brian Pollack, Junxiang Chen, Kayhan Batmanghelich

    Abstract: As machine learning methods see greater adoption and implementation in high stakes applications such as medical image diagnosis, the need for model interpretability and explanation has become more critical. Classical approaches that assess feature importance (e.g. saliency maps) do not explain how and why a particular region of an image is relevant to the prediction. We propose a method that expla… ▽ More

    Submitted 10 February, 2020; v1 submitted 1 November, 2019; originally announced November 2019.

  28. arXiv:1910.05898  [pdf, other

    cs.LG stat.ML

    Robust Ordinal VAE: Employing Noisy Pairwise Comparisons for Disentanglement

    Authors: Junxiang Chen, Kayhan Batmanghelich

    Abstract: Recent work by Locatello et al. (2018) has shown that an inductive bias is required to disentangle factors of interest in Variational Autoencoder (VAE). Motivated by a real-world problem, we propose a setting where such bias is introduced by providing pairwise ordinal comparisons between instances, based on the desired factor to be disentangled. For example, a doctor compares pairs of patients bas… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  29. arXiv:1909.00626  [pdf, other

    eess.IV

    Uncertainty-Driven Semantic Segmentation through Human-Machine Collaborative Learning

    Authors: Mahdyar Ravanbakhsh, Tassilo Klein, Kayhan Batmanghelich, Moin Nabi

    Abstract: Deep learning-based approaches achieve state-of-the-art performance in the majority of image segmentation benchmarks. However, training of such models requires a sizable amount of manual annotations. In order to reduce this effort, we propose a method based on conditional Generative Adversarial Network (cGAN), which addresses segmentation in a semi-supervised setup and in a human-in-the-loop fashi… ▽ More

    Submitted 2 September, 2019; originally announced September 2019.

    Comments: MIDL 2019 [arXiv:1907.08612]

    Report number: MIDL/2019/ExtendedAbstract/rkgnwY04cV

  30. arXiv:1907.06882  [pdf, other

    cs.CV

    Learning Depth from Monocular Videos Using Synthetic Data: A Temporally-Consistent Domain Adaptation Approach

    Authors: Yipeng Mou, Mingming Gong, Huan Fu, Kayhan Batmanghelich, Kun Zhang, Dacheng Tao

    Abstract: Majority of state-of-the-art monocular depth estimation methods are supervised learning approaches. The success of such approaches heavily depends on the high-quality depth labels which are expensive to obtain. Some recent methods try to learn depth networks by leveraging unsupervised cues from monocular videos which are easier to acquire but less reliable. In this paper, we propose to resolve thi… ▽ More

    Submitted 26 November, 2019; v1 submitted 16 July, 2019; originally announced July 2019.

  31. arXiv:1907.02690  [pdf, other

    cs.LG stat.ML

    Twin Auxiliary Classifiers GAN

    Authors: Mingming Gong, Yanwu Xu, Chunyuan Li, Kun Zhang, Kayhan Batmanghelich

    Abstract: Conditional generative models enjoy remarkable progress over the past few years. One of the popular conditional models is Auxiliary Classifier GAN (AC-GAN), which generates highly discriminative images by extending the loss function of GAN with an auxiliary classifier. However, the diversity of the generated samples by AC-GAN tends to decrease as the number of classes increases, hence limiting its… ▽ More

    Submitted 4 November, 2019; v1 submitted 5 July, 2019; originally announced July 2019.

  32. arXiv:1906.01044  [pdf, other

    cs.LG stat.ML

    Weakly Supervised Disentanglement by Pairwise Similarities

    Authors: Junxiang Chen, Kayhan Batmanghelich

    Abstract: Recently, researches related to unsupervised disentanglement learning with deep generative models have gained substantial popularity. However, without introducing supervision, there is no guarantee that the factors of interest can be successfully recovered. Motivated by a real-world problem, we propose a setting where the user introduces weak supervision by providing similarities between instances… ▽ More

    Submitted 11 March, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

  33. arXiv:1904.01612  [pdf, other

    cs.LG stat.ML

    Generative-Discriminative Complementary Learning

    Authors: Yanwu Xu, Mingming Gong, Junxiang Chen, Tongliang Liu, Kun Zhang, Kayhan Batmanghelich

    Abstract: Majority of state-of-the-art deep learning methods are discriminative approaches, which model the conditional distribution of labels given inputs features. The success of such approaches heavily depends on high-quality labeled instances, which are not easy to obtain, especially as the number of candidate classes increases. In this paper, we study the complementary learning problem. Unlike ordinary… ▽ More

    Submitted 11 September, 2019; v1 submitted 2 April, 2019; originally announced April 2019.

  34. arXiv:1901.07076  [pdf, other

    cs.CV

    Robust Angular Local Descriptor Learning

    Authors: Yanwu Xu, Mingming Gong, Tongliang Liu, Kayhan Batmanghelich, Chaohui Wang

    Abstract: In recent years, the learned local descriptors have outperformed handcrafted ones by a large margin, due to the powerful deep convolutional neural network architectures such as L2-Net [1] and triplet based metric learning [2]. However, there are two problems in the current methods, which hinders the overall performance. Firstly, the widely-used margin loss is sensitive to incorrect correspondences… ▽ More

    Submitted 26 January, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

    Comments: Accepted by ACCV2018

  35. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  36. arXiv:1810.03256  [pdf, other

    stat.ML cs.LG

    Deep Diffeomorphic Normalizing Flows

    Authors: Hadi Salman, Payman Yadollahpour, Tom Fletcher, Kayhan Batmanghelich

    Abstract: The Normalizing Flow (NF) models a general probability density by estimating an invertible transformation applied on samples drawn from a known distribution. We introduce a new type of NF, called Deep Diffeomorphic Normalizing Flow (DDNF). A diffeomorphic flow is an invertible function where both the function and its inverse are smooth. We construct the flow using an ordinary differential equation… ▽ More

    Submitted 22 November, 2018; v1 submitted 7 October, 2018; originally announced October 2018.

  37. arXiv:1809.05852  [pdf, other

    cs.CV

    Geometry-Consistent Generative Adversarial Networks for One-Sided Unsupervised Domain Mapping

    Authors: Huan Fu, Mingming Gong, Chaohui Wang, Kayhan Batmanghelich, Kun Zhang, Dacheng Tao

    Abstract: Unsupervised domain mapping aims to learn a function to translate domain X to Y by a function GXY in the absence of paired examples. Finding the optimal GXY without paired data is an ill-posed problem, so appropriate constraints are required to obtain reasonable solutions. One of the most prominent constraints is cycle consistency, which enforces the translated image by GXY to be translated back t… ▽ More

    Submitted 25 November, 2018; v1 submitted 16 September, 2018; originally announced September 2018.

  38. arXiv:1806.11217  [pdf, other

    cs.CV

    Subject2Vec: Generative-Discriminative Approach from a Set of Image Patches to a Vector

    Authors: Sumedha Singla, Mingming Gong, Siamak Ravanbakhsh, Frank Sciurba, Barnabas Poczos, Kayhan N. Batmanghelich

    Abstract: We propose an attention-based method that aggregates local image features to a subject-level representation for predicting disease severity. In contrast to classical deep learning that requires a fixed dimensional input, our method operates on a set of image patches; hence it can accommodate variable length input image without image resizing. The model learns a clinically interpretable subject-lev… ▽ More

    Submitted 28 June, 2018; originally announced June 2018.

    Comments: MICCAI 2018

  39. arXiv:1806.02446  [pdf, other

    cs.CV

    Deep Ordinal Regression Network for Monocular Depth Estimation

    Authors: Huan Fu, Mingming Gong, Chaohui Wang, Kayhan Batmanghelich, Dacheng Tao

    Abstract: Monocular depth estimation, which plays a crucial role in understanding 3D scene geometry, is an ill-posed problem. Recent methods have gained significant improvement by exploring image-level information and hierarchical features from deep convolutional neural networks (DCNNs). These methods model depth estimation as a regression problem and train the regression networks by minimizing mean squared… ▽ More

    Submitted 6 June, 2018; originally announced June 2018.

    Comments: CVPR 2018

  40. arXiv:1804.04333  [pdf, other

    stat.ML cs.LG

    Causal Generative Domain Adaptation Networks

    Authors: Mingming Gong, Kun Zhang, Biwei Huang, Clark Glymour, Dacheng Tao, Kayhan Batmanghelich

    Abstract: An essential problem in domain adaptation is to understand and make use of distribution changes across domains. For this purpose, we first propose a flexible Generative Domain Adaptation Network (G-DAN) with specific latent variables to capture changes in the generating process of features across domains. By explicitly modeling the changes, one can even generate data in new domains using the gener… ▽ More

    Submitted 28 June, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

    Comments: 12 pages

  41. arXiv:1707.09724  [pdf, other

    stat.ML

    Transfer Learning with Label Noise

    Authors: Xiyu Yu, Tongliang Liu, Mingming Gong, Kun Zhang, Kayhan Batmanghelich, Dacheng Tao

    Abstract: Transfer learning aims to improve learning in target domain by borrowing knowledge from a related but different source domain. To reduce the distribution shift between source and target domains, recent methods have focused on exploring invariant representations that have similar distributions across domains. However, when learning this invariant knowledge, existing methods assume that the labels i… ▽ More

    Submitted 7 August, 2018; v1 submitted 31 July, 2017; originally announced July 2017.

  42. arXiv:1706.03768  [pdf, other

    stat.ME cs.AI stat.ML

    Causal Discovery in the Presence of Measurement Error: Identifiability Conditions

    Authors: Kun Zhang, Mingming Gong, Joseph Ramsey, Kayhan Batmanghelich, Peter Spirtes, Clark Glymour

    Abstract: Measurement error in the observed values of the variables can greatly change the output of various causal discovery methods. This problem has received much attention in multiple fields, but it is not clear to what extent the causal model for the measurement-error-free variables can be identified in the presence of measurement error with unknown variance. In this paper, we study precise sufficient… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.

    Comments: 15 pages, 5 figures, 1 table

  43. arXiv:1604.00126  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Nonparametric Spherical Topic Modeling with Word Embeddings

    Authors: Kayhan Batmanghelich, Ardavan Saeedi, Karthik Narasimhan, Sam Gershman

    Abstract: Traditional topic models do not account for semantic regularities in language. Recent distributional representations of words exhibit semantic consistency over directional metrics such as cosine similarity. However, neither categorical nor Gaussian observational distributions used in existing topic models are appropriate to leverage such correlations. In this paper, we propose to use the von Mises… ▽ More

    Submitted 1 April, 2016; originally announced April 2016.

  44. arXiv:1411.6307  [pdf, other

    cs.LG cs.AI stat.ML

    Diversifying Sparsity Using Variational Determinantal Point Processes

    Authors: Nematollah Kayhan Batmanghelich, Gerald Quon, Alex Kulesza, Manolis Kellis, Polina Golland, Luke Bornn

    Abstract: We propose a novel diverse feature selection method based on determinantal point processes (DPPs). Our model enables one to flexibly define diversity based on the covariance of features (similar to orthogonal matching pursuit) or alternatively based on side information. We introduce our approach in the context of Bayesian sparse regression, employing a DPP as a variational approximation to the tru… ▽ More

    Submitted 23 November, 2014; originally announced November 2014.

    Comments: 9 pages, 3 figures