Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 50 results for author: Cohen, J P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06512  [pdf, other

    cs.CV cs.AI

    Merlin: A Vision Language Foundation Model for 3D Computed Tomography

    Authors: Louis Blankemeier, Joseph Paul Cohen, Ashwin Kumar, Dave Van Veen, Syed Jamal Safdar Gardezi, Magdalini Paschali, Zhihong Chen, Jean-Benoit Delbrouck, Eduardo Reis, Cesar Truyts, Christian Bluethgen, Malte Engmann Kjeldskov Jensen, Sophie Ostmeier, Maya Varma, Jeya Maria Jose Valanarasu, Zhongnan Fang, Zepeng Huo, Zaid Nabulsi, Diego Ardila, Wei-Hung Weng, Edson Amaro Junior, Neera Ahuja, Jason Fries, Nigam H. Shah, Andrew Johnston , et al. (6 additional authors not shown)

    Abstract: Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  2. arXiv:2401.12208  [pdf, other

    cs.CV cs.CL

    CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

    Authors: Zhihong Chen, Maya Varma, Jean-Benoit Delbrouck, Magdalini Paschali, Louis Blankemeier, Dave Van Veen, Jeya Maria Jose Valanarasu, Alaa Youssef, Joseph Paul Cohen, Eduardo Pontes Reis, Emily B. Tsai, Andrew Johnston, Cameron Olsen, Tanishq Mathew Abraham, Sergios Gatidis, Akshay S. Chaudhari, Curtis Langlotz

    Abstract: Chest X-rays (CXRs) are the most frequently performed imaging test in clinical practice. Recent advances in the development of vision-language foundation models (FMs) give rise to the possibility of performing automated CXR interpretation, which can assist physicians with clinical decision-making and improve patient outcomes. However, developing FMs that can accurately interpret CXRs is challengin… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 24 pages, 8 figures

  3. arXiv:2312.02186  [pdf, other

    cs.CV cs.AI cs.LG

    Identifying Spurious Correlations using Counterfactual Alignment

    Authors: Joseph Paul Cohen, Louis Blankemeier, Akshay Chaudhari

    Abstract: Models driven by spurious correlations often yield poor generalization performance. We propose the counterfactual alignment method to detect and explore spurious correlations of black box classifiers. Counterfactual images generated with respect to one classifier can be input into other classifiers to see if they also induce changes in the outputs of these classifiers. The relationship between the… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  4. arXiv:2304.00487  [pdf, other

    eess.IV cs.AI cs.CV cs.HC cs.LG

    The Effect of Counterfactuals on Reading Chest X-rays

    Authors: Joseph Paul Cohen, Rupert Brooks, Sovann En, Evan Zucker, Anuj Pareek, Matthew Lungren, Akshay Chaudhari

    Abstract: This study evaluates the effect of counterfactual explanations on the interpretation of chest X-rays. We conduct a reader study with two radiologists assessing 240 chest X-ray predictions to rate their confidence that the model's prediction is correct using a 5 point scale. Half of the predictions are false positives. Each prediction is explained twice, once using traditional attribution methods a… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: Abstract submitted to CVPR XAI4CV 2023 based on longer version: arXiv:2102.09475

  5. arXiv:2211.14830  [pdf, other

    eess.IV cs.CV

    Medical Image Segmentation Review: The success of U-Net

    Authors: Reza Azad, Ehsan Khodapanah Aghdam, Amelie Rauland, Yiwei Jia, Atlas Haddadi Avval, Afshin Bozorgpour, Sanaz Karimijafarbigloo, Joseph Paul Cohen, Ehsan Adeli, Dorit Merhof

    Abstract: Automatic medical image segmentation is a crucial topic in the medical domain and successively a critical counterpart in the computer-aided diagnosis paradigm. U-Net is the most widespread image segmentation architecture due to its flexibility, optimized modular design, and success in all medical image modalities. Over the years, the U-Net model achieved tremendous attention from academic and indu… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: Submitted to the IEEE Transactions on Pattern Analysis and Machine Intelligence Journal

  6. arXiv:2202.02833  [pdf, other

    eess.IV cs.CV cs.LG

    CheXstray: Real-time Multi-Modal Data Concordance for Drift Detection in Medical Imaging AI

    Authors: Arjun Soin, Jameson Merkow, Jin Long, Joseph Paul Cohen, Smitha Saligrama, Stephen Kaiser, Steven Borg, Ivan Tarapov, Matthew P Lungren

    Abstract: Clinical Artificial lntelligence (AI) applications are rapidly expanding worldwide, and have the potential to impact to all areas of medical practice. Medical imaging applications constitute a vast majority of approved clinical AI applications. Though healthcare systems are eager to adopt AI solutions a fundamental question remains: \textit{what happens after the AI model goes into production?} We… ▽ More

    Submitted 17 March, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

    Comments: Added code url

  7. arXiv:2112.13734  [pdf, ps, other

    cs.CV

    Multi-Domain Balanced Sampling Improves Out-of-Distribution Generalization of Chest X-ray Pathology Prediction Models

    Authors: Enoch Tetteh, Joseph Viviano, Yoshua Bengio, David Krueger, Joseph Paul Cohen

    Abstract: Learning models that generalize under different distribution shifts in medical imaging has been a long-standing research challenge. There have been several proposals for efficient and robust visual representation learning among vision research practitioners, especially in the sensitive and critical biomedical domain. In this paper, we propose an idea for out-of-distribution generalization of chest… ▽ More

    Submitted 27 December, 2021; v1 submitted 27 December, 2021; originally announced December 2021.

    Comments: MED-NEURIPS 2021

  8. arXiv:2111.00595  [pdf, other

    eess.IV cs.AI cs.CV

    TorchXRayVision: A library of chest X-ray datasets and models

    Authors: Joseph Paul Cohen, Joseph D. Viviano, Paul Bertin, Paul Morrison, Parsa Torabian, Matteo Guarrera, Matthew P Lungren, Akshay Chaudhari, Rupert Brooks, Mohammad Hashir, Hadrien Bertrand

    Abstract: TorchXRayVision is an open source software library for working with chest X-ray datasets and deep learning models. It provides a common interface and common pre-processing chain for a wide set of publicly available chest X-ray datasets. In addition, a number of classification and representation learning models with different architectures, trained on different data combinations, are available thro… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

    Comments: Library source code: https://github.com/mlmed/torchxrayvision

  9. arXiv:2102.09582  [pdf, other

    cs.CV eess.IV

    Benefits of Linear Conditioning with Metadata for Image Segmentation

    Authors: Andreanne Lemay, Charley Gros, Olivier Vincent, Yaou Liu, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: Medical images are often accompanied by metadata describing the image (vendor, acquisition parameters) and the patient (disease type or severity, demographics, genomics). This metadata is usually disregarded by image segmentation methods. In this work, we adapt a linear conditioning method called FiLM (Feature-wise Linear Modulation) for image segmentation tasks. This FiLM adaptation enables integ… ▽ More

    Submitted 26 April, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at MIDL 2021

  10. arXiv:2102.09475  [pdf, other

    cs.CV cs.AI eess.IV

    Gifsplanation via Latent Shift: A Simple Autoencoder Approach to Counterfactual Generation for Chest X-rays

    Authors: Joseph Paul Cohen, Rupert Brooks, Sovann En, Evan Zucker, Anuj Pareek, Matthew P. Lungren, Akshay Chaudhari

    Abstract: Motivation: Traditional image attribution methods struggle to satisfactorily explain predictions of neural networks. Prediction explanation is important, especially in medical imaging, for avoiding the unintended consequences of deploying AI systems when false positive predictions can impact patient care. Thus, there is a pressing need to develop improved models for model explainability and intros… ▽ More

    Submitted 24 April, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Full paper at MIDL2021

  11. arXiv:2010.09984  [pdf, other

    eess.IV cs.CV

    ivadomed: A Medical Imaging Deep Learning Toolbox

    Authors: Charley Gros, Andreanne Lemay, Olivier Vincent, Lucas Rouhier, Anthime Bucquet, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: ivadomed is an open-source Python package for designing, end-to-end training, and evaluating deep learning models applied to medical imaging data. The package includes APIs, command-line tools, documentation, and tutorials. ivadomed also includes pre-trained models such as spinal tumor segmentation and vertebral labeling. Original features of ivadomed include a data loader that can parse image met… ▽ More

    Submitted 19 October, 2020; originally announced October 2020.

  12. arXiv:2009.08348  [pdf, other

    cs.CV

    S2SD: Simultaneous Similarity-based Self-Distillation for Deep Metric Learning

    Authors: Karsten Roth, Timo Milbich, Björn Ommer, Joseph Paul Cohen, Marzyeh Ghassemi

    Abstract: Deep Metric Learning (DML) provides a crucial tool for visual similarity and zero-shot applications by learning generalizing embedding spaces, although recent work in DML has shown strong performance saturation across training objectives. However, generalization capacity is known to scale with the embedding space dimensionality. Unfortunately, high dimensional embeddings also create higher retriev… ▽ More

    Submitted 4 June, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted to ICML2021

  13. arXiv:2007.13224  [pdf, other

    eess.IV cs.CV

    Uniformizing Techniques to Process CT scans with 3D CNNs for Tuberculosis Prediction

    Authors: Hasib Zunair, Aimon Rahman, Nabeel Mohammed, Joseph Paul Cohen

    Abstract: A common approach to medical image analysis on volumetric data uses deep 2D convolutional neural networks (CNNs). This is largely attributed to the challenges imposed by the nature of the 3D data: variable volume size, GPU exhaustion during optimization. However, dealing with the individual slices independently in 2D CNNs deliberately discards the depth information which results in poor performanc… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: Accepted for publication at the MICCAI 2020 International Workshop on PRedictive Intelligence In MEdicine (PRIME)

  14. arXiv:2007.04250  [pdf, other

    cs.LG cs.CV stat.ML

    A Benchmark of Medical Out of Distribution Detection

    Authors: Tianshi Cao, Chin-Wei Huang, David Yu-Tung Hui, Joseph Paul Cohen

    Abstract: Motivation: Deep learning models deployed for use on medical tasks can be equipped with Out-of-Distribution Detection (OoDD) methods in order to avoid erroneous predictions. However it is unclear which OoDD method should be used in practice. Specific Problem: Systems trained for one particular domain of images cannot be expected to perform accurately on images of a different domain. These images s… ▽ More

    Submitted 4 August, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

    Comments: Submitted to Machine Learning for Biomedical Imaging Journal (MELBA)

  15. arXiv:2006.11988  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV

    COVID-19 Image Data Collection: Prospective Predictions Are the Future

    Authors: Joseph Paul Cohen, Paul Morrison, Lan Dao, Karsten Roth, Tim Q Duong, Marzyeh Ghassemi

    Abstract: Across the world's coronavirus disease 2019 (COVID-19) hot spots, the need to streamline patient diagnosis and management has become more pressing than ever. As one of the main imaging tools, chest X-rays (CXRs) are common, fast, non-invasive, relatively cheap, and potentially bedside to monitor the progression of the disease. This paper describes the first public COVID-19 image data collection as… ▽ More

    Submitted 14 December, 2020; v1 submitted 21 June, 2020; originally announced June 2020.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org. Code for baseline experiments can be found here: https://github.com/mlmed/covid-baselines

  16. arXiv:2005.11856  [pdf, other

    eess.IV cs.LG q-bio.QM stat.AP

    Predicting COVID-19 Pneumonia Severity on Chest X-ray with Deep Learning

    Authors: Joseph Paul Cohen, Lan Dao, Paul Morrison, Karsten Roth, Yoshua Bengio, Beiyi Shen, Almas Abbasi, Mahsa Hoshmand-Kochi, Marzyeh Ghassemi, Haifang Li, Tim Q Duong

    Abstract: Purpose: The need to streamline patient management for COVID-19 has become more pressing than ever. Chest X-rays provide a non-invasive (potentially bedside) tool to monitor the progression of the disease. In this study, we present a severity score prediction model for COVID-19 pneumonia for frontal chest X-ray images. Such a tool can gauge severity of COVID-19 lung infections (and pneumonia in ge… ▽ More

    Submitted 30 June, 2020; v1 submitted 24 May, 2020; originally announced May 2020.

  17. arXiv:2004.13458  [pdf, other

    cs.CV

    DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning

    Authors: Timo Milbich, Karsten Roth, Homanga Bharadhwaj, Samarth Sinha, Yoshua Bengio, Björn Ommer, Joseph Paul Cohen

    Abstract: Visual Similarity plays an important role in many computer vision applications. Deep metric learning (DML) is a powerful framework for learning such similarities which not only generalize from training data to identically distributed test distributions, but in particular also translate to unknown test classes. However, its prevailing learning paradigm is class-discriminative supervised training, w… ▽ More

    Submitted 10 September, 2020; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: published at ECCV 2020

  18. arXiv:2003.11597  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    COVID-19 Image Data Collection

    Authors: Joseph Paul Cohen, Paul Morrison, Lan Dao

    Abstract: This paper describes the initial COVID-19 open image data collection. It was created by assembling medical images from websites and publications and currently contains 123 frontal view X-rays.

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: Dataset available here: https://github.com/ieee8023/covid-chestxray-dataset

  19. arXiv:2003.04387  [pdf, other

    eess.IV cs.CV

    Spine intervertebral disc labeling using a fully convolutional redundant counting model

    Authors: Lucas Rouhier, Francisco Perdigon Romero, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: Labeling intervertebral discs is relevant as it notably enables clinicians to understand the relationship between a patient's symptoms (pain, paralysis) and the exact level of spinal cord injury. However manually labeling those discs is a tedious and user-biased task which would benefit from automated methods. While some automated methods already exist for MRI and CT-scan, they are either not publ… ▽ More

    Submitted 11 March, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: MIDL 2020

  20. arXiv:2003.04377  [pdf, other

    eess.IV cs.CV cs.LG

    Automatic segmentation of spinal multiple sclerosis lesions: How to generalize across MRI contrasts?

    Authors: Olivier Vincent, Charley Gros, Joseph Paul Cohen, Julien Cohen-Adad

    Abstract: Despite recent improvements in medical image segmentation, the ability to generalize across imaging contrasts remains an open issue. To tackle this challenge, we implement Feature-wise Linear Modulation (FiLM) to leverage physics knowledge within the segmentation model and learn the characteristics of each contrast. Interestingly, a well-optimised U-Net reached the same performance as our FiLMed-U… ▽ More

    Submitted 3 June, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: Presented at OHBM 2020 (v2-3 : corrected typos)

  21. arXiv:2002.08473  [pdf, other

    cs.CV

    Revisiting Training Strategies and Generalization Performance in Deep Metric Learning

    Authors: Karsten Roth, Timo Milbich, Samarth Sinha, Prateek Gupta, Björn Ommer, Joseph Paul Cohen

    Abstract: Deep Metric Learning (DML) is arguably one of the most influential lines of research for learning visual similarities with many proposed approaches every year. Although the field benefits from the rapid progress, the divergence in training protocols, architectures, and parameter choices make an unbiased comparison difficult. To provide a consistent reference point, we revisit the most widely used… ▽ More

    Submitted 1 August, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: ICML 2020. Main paper 8.25 pages, 26 pages total

  22. arXiv:2002.02582  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Quantifying the Value of Lateral Views in Deep Learning for Chest X-rays

    Authors: Mohammad Hashir, Hadrien Bertrand, Joseph Paul Cohen

    Abstract: Most deep learning models in chest X-ray prediction utilize the posteroanterior (PA) view due to the lack of other views available. PadChest is a large-scale chest X-ray dataset that has almost 200 labels and multiple views available. In this work, we use PadChest to explore multiple approaches to merging the PA and lateral views for predicting the radiological labels associated with the X-ray ima… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    Comments: Under review at MIDL 2020

  23. arXiv:2002.02497  [pdf, other

    eess.IV cs.LG q-bio.QM stat.ML

    On the limits of cross-domain generalization in automated X-ray prediction

    Authors: Joseph Paul Cohen, Mohammad Hashir, Rupert Brooks, Hadrien Bertrand

    Abstract: This large scale study focuses on quantifying what X-rays diagnostic prediction tasks generalize well across multiple different datasets. We present evidence that the issue of generalization is not due to a shift in the images but instead a shift in the labels. We study the cross-domain performance, agreement between models, and model representations. We find interesting discrepancies between perf… ▽ More

    Submitted 24 May, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: Full paper at MIDL2020

  24. arXiv:1910.13249  [pdf, other

    cs.CV cs.HC cs.LG

    Navigation Agents for the Visually Impaired: A Sidewalk Simulator and Experiments

    Authors: Martin Weiss, Simon Chamorro, Roger Girgis, Margaux Luck, Samira E. Kahou, Joseph P. Cohen, Derek Nowrouzezahrai, Doina Precup, Florian Golemo, Chris Pal

    Abstract: Millions of blind and visually-impaired (BVI) people navigate urban environments every day, using smartphones for high-level path-planning and white canes or guide dogs for local information. However, many BVI people still struggle to travel to new places. In our endeavor to create a navigation assistant for the BVI, we found that existing Reinforcement Learning (RL) environments were unsuitable f… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted at CoRL2019. Code & video available at https://mweiss17.github.io/SEVN/

  25. arXiv:1910.09600  [pdf, other

    q-bio.GN cs.LG q-bio.QM

    Is graph-based feature selection of genes better than random?

    Authors: Mohammad Hashir, Paul Bertin, Martin Weiss, Vincent Frappier, Theodore J. Perkins, Geneviève Boucher, Joseph Paul Cohen

    Abstract: Gene interaction graphs aim to capture various relationships between genes and represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing whether those graphs capture dep… ▽ More

    Submitted 27 December, 2019; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Accepted to the Machine Learning in Computational Biology (MLCB) meeting 2019. 7 pages. 4 figures. arXiv admin note: substantial text overlap with arXiv:1905.02295

  26. arXiv:1910.09570  [pdf, other

    q-bio.QM cs.CV eess.SP stat.AP stat.ML

    Icentia11K: An Unsupervised Representation Learning Dataset for Arrhythmia Subtype Discovery

    Authors: Shawn Tan, Guillaume Androz, Ahmad Chamseddine, Pierre Fecteau, Aaron Courville, Yoshua Bengio, Joseph Paul Cohen

    Abstract: We release the largest public ECG dataset of continuous raw signals for representation learning containing 11 thousand patients and 2 billion labelled beats. Our goal is to enable semi-supervised ECG models to be made as well as to discover unknown subtypes of arrhythmia and anomalous ECG signal events. To this end, we propose an unsupervised representation learning task, evaluated in a semi-super… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: Under Review

  27. arXiv:1910.08636  [pdf, other

    cs.LG q-bio.QM stat.ML

    The TCGA Meta-Dataset Clinical Benchmark

    Authors: Mandana Samiei, Tobias Würfl, Tristan Deleu, Martin Weiss, Francis Dutil, Thomas Fevens, Geneviève Boucher, Sebastien Lemieux, Joseph Paul Cohen

    Abstract: Machine learning is bringing a paradigm shift to healthcare by changing the process of disease diagnosis and prognosis in clinics and hospitals. This development equips doctors and medical staff with tools to evaluate their hypotheses and hence make more precise decisions. Although most current research in the literature seeks to develop techniques and methods for predicting one particular clinica… ▽ More

    Submitted 18 October, 2019; originally announced October 2019.

    Comments: 5 Pages, Submitted to MLCB 2019

  28. arXiv:1910.07655  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Semantic Segmentation of Natural and Medical Images: A Review

    Authors: Saeid Asgari Taghanaki, Kumar Abhishek, Joseph Paul Cohen, Julien Cohen-Adad, Ghassan Hamarneh

    Abstract: The semantic image segmentation task consists of classifying each pixel of an image into an instance, where each instance corresponds to a class. This task is a part of the concept of scene understanding or better explaining the global context of an image. In the medical image analysis domain, image segmentation can be used for image-guided interventions, radiotherapy, or improved radiological dia… ▽ More

    Submitted 30 March, 2024; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: 45 pages, 16 figures. Accepted for publication in Springer Artificial Intelligence Review

  29. arXiv:1910.00199  [pdf, other

    cs.CV cs.LG eess.IV

    Saliency is a Possible Red Herring When Diagnosing Poor Generalization

    Authors: Joseph D. Viviano, Becks Simpson, Francis Dutil, Yoshua Bengio, Joseph Paul Cohen

    Abstract: Poor generalization is one symptom of models that learn to predict target variables using spuriously-correlated image features present only in the training distribution instead of the true image features that denote a class. It is often thought that this can be diagnosed visually using attribution (aka saliency) maps. We study if this assumption is correct. In some prediction tasks, such as for me… ▽ More

    Submitted 10 February, 2021; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: 25 pages, 27 figures, 5 tables, code in paper (https://github.com/josephdviviano/saliency-red-herring). Published at International Conference on Learning Representations (ICLR) 2021. Previously titled "Underwhelming Generalization Improvements from Controlling Feature Attribution"

  30. arXiv:1909.06576  [pdf, ps, other

    cs.LG stat.ML

    Torchmeta: A Meta-Learning library for PyTorch

    Authors: Tristan Deleu, Tobias Würfl, Mandana Samiei, Joseph Paul Cohen, Yoshua Bengio

    Abstract: The constant introduction of standardized benchmarks in the literature has helped accelerating the recent advances in meta-learning research. They offer a way to get a fair comparison between different algorithms, and the wide range of datasets available allows full control over the complexity of this evaluation. However, for a large majority of code available online, the data pipeline is often sp… ▽ More

    Submitted 14 September, 2019; originally announced September 2019.

  31. arXiv:1905.02295  [pdf, other

    q-bio.GN cs.AI cs.LG q-bio.QM

    Analysis of Gene Interaction Graphs as Prior Knowledge for Machine Learning Models

    Authors: Paul Bertin, Mohammad Hashir, Martin Weiss, Vincent Frappier, Theodore J. Perkins, Geneviève Boucher, Joseph Paul Cohen

    Abstract: Gene interaction graphs aim to capture various relationships between genes and can represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more consistent with biological common knowledge. In this work, we focus on assessing how well those graphs captur… ▽ More

    Submitted 13 January, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

    Comments: Preprint. Under review

  32. arXiv:1904.08534  [pdf, other

    cs.CV cs.LG eess.IV

    Do Lateral Views Help Automated Chest X-ray Predictions?

    Authors: Hadrien Bertrand, Mohammad Hashir, Joseph Paul Cohen

    Abstract: Most convolutional neural networks in chest radiology use only the frontal posteroanterior (PA) view to make a prediction. However the lateral view is known to help the diagnosis of certain diseases and conditions. The recently released PadChest dataset contains paired PA and lateral views, allowing us to study for which diseases and conditions the performance of a neural network improves when pro… ▽ More

    Submitted 25 July, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: 3 pages and 1 figure. Under review as extended abstract at MIDL 2019 [arXiv:1907.08612]

    Report number: MIDL/2019/ExtendedAbstract/ryeLXFe494

  33. arXiv:1904.07478  [pdf, other

    cs.CV cs.LG eess.IV

    GradMask: Reduce Overfitting by Regularizing Saliency

    Authors: Becks Simpson, Francis Dutil, Yoshua Bengio, Joseph Paul Cohen

    Abstract: With too few samples or too many model parameters, overfitting can inhibit the ability to generalise predictions to new data. Within medical imaging, this can occur when features are incorrectly assigned importance such as distinct hospital specific artifacts, leading to poor performance on a new dataset from a different institution without those features, which is undesirable. Most regularization… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

  34. arXiv:1901.11210  [pdf, other

    cs.CV cs.LG q-bio.TO

    Chester: A Web Delivered Locally Computed Chest X-Ray Disease Prediction System

    Authors: Joseph Paul Cohen, Paul Bertin, Vincent Frappier

    Abstract: In order to bridge the gap between Deep Learning researchers and medical professionals we develop a very accessible free prototype system which can be used by medical professionals to understand the reality of Deep Learning tools for chest X-ray diagnostics. The system is designed to be a second opinion where a user can process an image to confirm or aid in their diagnosis. Code and network weight… ▽ More

    Submitted 2 February, 2020; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: Submitted to MIDL2020

  35. arXiv:1811.10120  [pdf, other

    cs.HC cs.AI

    A Survey of Mobile Computing for the Visually Impaired

    Authors: Martin Weiss, Margaux Luck, Roger Girgis, Chris Pal, Joseph Paul Cohen

    Abstract: The number of visually impaired or blind (VIB) people in the world is estimated at several hundred million. Based on a series of interviews with the VIB and developers of assistive technology, this paper provides a survey of machine-learning based mobile applications and identifies the most relevant applications. We discuss the functionality of these apps, how they align with the needs and require… ▽ More

    Submitted 27 November, 2018; v1 submitted 25 November, 2018; originally announced November 2018.

  36. arXiv:1810.03442  [pdf, other

    q-bio.GN cs.LG stat.ML

    Towards the Latent Transcriptome

    Authors: Assya Trofimov, Francis Dutil, Claude Perreault, Sebastien Lemieux, Yoshua Bengio, Joseph Paul Cohen

    Abstract: In this work we propose a method to compute continuous embeddings for kmers from raw RNA-seq data, without the need for alignment to a reference genome. The approach uses an RNN to transform kmers of the RNA-seq reads into a 2 dimensional representation that is used to predict abundance of each kmer. We report that our model captures information of both DNA sequence similarity as well as DNA seque… ▽ More

    Submitted 10 December, 2018; v1 submitted 8 October, 2018; originally announced October 2018.

    Comments: 7 figures

  37. arXiv:1810.00045  [pdf, other

    cs.LG q-bio.NC stat.ML

    Adversarial Domain Adaptation for Stable Brain-Machine Interfaces

    Authors: Ali Farshchian, Juan A. Gallego, Joseph P. Cohen, Yoshua Bengio, Lee E. Miller, Sara A. Solla

    Abstract: Brain-Machine Interfaces (BMIs) have recently emerged as a clinically viable option to restore voluntary movements after paralysis. These devices are based on the ability to extract information about movement intent from neural signals recorded using multi-electrode arrays chronically implanted in the motor cortices of the brain. However, the inherent loss and turnover of recorded neurons requires… ▽ More

    Submitted 15 January, 2019; v1 submitted 28 September, 2018; originally announced October 2018.

    Comments: 14 pages, 6 figures

  38. arXiv:1806.06975  [pdf, other

    q-bio.GN cs.CE cs.LG stat.ML

    Towards Gene Expression Convolutions using Gene Interaction Graphs

    Authors: Francis Dutil, Joseph Paul Cohen, Martin Weiss, Georgy Derevyanko, Yoshua Bengio

    Abstract: We study the challenges of applying deep learning to gene expression data. We find experimentally that there exists non-linear signal in the data, however is it not discovered automatically given the noise and low numbers of samples used in most research. We discuss how gene interaction graphs (same pathway, protein-protein, co-expression, or research paper text association) can be used to impose… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 4 pages +1 page references, To appear in the International Conference on Machine Learning Workshop on Computational Biology, 2018

  39. arXiv:1806.01984  [pdf, other

    cs.LG cs.AI stat.ML

    Learning to rank for censored survival data

    Authors: Margaux Luck, Tristan Sylvain, Joseph Paul Cohen, Heloise Cardinal, Andrea Lodi, Yoshua Bengio

    Abstract: Survival analysis is a type of semi-supervised ranking task where the target output (the survival time) is often right-censored. Utilizing this information is a challenge because it is not obvious how to correctly incorporate these censored examples into a model. We study how three categories of loss functions, namely partial likelihood methods, rank methods, and our classification method based on… ▽ More

    Submitted 8 June, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

  40. arXiv:1805.08841  [pdf, other

    cs.CV cs.LG

    Distribution Matching Losses Can Hallucinate Features in Medical Image Translation

    Authors: Joseph Paul Cohen, Margaux Luck, Sina Honari

    Abstract: This paper discusses how distribution matching losses, such as those used in CycleGAN, when used to synthesize medical images can lead to mis-diagnosis of medical conditions. It seems appealing to use these new image synthesis methods for translating images from a source to a target domain because they can produce high quality images and some even do not require paired data. However, the basis of… ▽ More

    Submitted 3 October, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: Published at Medical Image Computing & Computer Assisted Intervention (MICCAI 2018). An abstract is published at the Medical Imaging with Deep Learning Conference (MIDL 2018) as "How to Cure Cancer (in images) with Unpaired Image Translation"

    Journal ref: Medical Image Computing & Computer Assisted Intervention (MICCAI 2018 Oral)

  41. arXiv:1712.04120  [pdf, other

    stat.ML cs.LG

    GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

    Authors: Alex Lamb, Devon Hjelm, Yaroslav Ganin, Joseph Paul Cohen, Aaron Courville, Yoshua Bengio

    Abstract: Directed latent variable models that formulate the joint distribution as $p(x,z) = p(z) p(x \mid z)$ have the advantage of fast and exact sampling. However, these models have the weakness of needing to specify $p(z)$, often with a simple fixed prior that limits the expressiveness of the model. Undirected latent variable models discard the requirement that $p(z)$ be specified with a prior, yet samp… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

    Comments: NIPS 2017

  42. arXiv:1707.06684  [pdf, other

    cs.DL

    ShortScience.org - Reproducing Intuition

    Authors: Joseph Paul Cohen, Henry Z. Lo

    Abstract: We present ShortScience.org, a platform for post-publication discussion of research papers. On ShortScience.org, the research community can read and write summaries of papers in order to increase accessible and reproducibility. Summaries contain the perspective and insight of other readers, why they liked or disliked it, and their attempt to demystify complicated sections. ShortScience.org has ove… ▽ More

    Submitted 20 July, 2017; originally announced July 2017.

    Comments: To appear in International Conference on Machine Learning 2017 Workshop on Reproducibility in Machine Learning

  43. arXiv:1703.08710  [pdf, other

    cs.CV cs.LG stat.ML

    Count-ception: Counting by Fully Convolutional Redundant Counting

    Authors: Joseph Paul Cohen, Genevieve Boucher, Craig A. Glastonbury, Henry Z. Lo, Yoshua Bengio

    Abstract: Counting objects in digital images is a process that should be replaced by machines. This tedious task is time consuming and prone to errors due to fatigue of human annotators. The goal is to have a system that takes as input an image and returns a count of the objects inside and justification for the prediction in the form of object localization. We repose a problem, originally posed by Lempitsky… ▽ More

    Submitted 23 July, 2017; v1 submitted 25 March, 2017; originally announced March 2017.

    Comments: Under Review

  44. arXiv:1603.04395  [pdf, ps, other

    cs.NI cs.CY cs.DL

    Academic Torrents: Scalable Data Distribution

    Authors: Henry Z. Lo, Joseph Paul Cohen

    Abstract: As competitions get more popular, transferring ever-larger data sets becomes infeasible and costly. For example, downloading the 157.3 GB 2012 ImageNet data set incurs about $4.33 in bandwidth costs per download. Downloading the full ImageNet data set takes 33 days. ImageNet has since become popular beyond the competition, and many papers and models now revolve around this data set. For sharing su… ▽ More

    Submitted 14 March, 2016; originally announced March 2016.

    Comments: Presented at Neural Information Processing Systems 2015 Challenges in Machine Learning (CiML) workshop http://ciml.chalearn.org/home/schedule

  45. arXiv:1603.04392  [pdf, other

    cs.CV

    Rapid building detection using machine learning

    Authors: Joseph Paul Cohen, Wei Ding, Caitlin Kuhlman, Aijun Chen, Liping Di

    Abstract: This work describes algorithms for performing discrete object detection, specifically in the case of buildings, where usually only low quality RGB-only geospatial reflective imagery is available. We utilize new candidate search and feature extraction techniques to reduce the problem to a machine learning (ML) classification task. Here we can harness the complex patterns of contrast features contai… ▽ More

    Submitted 14 March, 2016; originally announced March 2016.

    Comments: Accepted to be published in Applied Intelligence 2016

  46. arXiv:1602.05931  [pdf, other

    cs.CV

    RandomOut: Using a convolutional gradient norm to rescue convolutional filters

    Authors: Joseph Paul Cohen, Henry Z. Lo, Wei Ding

    Abstract: Filters in convolutional neural networks are sensitive to their initialization. The random numbers used to initialize filters are a bias and determine if you will "win" and converge to a satisfactory local minimum so we call this The Filter Lottery. We observe that the 28x28 Inception-V3 model without Batch Normalization fails to train 26% of the time when varying the random seed alone. This is a… ▽ More

    Submitted 29 May, 2017; v1 submitted 18 February, 2016; originally announced February 2016.

    Comments: Extended version of the ICLR 2016 workshop track paper

  47. arXiv:1601.00978  [pdf, other

    cs.CV

    Crater Detection via Convolutional Neural Networks

    Authors: Joseph Paul Cohen, Henry Z. Lo, Tingting Lu, Wei Ding

    Abstract: Craters are among the most studied geomorphic features in the Solar System because they yield important information about the past and present geological processes and provide information about the relative ages of observed geologic formations. We present a method for automatic crater detection using advanced machine learning to deal with the large amount of satellite imagery collected. The challe… ▽ More

    Submitted 5 January, 2016; originally announced January 2016.

    Comments: 2 Pages. Submitted to 47th Lunar and Planetary Science Conference (LPSC 2016)

  48. arXiv:1512.00127  [pdf, other

    cs.DL

    The cost of reading research. A study of Computer Science publication venues

    Authors: Joseph Paul Cohen, Carla Aravena, Wei Ding

    Abstract: What does the cost of academic publishing look like to the common researcher today? Our goal is to convey the current state of academic publishing, specifically in regards to the field of computer science and provide analysis and data to be used as a basis for future studies. We will focus on author and reader costs as they are the primary points of interaction within the publishing world. In this… ▽ More

    Submitted 30 November, 2015; originally announced December 2015.

  49. arXiv:1505.01303  [pdf, other

    cs.IR cs.DB

    XTreePath: A generalization of XPath to handle real world structural variation

    Authors: Joseph Paul Cohen, Wei Ding, Abraham Bagherjeiran

    Abstract: We discuss a key problem in information extraction which deals with wrapper failures due to changing content templates. A good proportion of wrapper failures are due to HTML templates changing to cause wrappers to become incompatible after element inclusion or removal in a DOM (Tree representation of HTML). We perform a large-scale empirical analyses of the causes of shift and mathematically quant… ▽ More

    Submitted 26 December, 2017; v1 submitted 6 May, 2015; originally announced May 2015.

  50. arXiv:1307.7814  [pdf, other

    cs.NI

    Wireless Message Dissemination via Selective Relay over Bluetooth (MDSRoB)

    Authors: Joseph Paul Cohen

    Abstract: This paper presents a wireless message dissemination method designed with no need to trust other users. This method utilizes modern wireless adaptors ability to broadcast device name and identification information. Using the scanning features built into Bluetooth and Wifi, messages can be exchanged via their device names. This paper outlines a method of interchanging multiple messages to discovera… ▽ More

    Submitted 30 July, 2013; originally announced July 2013.