Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–29 of 29 results for author: Valanarasu, J M J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06512  [pdf, other

    cs.CV cs.AI

    Merlin: A Vision Language Foundation Model for 3D Computed Tomography

    Authors: Louis Blankemeier, Joseph Paul Cohen, Ashwin Kumar, Dave Van Veen, Syed Jamal Safdar Gardezi, Magdalini Paschali, Zhihong Chen, Jean-Benoit Delbrouck, Eduardo Reis, Cesar Truyts, Christian Bluethgen, Malte Engmann Kjeldskov Jensen, Sophie Ostmeier, Maya Varma, Jeya Maria Jose Valanarasu, Zhongnan Fang, Zepeng Huo, Zaid Nabulsi, Diego Ardila, Wei-Hung Weng, Edson Amaro Junior, Neera Ahuja, Jason Fries, Nigam H. Shah, Andrew Johnston , et al. (6 additional authors not shown)

    Abstract: Over 85 million computed tomography (CT) scans are performed annually in the US, of which approximately one quarter focus on the abdomen. Given the current radiologist shortage, there is a large impetus to use artificial intelligence to alleviate the burden of interpreting these complex imaging studies. Prior state-of-the-art approaches for automated medical image interpretation leverage vision la… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  2. arXiv:2404.17033  [pdf, other

    cs.CV

    Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segmentation

    Authors: Tanvi Deshpande, Eva Prakash, Elsie Gyang Ross, Curtis Langlotz, Andrew Ng, Jeya Maria Jose Valanarasu

    Abstract: The high cost of creating pixel-by-pixel gold-standard labels, limited expert availability, and presence of diverse tasks make it challenging to generate segmentation labels to train deep learning models for medical imaging tasks. In this work, we present a new approach to overcome the hurdle of costly medical image labeling by leveraging foundation models like Segment Anything Model (SAM) and its… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted at MIDL 2024

  3. arXiv:2404.13185  [pdf, other

    eess.IV cs.CV

    Unlocking Robust Segmentation Across All Age Groups via Continual Learning

    Authors: Chih-Ying Liu, Jeya Maria Jose Valanarasu, Camila Gonzalez, Curtis Langlotz, Andrew Ng, Sergios Gatidis

    Abstract: Most deep learning models in medical imaging are trained on adult data with unclear performance on pediatric images. In this work, we aim to address this challenge in the context of automated anatomy segmentation in whole-body Computed Tomography (CT). We evaluate the performance of CT organ segmentation algorithms trained on adult data when applied to pediatric CT volumes and identify substantial… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  4. arXiv:2404.09977  [pdf, other

    cs.CV

    MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

    Authors: Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M Patel

    Abstract: Large diffusion-based Text-to-Image (T2I) models have shown impressive generative powers for text-to-image generation as well as spatially conditioned image generation. For most applications, we can train the model end-toend with paired data to obtain photorealistic generation quality. However, to add an additional task, one often needs to retrain the model from scratch using paired data across al… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  5. arXiv:2404.09976  [pdf, other

    cs.CV

    Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers

    Authors: Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Recently, diffusion transformers have gained wide attention with its excellent performance in text-to-image and text-to-vidoe models, emphasizing the need for transformers as backbone for diffusion models. Transformer-based models have shown better generalization capability compared to CNN-based models for general vision tasks. However, much less has been explored in the existing literature regard… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  6. arXiv:2401.12208  [pdf, other

    cs.CV cs.CL

    CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

    Authors: Zhihong Chen, Maya Varma, Jean-Benoit Delbrouck, Magdalini Paschali, Louis Blankemeier, Dave Van Veen, Jeya Maria Jose Valanarasu, Alaa Youssef, Joseph Paul Cohen, Eduardo Pontes Reis, Emily B. Tsai, Andrew Johnston, Cameron Olsen, Tanishq Mathew Abraham, Sergios Gatidis, Akshay S. Chaudhari, Curtis Langlotz

    Abstract: Chest X-rays (CXRs) are the most frequently performed imaging test in clinical practice. Recent advances in the development of vision-language foundation models (FMs) give rise to the possibility of performing automated CXR interpretation, which can assist physicians with clinical decision-making and improve patient outcomes. However, developing FMs that can accurately interpret CXRs is challengin… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 24 pages, 8 figures

  7. arXiv:2307.16896  [pdf, other

    cs.CV

    Disruptive Autoencoders: Leveraging Low-level features for 3D Medical Image Pre-training

    Authors: Jeya Maria Jose Valanarasu, Yucheng Tang, Dong Yang, Ziyue Xu, Can Zhao, Wenqi Li, Vishal M. Patel, Bennett Landman, Daguang Xu, Yufan He, Vishwesh Nath

    Abstract: Harnessing the power of pre-training on large-scale datasets like ImageNet forms a fundamental building block for the progress of representation learning-driven solutions in computer vision. Medical images are inherently different from natural images as they are acquired in the form of many modalities (CT, MR, PET, Ultrasound etc.) and contain granulated information like tissue, lesion, organs etc… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Preprint

  8. arXiv:2304.04745  [pdf, other

    cs.CV

    Ambiguous Medical Image Segmentation using Diffusion Models

    Authors: Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel

    Abstract: Collective insights from a group of experts have always proven to outperform an individual's best diagnostic for clinical tasks. For the task of medical image segmentation, existing research on AI-based alternatives focuses more on developing models that can imitate the best individual rather than harnessing the power of expert groups. In this paper, we introduce a single diffusion model-based app… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  9. arXiv:2303.13504  [pdf, other

    cs.CV

    ReBotNet: Fast Real-time Video Enhancement

    Authors: Jeya Maria Jose Valanarasu, Rahul Garg, Andeep Toor, Xin Tong, Weijuan Xi, Andreas Lugmayr, Vishal M. Patel, Anne Menini

    Abstract: Most video restoration networks are slow, have high computational load, and can't be used for real-time video enhancement. In this work, we design an efficient and fast framework to perform real-time video enhancement for practical use-cases like live video calls and video streams. Our proposed method, called Recurrent Bottleneck Mixer Network (ReBotNet), employs a dual-branch framework. The first… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Project Website: https://jeya-maria-jose.github.io/rebotnet-web/

  10. arXiv:2303.11313  [pdf, other

    cs.CV

    CLIP goes 3D: Leveraging Prompt Tuning for Language Grounded 3D Recognition

    Authors: Deepti Hegde, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Vision-Language models like CLIP have been widely adopted for various tasks due to their impressive zero-shot capabilities. However, CLIP is not suitable for extracting 3D geometric features as it was trained on only images and text by natural language supervision. We work on addressing this limitation and propose a new framework termed CG3D (CLIP Goes 3D) where a 3D encoder is learned to exhibit… ▽ More

    Submitted 18 April, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: Website: https://jeya-maria-jose.github.io/cg3d-web/

  11. arXiv:2206.08936  [pdf, other

    eess.IV cs.CV

    Simultaneous Bone and Shadow Segmentation Network using Task Correspondence Consistency

    Authors: Aimon Rahman, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel

    Abstract: Segmenting both bone surface and the corresponding acoustic shadow are fundamental tasks in ultrasound (US) guided orthopedic procedures. However, these tasks are challenging due to minimal and blurred bone surface response in US images, cross-machine discrepancy, imaging artifacts, and low signal-to-noise ratio. Notably, bone shadows are caused by a significant acoustic impedance mismatch between… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted at MICCAI 2022

  12. arXiv:2206.08481  [pdf, other

    eess.IV cs.CV

    Orientation-guided Graph Convolutional Network for Bone Surface Segmentation

    Authors: Aimon Rahman, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Ilker Hacihaliloglu, Vishal M Patel

    Abstract: Due to imaging artifacts and low signal-to-noise ratio in ultrasound images, automatic bone surface segmentation networks often produce fragmented predictions that can hinder the success of ultrasound-guided computer-assisted surgical procedures. Existing pixel-wise predictions often fail to capture the accurate topology of bone tissues due to a lack of supervision to enforce connectivity. In this… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: Accepted at MICCAI 2022

  13. arXiv:2205.15906  [pdf, ps, other

    cs.CV eess.IV

    SAR Despeckling Using Overcomplete Convolutional Networks

    Authors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Synthetic Aperture Radar (SAR) despeckling is an important problem in remote sensing as speckle degrades SAR images, affecting downstream tasks like detection and segmentation. Recent studies show that convolutional neural networks(CNNs) outperform classical despeckling methods. Traditional CNNs try to increase the receptive field size as the network goes deeper, thus extracting global features. H… ▽ More

    Submitted 31 May, 2022; originally announced May 2022.

    Comments: Accepted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at https://github.com/malshaV/sar_overcomplete

  14. arXiv:2203.15792  [pdf, other

    cs.CV

    Target and Task specific Source-Free Domain Adaptive Image Segmentation

    Authors: Vibashan VS, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Solving the domain shift problem during inference is essential in medical imaging, as most deep-learning based solutions suffer from it. In practice, domain shifts are tackled by performing Unsupervised Domain Adaptation (UDA), where a model is adapted to an unlabelled target domain by leveraging the labelled source data. In medical scenarios, the data comes with huge privacy concerns making it di… ▽ More

    Submitted 10 March, 2023; v1 submitted 29 March, 2022; originally announced March 2022.

  15. arXiv:2203.08216  [pdf, other

    cs.CV

    Interactive Portrait Harmonization

    Authors: Jeya Maria Jose Valanarasu, He Zhang, Jianming Zhang, Yilin Wang, Zhe Lin, Jose Echevarria, Yinglan Ma, Zijun Wei, Kalyan Sunkavalli, Vishal M. Patel

    Abstract: Current image harmonization methods consider the entire background as the guidance for harmonization. However, this may limit the capability for user to choose any specific object/person in the background to guide the harmonization. To enable flexible interaction between user and harmonization, we introduce interactive harmonization, a new setting where the harmonization is performed with respect… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  16. arXiv:2203.05574  [pdf, other

    eess.IV cs.CV

    On-the-Fly Test-time Adaptation for Medical Image Segmentation

    Authors: Jeya Maria Jose Valanarasu, Pengfei Guo, Vibashan VS, Vishal M. Patel

    Abstract: One major problem in deep learning-based solutions for medical imaging is the drop in performance when a model is tested on a data distribution different from the one that it is trained on. Adapting the source model to target data distribution at test-time is an efficient solution for the data-shift problem. Previous methods solve this by adapting the model to target distribution by using techniqu… ▽ More

    Submitted 10 March, 2022; originally announced March 2022.

    Comments: Tech Report

  17. arXiv:2203.04967  [pdf, other

    eess.IV cs.CV

    UNeXt: MLP-based Rapid Medical Image Segmentation Network

    Authors: Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: UNet and its latest extensions like TransUNet have been the leading medical image segmentation methods in recent years. However, these networks cannot be effectively adopted for rapid image segmentation in point-of-care applications as they are parameter-heavy, computationally complex and slow to use. To this end, we propose UNeXt which is a Convolutional multilayer perceptron (MLP) based network… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: Tech Report

  18. arXiv:2201.09355  [pdf, ps, other

    cs.CV eess.IV

    Transformer-based SAR Image Despeckling

    Authors: Malsha V. Perera, Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Synthetic Aperture Radar (SAR) images are usually degraded by a multiplicative noise known as speckle which makes processing and interpretation of SAR images difficult. In this paper, we introduce a transformer-based network for SAR image despeckling. The proposed despeckling network comprises of a transformer-based encoder which allows the network to learn global dependencies between different im… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: Submitted to International Geoscience and Remote Sensing Symposium (IGARSS), 2022. Our code is available at https://github.com/malshaV/sar_transformer

  19. arXiv:2111.14813  [pdf, other

    cs.CV

    TransWeather: Transformer-based Restoration of Images Degraded by Adverse Weather Conditions

    Authors: Jeya Maria Jose Valanarasu, Rajeev Yasarla, Vishal M. Patel

    Abstract: Removing adverse weather conditions like rain, fog, and snow from images is an important problem in many applications. Most methods proposed in the literature have been designed to deal with just removing one type of degradation. Recently, a CNN-based method using neural architecture search (All-in-One) was proposed to remove all the weather conditions at once. However, it has a large number of pa… ▽ More

    Submitted 17 June, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: CVPR 2022

  20. arXiv:2109.09609  [pdf, other

    cs.CV

    Fine-Context Shadow Detection using Shadow Removal

    Authors: Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Current shadow detection methods perform poorly when detecting shadow regions that are small, unclear or have blurry edges. In this work, we attempt to address this problem on two fronts. First, we propose a Fine Context-aware Shadow Detection Network (FCSD-Net), where we constraint the receptive field size and focus on low-level features to learn fine context features better. Second, we propose a… ▽ More

    Submitted 26 November, 2021; v1 submitted 20 September, 2021; originally announced September 2021.

  21. arXiv:2109.07701  [pdf, other

    cs.CV cs.LG cs.RO

    SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving

    Authors: Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Road extraction is an essential step in building autonomous navigation systems. Detecting road segments is challenging as they are of varying widths, bifurcated throughout the image, and are often occluded by terrain, cloud, or other weather conditions. Using just convolution neural networks (ConvNets) for this problem is not effective as it is inefficient at capturing distant dependencies between… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Code available at: https://github.com/wgcban/SPIN_RoadMapper.git

    Journal ref: IEEE Conference of Robotics and Automation (ICRA) 2022

  22. arXiv:2107.09011  [pdf, other

    cs.CV

    Image Fusion Transformer

    Authors: Vibashan VS, Jeya Maria Jose Valanarasu, Poojan Oza, Vishal M. Patel

    Abstract: In image fusion, images obtained from different sensors are fused to generate a single image with enhanced information. In recent years, state-of-the-art methods have adopted Convolution Neural Networks (CNNs) to encode meaningful features for image fusion. Specifically, CNN-based methods perform image fusion by fusing local features. However, they do not consider long-range dependencies that are… ▽ More

    Submitted 4 December, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Accepted at ICIP 2022

  23. arXiv:2107.02630  [pdf, other

    cs.CV cs.LG eess.IV

    Hyperspectral Pansharpening Based on Improved Deep Image Prior and Residual Reconstruction

    Authors: Wele Gedara Chaminda Bandara, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Hyperspectral pansharpening aims to synthesize a low-resolution hyperspectral image (LR-HSI) with a registered panchromatic image (PAN) to generate an enhanced HSI with high spectral and spatial resolution. Recently proposed HS pansharpening methods have obtained remarkable results using deep convolutional networks (ConvNets), which typically consist of three steps: (1) up-sampling the LR-HSI, (2)… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  24. arXiv:2106.08886  [pdf, other

    eess.IV cs.CV

    Over-and-Under Complete Convolutional RNN for MRI Reconstruction

    Authors: Pengfei Guo, Jeya Maria Jose Valanarasu, Puyang Wang, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel

    Abstract: Reconstructing magnetic resonance (MR) images from undersampled data is a challenging problem due to various artifacts introduced by the under-sampling operation. Recent deep learning-based methods for MR image reconstruction usually leverage a generic auto-encoder architecture which captures low-level features at the initial layers and high-level features at the deeper layers. Such networks focus… ▽ More

    Submitted 24 June, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted to MICCAI 2021

  25. arXiv:2102.10662  [pdf, other

    cs.CV

    Medical Transformer: Gated Axial-Attention for Medical Image Segmentation

    Authors: Jeya Maria Jose Valanarasu, Poojan Oza, Ilker Hacihaliloglu, Vishal M. Patel

    Abstract: Over the past decade, Deep Convolutional Neural Networks have been widely adopted for medical image segmentation and shown to achieve adequate performance. However, due to the inherent inductive biases present in the convolutional architectures, they lack understanding of long-range dependencies in the image. Recently proposed Transformer-based architectures that leverage self-attention mechanism… ▽ More

    Submitted 6 July, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Accepted at MICCAI 2021

  26. arXiv:2012.04262  [pdf, other

    cs.CV cs.LG eess.IV

    Overcomplete Representations Against Adversarial Videos

    Authors: Shao-Yuan Lo, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Adversarial robustness of deep neural networks is an extensively studied problem in the literature and various methods have been proposed to defend against adversarial images. However, only a handful of defense methods have been developed for defending against attacked videos. In this paper, we propose a novel Over-and-Under complete restoration network for Defending against adversarial videos (OU… ▽ More

    Submitted 14 June, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted at IEEE International Conference on Image Processing (ICIP) 2021

  27. arXiv:2011.08306  [pdf, other

    cs.CV cs.LG

    Overcomplete Deep Subspace Clustering Networks

    Authors: Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Deep Subspace Clustering Networks (DSC) provide an efficient solution to the problem of unsupervised subspace clustering by using an undercomplete deep auto-encoder with a fully-connected layer to exploit the self expressiveness property. This method uses undercomplete representations of the input data which makes it not so robust and more dependent on pre-training. To overcome this, we propose a… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.

    Comments: WACV 2021

  28. Exploring Overcomplete Representations for Single Image Deraining using CNNs

    Authors: Rajeev Yasarla, Jeya Maria Jose Valanarasu, Vishal M. Patel

    Abstract: Removal of rain streaks from a single image is an extremely challenging problem since the rainy images often contain rain streaks of different size, shape, direction and density. Most recent methods for deraining use a deep network following a generic "encoder-decoder" architecture which captures low-level features across the initial layers and high-level features in the deeper layers. For the tas… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Report number: J-STSP-DLIVRC-00060-2020

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, 2020

  29. arXiv:2010.01663  [pdf, other

    eess.IV cs.CV

    KiU-Net: Overcomplete Convolutional Architectures for Biomedical Image and Volumetric Segmentation

    Authors: Jeya Maria Jose Valanarasu, Vishwanath A. Sindagi, Ilker Hacihaliloglu, Vishal M. Patel

    Abstract: Most methods for medical image segmentation use U-Net or its variants as they have been successful in most of the applications. After a detailed analysis of these "traditional" encoder-decoder based approaches, we observed that they perform poorly in detecting smaller structures and are unable to segment boundary regions precisely. This issue can be attributed to the increase in receptive field si… ▽ More

    Submitted 14 October, 2021; v1 submitted 4 October, 2020; originally announced October 2020.

    Comments: Journal Extension of KiU-Net (MICCAI-2020)