Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–18 of 18 results for author: Chakrabarty, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.09792  [pdf

    cs.NE cs.AI cs.ET eess.IV

    System-level Impact of Non-Ideal Program-Time of Charge Trap Flash (CTF) on Deep Neural Network

    Authors: S. Shrivastava, A. Biswas, S. Chakrabarty, G. Dash, V. Saraswat, U. Ganguly

    Abstract: Learning of deep neural networks (DNN) using Resistive Processing Unit (RPU) architecture is energy-efficient as it utilizes dedicated neuromorphic hardware and stochastic computation of weight updates for in-memory computing. Charge Trap Flash (CTF) devices can implement RPU-based weight updates in DNNs. However, prior work has shown that the weight updates (V_T) in CTF-based RPU are impacted by… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  2. arXiv:2402.03390  [pdf, other

    eess.IV cs.AI cs.CV cs.NI

    PixelGen: Rethinking Embedded Camera Systems

    Authors: Kunjun Li, Manoj Gulati, Steven Waskito, Dhairya Shah, Shantanu Chakrabarty, Ambuj Varshney

    Abstract: Embedded camera systems are ubiquitous, representing the most widely deployed example of a wireless embedded system. They capture a representation of the world - the surroundings illuminated by visible or infrared light. Despite their widespread usage, the architecture of embedded camera systems has remained unchanged, which leads to limitations. They visualize only a tiny portion of the world. Ad… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  3. Ultra Low Complexity Deep Learning Based Noise Suppression

    Authors: Shrishti Saha Shetu, Soumitro Chakrabarty, Oliver Thiergart, Edwin Mabande

    Abstract: This paper introduces an innovative method for reducing the computational complexity of deep neural networks in real-time speech enhancement on resource-constrained devices. The proposed approach utilizes a two-stage processing framework, employing channelwise feature reorientation to reduce the computational load of convolutional operations. By combining this with a modified power law compression… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  4. arXiv:2308.01318  [pdf, other

    eess.IV cs.CV physics.med-ph

    Framing image registration as a landmark detection problem for better representation of clinical relevance

    Authors: Diana Waldmannstetter, Benedikt Wiestler, Julian Schwarting, Ivan Ezhov, Marie Metz, Spyridon Bakas, Bhakti Baheti, Satrajit Chakrabarty, Jan S. Kirschke, Rolf A. Heckemann, Marie Piraud, Florian Kofler, Bjoern H. Menze

    Abstract: Nowadays, registration methods are typically evaluated based on sub-resolution tracking error differences. In an effort to reinfuse this evaluation process with clinical relevance, we propose to reframe image registration as a landmark detection problem. Ideally, landmark-specific detection thresholds are derived from an inter-rater analysis. To approximate this costly process, we propose to compu… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  5. arXiv:2306.00838  [pdf, other

    q-bio.OT eess.IV

    The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI

    Authors: Ahmed W. Moawad, Anastasia Janas, Ujjwal Baid, Divya Ramakrishnan, Rachit Saluja, Nader Ashraf, Leon Jekel, Raisa Amiruddin, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Sanjay Aneja, Syed Muhammad Anwar, Timothy Bergquist, Evan Calabrese, Veronica Chiang, Verena Chung, Gian Marco Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Juan Eugenio Iglesias, Zhifan Jiang , et al. (206 additional authors not shown)

    Abstract: The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara… ▽ More

    Submitted 17 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  6. arXiv:2304.01601  [pdf, other

    eess.IV cs.CV

    Primitive Simultaneous Optimization of Similarity Metrics for Image Registration

    Authors: Diana Waldmannstetter, Benedikt Wiestler, Julian Schwarting, Ivan Ezhov, Marie Metz, Spyridon Bakas, Bhakti Baheti, Satrajit Chakrabarty, Daniel Rueckert, Jan S. Kirschke, Rolf A. Heckemann, Marie Piraud, Bjoern H. Menze, Florian Kofler

    Abstract: Even though simultaneous optimization of similarity metrics is a standard procedure in the field of semantic segmentation, surprisingly, this is much less established for image registration. To help closing this gap in the literature, we investigate in a complex multi-modal 3D setting whether simultaneous optimization of registration metrics, here implemented by means of primitive summation, can b… ▽ More

    Submitted 12 October, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  7. arXiv:2302.04243  [pdf, other

    eess.SP

    Simplified markerless stride detection pipeline (sMaSDP) for surface EMG segmentation

    Authors: Rafael Castro Aguiar, Edward Jero, Samit Chakrabarty

    Abstract: People with mobility impairments are often recommended for gait assessment studies to diagnose their condition and to select appropriate physiotherapy to improve their mobility. These studies are often conducted in clinical or lab settings, where subjects are assessed in a foreign environment, which may influence their motivation, coordination and overall mobility. Alternatively, if the subject's… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: Algorithms available upon fair request

  8. arXiv:2210.03779  [pdf, other

    eess.IV cs.CV physics.med-ph

    MRI-based classification of IDH mutation and 1p/19q codeletion status of gliomas using a 2.5D hybrid multi-task convolutional neural network

    Authors: Satrajit Chakrabarty, Pamela LaMontagne, Joshua Shimony, Daniel S. Marcus, Aristeidis Sotiras

    Abstract: Isocitrate dehydrogenase (IDH) mutation and 1p/19q codeletion status are important prognostic markers for glioma. Currently, they are determined using invasive procedures. Our goal was to develop artificial intelligence-based methods to non-invasively determine these molecular alterations from MRI. For this purpose, pre-operative MRI scans of 2648 patients with gliomas (grade II-IV) were collected… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  9. arXiv:2210.03151  [pdf, other

    eess.IV cs.CV cs.LG

    Integrative Imaging Informatics for Cancer Research: Workflow Automation for Neuro-oncology (I3CR-WANO)

    Authors: Satrajit Chakrabarty, Syed Amaan Abidi, Mina Mousa, Mahati Mokkarala, Isabelle Hren, Divya Yadav, Matthew Kelsey, Pamela LaMontagne, John Wood, Michael Adams, Yuzhuo Su, Sherry Thorpe, Caroline Chung, Aristeidis Sotiras, Daniel S. Marcus

    Abstract: Efforts to utilize growing volumes of clinical imaging data to generate tumor evaluations continue to require significant manual data wrangling owing to the data heterogeneity. Here, we propose an artificial intelligence-based solution for the aggregation and processing of multisequence neuro-oncology MRI data to extract quantitative tumor measurements. Our end-to-end framework i) classifies MRI s… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  10. arXiv:2202.00733  [pdf, other

    eess.AS cs.SD

    New Insights on Target Speaker Extraction

    Authors: Mohamed Elminshawi, Wolfgang Mack, Srikanth Raj Chetupalli, Soumitro Chakrabarty, Emanuël A. P. Habets

    Abstract: Speaker extraction (SE) aims to segregate the speech of a target speaker from a mixture of interfering speakers with the help of auxiliary information. Several forms of auxiliary information have been employed in single-channel SE, such as a speech snippet enrolled from the target speaker or visual information corresponding to the spoken utterance. The effectiveness of the auxiliary information in… ▽ More

    Submitted 15 September, 2023; v1 submitted 1 February, 2022; originally announced February 2022.

  11. arXiv:2112.06979  [pdf, other

    eess.IV cs.CV

    The Brain Tumor Sequence Registration (BraTS-Reg) Challenge: Establishing Correspondence Between Pre-Operative and Follow-up MRI Scans of Diffuse Glioma Patients

    Authors: Bhakti Baheti, Satrajit Chakrabarty, Hamed Akbari, Michel Bilello, Benedikt Wiestler, Julian Schwarting, Evan Calabrese, Jeffrey Rudie, Syed Abidi, Mina Mousa, Javier Villanueva-Meyer, Brandon K. K. Fields, Florian Kofler, Russell Takeshi Shinohara, Juan Eugenio Iglesias, Tony C. W. Mok, Albert C. S. Chung, Marek Wodzinski, Artur Jurgas, Niccolo Marini, Manfredo Atzori, Henning Muller, Christoph Grobroehmer, Hanna Siebert, Lasse Hansen , et al. (48 additional authors not shown)

    Abstract: Registration of longitudinal brain MRI scans containing pathologies is challenging due to dramatic changes in tissue appearance. Although there has been progress in developing general-purpose medical image registration techniques, they have not yet attained the requisite precision and reliability for this task, highlighting its inherent complexity. Here we describe the Brain Tumor Sequence Registr… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 December, 2021; originally announced December 2021.

  12. arXiv:2011.04359  [pdf, ps, other

    eess.AS cs.CV cs.LG cs.SD eess.IV

    An Empirical Study of Visual Features for DNN based Audio-Visual Speech Enhancement in Multi-talker Environments

    Authors: Shrishti Saha Shetu, Soumitro Chakrabarty, Emanuël A. P. Habets

    Abstract: Audio-visual speech enhancement (AVSE) methods use both audio and visual features for the task of speech enhancement and the use of visual features has been shown to be particularly effective in multi-speaker scenarios. In the majority of deep neural network (DNN) based AVSE methods, the audio and visual data are first processed separately using different sub-networks, and then the learned feature… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

  13. arXiv:2003.07529  [pdf, other

    eess.IV cs.CV

    Cytology Image Analysis Techniques Towards Automation: Systematically Revisited

    Authors: Shyamali Mitra, Nibaran Das, Soumyajyoti Dey, Sukanta Chakrabarty, Mita Nasipuri, Mrinal Kanti Naskar

    Abstract: Cytology is the branch of pathology which deals with the microscopic examination of cells for diagnosis of carcinoma or inflammatory conditions. Automation in cytology started in the early 1950s with the aim to reduce manual efforts in diagnosis of cancer. The inflush of intelligent technological units with high computational power and improved specimen collection techniques helped to achieve its… ▽ More

    Submitted 17 March, 2020; originally announced March 2020.

  14. arXiv:2002.07476  [pdf, other

    eess.SY

    Model based fractional order controller design for process plants satisfying desired robustness criteria

    Authors: Pushkar Prakash Arya, Sohom Chakrabarty

    Abstract: This paper contributes to the design of a fractional order (FO) internal model controller (IMC) for a first order plus time delay (FOPTD) process model to satisfy a given set of desired robustness specifications in terms of gain margin (Am) and phase margin (Pm). The highlight of the design is the choice of a fractional order (FO) filter in the IMC structure which has two parameters (lambda and be… ▽ More

    Submitted 18 February, 2020; originally announced February 2020.

    Comments: 16 pages, 6 figures

    Report number: SC-PPA-FOIMC-01 MSC Class: 93C05; 93D09; 93B51; 93C83; 93B52

  15. Multi-scale aggregation of phase information for reducing computational cost of CNN based DOA estimation

    Authors: Soumitro Chakrabarty, Emanuël A. P. Habets

    Abstract: In a recent work on direction-of-arrival (DOA) estimation of multiple speakers with convolutional neural networks (CNNs), the phase component of short-time Fourier transform (STFT) coefficients of the microphone signal is given as input and small filters are used to learn the phase relations between neighboring microphones. Due to this chosen filter size, $M-1$ convolution layers are required to a… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

    Comments: arXiv admin note: text overlap with arXiv:1807.11722

  16. arXiv:1807.11722  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Multi-Speaker DOA Estimation Using Deep Convolutional Networks Trained with Noise Signals

    Authors: Soumitro Chakrabarty, Emanuël A. P. Habets

    Abstract: Supervised learning based methods for source localization, being data driven, can be adapted to different acoustic conditions via training and have been shown to be robust to adverse acoustic environments. In this paper, a convolutional neural network (CNN) based supervised learning method for estimating the direction-of-arrival (DOA) of multiple speakers is proposed. Multi-speaker DOA estimation… ▽ More

    Submitted 31 July, 2018; originally announced July 2018.

  17. Classification vs. Regression in Supervised Learning for Single Channel Speaker Count Estimation

    Authors: Fabian-Robert Stöter, Soumitro Chakrabarty, Bernd Edler, Emanuël A. P. Habets

    Abstract: The task of estimating the maximum number of concurrent speakers from single channel mixtures is important for various audio-based applications, such as blind source separation, speaker diarisation, audio surveillance or auditory scene classification. Building upon powerful machine learning methodology, we develop a Deep Neural Network (DNN) that estimates a speaker count. While DNNs efficiently m… ▽ More

    Submitted 15 February, 2018; v1 submitted 12 December, 2017; originally announced December 2017.

    Comments: Accepted in ICASSP 2018

  18. arXiv:1712.04276  [pdf, other

    cs.SD eess.AS stat.ML

    Multi-Speaker Localization Using Convolutional Neural Network Trained with Noise

    Authors: Soumitro Chakrabarty, Emanuël A. P. Habets

    Abstract: The problem of multi-speaker localization is formulated as a multi-class multi-label classification problem, which is solved using a convolutional neural network (CNN) based source localization method. Utilizing the common assumption of disjoint speaker activities, we propose a novel method to train the CNN using synthesized noise signals. The proposed localization method is evaluated for two spea… ▽ More

    Submitted 12 December, 2017; originally announced December 2017.

    Comments: Presented at Machine Learning for Audio Processing (ML4Audio) Workshop at NIPS 2017