Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 57 results for author: Joseph, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04673  [pdf, other

    cs.CV cs.AI cs.MM eess.AS

    MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models

    Authors: Sanjoy Chowdhury, Sayan Nag, K J Joseph, Balaji Vasan Srinivasan, Dinesh Manocha

    Abstract: Music is a universal language that can communicate emotions and feelings. It forms an essential part of the whole spectrum of creative media, ranging from movies to social media posts. Machine learning models that can synthesize music are predominantly conditioned on textual descriptions of it. Inspired by how musicians compose music not just from a movie script, but also through visualizations, w… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR 2024 as Highlight paper. Webpage: https://schowdhury671.github.io/melfusion_cvpr2024/

  2. arXiv:2405.11511  [pdf, other

    cs.CV

    Online Action Representation using Change Detection and Symbolic Programming

    Authors: Vishnu S Nair, Sneha Sree, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: This paper addresses the critical need for online action representation, which is essential for various applications like rehabilitation, surveillance, etc. The task can be defined as representation of actions as soon as they happen in a streaming video without access to video frames in the future. Most of the existing methods use predefined window sizes for video segments, which is a restrictive… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  3. arXiv:2403.13655  [pdf, other

    cs.ET

    A Fully Automated Platform for Evaluating ReRAM Crossbars

    Authors: Rebecca Pelke, Felix Staudigl, Niklas Thomas, Nils Bosbach, Mohammed Hossein, Jose Cubero-Cascante, Leticia Bolzani Poehls, Rainer Leupers, Jan Moritz Joseph

    Abstract: Resistive Random Access Memory (ReRAM) is a promising candidate for implementing Computing-in-Memory (CIM) architectures and neuromorphic circuits. ReRAM cells exhibit significant variability across different memristive devices and cycles, necessitating further improvements in the areas of devices, algorithms, and applications. To achieve this, understanding the stochastic behavior of the differen… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  4. arXiv:2402.06185  [pdf, other

    cs.CV cs.AI cs.LG

    Development and validation of an artificial intelligence model to accurately predict spinopelvic parameters

    Authors: Edward S. Harake, Joseph R. Linzey, Cheng Jiang, Rushikesh S. Joshi, Mark M. Zaki, Jaes C. Jones, Siri S. Khalsa, John H. Lee, Zachary Wilseck, Jacob R. Joseph, Todd C. Hollon, Paul Park

    Abstract: Objective. Achieving appropriate spinopelvic alignment has been shown to be associated with improved clinical symptoms. However, measurement of spinopelvic radiographic parameters is time-intensive and interobserver reliability is a concern. Automated measurement tools have the promise of rapid and consistent measurements, but existing tools are still limited by some degree of manual user-entry re… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 10 pages, 5 figures, to appear in Journal of Neurosurgery: Spine

  5. arXiv:2401.07671  [pdf, other

    cs.AR cs.ET cs.LG

    CLSA-CIM: A Cross-Layer Scheduling Approach for Computing-in-Memory Architectures

    Authors: Rebecca Pelke, Jose Cubero-Cascante, Nils Bosbach, Felix Staudigl, Rainer Leupers, Jan Moritz Joseph

    Abstract: The demand for efficient machine learning (ML) accelerators is growing rapidly, driving the development of novel computing concepts such as resistive random access memory (RRAM)-based tiled computing-in-memory (CIM) architectures. CIM allows to compute within the memory unit, resulting in faster data processing and reduced power consumption. Efficient compiler algorithms are essential to exploit t… ▽ More

    Submitted 17 January, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  6. arXiv:2309.03805  [pdf, other

    cs.AR

    Mapping of CNNs on multi-core RRAM-based CIM architectures

    Authors: Rebecca Pelke, Nils Bosbach, Jose Cubero, Felix Staudigl, Rainer Leupers, Jan Moritz Joseph

    Abstract: RRAM-based multi-core systems improve the energy efficiency and performance of CNNs. Thereby, the distributed parallel execution of convolutional layers causes critical data dependencies that limit the potential speedup. This paper presents synchronization techniques for parallel inference of convolutional layers on RRAM-based CIM architectures. We propose an architecture optimization that enables… ▽ More

    Submitted 26 October, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

  7. arXiv:2309.00613  [pdf, other

    cs.CV cs.AI cs.LG

    Iterative Multi-granular Image Editing using Diffusion Models

    Authors: K J Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan

    Abstract: Recent advances in text-guided image synthesis has dramatically changed how creative professionals generate artistic and aesthetically pleasing visual assets. To fully support such creative endeavors, the process should possess the ability to: 1) iteratively edit the generations and 2) control the spatial reach of desired changes (global, local or anything in between). We formalize this pragmatic… ▽ More

    Submitted 28 October, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  8. parti-gem5: gem5's Timing Mode Parallelised

    Authors: José Cubero-Cascante, Niko Zurstraßen, Jörn Nöller, Rainer Leupers, Jan Moritz Joseph

    Abstract: Detailed timing models are indispensable tools for the design space exploration of Multiprocessor Systems on Chip (MPSoCs). As core counts continue to increase, the complexity in memory hierarchies and interconnect topologies is also growing, making accurate predictions of design decisions more challenging than ever. In this context, the open-source Full System Simulator (FSS) gem5 is a popular ch… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: Pre-print of work presented at SAMOS Conference XXIII

    ACM Class: I.6.0

  9. arXiv:2308.02400  [pdf, other

    cs.AR cs.CY

    Work-in-Progress: A Universal Instrumentation Platform for Non-Volatile Memories

    Authors: Felix Staudigl, Mohammed Hossein, Tobias Ziegler, Hazem Al Indari, Rebecca Pelke, Sebastian Siegel, Dirk J. Wouters, Dominik Sisejkovic, Jan Moritz Joseph, Rainer Leupers

    Abstract: Emerging non-volatile memories (NVMs) represent a disruptive technology that allows a paradigm shift from the conventional von Neumann architecture towards more efficient computing-in-memory (CIM) architectures. Several instrumentation platforms have been proposed to interface NVMs allowing the characterization of single cells and crossbar structures. However, these platforms suffer from low flexi… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  10. arXiv:2307.00910  [pdf, other

    cs.CV cs.AI

    CoPL: Contextual Prompt Learning for Vision-Language Understanding

    Authors: Koustava Goswami, Srikrishna Karanam, Prateksha Udhayanan, K J Joseph, Balaji Vasan Srinivasan

    Abstract: Recent advances in multimodal learning has resulted in powerful vision-language models, whose representations are generalizable across a variety of downstream tasks. Recently, their generalization ability has been further extended by incorporating trainable prompts, borrowed from the natural language processing literature. While such prompt learning techniques have shown impressive results, we ide… ▽ More

    Submitted 12 December, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted at AAAI 2024

  11. arXiv:2306.14544  [pdf, other

    cs.CV

    A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis

    Authors: Aishwarya Agarwal, Srikrishna Karanam, K J Joseph, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan

    Abstract: While recent developments in text-to-image generative models have led to a suite of high-performing methods capable of producing creative imagery from free-form text, there are several limitations. By analyzing the cross-attention representations of these models, we notice two key issues. First, for text prompts that contain multiple concepts, there is a significant amount of pixel-space overlap (… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 15 pages, 16 figures

  12. arXiv:2305.19956  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    MicroSegNet: A Deep Learning Approach for Prostate Segmentation on Micro-Ultrasound Images

    Authors: Hongxu Jiang, Muhammad Imran, Preethika Muralidharan, Anjali Patel, Jake Pensa, Muxuan Liang, Tarik Benidir, Joseph R. Grajo, Jason P. Joseph, Russell Terry, John Michael DiBianco, Li-Ming Su, Yuyin Zhou, Wayne G. Brisbane, Wei Shao

    Abstract: Micro-ultrasound (micro-US) is a novel 29-MHz ultrasound technique that provides 3-4 times higher resolution than traditional ultrasound, potentially enabling low-cost, accurate diagnosis of prostate cancer. Accurate prostate segmentation is crucial for prostate volume measurement, cancer diagnosis, prostate biopsy, and treatment planning. However, prostate segmentation on micro-US is challenging… ▽ More

    Submitted 25 January, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Journal ref: Computerized Medical Imaging and Graphics (2024): 102326

  13. arXiv:2303.14772  [pdf, other

    cs.CV

    $Δ$-Patching: A Framework for Rapid Adaptation of Pre-trained Convolutional Networks without Base Performance Loss

    Authors: Chaitanya Devaguptapu, Samarth Sinha, K J Joseph, Vineeth N Balasubramanian, Animesh Garg

    Abstract: Models pre-trained on large-scale datasets are often fine-tuned to support newer tasks and datasets that arrive over time. This process necessitates storing copies of the model over time for each task that the pre-trained model is fine-tuned to. Building on top of recent model patching work, we propose $Δ$-Patching for fine-tuning neural network models in an efficient manner, without the need to s… ▽ More

    Submitted 21 September, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  14. arXiv:2302.07655  [pdf, other

    cs.ET

    Fault Injection in Native Logic-in-Memory Computation on Neuromorphic Hardware

    Authors: Felix Staudigl, Thorben Fetz, Rebecca Pelke, Dominik Sisejkovic, Jan Moritz Joseph, Leticia Bolzani Pöhls, Rainer Leupers

    Abstract: Logic-in-memory (LIM) describes the execution of logic gates within memristive crossbar structures, promising to improve performance and energy efficiency. Utilizing only binary values, LIM particularly excels in accelerating binary neural networks, shifting it in the focus of edge applications. Considering its potential, the impact of faults on BNNs accelerated with LIM still lacks investigation.… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  15. Component Segmentation of Engineering Drawings Using Graph Convolutional Networks

    Authors: Wentai Zhang, Joe Joseph, Yue Yin, Liuyue Xie, Tomotake Furuhata, Soji Yamakawa, Kenji Shimada, Levent Burak Kara

    Abstract: We present a data-driven framework to automate the vectorization and machine interpretation of 2D engineering part drawings. In industrial settings, most manufacturing engineers still rely on manual reads to identify the topological and manufacturing requirements from drawings submitted by designers. The interpretation process is laborious and time-consuming, which severely inhibits the efficiency… ▽ More

    Submitted 14 March, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Preprint accepted to Computers in Industry

  16. arXiv:2208.03767  [pdf, other

    cs.CV cs.AI cs.LG

    Class-Incremental Learning with Cross-Space Clustering and Controlled Transfer

    Authors: Arjun Ashok, K J Joseph, Vineeth Balasubramanian

    Abstract: In class-incremental learning, the model is expected to learn new classes continually while maintaining knowledge on previous classes. The challenge here lies in preserving the model's ability to effectively represent prior classes in the feature space, while adapting it to represent incoming new classes. We propose two distillation-based objectives for class incremental learning that leverage the… ▽ More

    Submitted 16 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: Accepted at ECCV 2022; Project Page at http://cscct.github.io/

  17. arXiv:2208.00777  [pdf, other

    cs.CV cs.LG

    D3Former: Debiased Dual Distilled Transformer for Incremental Learning

    Authors: Abdelrahman Mohamed, Rushali Grandhe, K J Joseph, Salman Khan, Fahad Khan

    Abstract: In class incremental learning (CIL) setting, groups of classes are introduced to a model in each learning phase. The goal is to learn a unified model performant on all the classes observed so far. Given the recent popularity of Vision Transformers (ViTs) in conventional classification settings, an interesting question is to study their continual learning behaviour. In this work, we develop a Debia… ▽ More

    Submitted 3 June, 2023; v1 submitted 25 July, 2022; originally announced August 2022.

    Comments: Accepted to CLVision at CVPR 2023

  18. arXiv:2207.11886  [pdf, other

    eess.IV cs.CV

    Deep learning based non-contact physiological monitoring in Neonatal Intensive Care Unit

    Authors: Nicky Nirlipta Sahoo, Balamurali Murugesan, Ayantika Das, Srinivasa Karthik, Keerthi Ram, Steffen Leonhardt, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: Preterm babies in the Neonatal Intensive Care Unit (NICU) have to undergo continuous monitoring of their cardiac health. Conventional monitoring approaches are contact-based, making the neonates prone to various nosocomial infections. Video-based monitoring approaches have opened up potential avenues for contactless measurement. This work presents a pipeline for remote estimation of cardiopulmonar… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  19. arXiv:2207.11036  [pdf, other

    cs.AR

    NISTT: A Non-Intrusive SystemC-TLM 2.0 Tracing Tool

    Authors: Nils Bosbach, Lukas Jünger, Jan Moritz Joseph, Rainer Leupers

    Abstract: The increasing complexity of systems-on-a-chip requires the continuous development of electronic design automation tools. Nowadays, the simulation of systems-on-a-chip using virtual platforms is common. Virtual platforms enable hardware/software co-design to shorten the time to market, offer insights into the models, and allow debugging of the simulated hardware. Profiling tools are required to im… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: PREPRINT - accepted by 30th IFIP/IEEE International Conference on Very Large Scale Integration 2022 (VLSI-SoC 2022)

  20. arXiv:2207.10659  [pdf, other

    cs.CV cs.AI cs.LG

    Novel Class Discovery without Forgetting

    Authors: K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian

    Abstract: Humans possess an innate ability to identify and differentiate instances that they are not familiar with, by leveraging and adapting the knowledge that they have acquired so far. Importantly, they achieve this without deteriorating the performance on their earlier learning. Inspired by this, we identify and formulate a new, pragmatic problem setting of NCDwF: Novel Class Discovery without Forgetti… ▽ More

    Submitted 21 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  21. arXiv:2206.11613  [pdf, other

    cs.AR

    EmuNoC: Hybrid Emulation for Fast and Flexible Network-on-Chip Prototyping on FPGAs

    Authors: Yee Yang Tan, Felix Staudigl, Lukas Jünger, Anna Drewes, Rainer Leupers, Jan Moritz Joseph

    Abstract: Networks-on-Chips (NoCs) recently became widely used, from multi-core CPUs to edge-AI accelerators. Emulation on FPGAs promises to accelerate their RTL modeling compared to slow simulations. However, realistic test stimuli are challenging to generate in hardware for diverse applications. In other words, both a fast and flexible design framework is required. The most promising solution is hybrid em… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  22. arXiv:2206.10183  [pdf

    eess.IV cs.CV

    covEcho Resource constrained lung ultrasound image analysis tool for faster triaging and active learning

    Authors: Jinu Joseph, Mahesh Raveendranatha Panicker, Yale Tung Chen, Kesavadas Chandrasekharan, Vimal Chacko Mondy, Anoop Ayyappan, Jineesh Valakkada, Kiran Vishnu Narayan

    Abstract: Lung ultrasound (LUS) is possibly the only medical imaging modality which could be used for continuous and periodic monitoring of the lung. This is extremely useful in tracking the lung manifestations either during the onset of lung infection or to track the effect of vaccination on lung as in pandemics such as COVID-19. There have been many attempts in automating the classification of severity of… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: Submitted to Elsevier CMPBUP on Dec 1, 2021

  23. arXiv:2204.10595  [pdf, other

    cs.CV cs.AI cs.LG

    Spacing Loss for Discovering Novel Categories

    Authors: K J Joseph, Sujoy Paul, Gaurav Aggarwal, Soma Biswas, Piyush Rai, Kai Han, Vineeth N Balasubramanian

    Abstract: Novel Class Discovery (NCD) is a learning paradigm, where a machine learning model is tasked to semantically group instances from unlabeled data, by utilizing labeled instances from a disjoint set of classes. In this work, we first characterize existing NCD approaches into single-stage and two-stage methods based on whether they require access to labeled and unlabeled data together while discoveri… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted to Continual Learning in Computer Vision Workshop (CLVision) at CVPR 2022

  24. arXiv:2204.01501  [pdf, other

    cs.ET

    X-Fault: Impact of Faults on Binary Neural Networks in Memristor-Crossbar Arrays with Logic-in-Memory Computation

    Authors: Felix Staudigl, Karl J. X. Sturm, Maximilian Bartel, Thorben Fetz, Dominik Sisejkovic, Jan Moritz Joseph, Leticia Bolzani Pöhls, Rainer Leupers

    Abstract: Memristor-based crossbar arrays represent a promising emerging memory technology to replace conventional memories by offering a high density and enabling computing-in-memory (CIM) paradigms. While analog computing provides the best performance, non-idealities and ADC/DAC conversion limit memristor-based CIM. Logic-in-Memory (LIM) presents another flavor of CIM, in which the memristors are used in… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  25. arXiv:2203.14952  [pdf, other

    cs.CV cs.AI cs.LG

    Energy-based Latent Aligner for Incremental Learning

    Authors: K J Joseph, Salman Khan, Fahad Shahbaz Khan, Rao Muhammad Anwer, Vineeth N Balasubramanian

    Abstract: Deep learning models tend to forget their earlier knowledge while incrementally learning new tasks. This behavior emerges because the parameter updates optimized for the new tasks may not align well with the updates suitable for older tasks. The resulting latent representation mismatch causes forgetting. In this work, we propose ELI: Energy-based Latent Aligner for Incremental Learning, which firs… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: To appear in CVPR 2022. Code is available in https://github.com/JosephKJ/ELI

  26. arXiv:2201.03954  [pdf, other

    cs.LG cs.AI

    The Dataset Nutrition Label (2nd Gen): Leveraging Context to Mitigate Harms in Artificial Intelligence

    Authors: Kasia S. Chmielinski, Sarah Newman, Matt Taylor, Josh Joseph, Kemi Thomas, Jessica Yurkofsky, Yue Chelsea Qiu

    Abstract: As the production of and reliance on datasets to produce automated decision-making systems (ADS) increases, so does the need for processes for evaluating and interrogating the underlying data. After launching the Dataset Nutrition Label in 2018, the Data Nutrition Project has made significant updates to the design and purpose of the Label, and is launching an updated Label in late 2020, which is p… ▽ More

    Submitted 10 March, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

  27. arXiv:2112.01513  [pdf, other

    cs.CV

    OW-DETR: Open-world Detection Transformer

    Authors: Akshita Gupta, Sanath Narayan, K J Joseph, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

    Abstract: Open-world object detection (OWOD) is a challenging computer vision problem, where the task is to detect a known set of object categories while simultaneously identifying unknown objects. Additionally, the model must incrementally learn new classes that become known in the next training episodes. Distinct from standard object detection, the OWOD setting poses significant challenges for generating… ▽ More

    Submitted 4 April, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: 16 pages, CVPR 2022 accepted

  28. NeuroHammer: Inducing Bit-Flips in Memristive Crossbar Memories

    Authors: Felix Staudigl, Hazem Al Indari, Daniel Schön, Dominik Sisejkovic, Farhad Merchant, Jan Moritz Joseph, Vikas Rana, Stephan Menzel, Rainer Leupers

    Abstract: Emerging non-volatile memory (NVM) technologies offer unique advantages in energy efficiency, latency, and features such as computing-in-memory. Consequently, emerging NVM technologies are considered an ideal substrate for computation and storage in future-generation neuromorphic platforms. These technologies need to be evaluated for fundamental reliability and security issues. In this paper, we p… ▽ More

    Submitted 6 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

  29. arXiv:2108.08295  [pdf, other

    cs.LG cs.AI cs.AR

    AIRCHITECT: Learning Custom Architecture Design and Mapping Space

    Authors: Ananda Samajdar, Jan Moritz Joseph, Matthew Denton, Tushar Krishna

    Abstract: Design space exploration is an important but costly step involved in the design/deployment of custom architectures to squeeze out maximum possible performance and energy efficiency. Conventionally, optimizations require iterative sampling of the design space using simulation or heuristic tools. In this paper we investigate the possibility of learning the optimization task using machine learning an… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

  30. arXiv:2103.02603  [pdf, other

    cs.CV cs.AI cs.LG

    Towards Open World Object Detection

    Authors: K J Joseph, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian

    Abstract: Humans have a natural instinct to identify unknown object instances in their environments. The intrinsic curiosity about these unknown instances aids in learning about them, when the corresponding knowledge is eventually available. This motivates us to propose a novel computer vision problem called: `Open World Object Detection', where a model is tasked to: 1) identify objects that have not been i… ▽ More

    Submitted 9 May, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: To appear in CVPR 2021 as an ORAL paper. Code is available in https://github.com/JosephKJ/OWOD

  31. arXiv:2102.05824  [pdf, other

    cs.LG cs.AI

    Reproducibility Report: La-MAML: Look-ahead Meta Learning for Continual Learning

    Authors: Joel Joseph, Alex Gu

    Abstract: The Continual Learning (CL) problem involves performing well on a sequence of tasks under limited compute. Current algorithms in the domain are either slow, offline or sensitive to hyper-parameters. La-MAML, an optimization-based meta-learning algorithm claims to be better than other replay-based, prior-based and meta-learning based approaches. According to the MER paper [1], metrics to measure pe… ▽ More

    Submitted 20 May, 2021; v1 submitted 10 February, 2021; originally announced February 2021.

  32. arXiv:2012.12563  [pdf, other

    cs.AR

    Architecture, Dataflow and Physical Design Implications of 3D-ICs for DNN-Accelerators

    Authors: Jan Moritz Joseph, Ananda Samajdar, Lingjun Zhu, Rainer Leupers, Sung-Kyu Lim, Thilo Pionteck, Tushar Krishna

    Abstract: The everlasting demand for higher computing power for deep neural networks (DNNs) drives the development of parallel computing architectures. 3D integration, in which chips are integrated and connected vertically, can further increase performance because it introduces another level of spatial parallelism. Therefore, we analyze dataflows, performance, area, power and temperature of such 3D-DNN-acce… ▽ More

    Submitted 18 February, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

  33. arXiv:2010.00352  [pdf, other

    cs.CV cs.AI

    Meta-Consolidation for Continual Learning

    Authors: K J Joseph, Vineeth N Balasubramanian

    Abstract: The ability to continuously learn and adapt itself to new tasks, without losing grasp of already acquired knowledge is a hallmark of biological learning systems, which current deep learning systems fall short of. In this work, we present a novel methodology for continual learning called MERLIN: Meta-Consolidation for Continual Learning. We assume that weights of a neural network $\boldsymbol ψ$,… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: Accepted to NeurIPS 2020

  34. arXiv:2009.06420  [pdf, other

    cs.CV

    Completely Self-Supervised Crowd Counting via Distribution Matching

    Authors: Deepak Babu Sam, Abhinav Agarwalla, Jimmy Joseph, Vishwanath A. Sindagi, R. Venkatesh Babu, Vishal M. Patel

    Abstract: Dense crowd counting is a challenging task that demands millions of head annotations for training models. Though existing self-supervised approaches could learn good representations, they require some labeled data to map these features to the end task of density estimation. We mitigate this issue with the proposed paradigm of complete self-supervision, which does not need even a single labeled ima… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  35. arXiv:2006.09437  [pdf, other

    cs.LG stat.ML

    A Study of Compositional Generalization in Neural Models

    Authors: Tim Klinger, Dhaval Adjodah, Vincent Marois, Josh Joseph, Matthew Riemer, Alex 'Sandy' Pentland, Murray Campbell

    Abstract: Compositional and relational learning is a hallmark of human intelligence, but one which presents challenges for neural models. One difficulty in the development of such models is the lack of benchmarks with clear compositional and relational task structure on which to systematically evaluate them. In this paper, we introduce an environment called ConceptWorld, which enables the generation of imag… ▽ More

    Submitted 8 July, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 28 pages

  36. arXiv:2004.08103  [pdf, other

    eess.SP cs.LG

    RPnet: A Deep Learning approach for robust R Peak detection in noisy ECG

    Authors: Sricharan Vijayarangan, Vignesh R, Balamurali Murugesan, Preejith SP, Jayaraj Joseph, Mohansankar Sivaprakasam

    Abstract: Automatic detection of R-peaks in an Electrocardiogram signal is crucial in a multitude of applications including Heart Rate Variability (HRV) analysis and Cardio Vascular Disease(CVD) diagnosis. Although there have been numerous approaches that have successfully addressed the problem, there has been a notable dip in the performance of these existing detectors on ECG episodes that contain noise an… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

    Comments: Accepted in EMBC 2020

  37. arXiv:2004.06301  [pdf, other

    eess.SP cs.LG

    Robust Modelling of Reflectance Pulse Oximetry for SpO$_2$ Estimation

    Authors: Sricharan Vijayarangan, Prithvi Suresh, Preejith SP, Jayaraj Joseph, Mohansankar Sivaprakasam

    Abstract: Continuous monitoring of blood oxygen saturation levels is vital for patients with pulmonary disorders. Traditionally, SpO$_2$ monitoring has been carried out using transmittance pulse oximeters due to its dependability. However, SpO$_2$ measurement from transmittance pulse oximeters is limited to peripheral regions. This becomes a disadvantage at very low temperatures as blood perfusion to the pe… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: Accepted in EMBC 2020(EMBS)

  38. arXiv:2004.05399  [pdf, other

    eess.SP cs.LG

    Interpreting Deep Neural Networks for Single-Lead ECG Arrhythmia Classification

    Authors: Sricharan Vijayarangan, Balamurali Murugesan, Vignesh R, Preejith SP, Jayaraj Joseph, Mohansankar Sivaprakasam

    Abstract: Cardiac arrhythmia is a prevalent and significant cause of morbidity and mortality among cardiac ailments. Early diagnosis is crucial in providing intervention for patients suffering from cardiac arrhythmia. Traditionally, diagnosis is performed by examination of the Electrocardiogram (ECG) by a cardiologist. This method of diagnosis is hampered by the lack of accessibility to expert cardiologists… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

    Comments: Accepted in EMBC 2020(EMBS)

  39. arXiv:2003.08798  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Incremental Object Detection via Meta-Learning

    Authors: K J Joseph, Jathushan Rajasegaran, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian

    Abstract: In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few efforts have been reported to address this limitation, all of which apply variants of knowledge distillation to avoid catastrophic forgetting. We note that alt… ▽ More

    Submitted 15 December, 2021; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: Published in IEEE Transactions on Pattern Analysis & Machine Intelligence, Nov 2021. Code is available in https://github.com/JosephKJ/iOD

    Journal ref: TPAMI, Nov 2021

  40. arXiv:1912.05670  [pdf, other

    cs.AR

    Ratatoskr: An open-source framework for in-depth power, performance and area analysis in 3D NoCs

    Authors: Jan Moritz Joseph, Lennart Bamberg, Imad Hajjar, Anna Drewes, Behnam Razi Perjikolaei, Alberto García-Ortiz, Thilo Pionteck

    Abstract: We introduce ratatoskr, an open-source framework for in-depth power, performance and area (PPA) analysis in NoCs for 3D-integrated and heterogeneous System-on-Chips (SoCs). It covers all layers of abstraction by providing a NoC hardware implementation on RT level, a NoC simulator on cycle-accurate level and an application model on transaction level. By this comprehensive approach, ratatoskr can pr… ▽ More

    Submitted 14 January, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  41. arXiv:1909.13807  [pdf, other

    cs.AR

    System-level optimization of Network-on-Chips for heterogeneous 3D System-on-Chips

    Authors: Jan Moritz Joseph, Dominik Ermel, Lennart Bamberg, Alberto García-Ortiz, Thilo Pionteck

    Abstract: For a system-level design of Networks-on-Chip for 3D heterogeneous System-on-Chip (SoC), the locations of components, routers and vertical links are determined from an application model and technology parameters. In conventional methods, the two inputs are accounted for separately; here, we define an integrated problem that considers both application model and technology parameters. We show that t… ▽ More

    Submitted 3 October, 2019; v1 submitted 30 September, 2019; originally announced September 2019.

  42. arXiv:1909.04554  [pdf, other

    cs.AR

    NoCs in Heterogeneous 3D SoCs: Co-Design of Routing Strategies and Microarchitectures

    Authors: Jan Moritz Joseph, Lennart Bamberg, Dominik Ermel, Behnam Razi Perjikolaei, Anna Drewes, Alberto García-Oritz, Thilo Pionteck

    Abstract: Heterogeneous 3D System-on-Chips (3D SoCs) are the most promising design paradigm to combine sensing and computing within a single chip. A special characteristic of communication networks in heterogeneous 3D SoCs is the varying latency and throughput in each layer. As shown in this work, this variance drastically degrades the network performance. We contribute a co-design of routing algorithms and… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

  43. arXiv:1908.05311  [pdf, other

    cs.CV

    Conv-MCD: A Plug-and-Play Multi-task Module for Medical Image Segmentation

    Authors: Balamurali Murugesan, Kaushik Sarveswaran, Sharath M Shankaranarayana, Keerthi Ram, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: For the task of medical image segmentation, fully convolutional network (FCN) based architectures have been extensively used with various modifications. A rising trend in these architectures is to employ joint-learning of the target region with an auxiliary task, a method commonly known as multi-task learning. These approaches help impose smoothness and shape priors, which vanilla FCN approaches d… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: Accepted in MLMI 2019

  44. arXiv:1907.03202  [pdf

    cs.CL cs.NE

    Evolutionary Algorithm for Sinhala to English Translation

    Authors: J. K. Joseph, W. M. T. Chathurika, A. Nugaliyadde, Y. Mallawarachchi

    Abstract: Machine Translation (MT) is an area in natural language processing, which focus on translating from one language to another. Many approaches ranging from statistical methods to deep learning approaches are used in order to achieve MT. However, these methods either require a large number of data or a clear understanding about the language. Sinhala language has less digital text which could be used… ▽ More

    Submitted 6 July, 2019; originally announced July 2019.

    Comments: The paper was submitted to National Information Technology Conference (2019)

  45. arXiv:1906.08771  [pdf, other

    cs.LG stat.ML

    Submodular Batch Selection for Training Deep Neural Networks

    Authors: K J Joseph, Vamshi Teja R, Krishnakant Singh, Vineeth N Balasubramanian

    Abstract: Mini-batch gradient descent based methods are the de facto algorithms for training neural network architectures today. We introduce a mini-batch selection strategy based on submodular function maximization. Our novel submodular formulation captures the informativeness of each sample and diversity of the whole subset. We design an efficient, greedy algorithm which can give high-quality solutions to… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: IJCAI 2019

  46. arXiv:1903.12536  [pdf, other

    cs.LG eess.SP stat.ML

    Deep Network for Capacitive ECG Denoising

    Authors: Vignesh Ravichandran, Balamurali Murugesan, Sharath M Shankaranarayana, Keerthi Ram, Preejith S. P, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: Continuous monitoring of cardiac health under free living condition is crucial to provide effective care for patients undergoing post operative recovery and individuals with high cardiac risk like the elderly. Capacitive Electrocardiogram (cECG) is one such technology which allows comfortable and long term monitoring through its ability to measure biopotential in conditions without having skin con… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

    Comments: Accepted IEEE MEMEA 2019

  47. arXiv:1903.08912  [pdf, other

    cs.LG eess.SP stat.ML

    PPGnet: Deep Network for Device Independent Heart Rate Estimation from Photoplethysmogram

    Authors: Shyam A, Vignesh Ravichandran, Preejith S. P, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: Photoplethysmogram (PPG) is increasingly used to provide monitoring of the cardiovascular system under ambulatory conditions. Wearable devices like smartwatches use PPG to allow long term unobtrusive monitoring of heart rate in free living conditions. PPG based heart rate measurement is unfortunately highly susceptible to motion artifacts, particularly when measured from the wrist. Traditional mac… ▽ More

    Submitted 21 March, 2019; originally announced March 2019.

    Comments: Under review in EMBC 2019

  48. arXiv:1902.04236  [pdf, other

    eess.SP cs.CV cs.LG

    RespNet: A deep learning model for extraction of respiration from photoplethysmogram

    Authors: Vignesh Ravichandran, Balamurali Murugesan, Vaishali Balakarthikeyan, Sharath M Shankaranarayana, Keerthi Ram, Preejith S. P, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: Respiratory ailments afflict a wide range of people and manifests itself through conditions like asthma and sleep apnea. Continuous monitoring of chronic respiratory ailments is seldom used outside the intensive care ward due to the large size and cost of the monitoring system. While Electrocardiogram (ECG) based respiration extraction is a validated approach, its adoption is limited by access to… ▽ More

    Submitted 20 February, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: Under review at EMBC

  49. arXiv:1809.10238  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis

    Authors: K J Joseph, Arghya Pal, Sailaja Rajanala, Vineeth N Balasubramanian

    Abstract: Generating an image from its description is a challenging task worth solving because of its numerous practical applications ranging from image editing to virtual reality. All existing methods use one single caption to generate a plausible image. A single caption by itself, can be limited, and may not be able to capture the variety of concepts and behavior that may be present in the image. We propo… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: To appear in the proceedings of IEEE Winter Conference on Applications of Computer Vision, WACV-2019

  50. arXiv:1809.07499  [pdf, other

    cs.CV cs.AI cs.LG

    MASON: A Model AgnoStic ObjectNess Framework

    Authors: K J Joseph, Vineeth N Balasubramanian

    Abstract: This paper proposes a simple, yet very effective method to localize dominant foreground objects in an image, to pixel-level precision. The proposed method 'MASON' (Model-AgnoStic ObjectNess) uses a deep convolutional network to generate category-independent and model-agnostic heat maps for any image. The network is not explicitly trained for the task, and hence, can be used off-the-shelf in tandem… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: Accepted at AutoNUE Workshop, 15th European Conference on Computer Vision (ECCV), September 2018, Munich, Germany