Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 73 results for author: Chaudhury, S

.
  1. arXiv:2408.00311  [pdf, other

    cs.CV

    Translating Imaging to Genomics: Leveraging Transformers for Predictive Modeling

    Authors: Aiman Farooq, Deepak Mishra, Santanu Chaudhury

    Abstract: In this study, we present a novel approach for predicting genomic information from medical imaging modalities using a transformer-based model. We aim to bridge the gap between imaging and genomics data by leveraging transformer networks, allowing for accurate genomic profile predictions from CT/MRI images. Presently most studies rely on the use of whole slide images (WSI) for the association, whic… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  2. arXiv:2407.16908  [pdf, other

    cs.CL cs.AI cs.LG

    Generation Constraint Scaling Can Mitigate Hallucination

    Authors: Georgios Kollias, Payel Das, Subhajit Chaudhury

    Abstract: Addressing the issue of hallucinations in large language models (LLMs) is a critical challenge. As the cognitive mechanisms of hallucination have been related to memory, here we explore hallucination for LLM that is enabled with explicit memory mechanisms. We empirically demonstrate that by simply scaling the readout vector that constrains generation in a memory-augmented LLM decoder, hallucinatio… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 7 pages; accepted at ICML 2024 Workshop on Large Language Models and Cognition

  3. arXiv:2407.11172  [pdf

    eess.SY physics.optics

    Micro-Ring Modulator Linearity Enhancement for Analog and Digital Optical Links

    Authors: Sumilak Chaudhury, Karl Johnson, Chengkuan Gao, Bill Lin, Yeshaiahu Fainman, Tzu-Chien Hsueh

    Abstract: An energy/area-efficient low-cost broadband linearity enhancement technique for electro-optic micro-ring modulators (MRM) is proposed to achieve 6.1-dB dynamic linearity improvement in spurious-free-dynamic-range with intermodulation distortions (IMD) and 17.9-dB static linearity improvement in integral nonlinearity over a conventional notch-filter MRM within a 4.8-dB extinction-ratio (ER) full-sc… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 4 pages, 5 figures

  4. arXiv:2407.01619  [pdf, other

    cs.LG cs.AI cs.DB

    TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes

    Authors: Aamod Khatiwada, Harsha Kokel, Ibrahim Abdelaziz, Subhajit Chaudhury, Julian Dolby, Oktie Hassanzadeh, Zhenhan Huang, Tejaswini Pedapati, Horst Samulowitz, Kavitha Srinivas

    Abstract: Enterprises have a growing need to identify relevant tables in data lakes; e.g. tables that are unionable, joinable, or subsets of each other. Tabular neural models can be helpful for such data discovery tasks. In this paper, we present TabSketchFM, a neural tabular model for data discovery over data lakes. First, we propose a novel pre-training sketch-based approach to enhance the effectiveness o… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2307.04217

  5. arXiv:2407.01437  [pdf, other

    cs.CL cs.AI cs.LG

    Needle in the Haystack for Memory Based Large Language Models

    Authors: Elliot Nelson, Georgios Kollias, Payel Das, Subhajit Chaudhury, Soham Dan

    Abstract: Current large language models (LLMs) often perform poorly on simple fact retrieval tasks. Here we investigate if coupling a dynamically adaptable external memory to a LLM can alleviate this problem. For this purpose, we test Larimar, a recently proposed language model architecture which uses an external associative memory, on long-context recall tasks including passkey and needle-in-the-haystack t… ▽ More

    Submitted 12 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 5 pages; slightly revised abstract

  6. arXiv:2404.10174  [pdf, other

    cs.CL

    On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning

    Authors: Mauricio Gruppi, Soham Dan, Keerthiram Murugesan, Subhajit Chaudhury

    Abstract: Text-based reinforcement learning involves an agent interacting with a fictional environment using observed text and admissible actions in natural language to complete a task. Previous works have shown that agents can succeed in text-based interactive environments even in the complete absence of semantic understanding or other linguistic capabilities. The success of these agents in playing such ga… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  7. arXiv:2403.11901  [pdf, other

    cs.LG cs.AI

    Larimar: Large Language Models with Episodic Memory Control

    Authors: Payel Das, Subhajit Chaudhury, Elliot Nelson, Igor Melnyk, Sarath Swaminathan, Sihui Dai, Aurélie Lozano, Georgios Kollias, Vijil Chenthamarakshan, Jiří, Navrátil, Soham Dan, Pin-Yu Chen

    Abstract: Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today. This paper presents Larimar - a novel, brain-inspired architecture for enhancing LLMs with a distributed episodic memory. Larimar's memory allows for dynamic, one-shot updates of knowledge without the need for computationally expensive re-training or fine-tunin… ▽ More

    Submitted 6 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  8. arXiv:2403.10692  [pdf, other

    cs.CL cs.AI cs.LO

    EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

    Authors: Kinjal Basu, Keerthiram Murugesan, Subhajit Chaudhury, Murray Campbell, Kartik Talamadupula, Tim Klinger

    Abstract: Text-based games (TBGs) have emerged as an important collection of NLP tasks, requiring reinforcement learning (RL) agents to combine natural language understanding with reasoning. A key challenge for agents attempting to solve such tasks is to generalize across multiple games and demonstrate good performance on both seen and unseen objects. Purely deep-RL-based approaches may perform well on seen… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  9. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 13 June, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  10. arXiv:2402.15491  [pdf, other

    cs.CL cs.AI

    API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs

    Authors: Kinjal Basu, Ibrahim Abdelaziz, Subhajit Chaudhury, Soham Dan, Maxwell Crouse, Asim Munawar, Sadhana Kumaravel, Vinod Muthusamy, Pavan Kapanipathi, Luis A. Lastras

    Abstract: There is a growing need for Large Language Models (LLMs) to effectively use tools and external Application Programming Interfaces (APIs) to plan and complete tasks. As such, there is tremendous interest in methods that can acquire sufficient quantities of train and test data that involve calls to tools / APIs. Two lines of research have emerged as the predominant strategies for addressing this cha… ▽ More

    Submitted 20 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted at ACL'24-main conference

  11. arXiv:2402.07301  [pdf, other

    cs.CV

    LISR: Learning Linear 3D Implicit Surface Representation Using Compactly Supported Radial Basis Functions

    Authors: Atharva Pandey, Vishal Yadav, Rajendra Nagar, Santanu Chaudhury

    Abstract: Implicit 3D surface reconstruction of an object from its partial and noisy 3D point cloud scan is the classical geometry processing and 3D computer vision problem. In the literature, various 3D shape representations have been developed, differing in memory efficiency and shape retrieval effectiveness, such as volumetric, parametric, and implicit surfaces. Radial basis functions provide memory-effi… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Journal ref: AAAI 2024

  12. arXiv:2310.16173  [pdf, other

    cs.LG

    On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration

    Authors: Shuai Zhang, Hongkang Li, Meng Wang, Miao Liu, Pin-Yu Chen, Songtao Lu, Sijia Liu, Keerthiram Murugesan, Subhajit Chaudhury

    Abstract: This paper provides a theoretical understanding of Deep Q-Network (DQN) with the $\varepsilon$-greedy exploration in deep reinforcement learning. Despite the tremendous empirical achievement of the DQN, its theoretical characterization remains underexplored. First, the exploration strategy is either impractical or ignored in the existing analysis. Second, in contrast to conventional Q-learning alg… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Journal ref: Neurips 2023

  13. arXiv:2307.04217  [pdf, other

    cs.DB cs.AI

    LakeBench: Benchmarks for Data Discovery over Data Lakes

    Authors: Kavitha Srinivas, Julian Dolby, Ibrahim Abdelaziz, Oktie Hassanzadeh, Harsha Kokel, Aamod Khatiwada, Tejaswini Pedapati, Subhajit Chaudhury, Horst Samulowitz

    Abstract: Within enterprises, there is a growing need to intelligently navigate data lakes, specifically focusing on data discovery. Of particular importance to enterprises is the ability to find related tables in data repositories. These tables can be unionable, joinable, or subsets of each other. There is a dearth of benchmarks for these tasks in the public domain, with related work targeting private data… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

  14. arXiv:2307.02689  [pdf, other

    cs.CL

    Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning

    Authors: Subhajit Chaudhury, Sarathkrishna Swaminathan, Daiki Kimura, Prithviraj Sen, Keerthiram Murugesan, Rosario Uceda-Sosa, Michiaki Tatsubori, Achille Fokoue, Pavan Kapanipathi, Asim Munawar, Alexander Gray

    Abstract: Text-based reinforcement learning agents have predominantly been neural network-based models with embeddings-based representation, learning uninterpretable policies that often do not generalize well to unseen games. On the other hand, neuro-symbolic methods, specifically those that leverage an intermediate formal representation, are gaining significant attention in language understanding tasks. Th… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: ACL 2023

  15. arXiv:2306.10452  [pdf, other

    cs.CL

    MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

    Authors: Keerthiram Murugesan, Sarathkrishna Swaminathan, Soham Dan, Subhajit Chaudhury, Chulaka Gunasekara, Maxwell Crouse, Diwakar Mahajan, Ibrahim Abdelaziz, Achille Fokoue, Pavan Kapanipathi, Salim Roukos, Alexander Gray

    Abstract: With the growing interest in large language models, the need for evaluating the quality of machine text compared to reference (typically human-generated) text has become focal attention. Most recent works focus either on task-specific evaluation metrics or study the properties of machine-generated text captured by the existing metrics. In this work, we propose a new evaluation scheme to model huma… ▽ More

    Submitted 17 June, 2023; originally announced June 2023.

    Comments: Accepted at ACL 2023 (ACL Findings Long)

  16. arXiv:2305.20018  [pdf, other

    cs.CL cs.AI

    Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency

    Authors: Maxwell Crouse, Ramon Astudillo, Tahira Naseem, Subhajit Chaudhury, Pavan Kapanipathi, Salim Roukos, Alexander Gray

    Abstract: We introduce Logical Offline Cycle Consistency Optimization (LOCCO), a scalable, semi-supervised method for training a neural semantic parser. Conceptually, LOCCO can be viewed as a form of self-learning where the semantic parser being trained is used to generate annotations for unlabeled text that are then used as new supervision. To increase the quality of annotations, our method utilizes a coun… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  17. arXiv:2305.04346  [pdf, other

    cs.CL cs.AI

    Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing

    Authors: Maxwell Crouse, Pavan Kapanipathi, Subhajit Chaudhury, Tahira Naseem, Ramon Astudillo, Achille Fokoue, Tim Klinger

    Abstract: Nearly all general-purpose neural semantic parsers generate logical forms in a strictly top-down autoregressive fashion. Though such systems have achieved impressive results across a variety of datasets and domains, recent works have called into question whether they are ultimately limited in their ability to compositionally generalize. In this work, we approach semantic parsing from, quite litera… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL main conference

  18. arXiv:2303.05556  [pdf, other

    cs.CV

    An Evaluation of Non-Contrastive Self-Supervised Learning for Federated Medical Image Analysis

    Authors: Soumitri Chattopadhyay, Soham Ganguly, Sreejit Chaudhury, Sayan Nag, Samiran Chattopadhyay

    Abstract: Privacy and annotation bottlenecks are two major issues that profoundly affect the practicality of machine learning-based medical image analysis. Although significant progress has been made in these areas, these issues are not yet fully resolved. In this paper, we seek to tackle these concerns head-on and systematically explore the applicability of non-contrastive self-supervised learning (SSL) al… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

  19. arXiv:2303.02245  [pdf, other

    cs.CV

    Exploring Self-Supervised Representation Learning For Low-Resource Medical Image Analysis

    Authors: Soumitri Chattopadhyay, Soham Ganguly, Sreejit Chaudhury, Sayan Nag, Samiran Chattopadhyay

    Abstract: The success of self-supervised learning (SSL) has mostly been attributed to the availability of unlabeled yet large-scale datasets. However, in a specialized domain such as medical imaging which is a lot different from natural images, the assumption of data availability is unrealistic and impractical, as the data itself is scanty and found in small databases, collected for specific prognosis tasks… ▽ More

    Submitted 28 June, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted at IEEE ICIP 2023

  20. arXiv:2210.12624  [pdf, other

    cs.LG math.OC stat.ML

    Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

    Authors: Heshan Fernando, Han Shen, Miao Liu, Subhajit Chaudhury, Keerthiram Murugesan, Tianyi Chen

    Abstract: Machine learning problems with multiple objective functions appear either in learning with multiple criteria where learning has to make a trade-off between multiple performance metrics such as fairness, safety and accuracy; or, in multi-task learning where multiple tasks are optimized jointly, sharing inductive bias between them. This problems are often tackled by the multi-objective optimization… ▽ More

    Submitted 19 March, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: Changed hyper-parameter choice which affects some of the convergence rate results in the paper

  21. arXiv:2206.12871  [pdf, ps, other

    math.NT math.CO

    A Matrix Analogue of Schur-Siegel-Smyth Trace Problem

    Authors: Srijonee Shabnam Chaudhury

    Abstract: Let $\mathcal{S}$ be the set of all positive-definite, symmetrizable integer matrices with non-zero upper and lower diagonal and $\mathcal{T}$ to be the set of all positive-definite real symmetric matrices with nonzero upper diagonal such that all non-zero entries are square-roots of some positive integers and the matrices satisfy a certain cycle condition. In this paper, for any $n \times n$ ma… ▽ More

    Submitted 23 January, 2024; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: 24 pages

    MSC Class: 15A18 15B36 15B57 11C08

  22. arXiv:2203.10888  [pdf, ps, other

    quant-ph cs.CR

    Proposal for Quantum Ciphertext-Policy Attribute-Based Encryption

    Authors: Asmita Samanta, Arpita Maitra, Shion Samadder Chaudhury

    Abstract: A Quantum Ciphertext-Policy Attribute-Based Encryption scheme (QCP-ABE) has been presented. In classical domain, most of the popular ABE schemes are based on the hardness of the Bilinear Diffie-Hellman Exponent problem, which has been proven to be vulnerable against Shor's algorithm. Recently, some quantum safe ABE schemes have been proposed exploiting the Lattice problem. However, no efficient Qu… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: 12 pages

  23. arXiv:2112.14489  [pdf, ps, other

    math.NT

    Sums of squares of integer-multiple of an integral element on real bi-quadratic fields

    Authors: Srijonee Shabnam Chaudhury

    Abstract: For any given positive integer $m$ we construct certain totally positive algebraic integers $α$ of a real bi-quadratic field $K$ and obtain some necessary conditions for which $mα$ can not be represented as sum of integral squares. We show this for integers lie in quadratic subfields of $K$ and for integers which are in $K$ but not in any quadratic subfield of $K$. We provide examples in tabular f… ▽ More

    Submitted 9 February, 2024; v1 submitted 29 December, 2021; originally announced December 2021.

    Comments: 33 pages

    MSC Class: 11E25; 11R16; 11R33

  24. arXiv:2110.10973  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    LOA: Logical Optimal Actions for Text-based Interaction Games

    Authors: Daiki Kimura, Subhajit Chaudhury, Masaki Ono, Michiaki Tatsubori, Don Joven Agravante, Asim Munawar, Akifumi Wachi, Ryosuke Kohita, Alexander Gray

    Abstract: We present Logical Optimal Actions (LOA), an action decision architecture of reinforcement learning applications with a neuro-symbolic framework which is a combination of neural network and symbolic knowledge acquisition approach for natural language interaction games. The demonstration for LOA experiments consists of a web-based interactive platform for text-based games and visualization for acqu… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: ACL-IJCNLP 2021 (demo paper)

  25. arXiv:2110.10963  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    Neuro-Symbolic Reinforcement Learning with First-Order Logic

    Authors: Daiki Kimura, Masaki Ono, Subhajit Chaudhury, Ryosuke Kohita, Akifumi Wachi, Don Joven Agravante, Michiaki Tatsubori, Asim Munawar, Alexander Gray

    Abstract: Deep reinforcement learning (RL) methods often require many trials before convergence, and no direct interpretability of trained policies is provided. In order to achieve fast convergence and interpretability for the policy in RL, we propose a novel RL method for text-based games with a recent neuro-symbolic framework called Logical Neural Network, which can learn symbolic and interpretable rules… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021 (main conference)

  26. arXiv:2109.03575  [pdf, other

    cs.CV cs.LG

    Deriving Explanation of Deep Visual Saliency Models

    Authors: Sai Phani Kumar Malladi, Jayanta Mukhopadhyay, Chaker Larabi, Santanu Chaudhury

    Abstract: Deep neural networks have shown their profound impact on achieving human level performance in visual saliency prediction. However, it is still unclear how they learn the task and what it means in terms of understanding human visual system. In this work, we develop a technique to derive explainable saliency models from their corresponding deep neural architecture based saliency models by applying h… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

  27. arXiv:2108.04558  [pdf, other

    cs.CV

    Understanding Character Recognition using Visual Explanations Derived from the Human Visual System and Deep Networks

    Authors: Chetan Ralekar, Shubham Choudhary, Tapan Kumar Gandhi, Santanu Chaudhury

    Abstract: Human observers engage in selective information uptake when classifying visual patterns. The same is true of deep neural networks, which currently constitute the best performing artificial vision systems. Our goal is to examine the congruence, or lack thereof, in the information-gathering strategies of the two systems. We have operationalized our investigation as a character recognition task. We h… ▽ More

    Submitted 29 August, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

  28. arXiv:2106.05387  [pdf, other

    cs.LG cs.CL

    Eye of the Beholder: Improved Relation Generalization for Text-based Reinforcement Learning Agents

    Authors: Keerthiram Murugesan, Subhajit Chaudhury, Kartik Talamadupula

    Abstract: Text-based games (TBGs) have become a popular proving ground for the demonstration of learning-based agents that make decisions in quasi real-world settings. The crux of the problem for a reinforcement learning agent in such TBGs is identifying the objects in the world, and those objects' relations with that world. While the recent use of text-based resources for increasing an agent's knowledge an… ▽ More

    Submitted 15 June, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

  29. arXiv:2103.05322  [pdf, ps, other

    math.NT

    Sums of integral squares in complex bi-quadratic fields and in CM fields

    Authors: Srijonee Shabnam Chaudhury

    Abstract: Let $K$ be a complex bi-quadratic field with ring of integers $\mathcal{O}_{K}$. For $K = \mathbb{Q}(\sqrt{-m}$, $\sqrt{n}$), where $ m \equiv 3 \pmod 4 $ and $ n \equiv 1 \pmod 4$, we prove that every algebraic integer can be written as sum of integral squares. Using this, we prove that for any complex bi-quadratic field $K$, every element of $4\mathcal{O}_K$ can be written as sum of five integra… ▽ More

    Submitted 9 March, 2021; originally announced March 2021.

    Comments: 11pages. arXiv admin note: text overlap with arXiv:2005.13870

  30. arXiv:2103.02363  [pdf, other

    cs.AI

    Reinforcement Learning with External Knowledge by using Logical Neural Networks

    Authors: Daiki Kimura, Subhajit Chaudhury, Akifumi Wachi, Ryosuke Kohita, Asim Munawar, Michiaki Tatsubori, Alexander Gray

    Abstract: Conventional deep reinforcement learning methods are sample-inefficient and usually require a large number of training trials before convergence. Since such methods operate on an unconstrained action set, they can lead to useless actions. A recent neuro-symbolic framework called the Logical Neural Networks (LNNs) can simultaneously provide key-properties of both neural networks and symbolic logic.… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: KBRL Workshop at IJCAI-PRICAI 2020

  31. Image inpainting using frequency domain priors

    Authors: Hiya Roy, Subhajit Chaudhury, Toshihiko Yamasaki, Tatsuaki Hashimoto

    Abstract: In this paper, we present a novel image inpainting technique using frequency domain information. Prior works on image inpainting predict the missing pixels by training neural networks using only the spatial domain information. However, these methods still struggle to reconstruct high-frequency details for real complex scenes, leading to a discrepancy in color, boundary artifacts, distorted pattern… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

  32. arXiv:2010.13839  [pdf, other

    cs.LG cs.CL

    VisualHints: A Visual-Lingual Environment for Multimodal Reinforcement Learning

    Authors: Thomas Carta, Subhajit Chaudhury, Kartik Talamadupula, Michiaki Tatsubori

    Abstract: We present VisualHints, a novel environment for multimodal reinforcement learning (RL) involving text-based interactions along with visual hints (obtained from the environment). Real-life problems often demand that agents interact with the environment using both natural language information and visual perception towards solving a goal. However, most traditional RL environments either solve pure vi… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: Code is available at http://ibm.biz/VisualHints

  33. arXiv:2009.11896  [pdf, other

    cs.LG cs.CL stat.ML

    Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games

    Authors: Subhajit Chaudhury, Daiki Kimura, Kartik Talamadupula, Michiaki Tatsubori, Asim Munawar, Ryuki Tachibana

    Abstract: We show that Reinforcement Learning (RL) methods for solving Text-Based Games (TBGs) often fail to generalize on unseen games, especially in small data regimes. To address this issue, we propose Context Relevant Episodic State Truncation (CREST) for irrelevant token removal in observation text for improved generalization. Our method first trains a base model using Q-learning, which typically overf… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

    Comments: Accepted to EMNLP 2020

  34. arXiv:2009.01478  [pdf, ps, other

    cond-mat.mtrl-sci

    Pressure induced emergence of visible luminescence in $Cs_3Bi_2Br_9$: Effect of structural distortion in optical behaviour

    Authors: Debabrata Samanta, Pinku Saha, Bishnupada Ghosh, Sonu Pratap Chaudhury, Sayan Bhattacharya, Swastika Chatterjee, Goutam Dev Mukherjee

    Abstract: We report emergence of photoluminescence at room temperature in trigonal $Cs_3Bi_2Br_9$ at high pressures. Enhancement in intensity with pressure is found to be driven by increase in distortion of $BiBr_6$ octahedra and iso-structural transitions. Electronic band structure calculations show the sample in the high pressure phase to be an indirect band gap semiconductor. The luminescence peak profil… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

  35. arXiv:2008.03205  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-Task Driven Explainable Diagnosis of COVID-19 using Chest X-ray Images

    Authors: Aakarsh Malhotra, Surbhi Mittal, Puspita Majumdar, Saheb Chhabra, Kartik Thakral, Mayank Vatsa, Richa Singh, Santanu Chaudhury, Ashwin Pudrod, Anjali Agrawal

    Abstract: With increasing number of COVID-19 cases globally, all the countries are ramping up the testing numbers. While the RT-PCR kits are available in sufficient quantity in several countries, others are facing challenges with limited availability of testing kits and processing centers in remote areas. This has motivated researchers to find alternate methods of testing which are reliable, easily accessib… ▽ More

    Submitted 3 August, 2020; originally announced August 2020.

  36. arXiv:2005.13870  [pdf, ps, other

    math.NT

    Sums of Integral Squares In Certain Complex Bi-quadratic Fields

    Authors: Srijonee Shabnam Chaudhury

    Abstract: Let K be an algebraic number field and O_K be its ring of integers. Let S_K be the set of elements in O_K which are sums of squares in O_K and s(O_K) the minimal number of squares necessary to represent -1in O_K. Let g( S_K ) be the smallest positive integer t such that every element in S_K is a sum of t squares in O_K. Here K is generated over field of rational number by square root of m and -n ,… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: 10 pages

  37. arXiv:2003.06646  [pdf, other

    cs.LG cs.CR cs.CV eess.IV stat.ML

    Investigating Generalization in Neural Networks under Optimally Evolved Training Perturbations

    Authors: Subhajit Chaudhury, Toshihiko Yamasaki

    Abstract: In this paper, we study the generalization properties of neural networks under input perturbations and show that minimal training data corruption by a few pixel modifications can cause drastic overfitting. We propose an evolutionary algorithm to search for optimal pixel perturbations using novel cost function inspired from literature in domain adaptation that explicitly maximizes the generalizatio… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: Accepted at IEEE ICASSP 2020

  38. Unsupervised Temporal Feature Aggregation for Event Detection in Unstructured Sports Videos

    Authors: Subhajit Chaudhury, Daiki Kimura, Phongtharin Vinayavekhin, Asim Munawar, Ryuki Tachibana, Koji Ito, Yuki Inaba, Minoru Matsumoto, Shuji Kidokoro, Hiroki Ozaki

    Abstract: Image-based sports analytics enable automatic retrieval of key events in a game to speed up the analytics process for human experts. However, most existing methods focus on structured television broadcast video datasets with a straight and fixed camera having minimum variability in the capturing pose. In this paper, we study the case of event detection in sports videos for unstructured environment… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

    Comments: Accepted to IEEE International Symposium on Multimedia, 2019

  39. arXiv:2001.05878  [pdf, other

    cs.CV cs.LG stat.ML

    Assessing Robustness of Deep learning Methods in Dermatological Workflow

    Authors: Sourav Mishra, Subhajit Chaudhury, Hideaki Imaizumi, Toshihiko Yamasaki

    Abstract: This paper aims to evaluate the suitability of current deep learning methods for clinical workflow especially by focusing on dermatology. Although deep learning methods have been attempted to get dermatologist level accuracy in several individual conditions, it has not been rigorously tested for common clinical complaints. Most projects involve data acquired in well-controlled laboratory condition… ▽ More

    Submitted 17 March, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

    Comments: Accepted in ACM CHIL 2020 Workshop (Oral and poster, without publication)

  40. arXiv:2001.03463  [pdf, other

    cs.CV

    Compressive sensing based privacy for fall detection

    Authors: Ronak Gupta, Prashant Anand, Santanu Chaudhury, Brejesh Lall, Sanjay Singh

    Abstract: Fall detection holds immense importance in the field of healthcare, where timely detection allows for instant medical assistance. In this context, we propose a 3D ConvNet architecture which consists of 3D Inception modules for fall detection. The proposed architecture is a custom version of Inflated 3D (I3D) architecture, that takes compressed measurements of video sequence as spatio-temporal inpu… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

    Comments: accepted in NCVPRIPG 2019

  41. arXiv:1912.07974  [pdf, ps, other

    cond-mat.soft

    Pulling a folded polymer through a nanopore

    Authors: Bappa Ghosh, Jalal Sarabadani, Srabanti Chaudhury, Tapio Ala-Nissila

    Abstract: We investigate the translocation dynamics of a folded linear polymer which is pulled through a nanopore by an external force. To this end, we generalize the iso-flux tension propagation (IFTP) theory for end-pulled polymer translocation to include the case of two segments of the folded polymer traversing simultaneously trough the pore. Our theory is extensively benchmarked with corresponding Molec… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

  42. arXiv:1904.06683  [pdf

    cs.CV

    Lunar surface image restoration using U-net based deep neural networks

    Authors: Hiya Roy, Subhajit Chaudhury, Toshihiko Yamasaki, Danielle DeLatte, Makiko Ohtake, Tatsuaki Hashimoto

    Abstract: Image restoration is a technique that reconstructs a feasible estimate of the original image from the noisy observation. In this paper, we present a U-Net based deep neural network model to restore the missing pixels on the lunar surface image in a context-aware fashion, which is often known as image inpainting problem. We use the grayscale image of the lunar surface captured by Multiband Imager (… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

  43. arXiv:1904.01215  [pdf, other

    cs.CV

    DSAL-GAN: Denoising based Saliency Prediction with Generative Adversarial Networks

    Authors: Prerana Mukherjee, Manoj Sharma, Megh Makwana, Ajay Pratap Singh, Avinash Upadhyay, Akkshita Trivedi, Brejesh Lall, Santanu Chaudhury

    Abstract: Synthesizing high quality saliency maps from noisy images is a challenging problem in computer vision and has many practical applications. Samples generated by existing techniques for saliency detection cannot handle the noise perturbations smoothly and fail to delineate the salient objects present in the given scene. In this paper, we present a novel end-to-end coupled Denoising based Saliency Pr… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  44. arXiv:1811.03692  [pdf, other

    cs.CV cs.LG

    Mode matching in GANs through latent space learning and inversion

    Authors: Deepak Mishra, Prathosh A. P., Aravind Jayendran, Varun Srivastava, Santanu Chaudhury

    Abstract: Generative adversarial networks (GANs) have shown remarkable success in generation of unstructured data, such as, natural images. However, discovery and separation of modes in the generated space, essential for several tasks beyond naive data generation, is still a challenge. In this paper, we address the problem of imposing desired modal properties on the generated space using a latent distributi… ▽ More

    Submitted 24 March, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

  45. arXiv:1810.01108  [pdf, other

    cs.LG cs.CV stat.ML

    Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning

    Authors: Subhajit Chaudhury, Daiki Kimura, Asim Munawar, Ryuki Tachibana

    Abstract: The growing use of virtual autonomous agents in applications like games and entertainment demands better control policies for natural-looking movements and actions. Unlike the conventional approach of hard-coding motion routines, we propose a deep learning method for obtaining control policies by directly mimicking raw video demonstrations. Previous methods in this domain rely on extracting low-di… ▽ More

    Submitted 25 October, 2019; v1 submitted 2 October, 2018; originally announced October 2018.

    Comments: Updated the paper to match with version accepted at IEEE MMSP 2019

  46. arXiv:1809.08925  [pdf, other

    cs.LG cs.AI

    Constrained Exploration and Recovery from Experience Shaping

    Authors: Tu-Hoa Pham, Giovanni De Magistris, Don Joven Agravante, Subhajit Chaudhury, Asim Munawar, Ryuki Tachibana

    Abstract: We consider the problem of reinforcement learning under safety requirements, in which an agent is trained to complete a given task, typically formalized as the maximization of a reward signal over time, while concurrently avoiding undesirable actions or states, associated to lower rewards, or penalties. The construction and balancing of different reward components can be difficult in the presence… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

    Comments: Code: https://github.com/IBM/constrained-rl

  47. arXiv:1807.01990  [pdf, other

    cs.CV

    Transfer Learning From Synthetic To Real Images Using Variational Autoencoders For Precise Position Detection

    Authors: Tadanobu Inoue, Subhajit Chaudhury, Giovanni De Magistris, Sakyasingha Dasgupta

    Abstract: Capturing and labeling camera images in the real world is an expensive task, whereas synthesizing labeled images in a simulation environment is easy for collecting large-scale image data. However, learning from only synthetic images may not achieve the desired performance in the real world due to a gap between synthetic and real images. We propose a method that transfers learned detection of an ob… ▽ More

    Submitted 4 July, 2018; originally announced July 2018.

    Comments: Copyright 2018 IEEE - Accepted at ICIP 2018, Athens, Greece, October 7-10, 2018. Video: https://youtu.be/30vji7nJibA. arXiv admin note: text overlap with arXiv:1709.06762

  48. arXiv:1806.08523  [pdf, ps, other

    cs.CV

    Focusing on What is Relevant: Time-Series Learning and Understanding using Attention

    Authors: Phongtharin Vinayavekhin, Subhajit Chaudhury, Asim Munawar, Don Joven Agravante, Giovanni De Magistris, Daiki Kimura, Ryuki Tachibana

    Abstract: This paper is a contribution towards interpretability of the deep learning models in different applications of time-series. We propose a temporal attention layer that is capable of selecting the relevant information to perform various tasks, including data completion, key-frame detection and classification. The method uses the whole input sequence to calculate an attention value for each time step… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

    Comments: To appear in ICPR 2018

  49. arXiv:1806.01267  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Internal Model from Observations for Reward Shaping

    Authors: Daiki Kimura, Subhajit Chaudhury, Ryuki Tachibana, Sakyasingha Dasgupta

    Abstract: Reinforcement learning methods require careful design involving a reward function to obtain the desired action policy for a given task. In the absence of hand-crafted reward functions, prior work on the topic has proposed several methods for reward estimation by using expert state trajectories and action pairs. However, there are cases where complete or good action information cannot be obtained f… ▽ More

    Submitted 14 October, 2018; v1 submitted 2 June, 2018; originally announced June 2018.

    Comments: 7 pages, 6 figures, ICML workshop (ALA 2018)

  50. arXiv:1805.00223  [pdf, other

    cs.CV

    Localization: A Missing Link in the Pipeline of Object Matching and Registration

    Authors: Deepak Mishra, Rajeev Ranjan, Santanu Chaudhury, Mukul Sarkar, Arvinder Singh Soin

    Abstract: Image registration is a process of aligning two or more images of same objects using geometric transformation. Most of the existing approaches work on the assumption of location invariance. These approaches require object-centric images to perform matching. Further, in absence of intensity level symmetry between the corresponding points in two images, the learning based registration approaches rel… ▽ More

    Submitted 11 January, 2019; v1 submitted 1 May, 2018; originally announced May 2018.

    Comments: 11 pages, 6 figures