Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–47 of 47 results for author: Sarkar, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14436  [pdf, other

    cs.CV cs.RO

    Video Generation with Learned Action Prior

    Authors: Meenakshi Sarkar, Devansh Bhardwaj, Debasish Ghose

    Abstract: Stochastic video generation is particularly challenging when the camera is mounted on a moving platform, as camera motion interacts with observed image pixels, creating complex spatio-temporal dynamics and making the problem partially observable. Existing methods typically address this by focusing on raw pixel-level image reconstruction without explicitly modelling camera motion dynamics. We propo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. The Impact of Machine Learning on Society: An Analysis of Current Trends and Future Implications

    Authors: Md Kamrul Hossain Siam, Manidipa Bhattacharjee, Shakik Mahmud, Md. Saem Sarkar, Md. Masud Rana

    Abstract: The Machine learning (ML) is a rapidly evolving field of technology that has the potential to greatly impact society in a variety of ways. However, there are also concerns about the potential negative effects of ML on society, such as job displacement and privacy issues. This research aimed to conduct a comprehensive analysis of the current and future impact of ML on society. The research included… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 12 pages

  3. arXiv:2404.05439  [pdf, other

    cs.CV

    Action-conditioned video data improves predictability

    Authors: Meenakshi Sarkar, Debasish Ghose

    Abstract: Long-term video generation and prediction remain challenging tasks in computer vision, particularly in partially observable scenarios where cameras are mounted on moving platforms. The interaction between observed image frames and the motion of the recording agent introduces additional complexities. To address these issues, we introduce the Action-Conditioned Video Generation (ACVG) framework, a n… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  4. arXiv:2402.14702  [pdf, other

    cs.CL

    InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

    Authors: Somnath Banerjee, Maulindu Sarkar, Punyajoy Saha, Binny Mathew, Animesh Mukherjee

    Abstract: Recently, influence functions present an apparatus for achieving explainability for deep neural models by quantifying the perturbation of individual train instances that might impact a test prediction. Our objectives in this paper are twofold. First we incorporate influence functions as a feedback into the model to improve its performance. Second, in a dataset extension exercise, using influence f… ▽ More

    Submitted 9 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at LREC-COLING 2024 (Long Paper)

  5. arXiv:2401.13968  [pdf, other

    cs.LG cs.AI

    Dynamic Long-Term Time-Series Forecasting via Meta Transformer Networks

    Authors: Muhammad Anwar Ma'sum, MD Rasel Sarkar, Mahardhika Pratama, Savitha Ramasamy, Sreenatha Anavatti, Lin Liu, Habibullah, Ryszard Kowalczyk

    Abstract: A reliable long-term time-series forecaster is highly demanded in practice but comes across many challenges such as low computational and memory footprints as well as robustness against dynamic learning environments. This paper proposes Meta-Transformer Networks (MANTRA) to deal with the dynamic long-term time-series forecasting tasks. MANTRA relies on the concept of fast and slow learners where a… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Under Consideration in IEEE Transactions on Artificial Intelligence

  6. arXiv:2310.17808  [pdf, other

    quant-ph cs.ET

    A Novel Fast Path Planning Approach for Mobile Devices using Hybrid Quantum Ant Colony Optimization Algorithm

    Authors: Mayukh Sarkar, Jitesh Pradhan, Anil Kumar Singh, Hathiram Nenavath

    Abstract: With IoT systems' increasing scale and complexity, maintenance of a large number of nodes using stationary devices is becoming increasingly difficult. Hence, mobile devices are being employed that can traverse through a set of target locations and provide the necessary services. In order to reduce energy consumption and time requirements, the devices are required to traverse following a Hamiltonia… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  7. arXiv:2308.11239  [pdf, other

    cs.CV

    LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training

    Authors: Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: Learning object segmentation in image and video datasets without human supervision is a challenging problem. Humans easily identify moving salient objects in videos using the gestalt principle of common fate, which suggests that what moves together belongs together. Building upon this idea, we propose a self-supervised object discovery approach that leverages motion and appearance information to p… ▽ More

    Submitted 2 December, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted to British Machine Vision Conference (BMVC) 2023

  8. arXiv:2307.04392  [pdf, other

    cs.CV

    FODVid: Flow-guided Object Discovery in Videos

    Authors: Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy

    Abstract: Segmentation of objects in a video is challenging due to the nuances such as motion blurring, parallax, occlusions, changes in illumination, etc. Instead of addressing these nuances separately, we focus on building a generalizable solution that avoids overfitting to the individual intricacies. Such a solution would also help us save enormous resources involved in human annotation of video corpora.… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: CVPR 2023 (L3D-IVU workshop)

  9. arXiv:2306.16503  [pdf, other

    cs.LG cs.AI

    SARC: Soft Actor Retrospective Critic

    Authors: Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence.… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at RLDM 2022

  10. arXiv:2306.15852  [pdf, other

    cs.RO cs.CV

    Action-conditioned Deep Visual Prediction with RoAM, a new Indoor Human Motion Dataset for Autonomous Robots

    Authors: Meenakshi Sarkar, Vinayak Honkote, Dibyendu Das, Debasish Ghose

    Abstract: With the increasing adoption of robots across industries, it is crucial to focus on developing advanced algorithms that enable robots to anticipate, comprehend, and plan their actions effectively in collaboration with humans. We introduce the Robot Autonomous Motion (RoAM) video dataset, which is collected with a custom-made turtlebot3 Burger robot in a variety of indoor environments recording var… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  11. arXiv:2305.17523  [pdf

    cs.LG q-fin.PM

    A Comparative Analysis of Portfolio Optimization Using Mean-Variance, Hierarchical Risk Parity, and Reinforcement Learning Approaches on the Indian Stock Market

    Authors: Jaydip Sen, Aditya Jaiswal, Anshuman Pathak, Atish Kumar Majee, Kushagra Kumar, Manas Kumar Sarkar, Soubhik Maji

    Abstract: This paper presents a comparative analysis of the performances of three portfolio optimization approaches. Three approaches of portfolio optimization that are considered in this work are the mean-variance portfolio (MVP), hierarchical risk parity (HRP) portfolio, and reinforcement learning-based portfolio. The portfolios are trained and tested over several stock data and their performances are com… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Comments: The report is 52 pages long. It is based on the capstone project done in the post graduate course of data science in Praxis Business School, Kolkata, India, of the Autumn Batch, 2022

  12. arXiv:2303.15122  [pdf, other

    cs.CV

    Parameter Efficient Local Implicit Image Function Network for Face Segmentation

    Authors: Mausoom Sarkar, Nikitha SR, Mayur Hemani, Rishabh Jain, Balaji Krishnamurthy

    Abstract: Face parsing is defined as the per-pixel labeling of images containing human faces. The labels are defined to identify key facial regions like eyes, lips, nose, hair, etc. In this work, we make use of the structural consistency of the human face to propose a lightweight face-parsing method using a Local Implicit Function network, FP-LIIF. We propose a simple architecture having a convolutional enc… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  13. arXiv:2302.07089  [pdf

    quant-ph cs.ET

    Novel Design of Quantum Circuits for Representation of Grayscale Images

    Authors: Mayukh Sarkar

    Abstract: The advent of Quantum Computing has influenced researchers around the world to solve multitudes of computational problems with the promising technology. Feasibility of solutions for computational problems, and representation of various information, may allow quantum computing to replace classical computer in near future. One such challenge is the representation of digital images in quantum compute… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  14. arXiv:2301.06928  [pdf, other

    cs.LG cs.AI

    Towards Estimating Transferability using Hard Subsets

    Authors: Tarun Ram Menta, Surgan Jandial, Akash Patil, Vimal KB, Saketh Bachu, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Chirag Agarwal, Mausoom Sarkar

    Abstract: As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine tuning. In this work, we propose HASTE (HArd Subset TransfErability), a new strategy to estimate the transferability of a source model to a pa… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: First three authors contributed equally

  15. arXiv:2211.10157  [pdf, other

    cs.CV cs.AI

    UMFuse: Unified Multi View Fusion for Human Editing applications

    Authors: Rishabh Jain, Mayur Hemani, Duygu Ceylan, Krishna Kumar Singh, Jingwan Lu, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: Numerous pose-guided human editing methods have been explored by the vision community due to their extensive practical applications. However, most of these methods still use an image-to-image formulation in which a single image is given as input to produce an edited image as output. This objective becomes ill-defined in cases when the target pose differs significantly from the input pose. Existing… ▽ More

    Submitted 28 March, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 8 pages, 6 figures

    ACM Class: I.4; I.5

  16. arXiv:2211.08540  [pdf, other

    cs.CV cs.AI

    VGFlow: Visibility guided Flow Network for Human Reposing

    Authors: Rishabh Jain, Krishna Kumar Singh, Mayur Hemani, Jingwan Lu, Mausoom Sarkar, Duygu Ceylan, Balaji Krishnamurthy

    Abstract: The task of human reposing involves generating a realistic image of a person standing in an arbitrary conceivable pose. There are multiple difficulties in generating perceptually accurate images, and existing methods suffer from limitations in preserving texture, maintaining pattern coherence, respecting cloth boundaries, handling occlusions, manipulating skin generation, etc. These difficulties a… ▽ More

    Submitted 28 March, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: Selected for publication in CVPR2023

    ACM Class: I.4; I.5

  17. arXiv:2209.06584  [pdf, other

    cs.CV

    One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text

    Authors: Abhinav Java, Shripad Deshmukh, Milan Aggarwal, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: Active consumption of digital documents has yielded scope for research in various applications, including search. Traditionally, searching within a document has been cast as a text matching problem ignoring the rich layout and visual cues commonly present in structured documents, forms, etc. To that end, we ask a mostly unexplored question: "Can we search for other similar snippets present in a ta… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

  18. arXiv:2207.02964  [pdf, other

    cs.LG cs.AI

    Mitigating shortage of labeled data using clustering-based active learning with diversity exploration

    Authors: Xuyang Yan, Shabnam Nazmi, Biniam Gebru, Mohd Anwar, Abdollah Homaifar, Mrinmoy Sarkar, Kishor Datta Gupta

    Abstract: In this paper, we proposed a new clustering-based active learning framework, namely Active Learning using a Clustering-based Sampling (ALCS), to address the shortage of labeled data. ALCS employs a density-based clustering approach to explore the cluster structure from the data without requiring exhaustive parameter tuning. A bi-cluster boundary-based sample query procedure is introduced to improv… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: Accepted by the ICML 2022 Workshop on Adaptive Experimental Design and Active Learning in the Real World

  19. Modeling and Optimization of a Longitudinally-Distributed Global Solar Grid

    Authors: Harsh Vardhan, Neal M Sarkar, Himanshu Neema

    Abstract: Our simulation-based experiments are aimed to demonstrate a use case on the feasibility of fulfillment of global energy demand by primarily relying on solar energy through the integration of a longitudinally-distributed grid. These experiments demonstrate the availability of simulation technologies, good approximation models of grid components, and data for simulation. We also experimented with in… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  20. arXiv:2203.13721  [pdf, other

    cs.CV eess.IV

    Salt Detection Using Segmentation of Seismic Image

    Authors: Mrinmoy Sarkar

    Abstract: In this project, a state-of-the-art deep convolution neural network (DCNN) is presented to segment seismic images for salt detection below the earth's surface. Detection of salt location is very important for starting mining. Hence, a seismic image is used to detect the exact salt location under the earth's surface. However, precisely detecting the exact location of salt deposits is difficult. The… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

  21. arXiv:2111.13974  [pdf, other

    cs.CL

    Exploring Transformer Based Models to Identify Hate Speech and Offensive Content in English and Indo-Aryan Languages

    Authors: Somnath Banerjee, Maulindu Sarkar, Nancy Agrawal, Punyajoy Saha, Mithun Das

    Abstract: Hate speech is considered to be one of the major issues currently plaguing online social media. Repeated and repetitive exposure to hate speech has been shown to create physiological effects on the target users. Thus, hate speech, in all its forms, should be addressed on these platforms in order to maintain good health. In this paper, we explored several Transformer based machine learning models f… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: Accepted in FIRE'21 (Track HASOC - English and Indo-Aryan Languages)

  22. arXiv:2111.13157  [pdf, other

    cs.CV cs.AI

    DA$^{\textbf{2}}$-Net : Diverse & Adaptive Attention Convolutional Neural Network

    Authors: Abenezer Girma, Abdollah Homaifar, M Nabil Mahmoud, Xuyang Yan, Mrinmoy Sarkar

    Abstract: Standard Convolutional Neural Network (CNN) designs rarely focus on the importance of explicitly capturing diverse features to enhance the network's performance. Instead, most existing methods follow an indirect approach of increasing or tuning the networks' depth and width, which in many cases significantly increases the computational cost. Inspired by a biological visual system, we propose a Div… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  23. arXiv:2111.11692  [pdf, other

    cs.MA

    Status-quo policy gradient in Multi-Agent Reinforcement Learning

    Authors: Pinkesh Badjatiya, Mausoom Sarkar, Nikaash Puri, Jayakumar Subramanian, Abhishek Sinha, Siddharth Singh, Balaji Krishnamurthy

    Abstract: Individual rationality, which involves maximizing expected individual returns, does not always lead to high-utility individual or group outcomes in multi-agent problems. For instance, in multi-agent social dilemmas, Reinforcement Learning (RL) agents trained to maximize individual rewards converge to a low-utility mutually harmful equilibrium. In contrast, humans evolve useful strategies in such s… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  24. arXiv:2111.08169  [pdf, other

    cs.LG

    A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering

    Authors: Xuyang Yan, Mrinmoy Sarkar, Biniam Gebru, Shabnam Nazmi, Abdollah Homaifar

    Abstract: Feature selection methods are widely used to address the high computational overheads and curse of dimensionality in classifying high-dimensional data. Most conventional feature selection methods focus on handling homogeneous features, while real-world datasets usually have a mixture of continuous and discrete features. Some recent mixed-type feature selection studies only select features with hig… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: 6 pages, 3 figures, 4 tables, accepted by the IEEE SMC 2021

  25. arXiv:2111.05413  [pdf, other

    cs.RO

    A Framework for eVTOL Performance Evaluation in Urban Air Mobility Realm

    Authors: Mrinmoy Sarkar, Xuyang Yan, Abenezer Girma, Abdollah Homaifar

    Abstract: In this paper, we developed a generalized simulation framework for the evaluation of electric vertical takeoff and landing vehicles (eVTOLs) in the context of Unmanned Aircraft Systems (UAS) Traffic Management (UTM) and under the concept of Urban Air Mobility (UAM). Unlike most existing studies, the proposed framework combines the utilization of UTM and eVTOLs to develop a realistic UAM testing pl… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: 7 pages, 9 figures, Submitted to ICRA 2022 conference

  26. arXiv:2109.03813  [pdf, other

    cs.AI

    Video2Skill: Adapting Events in Demonstration Videos to Skills in an Environment using Cyclic MDP Homomorphisms

    Authors: Sumedh A Sontakke, Sumegh Roychowdhury, Mausoom Sarkar, Nikaash Puri, Balaji Krishnamurthy, Laurent Itti

    Abstract: Humans excel at learning long-horizon tasks from demonstrations augmented with textual commentary, as evidenced by the burgeoning popularity of tutorial videos online. Intuitively, this capability can be separated into 2 distinct subtasks - first, dividing a long-horizon demonstration sequence into semantically meaningful events; second, adapting such events into meaningful behaviors in one's own… ▽ More

    Submitted 9 September, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

  27. arXiv:2107.04419  [pdf, other

    cs.LG

    Form2Seq : A Framework for Higher-Order Form Structure Extraction

    Authors: Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: Document structure extraction has been a widely researched area for decades with recent works performing it as a semantic segmentation task over document images using fully-convolution networks. Such methods are limited by image resolution due to which they fail to disambiguate structures in dense regions which appear commonly in forms. To mitigate this, we propose Form2Seq, a novel sequence-to-se… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: This paper has been presented at EMNLP 2020

  28. arXiv:2107.04396  [pdf, other

    cs.CV

    Multi-Modal Association based Grouping for Form Structure Extraction

    Authors: Milan Aggarwal, Mausoom Sarkar, Hiresh Gupta, Balaji Krishnamurthy

    Abstract: Document structure extraction has been a widely researched area for decades. Recent work in this direction has been deep learning-based, mostly focusing on extracting structure using fully convolution NN through semantic segmentation. In this work, we present a novel multi-modal approach for form structure extraction. Given simple elements such as textruns and widgets, we extract higher-order stru… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: This work has been accepted and presented at WACV 2020

  29. arXiv:2106.11823  [pdf, other

    cs.LG cs.AI

    A Clustering-based Framework for Classifying Data Streams

    Authors: Xuyang Yan, Abdollah Homaifar, Mrinmoy Sarkar, Abenezer Girma, Edward Tunstel

    Abstract: The non-stationary nature of data streams strongly challenges traditional machine learning techniques. Although some solutions have been proposed to extend traditional machine learning techniques for handling data streams, these approaches either require an initial label set or rely on specialized design parameters. The overlap among classes and the labeling of data streams constitute other major… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Comments: This paper has been accepted by IJCAI 2021

  30. arXiv:2104.09571  [pdf, ps, other

    cs.NI

    Medium Access Strategies for Integrated Access and Backhaul at mmWaves Unlicensed Spectrum

    Authors: Biswa P. S. Sahoo, Styabrata Swain, Hung-Yu Wei, Mahasweta Sarkar

    Abstract: The unlicensed spectrum is recently considered one of the defining solutions to meet the steadily growing traffic demand. This, in turn, has led to the enhancement for LTE in Release-13 to enable Licensed-Assisted Access (LAA) operations. The design of the medium access control (MAC) protocol for the LAA system to harmonically coexist with the incumbent WLAN system operating in an unlicensed band… ▽ More

    Submitted 22 March, 2021; originally announced April 2021.

    Comments: 6 pages, 6 figures, conference paper, Accepted for publication in Wireless Telecommunications Symposium (WTS), San Francisco, USA, April 2021

  31. arXiv:2010.02556  [pdf, other

    cs.LG cs.AI cs.CL

    SHERLock: Self-Supervised Hierarchical Event Representation Learning

    Authors: Sumegh Roychowdhury, Sumedh A. Sontakke, Nikaash Puri, Mausoom Sarkar, Milan Aggarwal, Pinkesh Badjatiya, Balaji Krishnamurthy, Laurent Itti

    Abstract: Temporal event representations are an essential aspect of learning among humans. They allow for succinct encoding of the experiences we have through a variety of sensory inputs. Also, they are believed to be arranged hierarchically, allowing for an efficient representation of complex long-horizon experiences. Additionally, these representations are acquired in a self-supervised manner. Analogously… ▽ More

    Submitted 22 August, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Accepted at ICPR '22

  32. arXiv:2009.01485  [pdf, other

    cs.CV cs.AI

    SAC: Semantic Attention Composition for Text-Conditioned Image Retrieval

    Authors: Surgan Jandial, Pinkesh Badjatiya, Pranit Chawla, Ayush Chopra, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: The ability to efficiently search for images is essential for improving the user experiences across various products. Incorporating user feedback, via multi-modal inputs, to navigate visual search can help tailor retrieved results to specific user queries. We focus on the task of text-conditioned image retrieval that utilizes support text feedback alongside a reference image to retrieve images tha… ▽ More

    Submitted 19 October, 2021; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: Surgan Jandial, Pinkesh Badjatiya, Pranit Chawla, and Ayush Chopra contributed equally to this work. Work accepted at WACV 2022

  33. arXiv:2006.13593  [pdf, other

    cs.CV

    Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks

    Authors: Surgan Jandial, Ayush Chopra, Mausoom Sarkar, Piyush Gupta, Balaji Krishnamurthy, Vineeth Balasubramanian

    Abstract: Deep neural networks (DNNs) are powerful learning machines that have enabled breakthroughs in several domains. In this work, we introduce a new retrospective loss to improve the training of deep neural network models by utilizing the prior experience available in past model states during training. Minimizing the retrospective loss, along with the task-specific loss, pushes the parameter state at t… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: Accepted at KDD 2020; The first two authors contributed equally

  34. arXiv:2001.05458  [pdf, other

    cs.AI cs.GT cs.LG

    Inducing Cooperative behaviour in Sequential-Social dilemmas through Multi-Agent Reinforcement Learning using Status-Quo Loss

    Authors: Pinkesh Badjatiya, Mausoom Sarkar, Abhishek Sinha, Siddharth Singh, Nikaash Puri, Jayakumar Subramanian, Balaji Krishnamurthy

    Abstract: In social dilemma situations, individual rationality leads to sub-optimal group outcomes. Several human engagements can be modeled as a sequential (multi-step) social dilemmas. However, in contrast to humans, Deep Reinforcement Learning agents trained to optimize individual rewards in sequential social dilemmas converge to selfish, mutually harmful behavior. We introduce a status-quo loss (SQLoss)… ▽ More

    Submitted 13 February, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  35. arXiv:1912.09428  [pdf, other

    eess.SP cs.LG cs.SD eess.AS stat.ML

    Location Forensics Analysis Using ENF Sequences Extracted from Power and Audio Recordings

    Authors: Dhiman Chowdhury, Mrinmoy Sarkar

    Abstract: Electrical network frequency (ENF) is the signature of a power distribution grid which represents the nominal frequency (50 or 60 Hz) of a power system network. Due to load variations in a power grid, ENF sequences experience fluctuations. These ENF variations are inherently located in a multimedia signal which is recorded close to the grid or directly from the mains power line. Therefore, a multi… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: 5 pages, 5 figures, conference paper

  36. arXiv:1911.12170  [pdf, other

    cs.CV

    Document Structure Extraction using Prior based High Resolution Hierarchical Semantic Segmentation

    Authors: Mausoom Sarkar, Milan Aggarwal, Arneh Jain, Hiresh Gupta, Balaji Krishnamurthy

    Abstract: Structure extraction from document images has been a long-standing research topic due to its high impact on a wide range of practical applications. In this paper, we share our findings on employing a hierarchical semantic segmentation network for this task of structure extraction. We propose a prior based deep hierarchical CNN network architecture that enables document structure extraction using v… ▽ More

    Submitted 17 September, 2020; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: This work has been accepted at ECCV 2020

  37. arXiv:1909.00237  [pdf, ps, other

    cs.NE q-bio.QM

    Triclustering of Gene Expression Microarray Data Using Coarse-Grained Parallel Genetic Algorithm

    Authors: Shubhankar Mohapatra, Moumita Sarkar, Anjali Mohapatra, Bhawani Sankar Biswal

    Abstract: Microarray data analysis is one of the major area of research in the field computational biology. Numerous techniques like clustering, biclustering are often applied to microarray data to extract meaningful outcomes which play key roles in practical healthcare affairs like disease identification, drug discovery etc. But these techniques become obsolete when time as an another factor is considered… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Journal ref: Springer Lecture Notes in Networks and Systems 2016 - 2020

  38. arXiv:1906.10182  [pdf, other

    cs.RO cs.CV cs.LG

    Planning Robot Motion using Deep Visual Prediction

    Authors: Meenakshi Sarkar, Prabhu Pradhan, Debasish Ghose

    Abstract: In this paper, we introduce a novel framework that can learn to make visual predictions about the motion of a robotic agent from raw video frames. Our proposed motion prediction network (PROM-Net) can learn in a completely unsupervised manner and efficiently predict up to 10 frames in the future. Moreover, unlike any other motion prediction models, it is lightweight and once trained it can be easi… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: 7th ICAPS Workshop on Planning and Robotics (PlanRob), 2019

  39. arXiv:1810.05394  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Sequential Learning of Movement Prediction in Dynamic Environments using LSTM Autoencoder

    Authors: Meenakshi Sarkar, Debasish Ghose

    Abstract: Predicting movement of objects while the action of learning agent interacts with the dynamics of the scene still remains a key challenge in robotics. We propose a multi-layer Long Short Term Memory (LSTM) autoendocer network that predicts future frames for a robot navigating in a dynamic environment with moving obstacles. The autoencoder network is composed of a state and action conditioned decode… ▽ More

    Submitted 12 October, 2018; originally announced October 2018.

    Comments: 4 pages

    MSC Class: 68T05

  40. Internet of Things: Technology, Applications and Standardardization

    Authors: Jaydip Sen, Moonkun Lee, Sunghyeon Lee, Yeongbok Choe, Menachem Domb, Arpan Pal, Hemant Kumar Rath, Samar Shailendra, Abhijan Bhattacharyya, Albena Mihovska, Mahasweta Sarkar, Hyun Jung Lee, Myungho Kim, Alexandru Averian

    Abstract: The term "Internet of Things" (IoT) refers to an ecosystem of interconnected physical objects and devices that are accessible through the Internet and can communicate with each other. The main strength of the IoT vision is the high impact it has created and will continue to do so on several aspects of the everyday life and behavior of its potential users. This book presents some of the state-of-th… ▽ More

    Submitted 25 August, 2018; originally announced August 2018.

    Comments: The book contains 137 pages. It is published by IntechOpen, London, United Kingdom in August 2018.Print ISBN 978-1-78923-548-7, Online ISBN 978-1-78923-549-4

  41. arXiv:1805.00223  [pdf, other

    cs.CV

    Localization: A Missing Link in the Pipeline of Object Matching and Registration

    Authors: Deepak Mishra, Rajeev Ranjan, Santanu Chaudhury, Mukul Sarkar, Arvinder Singh Soin

    Abstract: Image registration is a process of aligning two or more images of same objects using geometric transformation. Most of the existing approaches work on the assumption of location invariance. These approaches require object-centric images to perform matching. Further, in absence of intensity level symmetry between the corresponding points in two images, the learning based registration approaches rel… ▽ More

    Submitted 11 January, 2019; v1 submitted 1 May, 2018; originally announced May 2018.

    Comments: 11 pages, 6 figures

  42. arXiv:1804.08454  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Attention Based Natural Language Grounding by Navigating Virtual Environment

    Authors: Akilesh B, Abhishek Sinha, Mausoom Sarkar, Balaji Krishnamurthy

    Abstract: In this work, we focus on the problem of grounding language by training an agent to follow a set of natural language instructions and navigate to a target object in an environment. The agent receives visual information through raw pixels and a natural language instruction telling what task needs to be achieved and is trained in an end-to-end way. We develop an attention mechanism for multi-modal f… ▽ More

    Submitted 21 December, 2018; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: Accepted at WACV 2019. Also at NeurIPS 2017 workshop on Visually-Grounded Interaction and Language (ViGIL)

  43. arXiv:1801.03318  [pdf, other

    cs.CV

    Unsupervised Despeckling

    Authors: Deepak Mishra, Santanu Chaudhury, Mukul Sarkar, Arvinder Singh Soin

    Abstract: Contrast and quality of ultrasound images are adversely affected by the excessive presence of speckle. However, being an inherent imaging property, speckle helps in tissue characterization and tracking. Thus, despeckling of the ultrasound images requires the reduction of speckle extent without any oversmoothing. In this letter, we aim to address the despeckling problem using an unsupervised deep a… ▽ More

    Submitted 10 January, 2018; originally announced January 2018.

  44. arXiv:1705.02338  [pdf, other

    cs.ET

    Exploiting OxRAM Resistive Switching for Dynamic Range Improvement of CMOS Image Sensors

    Authors: Ashwani Kumar, Mukul Sarkar, Manan Suri

    Abstract: We present a unique application of OxRAM devices in CMOS Image Sensors (CIS) for dynamic range (DR) improvement. We propose a modified 3T-APS (Active Pixel Sensor) circuit that incorporates OxRAM in 1T-1R configuration. DR improvement is achieved by resistive compression of the pixel output signal through autonomous programming of OxRAM device resistance during exposure. We show that by carefully… ▽ More

    Submitted 5 May, 2017; originally announced May 2017.

  45. arXiv:1704.04959  [pdf, other

    cs.LG

    Introspection: Accelerating Neural Network Training By Learning Weight Evolution

    Authors: Abhishek Sinha, Mausoom Sarkar, Aahitagni Mukherjee, Balaji Krishnamurthy

    Abstract: Neural Networks are function approximators that have achieved state-of-the-art accuracy in numerous machine learning tasks. In spite of their great success in terms of accuracy, their large training time makes it difficult to use them for various tasks. In this paper, we explore the idea of learning weight evolution pattern from a simple network for accelerating training of novel neural networks.… ▽ More

    Submitted 17 April, 2017; originally announced April 2017.

  46. A Novel Reconfigurable Architecture of a DSP Processor for Efficient Mapping of DSP Functions using Field Programmable DSP Arrays

    Authors: Amitabha Sinha, Mitrava Sarkar, Soumojit Acharyya, Suranjan Chakraborty

    Abstract: Development of modern integrated circuit technologies makes it feasible to develop cheaper, faster and smaller special purpose signal processing function circuits. Digital Signal processing functions are generally implemented either on ASICs with inflexibility, or on FPGAs with bottlenecks of relatively smaller utilization factor or lower speed compared to ASIC. Field Programmable DSP Array (FPDA)… ▽ More

    Submitted 1 June, 2013; originally announced June 2013.

    Comments: 8 Pages, 12 Figures, ACM SIGARCH Computer Architecture News. arXiv admin note: substantial text overlap with arXiv:1305.3251

    MSC Class: 68R01

    Journal ref: ACM SIGARCH Computer Architecture News, Volume 41 Issue 2, May 2013, Pages 1-8

  47. Field Programmable DSP Arrays - A Novel Reconfigurable Architecture for Efficient Realization of Digital Signal Processing Functions

    Authors: Amitabha Sinha, Soumojit Acharyya, Suranjan Chakraborty, Mitrava Sarkar

    Abstract: Digital Signal Processing functions are widely used in real time high speed applications. Those functions are generally implemented either on ASICs with inflexibility, or on FPGAs with bottlenecks of relatively smaller utilization factor or lower speed compared to ASIC. The proposed reconfigurable DSP processor is redolent to FPGA, but with basic fixed Common Modules (CMs) (like adders, subtractor… ▽ More

    Submitted 13 May, 2013; originally announced May 2013.

    Comments: 18 pages, 17 figures. This paper has been published into Signal & Image Processing : An International Journal (SIPIJ - AIRCC) Vol.4, No.2, April 2013. http://airccse.org/journal/sipij/current2013.html

    Journal ref: Signal & Image Processing : An International Journal (SIPIJ - AIRCC) Vol.4, No.2, April 2013