Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 103 results for author: Gupta, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.15444  [pdf, other

    cs.CL

    Investigating the Robustness of LLMs on Math Word Problems

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral, Swaroop Mishra

    Abstract: Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experim… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

  2. arXiv:2406.14236  [pdf, other

    quant-ph cs.DC

    NAC-QFL: Noise Aware Clustered Quantum Federated Learning

    Authors: Himanshu Sahu, Hari Prabhat Gupta

    Abstract: Recent advancements in quantum computing, alongside successful deployments of quantum communication, hold promises for revolutionizing mobile networks. While Quantum Machine Learning (QML) presents opportunities, it contends with challenges like noise in quantum devices and scalability. Furthermore, the high cost of quantum communication constrains the practical application of QML in real-world sc… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2405.16129  [pdf, other

    cs.CL

    iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers

    Authors: Harshit Gupta, Manav Chaudhary, Tathagata Raha, Shivansh Subramanian, Vasudeva Varma

    Abstract: This paper describes our approach for SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense. The BRAINTEASER task comprises multiple-choice Question Answering designed to evaluate the models' lateral thinking capabilities. It consists of Sentence Puzzle and Word Puzzle subtasks that require models to defy default common-sense associations and exhibit unconventional thinking. We propo… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  4. arXiv:2405.11192  [pdf, other

    cs.CL cs.SI

    BrainStorm @ iREL at SMM4H 2024: Leveraging Translation and Topical Embeddings for Annotation Detection in Tweets

    Authors: Manav Chaudhary, Harshit Gupta, Vasudeva Varma

    Abstract: The proliferation of LLMs in various NLP tasks has sparked debates regarding their reliability, particularly in annotation tasks where biases and hallucinations may arise. In this shared task, we address the challenge of distinguishing annotations made by LLMs from those made by human domain experts in the context of COVID-19 symptom detection from tweets in Latin American Spanish. This paper pres… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Submitted to SMM4H, colocated at ACL 2024

  5. arXiv:2405.07499  [pdf, other

    quant-ph cs.ET

    Distributed Quantum Computation with Minimum Circuit Execution Time over Quantum Networks

    Authors: Ranjani G Sundaram, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Present quantum computers are constrained by limited qubit capacity and restricted physical connectivity, leading to challenges in large-scale quantum computations. Distributing quantum computations across a network of quantum computers is a promising way to circumvent these challenges and facilitate large quantum computations. However, distributed quantum computations require entanglements (to ex… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2405.00222  [pdf, other

    quant-ph cs.NI

    Optimized Distribution of Entanglement Graph States in Quantum Networks

    Authors: Xiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantu… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages, 13 figures

  7. arXiv:2404.07164  [pdf, other

    cs.AR cs.AI cs.DC cs.LG

    Analysis of Distributed Optimization Algorithms on a Real Processing-In-Memory System

    Authors: Steve Rhyner, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Jiawei Jiang, Ataberk Olgun, Harshita Gupta, Ce Zhang, Onur Mutlu

    Abstract: Machine Learning (ML) training on large-scale datasets is a very expensive and time-consuming workload. Processor-centric architectures (e.g., CPU, GPU) commonly used for modern ML training workloads are limited by the data movement bottleneck, i.e., due to repeatedly accessing the training dataset. As a result, processor-centric systems suffer from performance degradation and high energy consumpt… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  8. arXiv:2402.06689  [pdf, other

    q-fin.ST cs.LG

    A Study on Stock Forecasting Using Deep Learning and Statistical Models

    Authors: Himanshu Gupta, Aditya Jaiswal

    Abstract: Predicting a fast and accurate model for stock price forecasting is been a challenging task and this is an active area of research where it is yet to be found which is the best way to forecast the stock price. Machine learning, deep learning and statistical analysis techniques are used here to get the accurate result so the investors can see the future trend and maximize the return of investment i… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  9. arXiv:2402.03796  [pdf, other

    cs.CV cs.AI cs.LG

    Face Detection: Present State and Research Directions

    Authors: Purnendu Prabhat, Himanshu Gupta, Ajeet Kumar Vishwakarma

    Abstract: The majority of computer vision applications that handle images featuring humans use face detection as a core component. Face detection still has issues, despite much research on the topic. Face detection's accuracy and speed might yet be increased. This review paper shows the progress made in this area as well as the substantial issues that still need to be tackled. The paper provides research di… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  10. arXiv:2401.14521  [pdf

    cs.LG cs.AI

    Towards Interpretable Physical-Conceptual Catchment-Scale Hydrological Modeling using the Mass-Conserving-Perceptron

    Authors: Yuan-Heng Wang, Hoshin V. Gupta

    Abstract: We investigate the applicability of machine learning technologies to the development of parsimonious, interpretable, catchment-scale hydrologic models using directed-graph architectures based on the mass-conserving perceptron (MCP) as the fundamental computational unit. Here, we focus on architectural complexity (depth) at a single location, rather than universal applicability (breadth) across lar… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 65 pages, 8 Figures, 4 Tables, 1 Supplementary Material

  11. arXiv:2312.13234  [pdf, other

    cs.LG

    Position Paper: Bridging the Gap Between Machine Learning and Sensitivity Analysis

    Authors: Christian A. Scholbeck, Julia Moosbauer, Giuseppe Casalicchio, Hoshin Gupta, Bernd Bischl, Christian Heumann

    Abstract: We argue that interpretations of machine learning (ML) models or the model-building process can bee seen as a form of sensitivity analysis (SA), a general methodology used to explain complex systems in many fields such as environmental modeling, engineering, or economics. We address both researchers and practitioners, calling attention to the benefits of a unified SA-based view of explanations in… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  12. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  13. arXiv:2311.09564  [pdf, other

    cs.CL cs.AI

    LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

    Authors: Mihir Parmar, Aakanksha Naik, Himanshu Gupta, Disha Agrawal, Chitta Baral

    Abstract: Many large language models (LLMs) for medicine have largely been evaluated on short texts, and their ability to handle longer sequences such as a complete electronic health record (EHR) has not been systematically explored. Assessing these models on long sequences is crucial since prior work in the general domain has demonstrated performance degradation of LLMs on longer texts. Motivated by this,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages

  14. arXiv:2310.17876  [pdf, other

    cs.CL

    TarGEN: Targeted Data Generation with Large Language Models

    Authors: Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra

    Abstract: The rapid advancement of large language models (LLMs) has sparked interest in data synthesis techniques, aiming to generate diverse and high-quality synthetic datasets. However, these synthetic datasets often suffer from a lack of diversity and added noise. In this paper, we present TarGEN, a multi-step prompting strategy for generating high-quality synthetic datasets utilizing a LLM. An advantage… ▽ More

    Submitted 30 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 tables, 5 figures, 5 pages references, 17 pages appendix

  15. arXiv:2310.08644  [pdf

    cs.LG cs.AI

    A Mass-Conserving-Perceptron for Machine Learning-Based Modeling of Geoscientific Systems

    Authors: Yuan-Heng Wang, Hoshin V. Gupta

    Abstract: Although decades of effort have been devoted to building Physical-Conceptual (PC) models for predicting the time-series evolution of geoscientific systems, recent work shows that Machine Learning (ML) based Gated Recurrent Neural Network technology can be used to develop models that are much more accurate. However, the difficulty of extracting physical understanding from ML-based models complicate… ▽ More

    Submitted 12 May, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 68 pages, 7 figures in the main text, 10 figures, and 10 tables in the supplementary materials

  16. arXiv:2309.07330  [pdf, other

    cs.CV

    Automated Assessment of Critical View of Safety in Laparoscopic Cholecystectomy

    Authors: Yunfan Li, Himanshu Gupta, Haibin Ling, IV Ramakrishnan, Prateek Prasanna, Georgios Georgakis, Aaron Sasson

    Abstract: Cholecystectomy (gallbladder removal) is one of the most common procedures in the US, with more than 1.2M procedures annually. Compared with classical open cholecystectomy, laparoscopic cholecystectomy (LC) is associated with significantly shorter recovery period, and hence is the preferred method. However, LC is also associated with an increase in bile duct injuries (BDIs), resulting in significa… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  17. arXiv:2309.06545  [pdf, other

    cs.CR cs.AR

    Evaluating Homomorphic Operations on a Real-World Processing-In-Memory System

    Authors: Harshita Gupta, Mayank Kabra, Juan Gómez-Luna, Konstantinos Kanellopoulos, Onur Mutlu

    Abstract: Computing on encrypted data is a promising approach to reduce data security and privacy risks, with homomorphic encryption serving as a facilitator in achieving this goal. In this work, we accelerate homomorphic operations using the Processing-in- Memory (PIM) paradigm to mitigate the large memory capacity and frequent data movement requirements. Using a real-world PIM system, we accelerate the Br… ▽ More

    Submitted 3 October, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: This work will be presented at IISWC 2023

  18. A Dataset of Inertial Measurement Units for Handwritten English Alphabets

    Authors: Hari Prabhat Gupta, Rahul Mishra

    Abstract: This paper presents an end-to-end methodology for collecting datasets to recognize handwritten English alphabets by utilizing Inertial Measurement Units (IMUs) and leveraging the diversity present in the Indian writing style. The IMUs are utilized to capture the dynamic movement patterns associated with handwriting, enabling more accurate recognition of alphabets. The Indian context introduces var… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 10 pages, 12 figures

  19. arXiv:2307.02409  [pdf, other

    cs.DC

    Utility-Aware Load Shedding for Real-time Video Analytics at the Edge

    Authors: Enrique Saurez, Harshit Gupta, Henriette Roger, Sukanya Bhowmik, Umakishore Ramachandran, Kurt Rothermel

    Abstract: Real-time video analytics typically require video frames to be processed by a query to identify objects or activities of interest while adhering to an end-to-end frame processing latency constraint. Such applications impose a continuous and heavy load on backend compute and network infrastructure because of the need to stream and process all video frames. Video data has inherent redundancy and doe… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: This work was supported by the German Research Foundation (DFG) under the research grant "PRECEPT II" (RO 1086/19-2 and BH 154/1-2)

  20. arXiv:2306.08872  [pdf, other

    cs.CL cs.AI

    Neural models for Factual Inconsistency Classification with Explanations

    Authors: Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma

    Abstract: Factual consistency is one of the most important requirements when editing high quality documents. It is extremely important for automatic text generation systems like summarization, question answering, dialog modeling, and language modeling. Still, automated factual inconsistency detection is rather under-studied. Existing work has focused on (a) finding fake news keeping a knowledge base in cont… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: ECML-PKDD 2023

  21. arXiv:2306.05539  [pdf, other

    cs.CL

    Instruction Tuned Models are Quick Learners

    Authors: Himanshu Gupta, Saurabh Arjun Sawant, Swaroop Mishra, Mutsumi Nakamura, Arindam Mitra, Santosh Mashetty, Chitta Baral

    Abstract: Instruction tuning of language models has demonstrated the ability to enhance model generalization to unseen tasks via in-context learning using a few examples. However, typical supervised learning still requires a plethora of downstream training data for finetuning. Often in real-world situations, there is a scarcity of data available for finetuning, falling somewhere between few shot inference a… ▽ More

    Submitted 17 May, 2023; originally announced June 2023.

    Comments: 9 pages, 5 figures, 19 Tables (inclusing appendix), 12 pages of Appendix

  22. arXiv:2306.04207  [pdf, ps, other

    cs.DC

    Resource Aware Clustering for Tackling the Heterogeneity of Participants in Federated Learning

    Authors: Rahul Mishra, Hari Prabhat Gupta, Garvit Banga

    Abstract: Federated Learning is a training framework that enables multiple participants to collaboratively train a shared model while preserving data privacy and minimizing communication overhead. The heterogeneity of devices and networking resources of the participants delay the training and aggregation in federated learning. This paper proposes a federated learning approach to manoeuvre the heterogeneity… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 13 pages, 4 figures

  23. arXiv:2306.00195  [pdf, other

    quant-ph cs.ET

    Distributing Quantum Circuits Using Teleportations

    Authors: Ranjani G Sundaram, Himanshu Gupta

    Abstract: Scalability is currently one of the most sought-after objectives in the field of quantum computing. Distributing a quantum circuit across a quantum network is one way to facilitate large computations using current quantum computers. In this paper, we consider the problem of distributing a quantum circuit across a network of heterogeneous quantum computers, while minimizing the number of teleportat… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  24. arXiv:2305.16357  [pdf, other

    cs.CL

    EDM3: Event Detection as Multi-task Text Generation

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Mihir Parmar, Kuntal Kumar Pal, Chitta Baral

    Abstract: Event detection refers to identifying event occurrences in a text and comprises of two subtasks; event identification and classification. We present EDM3, a novel approach for Event Detection that formulates three generative tasks: identification, classification, and combined detection. We show that EDM3 helps to learn transferable knowledge that can be leveraged to perform Event Detection and its… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, 10 tables, 5 Page appendix

  25. arXiv:2305.05079  [pdf, other

    cs.CL

    A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution

    Authors: Neeraj Varshney, Himanshu Gupta, Eric Robertson, Bing Liu, Chitta Baral

    Abstract: State-of-the-art natural language processing models have been shown to achieve remarkable performance in 'closed-world' settings where all the labels in the evaluation set are known at training time. However, in real-world settings, 'novel' instances that do not belong to any known class are often observed. This renders the ability to deal with novelties crucial. To initiate a systematic research… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  26. arXiv:2304.12483  [pdf, other

    cs.CV

    Towards Realistic Generative 3D Face Models

    Authors: Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando de la Torre

    Abstract: In recent years, there has been significant progress in 2D generative face models fueled by applications such as animation, synthetic data generation, and digital avatars. However, due to the absence of 3D information, these 2D models often struggle to accurately disentangle facial attributes like pose, expression, and illumination, limiting their editing capabilities. To address this limitation,… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Preprint

  27. arXiv:2304.08302  [pdf

    cs.NI

    Fog Computing& IoT: Overview, Architecture and Applications

    Authors: Harshit Gupta, Dr. Ajay Kumar Bharti

    Abstract: Fog computing is an emerging technology in the field of network services where data transfer from one device to another to perform some kind of activity. Fog computing is an extended concept of cloud computing. It works in-between the Internet of Things (IoT) and cloud data centers and reduces the communication gaps. Fog computing has made possible to have decreased latency and low network congest… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 5 pages, 2 figures

  28. arXiv:2302.08624  [pdf, other

    cs.CL cs.LG

    InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis

    Authors: Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral

    Abstract: We introduce InstructABSA, an instruction learning paradigm for Aspect-Based Sentiment Analysis (ABSA) subtasks. Our method introduces positive, negative, and neutral examples to each training sample, and instruction tune the model (Tk-Instruct) for ABSA subtasks, yielding significant performance improvements. Experimental results on the Sem Eval 2014, 15, and 16 datasets demonstrate that Instruct… ▽ More

    Submitted 13 November, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: 4 pages, 3 figures, 9 tables, 9 appendix pages

  29. arXiv:2301.04027  [pdf

    cs.LG cs.CE physics.ao-ph physics.geo-ph

    Differentiable modeling to unify machine learning and physical models and advance Geosciences

    Authors: Chaopeng Shen, Alison P. Appling, Pierre Gentine, Toshiyuki Bandai, Hoshin Gupta, Alexandre Tartakovsky, Marco Baity-Jesi, Fabrizio Fenicia, Daniel Kifer, Li Li, Xiaofeng Liu, Wei Ren, Yi Zheng, Ciaran J. Harman, Martyn Clark, Matthew Farthing, Dapeng Feng, Praveen Kumar, Doaa Aboelyazeed, Farshid Rahmani, Hylke E. Beck, Tadd Bindas, Dipankar Dwivedi, Kuai Fang, Marvin Höge , et al. (5 additional authors not shown)

    Abstract: Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage lar… ▽ More

    Submitted 26 December, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Journal ref: Nat Rev Earth Environ 4, 552-567 (2023)

  30. A Roadmap to Domain Knowledge Integration in Machine Learning

    Authors: Himel Das Gupta, Victor S. Sheng

    Abstract: Many machine learning algorithms have been developed in recent years to enhance the performance of a model in different aspects of artificial intelligence. But the problem persists due to inadequate data and resources. Integrating knowledge in a machine learning model can help to overcome these obstacles up to a certain degree. Incorporating knowledge is a complex task though because of various fo… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

  31. arXiv:2211.08182  [pdf, other

    cs.CV cs.RO

    Grasping the Inconspicuous

    Authors: Hrishikesh Gupta, Stefan Thalhammer, Markus Leitner, Markus Vincze

    Abstract: Transparent objects are common in day-to-day life and hence find many applications that require robot grasping. Many solutions toward object grasping exist for non-transparent objects. However, due to the unique visual properties of transparent objects, standard 3D sensors produce noisy or distorted measurements. Modern approaches tackle this problem by either refining the noisy depth measurements… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  32. arXiv:2210.11762  [pdf, other

    cs.CL

    Detecting Unintended Social Bias in Toxic Language Datasets

    Authors: Nihar Sahoo, Himanshu Gupta, Pushpak Bhattacharyya

    Abstract: With the rise of online hate speech, automatic detection of Hate Speech, Offensive texts as a natural language processing task is getting popular. However, very little research has been done to detect unintended social bias from these toxic language datasets. This paper introduces a new dataset ToxicBias curated from the existing dataset of Kaggle competition named "Jigsaw Unintended Bias in Toxic… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  33. arXiv:2210.07471  [pdf, other

    cs.CL

    "John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility

    Authors: Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun Sawant, Kevin Scaria, Siddharth Goyal, Chitta Baral

    Abstract: In current NLP research, large-scale language models and their abilities are widely being discussed. Some recent works have also found notable failures of these models. Often these failure examples involve complex reasoning abilities. This work focuses on a simple commonsense ability, reasoning about when an action (or its effect) is feasible. To this end, we introduce FeasibilityQA, a question-an… ▽ More

    Submitted 2 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL 2023

  34. arXiv:2210.01588  [pdf, other

    cs.CV

    Cross-Geography Generalization of Machine Learning Methods for Classification of Flooded Regions in Aerial Images

    Authors: Sushant Lenka, Pratyush Kerhalkar, Pranav Shetty, Harsh Gupta, Bhavam Vidyarthi, Ujjwal Verma

    Abstract: Identification of regions affected by floods is a crucial piece of information required for better planning and management of post-disaster relief and rescue efforts. Traditionally, remote sensing images are analysed to identify the extent of damage caused by flooding. The data acquired from sensors onboard earth observation satellites are analyzed to detect the flooded regions, which can be affec… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  35. arXiv:2209.15560  [pdf, ps, other

    cs.LG cs.NE

    Designing and Training of Lightweight Neural Networks on Edge Devices using Early Halting in Knowledge Distillation

    Authors: Rahul Mishra, Hari Prabhat Gupta

    Abstract: Automated feature extraction capability and significant performance of Deep Neural Networks (DNN) make them suitable for Internet of Things (IoT) applications. However, deploying DNN on edge devices becomes prohibitive due to the colossal computation, energy, and storage requirements. This paper presents a novel approach for designing and training lightweight DNN using large-size DNN. The approach… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 figures, 11 tables

  36. arXiv:2209.01417  [pdf, ps, other

    cs.LG

    Suppressing Noise from Built Environment Datasets to Reduce Communication Rounds for Convergence of Federated Learning

    Authors: Rahul Mishra, Hari Prabhat Gupta, Tanima Dutta, Sajal K. Das

    Abstract: Smart sensing provides an easier and convenient data-driven mechanism for monitoring and control in the built environment. Data generated in the built environment are privacy sensitive and limited. Federated learning is an emerging paradigm that provides privacy-preserving collaboration among multiple participants for model training without sharing private and limited data. The noisy labels in the… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 11 pages, 5 figures

  37. arXiv:2209.01338  [pdf, ps, other

    cs.LG

    FedAR+: A Federated Learning Approach to Appliance Recognition with Mislabeled Data in Residential Buildings

    Authors: Ashish Gupta, Hari Prabhat Gupta, Sajal K. Das

    Abstract: With the enhancement of people's living standards and rapid growth of communication technologies, residential environments are becoming smart and well-connected, increasing overall energy consumption substantially. As household appliances are the primary energy consumers, their recognition becomes crucial to avoid unattended usage, thereby conserving energy and making smart environments more susta… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: 11 pages, 9 figures, 4 tables

  38. arXiv:2207.04996  [pdf, other

    quant-ph cs.IT

    Construction of non-CSS quantum codes using measurements on cluster states

    Authors: Swayangprabha Shaw, Harsh Gupta, Shahid Mehraj Shah, Ankur Raina

    Abstract: The Measurement-based quantum computation provides an alternate model for quantum computation compared to the well-known gate-based model. It uses qubits prepared in a specific entangled state followed by single-qubit measurements. The stabilizers of cluster states are well defined because of their graph structure. We exploit this graph structure extensively to design non-CSS codes using measureme… ▽ More

    Submitted 24 January, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 7 Pages, 4 Figures, Two Algorithms, One table

  39. Knowledge Distillation via Weighted Ensemble of Teaching Assistants

    Authors: Durga Prasad Ganta, Himel Das Gupta, Victor S. Sheng

    Abstract: Knowledge distillation in machine learning is the process of transferring knowledge from a large model called the teacher to a smaller model called the student. Knowledge distillation is one of the techniques to compress the large network (teacher) to a smaller network (student) that can be deployed in small devices such as mobile phones. When the network size gap between the teacher and student i… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:1902.03393 by other authors

  40. arXiv:2206.10028  [pdf, other

    cs.RO cs.AI

    Intention-Aware Navigation in Crowds with Extended-Space POMDP Planning

    Authors: Himanshu Gupta, Bradley Hayes, Zachary Sunberg

    Abstract: This paper presents a hybrid online Partially Observable Markov Decision Process (POMDP) planning system that addresses the problem of autonomous navigation in the presence of multi-modal uncertainty introduced by other agents in the environment. As a particular example, we consider the problem of autonomous navigation in dense crowds of pedestrians and among obstacles. Popular approaches to this… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  41. arXiv:2206.07198  [pdf, other

    cs.CV

    Surgical Phase Recognition in Laparoscopic Cholecystectomy

    Authors: Yunfan Li, Vinayak Shenoy, Prateek Prasanna, I. V. Ramakrishnan, Haibin Ling, Himanshu Gupta

    Abstract: Automatic recognition of surgical phases in surgical videos is a fundamental task in surgical workflow analysis. In this report, we propose a Transformer-based method that utilizes calibrated confidence scores for a 2-stage inference pipeline, which dynamically switches between a baseline model and a separately trained transition model depending on the calibrated confidence level. Our method outpe… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  42. arXiv:2206.06437  [pdf, other

    cs.ET quant-ph

    Distribution of Quantum Circuits Over General Quantum Networks

    Authors: Ranjani G Sundaram, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Near-term quantum computers can hold only a small number of qubits. One way to facilitate large-scale quantum computations is through a distributed network of quantum computers. In this work, we consider the problem of distributing quantum programs represented as quantum circuits across a quantum network of heterogeneous quantum computers, in a way that minimizes the overall communication cost req… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

  43. arXiv:2205.15951  [pdf, other

    cs.CL cs.CY cs.LG

    Hollywood Identity Bias Dataset: A Context Oriented Bias Analysis of Movie Dialogues

    Authors: Sandhya Singh, Prapti Roy, Nihar Sahoo, Niteesh Mallela, Himanshu Gupta, Pushpak Bhattacharyya, Milind Savagaonkar, Nidhi, Roshni Ramnani, Anutosh Maitra, Shubhashis Sengupta

    Abstract: Movies reflect society and also hold power to transform opinions. Social biases and stereotypes present in movies can cause extensive damage due to their reach. These biases are not always found to be the need of storyline but can creep in as the author's bias. Movie production houses would prefer to ascertain that the bias present in a script is the story's demand. Today, when deep learning model… ▽ More

    Submitted 1 June, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

  44. Pre-Distribution of Entanglements in Quantum Networks

    Authors: Mohammad Ghaderibaneh, Himanshu Gupta, C. R. Ramakrishnan, Ertai Luo

    Abstract: Quantum network communication is challenging, as the No-Cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the low probability… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 11 pages, 9 figures

  45. arXiv:2203.02585  [pdf, other

    cs.NI

    NFSlicer: Data Movement Optimization for Shallow Network Functions

    Authors: Anirudh Sarma, Hamed Seyedroudbari, Harshit Gupta, Umakishore Ramachandran, Alexandros Daglis

    Abstract: Network Function (NF) deployments on commodity servers have become ubiquitous in datacenters and enterprise settings. Many commonly used NFs such as firewalls, load balancers and NATs are shallow - i.e., they only examine the packet's header, despite the entire packet being transferred on and off the server. As a result, the gap between moved and inspected data when handling large packets exceeds… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 13 pages, 16 figures

  46. arXiv:2202.09256  [pdf, other

    cs.NI

    Traffic-Aware Dynamic Functional Split for 5G Cloud Radio Access Networks

    Authors: Himank Gupta, Antony Franklin A, Mayank Kumar, Bheemarjuna Reddy Tamma

    Abstract: The recent adaption of virtualization technologies in the next generation mobile network enables 5G base station to be segregated into a Radio Unit (RU), a Distributed Unit (DU), and a Central Unit (CU) to support Cloud based Radio Access Networks (C-RAN). RU and DU are connected through a fronthaul link. In contrast, CU and DU are connected through a midhaul link. Although virtualization of CU gi… ▽ More

    Submitted 16 May, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

  47. arXiv:2201.13145  [pdf, other

    stat.ML cs.LG

    Assessment of DeepONet for reliability analysis of stochastic nonlinear dynamical systems

    Authors: Shailesh Garg, Harshit Gupta, Souvik Chakraborty

    Abstract: Time dependent reliability analysis and uncertainty quantification of structural system subjected to stochastic forcing function is a challenging endeavour as it necessitates considerable computational time. We investigate the efficacy of recently proposed DeepONet in solving time dependent reliability analysis and uncertainty quantification of systems subjected to stochastic loading. Unlike conve… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: 21 pages

  48. DeepAlloc: CNN-Based Approach to Efficient Spectrum Allocation in Shared Spectrum Systems

    Authors: Mohammad Ghaderibaneh, Caitao Zhan, Himanshu Gupta

    Abstract: Shared spectrum systems facilitate spectrum allocation to unlicensed users without harming the licensed users; they offer great promise in optimizing spectrum utility, but their management (in particular, efficient spectrum allocation to unlicensed users) is challenging. A significant shortcoming of current allocation methods is that they are either done very conservatively to ensure correctness,… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: 15 pages, 16 figures

  49. DeepMTL Pro: Deep Learning Based MultipleTransmitter Localization and Power Estimation

    Authors: Caitao Zhan, Mohammad Ghaderibaneh, Pranjal Sahu, Himanshu Gupta

    Abstract: In this paper, we address the problem of Multiple Transmitter Localization (MTL). MTL is to determine the locations of potential multiple transmitters in a field, based on readings from a distributed set of sensors. In contrast to the widely studied single transmitter localization problem, the MTL problem has only been studied recently in a few works. MTL is of great significance in many applicati… ▽ More

    Submitted 22 March, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

    Comments: 38 pages, 27 figures. This is the final revision verison of a journal paper submitted to Pervasive and Mobile Computing (PMC). This is an extension of an accepted paper at IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM 2021)

  50. Efficient Quantum Network Communication using Optimized Entanglement-Swapping Trees

    Authors: Mohammad Ghaderibaneh, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Quantum network communication is challenging, as the No-cloning theorem in quantum regime makes many classical techniques inapplicable. For long-distance communication, the only viable communication approach is teleportation of quantum states, which requires a prior distribution of entangled pairs (EPs) of qubits. Establishment of EPs across remote nodes can incur significant latency due to the lo… ▽ More

    Submitted 4 April, 2024; v1 submitted 21 December, 2021; originally announced December 2021.