Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 688 results for author: Gupta, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19299  [pdf, other

    cs.CV

    PNeRV: A Polynomial Neural Representation for Videos

    Authors: Sonam Gupta, Snehal Singh Tomar, Grigorios G Chrysos, Sukhendu Das, A. N. Rajagopalan

    Abstract: Extracting Implicit Neural Representations (INRs) on video data poses unique challenges due to the additional temporal dimension. In the context of videos, INRs have predominantly relied on a frame-only parameterization, which sacrifices the spatiotemporal continuity observed in pixel-level (spatial) representations. To mitigate this, we introduce Polynomial Neural Representation for Videos (PNeRV… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 25 pages, 17 figures, published at TMLR, Feb 2024

  2. arXiv:2406.19102  [pdf, other

    cs.CL cs.AI cs.IR

    Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs

    Authors: Lokesh Mishra, Sohayl Dhibi, Yusik Kim, Cesar Berrospi Ramis, Shubham Gupta, Michele Dolfi, Peter Staar

    Abstract: Environment, Social, and Governance (ESG) KPIs assess an organization's performance on issues such as climate change, greenhouse gas emissions, water consumption, waste management, human rights, diversity, and policies. ESG reports convey this valuable quantitative information through tables. Unfortunately, extracting this information is difficult due to high variability in the table structure as… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted at the NLP4Climate workshop in the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024)

  3. arXiv:2406.17713  [pdf, other

    cs.NE

    Multi-objective Binary Differential Approach with Parameter Tuning for Discovering Business Process Models: MoD-ProM

    Authors: Sonia Deshmukh, Shikha Gupta, Naveen Kumar

    Abstract: Process discovery approaches analyze the business data to automatically uncover structured information, known as a process model. The quality of a process model is measured using quality dimensions -- completeness (replay fitness), preciseness, simplicity, and generalization. Traditional process discovery algorithms usually output a single process model. A single model may not accurately capture t… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  4. arXiv:2406.15958  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Bone Fracture Classification using Transfer Learning

    Authors: Shyam Gupta, Dhanisha Sharma

    Abstract: The manual examination of X-ray images for fractures is a time-consuming process that is prone to human error. In this work, we introduce a robust yet simple training loop for the classification of fractures, which significantly outperforms existing methods. Our method achieves superior performance in less than ten epochs and utilizes the latest dataset to deliver the best-performing model for thi… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: code is publicly available at - https://github.com/shyamgupta196/Bone-Fracture-Classification

  5. arXiv:2406.14706  [pdf

    cs.ET cs.AR

    SWANN: Shuffling Weights in Crossbar Arrays for Enhanced DNN Accuracy in Deeply Scaled Technologies

    Authors: Jeffry Victor, Dong Eun Kim, Chunguang Wang, Kaushik Roy, Sumeet Gupta

    Abstract: Deep neural network (DNN) accelerators employing crossbar arrays capable of in-memory computing (IMC) are highly promising for neural computing platforms. However, in deeply scaled technologies, interconnect resistance severely impairs IMC robustness, leading to a drop in the system accuracy. To address this problem, we propose SWANN - a technique based on shuffling weights in crossbar arrays whic… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2406.14398  [pdf, other

    cs.CV

    ATAC-Net: Zoomed view works better for Anomaly Detection

    Authors: Shaurya Gupta, Neil Gautam, Anurag Malyala

    Abstract: The application of deep learning in visual anomaly detection has gained widespread popularity due to its potential use in quality control and manufacturing. Current standard methods are Unsupervised, where a clean dataset is utilised to detect deviations and flag anomalies during testing. However, incorporating a few samples when the type of anomalies is known beforehand can significantly enhance… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2406.14330  [pdf, other

    quant-ph cs.DS math.OC

    Promise of Graph Sparsification and Decomposition for Noise Reduction in QAOA: Analysis for Trapped-Ion Compilations

    Authors: Jai Moondra, Philip C. Lotshaw, Greg Mohler, Swati Gupta

    Abstract: We develop new approximate compilation schemes that significantly reduce the expense of compiling the Quantum Approximate Optimization Algorithm (QAOA) for solving the Max-Cut problem. Our main focus is on compilation with trapped-ion simulators using Pauli-$X$ operations and all-to-all Ising Hamiltonian $H_\text{Ising}$ evolution generated by Molmer-Sorensen or optical dipole force interactions,… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    MSC Class: 81P68

  8. arXiv:2406.11784  [pdf, other

    cs.CL cs.AI

    MDCR: A Dataset for Multi-Document Conditional Reasoning

    Authors: Peter Baile Chen, Yi Zhang, Chunwei Liu, Sejal Gupta, Yoon Kim, Michael Cafarella

    Abstract: The same real-life questions posed to different individuals may lead to different answers based on their unique situations. For instance, whether a student is eligible for a scholarship depends on eligibility conditions, such as major or degree required. ConditionalQA was proposed to evaluate models' capability of reading a document and answering eligibility questions, considering unmentioned cond… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  9. arXiv:2406.10528  [pdf, other

    cs.LG

    Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training

    Authors: Akul Malhotra, Sumeet Kumar Gupta

    Abstract: Improving the hardware efficiency of deep neural network (DNN) accelerators with techniques such as quantization and sparsity enhancement have shown an immense promise. However, their inference accuracy in non-ideal real-world settings (such as in the presence of hardware faults) is yet to be systematically analyzed. In this work, we investigate the impact of memory faults on activation-sparse qua… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2301.00675

  10. arXiv:2406.10422  [pdf, other

    eess.AS cs.SD eess.SP

    Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice

    Authors: Shubham Gupta, Mirco Ravanelli, Pascal Germain, Cem Subakan

    Abstract: In this paper, we propose Phoneme Discretized Saliency Maps (PDSM), a discretization algorithm for saliency maps that takes advantage of phoneme boundaries for explainable detection of AI-generated voice. We experimentally show with two different Text-to-Speech systems (i.e., Tacotron2 and Fastspeech2) that the proposed algorithm produces saliency maps that result in more faithful explanations com… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  11. arXiv:2406.10090  [pdf, other

    cs.LG

    Over-parameterization and Adversarial Robustness in Neural Networks: An Overview and Empirical Analysis

    Authors: Zhang Chen, Luca Demetrio, Srishti Gupta, Xiaoyi Feng, Zhaoqiang Xia, Antonio Emanuele Cinà, Maura Pintor, Luca Oneto, Ambra Demontis, Battista Biggio, Fabio Roli

    Abstract: Thanks to their extensive capacity, over-parameterized neural networks exhibit superior predictive capabilities and generalization. However, having a large parameter space is considered one of the main suspects of the neural networks' vulnerability to adversarial example -- input samples crafted ad-hoc to induce a desired misclassification. Relevant literature has claimed contradictory remarks in… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    MSC Class: 68T10 ACM Class: I.5

  12. arXiv:2406.09208  [pdf, other

    cs.AR

    Python-based DSL for generating Verilog model of Synchronous Digital Circuits

    Authors: Mandar Datar, Dhruva S. Hegde, Vendra Durga Prasad, Manish Prajapati, Neralla Manikanta, Devansh Gupta, Janampalli Pavanija, Pratyush Pare, Akash, Shivam Gupta, Sachin B. Patkar

    Abstract: We have designed a Python-based Domain Specific Language (DSL) for modeling synchronous digital circuits. In this DSL, hardware is modeled as a collection of transactions -- running in series, parallel, and loops. When the model is executed by a Python interpreter, synthesizable and behavioural Verilog is generated as output, which can be integrated with other RTL designs or directly used for FPGA… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 9 pages, 13 figures

  13. arXiv:2406.06608  [pdf, other

    cs.CL cs.AI

    The Prompt Report: A Systematic Survey of Prompting Techniques

    Authors: Sander Schulhoff, Michael Ilie, Nishant Balepur, Konstantine Kahadze, Amanda Liu, Chenglei Si, Yinheng Li, Aayush Gupta, HyoJung Han, Sevien Schulhoff, Pranav Sandeep Dulepet, Saurav Vidyadhara, Dayeon Ki, Sweta Agrawal, Chau Pham, Gerson Kroiz, Feileen Li, Hudson Tao, Ashay Srivastava, Hevander Da Costa, Saloni Gupta, Megan L. Rogers, Inna Goncearenco, Giuseppe Sarli, Igor Galynker , et al. (6 additional authors not shown)

    Abstract: Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a p… ▽ More

    Submitted 16 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  14. arXiv:2406.00924  [pdf, ps, other

    cs.LG cs.DS math.ST stat.ML

    Faster Diffusion-based Sampling with Randomized Midpoints: Sequential and Parallel

    Authors: Shivam Gupta, Linda Cai, Sitan Chen

    Abstract: In recent years, there has been a surge of interest in proving discretization bounds for diffusion models. These works show that for essentially any data distribution, one can approximately sample in polynomial time given a sufficiently accurate estimate of its score functions at different noise levels. In this work, we propose a new discretization scheme for diffusion models inspired by Shen and… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  15. arXiv:2405.18193  [pdf, other

    cs.LG cs.CV

    In-Context Symmetries: Self-Supervised Learning through Contextual World Models

    Authors: Sharut Gupta, Chenyu Wang, Yifei Wang, Tommi Jaakkola, Stefanie Jegelka

    Abstract: At the core of self-supervised learning for vision is the idea of learning invariant or equivariant representations with respect to a set of data transformations. This approach, however, introduces strong inductive biases, which can render the representations fragile in downstream tasks that do not conform to these symmetries. In this work, drawing insights from world models, we propose to instead… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 32 pages, 24 tables and 11 figures

  16. arXiv:2405.17469  [pdf, other

    cs.LG cs.AI cs.CY cs.PF

    A Dataset for Research on Water Sustainability

    Authors: Pranjol Sen Gupta, Md Rajib Hossen, Pengfei Li, Shaolei Ren, Mohammad A. Islam

    Abstract: Freshwater scarcity is a global problem that requires collective efforts across all industry sectors. Nevertheless, a lack of access to operational water footprint data bars many applications from exploring optimization opportunities hidden within the temporal and spatial variations. To break this barrier into research in water sustainability, we build a dataset for operation direct water usage in… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted by ACM e-Energy 2024

  17. arXiv:2405.15372  [pdf, other

    cs.DS cs.GT

    When far is better: The Chamberlin-Courant approach to obnoxious committee selection

    Authors: Sushmita Gupta, Tanmay Inamdar, Pallavi Jain, Daniel Lokshtanov, Fahad Panolan, Saket Saurabh

    Abstract: Classical work on metric space based committee selection problem interprets distance as ``near is better''. In this work, motivated by real-life situations, we interpret distance as ``far is better''. Formally stated, we initiate the study of ``obnoxious'' committee scoring rules when the voters' preferences are expressed via a metric space. To this end, we propose a model where large distances im… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  18. arXiv:2405.15254  [pdf, other

    stat.ML cs.AI cs.LG

    Novel Kernel Models and Exact Representor Theory for Neural Networks Beyond the Over-Parameterized Regime

    Authors: Alistair Shilton, Sunil Gupta, Santu Rana, Svetha Venkatesh

    Abstract: This paper presents two models of neural-networks and their training applicable to neural networks of arbitrary width, depth and topology, assuming only finite-energy neural activations; and a novel representor theory for neural networks in terms of a matrix-valued kernel. The first model is exact (un-approximated) and global, casting the neural network as an elements in a reproducing kernel Banac… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  19. arXiv:2405.15227  [pdf, other

    cs.RO

    Neural Elevation Models for Terrain Mapping and Path Planning

    Authors: Adam Dai, Shubh Gupta, Grace Gao

    Abstract: This work introduces Neural Elevations Models (NEMos), which adapt Neural Radiance Fields to a 2.5D continuous and differentiable terrain model. In contrast to traditional terrain representations such as digital elevation models, NEMos can be readily generated from imagery, a low-cost data source, and provide a lightweight representation of terrain through an implicit continuous and differentiable… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  20. arXiv:2405.11458  [pdf, other

    cs.AI eess.SY

    CPS-LLM: Large Language Model based Safe Usage Plan Generator for Human-in-the-Loop Human-in-the-Plant Cyber-Physical System

    Authors: Ayan Banerjee, Aranyak Maity, Payal Kamboj, Sandeep K. S. Gupta

    Abstract: We explore the usage of large language models (LLM) in human-in-the-loop human-in-the-plant cyber-physical systems (CPS) to translate a high-level prompt into a personalized plan of actions, and subsequently convert that plan into a grounded inference of sequential decision-making automated by a real-world CPS controller to achieve a control goal. We show that it is relatively straightforward to c… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted for publication in AAAI 2024, Planning for Cyber Physical Systems

  21. arXiv:2405.06049  [pdf, other

    cs.CV cs.CR cs.LG

    BB-Patch: BlackBox Adversarial Patch-Attack using Zeroth-Order Optimization

    Authors: Satyadwyoom Kumar, Saurabh Gupta, Arun Balaji Buduru

    Abstract: Deep Learning has become popular due to its vast applications in almost all domains. However, models trained using deep learning are prone to failure for adversarial samples and carry a considerable risk in sensitive applications. Most of these adversarial attack strategies assume that the adversary has access to the training data, the model parameters, and the input during deployment, hence, focu… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  22. arXiv:2405.05736  [pdf, other

    cs.LG cs.IR

    Optimal Baseline Corrections for Off-Policy Contextual Bandits

    Authors: Shashank Gupta, Olivier Jeunen, Harrie Oosterhuis, Maarten de Rijke

    Abstract: The off-policy learning paradigm allows for recommender systems and general ranking applications to be framed as decision-making problems, where we aim to learn decision policies that optimize an unbiased offline estimate of an online reward metric. With unbiasedness comes potentially high variance, and prevalent methods exist to reduce estimation variance. These methods typically make use of cont… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  23. arXiv:2405.00970  [pdf, other

    cs.CL cs.AI cs.HC

    How Can I Get It Right? Using GPT to Rephrase Incorrect Trainee Responses

    Authors: Jionghao Lin, Zifei Han, Danielle R. Thomas, Ashish Gurung, Shivang Gupta, Vincent Aleven, Kenneth R. Koedinger

    Abstract: One-on-one tutoring is widely acknowledged as an effective instructional method, conditioned on qualified tutors. However, the high demand for qualified tutors remains a challenge, often necessitating the training of novice tutors (i.e., trainees) to ensure effective tutoring. Research suggests that providing timely explanatory feedback can facilitate the training process for trainees. However, it… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: International Journal of Artificial Intelligence in Education

  24. arXiv:2405.00577  [pdf

    cs.LG eess.SP q-bio.NC

    Discovering robust biomarkers of neurological disorders from functional MRI using graph neural networks: A Review

    Authors: Yi Hao Chan, Deepank Girish, Sukrit Gupta, Jing Xia, Chockalingam Kasi, Yinan He, Conghao Wang, Jagath C. Rajapakse

    Abstract: Graph neural networks (GNN) have emerged as a popular tool for modelling functional magnetic resonance imaging (fMRI) datasets. Many recent studies have reported significant improvements in disorder classification performance via more sophisticated GNN designs and highlighted salient features that could be potential biomarkers of the disorder. In this review, we provide an overview of how GNN and… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  25. arXiv:2405.00554  [pdf, other

    cs.IR

    A First Look at Selection Bias in Preference Elicitation for Recommendation

    Authors: Shashank Gupta, Harrie Oosterhuis, Maarten de Rijke

    Abstract: Preference elicitation explicitly asks users what kind of recommendations they would like to receive. It is a popular technique for conversational recommender systems to deal with cold-starts. Previous work has studied selection bias in implicit feedback, e.g., clicks, and in some forms of explicit feedback, i.e., ratings on items. Despite the fact that the extreme sparsity of preference elicitati… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: Accepted at the CONSEQUENCES'23 workshop at RecSys '23

  26. arXiv:2404.19281  [pdf, other

    cs.RO

    Audio-Visual Traffic Light State Detection for Urban Robots

    Authors: Sagar Gupta, Akansel Cosgun

    Abstract: We present a multimodal traffic light state detection using vision and sound, from the viewpoint of a quadruped robot navigating in urban settings. This is a challenging problem because of the visual occlusions and noise from robot locomotion. Our method combines features from raw audio with the ratios of red and green pixels within bounding boxes, identified by established vision-based detectors.… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Submitted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024

  27. arXiv:2404.18400  [pdf, other

    cs.LG cs.AI cs.CL cs.NE

    LLM-SR: Scientific Equation Discovery via Programming with Large Language Models

    Authors: Parshin Shojaee, Kazem Meidani, Shashank Gupta, Amir Barati Farimani, Chandan K Reddy

    Abstract: Mathematical equations have been unreasonably effective in describing complex natural phenomena across various scientific disciplines. However, discovering such insightful equations from data presents significant challenges due to the necessity of navigating extremely high-dimensional combinatorial and nonlinear hypothesis spaces. Traditional methods of equation discovery, commonly known as symbol… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

  28. arXiv:2404.17607  [pdf, other

    cs.IR cs.AI cs.CL cs.LG cs.SI

    Utilizing Large Language Models to Identify Reddit Users Considering Vaping Cessation for Digital Interventions

    Authors: Sai Krishna Revanth Vuruma, Dezhi Wu, Saborny Sen Gupta, Lucas Aust, Valerie Lookingbill, Caleb Henry, Yang Ren, Erin Kasson, Li-Shiun Chen, Patricia Cavazos-Rehg, Dian Hu, Ming Huang

    Abstract: The widespread adoption of social media platforms globally not only enhances users' connectivity and communication but also emerges as a vital channel for the dissemination of health-related information, thereby establishing social media data as an invaluable organic data resource for public health research. The surge in popularity of vaping or e-cigarette use in the United States and other countr… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  29. arXiv:2404.16687  [pdf, other

    cs.CV

    NTIRE 2024 Quality Assessment of AI-Generated Content Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Guangtao Zhai, Chunyi Li, Tengchuan Kou, Wei Sun, Haoning Wu, Yixuan Gao, Yuqin Cao, Zicheng Zhang, Xiele Wu, Radu Timofte, Fei Peng, Huiyuan Fu, Anlong Ming, Chuanming Wang, Huadong Ma, Shuai He, Zifei Dou, Shu Chen, Huacong Zhang, Haiyi Xie, Chengwei Wang, Baoying Chen, Jishen Zeng , et al. (89 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2024 Quality Assessment of AI-Generated Content Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2024. This challenge is to address a major challenge in the field of image and video processing, namely, Image Quality Assessment (IQA) and Video Quality Assessment (VQA) for AI-Generated Conte… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  30. arXiv:2404.15549  [pdf, other

    cs.CL cs.AI

    PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models

    Authors: Shashi Kant Gupta, Aditya Basu, Mauro Nievas, Jerrin Thomas, Nathan Wolfrath, Adhitya Ramamurthi, Bradley Taylor, Anai N. Kothari, Regina Schwind, Therica M. Miller, Sorena Nadaf-Rahrov, Yanshan Wang, Hrituraj Singh

    Abstract: Clinical trial matching is the task of identifying trials for which patients may be potentially eligible. Typically, this task is labor-intensive and requires detailed verification of patient electronic health records (EHRs) against the stringent inclusion and exclusion criteria of clinical trials. This process is manual, time-intensive, and challenging to scale up, resulting in many patients miss… ▽ More

    Submitted 26 April, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 Pages, 8 Figures, Supplementary Work Attached

  31. arXiv:2404.13191  [pdf, other

    cs.RO

    Action Contextualization: Adaptive Task Planning and Action Tuning using Large Language Models

    Authors: Sthithpragya Gupta, Kunpeng Yao, Loïc Niederhauser, Aude Billard

    Abstract: Large Language Models (LLMs) present a promising frontier in robotic task planning by leveraging extensive human knowledge. Nevertheless, the current literature often overlooks the critical aspects of adaptability and error correction within robotic systems. This work aims to overcome this limitation by enabling robots to modify their motion strategies and select the most suitable task plans based… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  32. arXiv:2404.11752  [pdf

    cs.CL cs.CY

    Mapping Violence: Developing an Extensive Framework to Build a Bangla Sectarian Expression Dataset from Social Media Interactions

    Authors: Nazia Tasnim, Sujan Sen Gupta, Md. Istiak Hossain Shihab, Fatiha Islam Juee, Arunima Tahsin, Pritom Ghum, Kanij Fatema, Marshia Haque, Wasema Farzana, Prionti Nasir, Ashique KhudaBukhsh, Farig Sadeque, Asif Sushmit

    Abstract: Communal violence in online forums has become extremely prevalent in South Asia, where many communities of different cultures coexist and share resources. These societies exhibit a phenomenon characterized by strong bonds within their own groups and animosity towards others, leading to conflicts that frequently escalate into violent confrontations. To address this issue, we have developed the firs… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  33. arXiv:2404.09067  [pdf, other

    cs.CV cs.AI

    Exploring Explainability in Video Action Recognition

    Authors: Avinab Saha, Shashank Gupta, Sravan Kumar Ankireddy, Karl Chahine, Joydeep Ghosh

    Abstract: Image Classification and Video Action Recognition are perhaps the two most foundational tasks in computer vision. Consequently, explaining the inner workings of trained deep neural networks is of prime importance. While numerous efforts focus on explaining the decisions of trained deep neural networks in image classification, exploration in the domain of its temporal version, video action recognit… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: 6 pages, 10 figures, Accepted to the 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024

  34. arXiv:2404.07308  [pdf, other

    cs.LG

    Spatial Transfer Learning for Estimating PM2.5 in Data-poor Regions

    Authors: Shrey Gupta, Yongbee Park, Jianzhao Bi, Suyash Gupta, Andreas Züfle, Avani Wildani, Yang Liu

    Abstract: Air pollution, especially particulate matter 2.5 (PM2.5), is a pressing concern for public health and is difficult to estimate in developing countries (data-poor regions) due to a lack of ground sensors. Transfer learning models can be leveraged to solve this problem, as they use alternate data sources to gain knowledge (i.e., data from data-rich regions). However, current transfer learning method… ▽ More

    Submitted 22 June, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted for publication at ECML-PKDD 2024

  35. arXiv:2404.06680  [pdf, other

    cs.CL

    Onco-Retriever: Generative Classifier for Retrieval of EHR Records in Oncology

    Authors: Shashi Kant Gupta, Aditya Basu, Bradley Taylor, Anai Kothari, Hrituraj Singh

    Abstract: Retrieving information from EHR systems is essential for answering specific questions about patient journeys and improving the delivery of clinical care. Despite this fact, most EHR systems still rely on keyword-based searches. With the advent of generative large language models (LLMs), retrieving information can lead to better search and summarization capabilities. Such retrievers can also feed R… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 18 pages

  36. arXiv:2403.20145  [pdf, other

    cs.CL

    Fine-tuning Large Language Models for Automated Diagnostic Screening Summaries

    Authors: Manjeet Yadav, Nilesh Kumar Sahu, Mudita Chaturvedi, Snehil Gupta, Haroon R Lone

    Abstract: Improving mental health support in developing countries is a pressing need. One potential solution is the development of scalable, automated systems to conduct diagnostic screenings, which could help alleviate the burden on mental health professionals. In this work, we evaluate several state-of-the-art Large Language Models (LLMs), with and without fine-tuning, on our custom dataset for generating… ▽ More

    Submitted 4 April, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  37. arXiv:2403.16428  [pdf, other

    cs.CV

    Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

    Authors: Zicong Fan, Takehiko Ohkawa, Linlin Yang, Nie Lin, Zhishan Zhou, Shihao Zhou, Jiajun Liang, Zhong Gao, Xuanyang Zhang, Xue Zhang, Fei Li, Liu Zheng, Feng Lu, Karim Abou Zeid, Bastian Leibe, Jeongwan On, Seungryul Baek, Aditya Prakash, Saurabh Gupta, Kun He, Yoichi Sato, Otmar Hilliges, Hyung Jin Chang, Angela Yao

    Abstract: We interact with the world with our hands and see it through our own (egocentric) perspective. A holistic 3D understanding of such interactions from egocentric views is important for tasks in robotics, AR/VR, action recognition and motion generation. Accurately reconstructing such interactions in 3D is challenging due to heavy occlusion, viewpoint bias, camera distortion, and motion blur from the… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  38. arXiv:2403.16336  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Predictive Inference in Multi-environment Scenarios

    Authors: John C. Duchi, Suyash Gupta, Kuanhao Jiang, Pragya Sur

    Abstract: We address the challenge of constructing valid confidence intervals and sets in problems of prediction across multiple environments. We investigate two types of coverage suitable for these problems, extending the jackknife and split-conformal methods to show how to obtain distribution-free coverage in such non-traditional, hierarchical data-generating scenarios. Our contributions also include exte… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  39. arXiv:2403.10948  [pdf, other

    cs.RO

    Real-to-Sim Adaptation via High-Fidelity Simulation to Control a Wheeled-Humanoid Robot with Unknown Dynamics

    Authors: Donghoon Baek, Youngwoo Sim, Amartya Purushottam, Saurabh Gupta, Joao Ramos

    Abstract: Model-based controllers using a linearized model around the system's equilibrium point is a common approach in the control of a wheeled humanoid due to their less computational load and ease of stability analysis. However, controlling a wheeled humanoid robot while it lifts an unknown object presents significant challenges, primarily due to the lack of knowledge in object dynamics. This paper pres… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  40. arXiv:2403.10270  [pdf, other

    math.FA cs.DM math-ph math.SP

    Discrete functional inequalities on lattice graphs

    Authors: Shubham Gupta

    Abstract: In this thesis, we study problems at the interface of analysis and discrete mathematics. We discuss analogues of well known Hardy-type inequalities and Rearrangement inequalities on the lattice graphs $\mathbb{Z}^d$, with a particular focus on behaviour of sharp constants and optimizers.In the first half of the thesis, we analyse Hardy inequalities on $\mathbb{Z}^d$, first for $d=1$ and then for… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: PhD thesis, Imperial College London, 140 pages

  41. arXiv:2403.08297  [pdf

    physics.optics cs.HC

    Semi-Transparent Image Sensors for Eye-Tracking Applications

    Authors: Gabriel Mercier, Emre O. Polat, Shengtai Shi, Shuchi Gupta, Gerasimos Konstantatos, Stijn Goossens, Frank H. L. Koppens

    Abstract: Image sensors hold a pivotal role in society due to their ability to capture vast amounts of information. Traditionally, image sensors are opaque due to light absorption in both the pixels and the read-out electronics that are stacked on top of each other. Making image sensors visibly transparent would have a far-reaching impact in numerous areas such as human-computer interfaces, smart displays,… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  42. arXiv:2403.08176  [pdf

    cs.DL cs.CY

    Sentiment-aware Enhancements of PageRank-based Citation Metric, Impact Factor, and H-index for Ranking the Authors of Scholarly Articles

    Authors: Shikha Gupta, Animesh Kumar

    Abstract: Heretofore, the only way to evaluate an author has been frequency-based citation metrics that assume citations to be of a neutral sentiment. However, considering the sentiment behind citations aids in a better understanding of the viewpoints of fellow researchers for the scholarly output of an author.

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: The paper has been accepted for publication in Computer Science journal: http://journals.agh.edu.pl/csci}

  43. arXiv:2403.05931  [pdf

    cs.CL cs.LG

    Thread Detection and Response Generation using Transformers with Prompt Optimisation

    Authors: Kevin Joshua T, Arnav Agarwal, Shriya Sanjay, Yash Sarda, John Sahaya Rani Alex, Saurav Gupta, Sushant Kumar, Vishwanath Kamath

    Abstract: Conversational systems are crucial for human-computer interaction, managing complex dialogues by identifying threads and prioritising responses. This is especially vital in multi-party conversations, where precise identification of threads and strategic response prioritisation ensure efficient dialogue management. To address these challenges an end-to-end model that identifies threads and prioriti… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures, submitted to 2024 IEEE International Conference on Signal Processing and Communications (SPCOM)

    ACM Class: I.2.7; I.2.6

  44. arXiv:2403.04265  [pdf, other

    cs.GT cs.DS

    Conflict and Fairness in Resource Allocation

    Authors: Susobhan Bandopadhyay, Aritra Banik, Sushmita Gupta, Pallavi Jain, Abhishek Sahu, Saket Saurabh, Prafullkumar Tale

    Abstract: In the standard model of fair allocation of resources to agents, every agent has some utility for every resource, and the goal is to assign resources to agents so that the agents' welfare is maximized. Motivated by job scheduling, interest in this problem dates back to the work of Deuermeyer et al. [SIAM J. on Algebraic Discrete Methods'82]. Recent works consider the compatibility between resource… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2309.04995

  45. arXiv:2403.00584  [pdf, other

    cs.IR cs.LG

    Generalized User Representations for Transfer Learning

    Authors: Ghazal Fazelnia, Sanket Gupta, Claire Keum, Mark Koh, Ian Anderson, Mounia Lalmas

    Abstract: We present a novel framework for user representation in large-scale recommender systems, aiming at effectively representing diverse user taste in a generalized manner. Our approach employs a two-stage methodology combining representation learning and transfer learning. The representation learning model uses an autoencoder that compresses various user features into a representation space. In the se… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  46. arXiv:2402.19462  [pdf, other

    cond-mat.mtrl-sci cs.CL physics.app-ph

    Accelerating materials discovery for polymer solar cells: Data-driven insights enabled by natural language processing

    Authors: Pranav Shetty, Aishat Adeboye, Sonakshi Gupta, Chao Zhang, Rampi Ramprasad

    Abstract: We present a simulation of various active learning strategies for the discovery of polymer solar cell donor/acceptor pairs using data extracted from the literature spanning $\sim$20 years by a natural language processing pipeline. While data-driven methods have been well established to discover novel materials faster than Edisonian trial-and-error approaches, their benefits have not been quantifie… ▽ More

    Submitted 21 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  47. arXiv:2402.17768  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning

    Authors: Xiaoyu Zhang, Matthew Chang, Pranav Kumar, Saurabh Gupta

    Abstract: A common failure mode for policies trained with imitation is compounding execution errors at test time. When the learned policy encounters states that are not present in the expert demonstrations, the policy fails, leading to degenerate behavior. The Dataset Aggregation, or DAgger approach to this problem simply collects more data to cover these failure states. However, in practice, this is often… ▽ More

    Submitted 5 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by Robotics: Science and Systems (RSS) 2024. project website with video, see https://sites.google.com/view/diffusion-meets-dagger

  48. arXiv:2402.17767  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Opening Cabinets and Drawers in the Real World using a Commodity Mobile Manipulator

    Authors: Arjun Gupta, Michelle Zhang, Rishik Sathua, Saurabh Gupta

    Abstract: Pulling open cabinets and drawers presents many difficult technical challenges in perception (inferring articulation parameters for objects from onboard sensors), planning (producing motion plans that conform to tight task constraints), and control (making and maintaining contact while applying forces on the environment). In this work, we build an end-to-end system that enables a commodity mobile… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Project webpage: https://arjung128.github.io/opening-cabinets-and-drawers

  49. arXiv:2402.17343  [pdf, other

    cs.LG stat.ML

    Enhanced Bayesian Optimization via Preferential Modeling of Abstract Properties

    Authors: Arun Kumar A V, Alistair Shilton, Sunil Gupta, Santu Rana, Stewart Greenhill, Svetha Venkatesh

    Abstract: Experimental (design) optimization is a key driver in designing and discovering new products and processes. Bayesian Optimization (BO) is an effective tool for optimizing expensive and black-box experimental design processes. While Bayesian optimization is a principled data-driven approach to experimental optimization, it learns everything from scratch and could greatly benefit from the expertise… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 19 Pages, 6 Figures

  50. arXiv:2402.16237  [pdf, other

    cs.LG cs.AI

    Active Level Set Estimation for Continuous Search Space with Theoretical Guarantee

    Authors: Giang Ngo, Dang Nguyen, Dat Phan-Trong, Sunil Gupta

    Abstract: A common problem encountered in many real-world applications is level set estimation where the goal is to determine the region in the function domain where the function is above or below a given threshold. When the function is black-box and expensive to evaluate, the level sets need to be found in a minimum set of function evaluations. Existing methods often assume a discrete search space with a f… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.