Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 471 results for author: Gupta, V

.
  1. arXiv:2408.05937  [pdf, other

    astro-ph.HE astro-ph.CO

    The impact of the FREDDA dedispersion algorithm on $H_0$ estimations with FRBs

    Authors: Jordan Hoffmann, Clancy W. James, Hao Qiu, Marcin Glowacki, Keith W. Bannister, Vivek Gupta, Jason X. Prochaska, Apurba Bera, Adam T. Deller, Kelly Gourdji, Lachlan Marnoch, Stuart D. Ryder, Danica R. Scott, Ryan M. Shannon, Nicolas Tejos

    Abstract: Fast radio bursts (FRBs) are transient radio signals of extragalactic origins that are subjected to propagation effects such as dispersion and scattering. It follows then that these signals hold information regarding the medium they have traversed and are hence useful as cosmological probes of the Universe. Recently, FRBs were used to make an independent measure of the Hubble Constant $H_0$, promi… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 8 pages, 6 figures, Published in MNRAS

  2. arXiv:2408.00582  [pdf, other

    hep-ex physics.ins-det

    First Measurement of the Total Inelastic Cross-Section of Positively-Charged Kaons on Argon at Energies Between 5.0 and 7.5 GeV

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, C. Andreopoulos, M. Andreotti , et al. (1341 additional authors not shown)

    Abstract: ProtoDUNE Single-Phase (ProtoDUNE-SP) is a 770-ton liquid argon time projection chamber that operated in a hadron test beam at the CERN Neutrino Platform in 2018. We present a measurement of the total inelastic cross section of charged kaons on argon as a function of kaon energy using 6 and 7 GeV/$c$ beam momentum settings. The flux-weighted average of the extracted inelastic cross section at each… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Report number: CERN-EP-2024-211, FERMILAB-PUB-24-0216-V

  3. arXiv:2407.21783  [pdf, other

    cs.AI cs.CL cs.CV

    The Llama 3 Herd of Models

    Authors: Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, Arun Rao, Aston Zhang, Aurelien Rodriguez, Austen Gregerson, Ava Spataru, Baptiste Roziere, Bethany Biron, Binh Tang , et al. (510 additional authors not shown)

    Abstract: Modern artificial intelligence (AI) systems are powered by foundation models. This paper presents a new set of foundation models, called Llama 3. It is a herd of language models that natively support multilinguality, coding, reasoning, and tool usage. Our largest model is a dense Transformer with 405B parameters and a context window of up to 128K tokens. This paper presents an extensive empirical… ▽ More

    Submitted 15 August, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

  4. arXiv:2407.16030  [pdf, other

    cs.CL cs.AI cs.DB cs.LG

    Enhancing Temporal Understanding in LLMs for Semi-structured Tables

    Authors: Irwin Deng, Kushagra Dixit, Vivek Gupta, Dan Roth

    Abstract: Temporal reasoning over tabular data presents substantial challenges for large language models (LLMs), as evidenced by recent research. In this study, we conduct a comprehensive analysis of temporal datasets to pinpoint the specific limitations of LLMs. Our investigation leads to enhancements in TempTabQA, a dataset specifically designed for tabular temporal question answering. We provide critical… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Total Pages 18, Total Tables 6, Total figures 7

  5. arXiv:2407.15452  [pdf, other

    cs.LG cs.DC cs.SI

    GraphScale: A Framework to Enable Machine Learning over Billion-node Graphs

    Authors: Vipul Gupta, Xin Chen, Ruoyun Huang, Fanlong Meng, Jianjun Chen, Yujun Yan

    Abstract: Graph Neural Networks (GNNs) have emerged as powerful tools for supervised machine learning over graph-structured data, while sampling-based node representation learning is widely utilized in unsupervised learning. However, scalability remains a major challenge in both supervised and unsupervised learning for large graphs (e.g., those with over 1 billion nodes). The scalability bottleneck largely… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Published in the Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024), 8 Pages, 12 Figures

    Journal ref: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024), October 21-25, 2024, Boise, ID, USA

  6. arXiv:2407.14933  [pdf, other

    cs.CL cs.AI cs.LG

    Consent in Crisis: The Rapid Decline of the AI Data Commons

    Authors: Shayne Longpre, Robert Mahari, Ariel Lee, Campbell Lund, Hamidah Oderinwale, William Brannon, Nayan Saxena, Naana Obeng-Marnu, Tobin South, Cole Hunter, Kevin Klyman, Christopher Klamm, Hailey Schoelkopf, Nikhil Singh, Manuel Cherep, Ahmad Anis, An Dinh, Caroline Chitongo, Da Yin, Damien Sileo, Deividas Mataciunas, Diganta Misra, Emad Alghamdi, Enrico Shippole, Jianguo Zhang , et al. (24 additional authors not shown)

    Abstract: General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge, we conduct the first, large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training corpora. Our audit of 14,000 web domains provides an expansive view of crawlable web data and how co… ▽ More

    Submitted 24 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

    Comments: 41 pages (13 main), 5 figures, 9 tables

  7. arXiv:2407.11229  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    Unraveling the Truth: Do LLMs really Understand Charts? A Deep Dive into Consistency and Robustness

    Authors: Srija Mukhopadhyay, Adnan Qidwai, Aparna Garimella, Pritika Ramu, Vivek Gupta, Dan Roth

    Abstract: Chart question answering (CQA) is a crucial area of Visual Language Understanding. However, the robustness and consistency of current Visual Language Models (VLMs) in this field remain under-explored. This paper evaluates state-of-the-art VLMs on comprehensive datasets, developed specifically for this study, encompassing diverse question categories and chart formats. We investigate two key aspects… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 22 pages, 7 Tables, 3 Figures, 25 examples

  8. arXiv:2407.11014  [pdf, other

    cs.CL cs.AI cs.MA

    Geode: A Zero-shot Geospatial Question-Answering Agent with Explicit Reasoning and Precise Spatio-Temporal Retrieval

    Authors: Devashish Vikas Gupta, Azeez Syed Ali Ishaqui, Divya Kiran Kadiyala

    Abstract: Large language models (LLMs) have shown promising results in learning and contextualizing information from different forms of data. Recent advancements in foundational models, particularly those employing self-attention mechanisms, have significantly enhanced our ability to comprehend the semantics of diverse data types. One such area that could highly benefit from multi-modality is in understandi… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

  9. arXiv:2407.10380  [pdf, other

    cs.CV cs.AI cs.CL cs.IR

    NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models

    Authors: Pranshu Pandya, Agney S Talwarr, Vatsal Gupta, Tushar Kataria, Vivek Gupta, Dan Roth

    Abstract: Cognitive textual and visual reasoning tasks, such as puzzles, series, and analogies, demand the ability to quickly reason, decipher, and evaluate patterns both textually and spatially. While LLMs and VLMs, through extensive training on large amounts of human-curated data, have attained a high level of pseudo-human intelligence in some common sense reasoning tasks, they still struggle with more co… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures, 5 tables

  10. arXiv:2407.10339  [pdf, other

    hep-ex astro-ph.HE astro-ph.IM astro-ph.SR nucl-ex physics.ins-det

    Supernova Pointing Capabilities of DUNE

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

    Abstract: The determination of the direction of a stellar core collapse via its neutrino emission is crucial for the identification of the progenitor for a multimessenger follow-up. A highly effective method of reconstructing supernova directions within the Deep Underground Neutrino Experiment (DUNE) is introduced. The supernova neutrino pointing resolution is studied by simulating and reconstructing electr… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 25 pages, 16 figures

    Report number: FERMILAB-PUB-24-0319-LBNF

  11. arXiv:2407.10088  [pdf, other

    physics.flu-dyn

    Predictability of weakly turbulent systems from spatially sparse observations using data assimilation and machine learning

    Authors: Vikrant Gupta, Yuanqing Chen, Minping Wan

    Abstract: We apply two data assimilation (DA) methods, a smoother and a filter, and a model-free machine learning (ML) shallow network to forecast two weakly turbulent systems. We analyse the effect of the spatial sparsity of observations on accuracy of the predictions obtained from these data-driven methods. Based on the results, we divide the spatial sparsity levels in three zones. First is the good-predi… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  12. arXiv:2407.08221  [pdf, other

    cs.CV

    GAURA: Generalizable Approach for Unified Restoration and Rendering of Arbitrary Views

    Authors: Vinayak Gupta, Rongali Simhachala Venkata Girish, Mukund Varma T, Ayush Tewari, Kaushik Mitra

    Abstract: Neural rendering methods can achieve near-photorealistic image synthesis of scenes from posed input images. However, when the images are imperfect, e.g., captured in very low-light conditions, state-of-the-art methods fail to reconstruct high-quality 3D scenes. Recent approaches have tried to address this limitation by modeling various degradation processes in the image formation model; however, t… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: European Conference on Computer Vision(ECCV) 2024

  13. arXiv:2407.05952  [pdf, other

    cs.DB cs.AI cs.CL cs.LG

    H-STAR: LLM-driven Hybrid SQL-Text Adaptive Reasoning on Tables

    Authors: Nikhil Abhyankar, Vivek Gupta, Dan Roth, Chandan K. Reddy

    Abstract: Tabular reasoning involves interpreting unstructured queries against structured tables, requiring a synthesis of textual understanding and symbolic reasoning. Existing methods rely on either of the approaches and are constrained by their respective limitations. Textual reasoning excels in semantic interpretation unlike symbolic reasoning (SQL logic), but falls short in mathematical reasoning where… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: 13 pages, 14 tables, 9 figures

  14. arXiv:2406.19470  [pdf, other

    cs.CL

    Changing Answer Order Can Decrease MMLU Accuracy

    Authors: Vipul Gupta, David Pantoja, Candace Ross, Adina Williams, Megan Ung

    Abstract: As large language models (LLMs) have grown in prevalence, particular benchmarks have become essential for the evaluation of these models and for understanding model capabilities. Most commonly, we use test accuracy averaged across multiple subtasks in order to rank models on leaderboards, to determine which model is best for our purposes. In this paper, we investigate the robustness of the accurac… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Short paper, 9 pages

  15. arXiv:2406.19237  [pdf, other

    cs.CL cs.CV cs.IR cs.LG

    FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts

    Authors: Shubhankar Singh, Purvi Chaurasia, Yerram Varun, Pranshu Pandya, Vatsal Gupta, Vivek Gupta, Dan Roth

    Abstract: Existing benchmarks for visual question answering lack in visual grounding and complexity, particularly in evaluating spatial reasoning skills. We introduce FlowVQA, a novel benchmark aimed at assessing the capabilities of visual question-answering multimodal language models in reasoning with flowcharts as visual contexts. FlowVQA comprises 2,272 carefully generated and human-verified flowchart im… ▽ More

    Submitted 28 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted in ACL 2024 (Findings), 21 pages, 7 figures, 9 Tables

  16. arXiv:2406.18128  [pdf, other

    physics.flu-dyn math-ph

    Stokes' paradox in rarefied gases: A perspective through the method of fundamental solutions

    Authors: Himanshi, Anirudh Singh Rana, Vinay Kumar Gupta

    Abstract: In the realm of fluid dynamics, a curious and counterintuitive phenomenon is Stokes' paradox. While Stokes equations -- used for modeling slow and steady flows -- lead to a meaningful solution to the problem of slow and steady flow past a sphere, they fail to yield a non-trivial solution to the problem of slow and steady flow past an infinitely long cylinder (a two-dimensional problem essentially)… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 28 Pages, 16 figures

    MSC Class: 76P05; 35E05

  17. arXiv:2406.16964  [pdf, other

    cs.LG cs.AI

    Are Language Models Actually Useful for Time Series Forecasting?

    Authors: Mingtian Tan, Mike A. Merrill, Vinayak Gupta, Tim Althoff, Thomas Hartvigsen

    Abstract: Large language models (LLMs) are being applied to time series tasks, particularly time series forecasting. However, are language models actually useful for time series? After a series of ablation studies on three recent and popular LLM-based time series forecasting methods, we find that removing the LLM component or replacing it with a basic attention layer does not degrade the forecasting results… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 25 pages, 8 figures and 20 tables

  18. arXiv:2406.16253  [pdf, other

    cs.CL

    LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

    Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

    Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More

    Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  19. arXiv:2406.10889  [pdf, other

    cs.CV cs.AI cs.LG

    VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time?

    Authors: Darshana Saravanan, Darshan Singh, Varun Gupta, Zeeshan Khan, Vineet Gandhi, Makarand Tapaswi

    Abstract: Compositionality is a fundamental aspect of vision-language understanding and is especially required for videos since they contain multiple entities (e.g. persons, actions, and scenes) interacting dynamically over time. Existing benchmarks focus primarily on perception capabilities. However, they do not study binding, the ability of a model to associate entities through appropriate relationships.… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 26 pages, 17 figures, 3 tables

  20. arXiv:2406.10085  [pdf, other

    cs.CL

    Enhancing Question Answering on Charts Through Effective Pre-training Tasks

    Authors: Ashim Gupta, Vivek Gupta, Shuo Zhang, Yujie He, Ning Zhang, Shalin Shah

    Abstract: To completely understand a document, the use of textual information is not enough. Understanding visual cues, such as layouts and charts, is also required. While the current state-of-the-art approaches for document understanding (both OCR-based and OCR-free) work well, a thorough analysis of their capabilities and limitations has not yet been performed. Therefore, in this work, we addresses the li… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  21. arXiv:2406.00968  [pdf, other

    cs.RO cs.HC

    Evaluating MEDIRL: A Replication and Ablation Study of Maximum Entropy Deep Inverse Reinforcement Learning for Human Social Navigation

    Authors: Vinay Gupta, Nihal Gunukula

    Abstract: In this study, we enhance the Maximum Entropy Deep Inverse Reinforcement Learning (MEDIRL) framework, targeting its application in human robot interaction (HRI) for modeling pedestrian behavior in crowded environments. Our work is grounded in the pioneering research by Fahad, Chen, and Guo, and aims to elevate MEDIRL's efficacy in real world HRI settings. We replicated the original MEDIRL model an… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: 14 pages, 13 figures

  22. arXiv:2405.19772  [pdf, other

    math.FA

    New Exponential operators connected with a^2+x^2: a generalization of Post-Widder and Ismail May

    Authors: Vijay Gupta, Anjali

    Abstract: The present study offers a general exponential operator connected with a^2+x^2; for positive real "a". We estimate the asymptotic formula for simultaneous and ordinary approximation of the constructed operator. In the last section, we graphically interpret the created operator's convergence to two periodic functions "x sin(x)" and "-x/2*cos(pi*x)". We also consider the limiting case a tends to 0;… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 12 pages, 14 figures,

    MSC Class: 41A25 and 41A35

  23. arXiv:2405.16752  [pdf, other

    cs.LG cs.AI

    Model Ensembling for Constrained Optimization

    Authors: Ira Globus-Harris, Varun Gupta, Michael Kearns, Aaron Roth

    Abstract: There is a long history in machine learning of model ensembling, beginning with boosting and bagging and continuing to the present day. Much of this history has focused on combining models for classification and regression, but recently there is interest in more complex settings such as ensembling policies in reinforcement learning. Strong connections have also emerged between ensembling and multi… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  24. arXiv:2405.15046  [pdf, other

    math.CO cs.DM

    On the minimum spectral radius of connected graphs of given order and size

    Authors: Sebastian M. Cioabă, Vishal Gupta, Celso Marques

    Abstract: In this paper, we study a question of Hong from 1993 related to the minimum spectral radii of the adjacency matrices of connected graphs of given order and size. Hong asked if it is true that among all connected graphs of given number of vertices $n$ and number of edges $e$, the graphs having minimum spectral radius (the minimizer graphs) must be almost regular, meaning that the difference between… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 19 pages, 6 figures

    MSC Class: 05C50; 15A18

  25. Fair Evaluation of Federated Learning Algorithms for Automated Breast Density Classification: The Results of the 2022 ACR-NCI-NVIDIA Federated Learning Challenge

    Authors: Kendall Schmidt, Benjamin Bearce, Ken Chang, Laura Coombs, Keyvan Farahani, Marawan Elbatele, Kaouther Mouhebe, Robert Marti, Ruipeng Zhang, Yao Zhang, Yanfeng Wang, Yaojun Hu, Haochao Ying, Yuyang Xu, Conrad Testagrose, Mutlu Demirer, Vikash Gupta, Ünal Akünal, Markus Bujotzek, Klaus H. Maier-Hein, Yi Qin, Xiaomeng Li, Jayashree Kalpathy-Cramer, Holger R. Roth

    Abstract: The correct interpretation of breast density is important in the assessment of breast cancer risk. AI has been shown capable of accurately predicting breast density, however, due to the differences in imaging characteristics across mammography systems, models built using data from one system do not generalize well to other systems. Though federated learning (FL) has emerged as a way to improve the… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 figures

    Journal ref: Medical Image Analysis Volume 95, July 2024, 103206

  26. arXiv:2405.12403  [pdf, other

    astro-ph.HE astro-ph.SR

    Searching for gravitational wave optical counterparts with the Zwicky Transient Facility: summary of O4a

    Authors: Tomás Ahumada, Shreya Anand, Michael W. Coughlin, Vaidehi Gupta, Mansi M. Kasliwal, Viraj R. Karambelkar, Robert D. Stein, Gaurav Waratkar, Vishwajeet Swain, Theophile Jegou du Laz, Akash Anumarlapudi, Igor Andreoni, Mattia Bulla, Gokul P. Srinivasaragavan, Andrew Toivonen, Avery Wold, Eric C. Bellm, S. Bradley Cenko, David L. Kaplan, Jesper Sollerman, Varun Bhalerao, Daniel Perley, Anirudh Salgundi, Aswin Suresh, K-Ryan Hinds , et al. (27 additional authors not shown)

    Abstract: During the first half of the fourth observing run (O4a) of the International Gravitational Wave Network (IGWN), the Zwicky Transient Facility (ZTF) conducted a systematic search for kilonova (KN) counterparts to binary neutron star (BNS) and neutron star-black hole (NSBH) merger candidates. Here, we present a comprehensive study of the five high-significance (FAR < 1 per year) BNS and NSBH candida… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: submitted

  27. arXiv:2405.07439  [pdf, other

    astro-ph.IM astro-ph.HE

    A Fast Radio Burst monitor with a Compact All-Sky Phased Array (CASPA)

    Authors: R. Luo, R. D. Ekers, G. Hobbs, A. Dunning, C. W. James, M. E. Lower, V. Gupta, A. Zic, M. Sokolowski, C. Phillips, A. T. Deller, L. Staveley-Smith

    Abstract: Fast Radio Bursts (FRBs) are short-duration radio transients that occur at random times in host galaxies distributed all over the sky. Large field of view instruments can play a critical role in the blind search for rare FRBs. We present a concept for an all-sky FRB monitor using a compact all-sky phased array (CASPA), which can efficiently achieve an extremely large field of view of $\sim10^4$ sq… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Submitted to PASA, comments welcome

  28. arXiv:2405.00908  [pdf

    cs.CV cs.AI cs.LG

    Transformer-Based Self-Supervised Learning for Histopathological Classification of Ischemic Stroke Clot Origin

    Authors: K. Yeh, M. S. Jabal, V. Gupta, D. F. Kallmes, W. Brinjikji, B. S. Erdal

    Abstract: Background and Purpose: Identifying the thromboembolism source in ischemic stroke is crucial for treatment and secondary prevention yet is often undetermined. This study describes a self-supervised deep learning approach in digital pathology of emboli for classifying ischemic stroke clot origin from histopathological images. Methods: The dataset included whole slide images (WSI) from the STRIP AI… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  29. arXiv:2404.15546  [pdf, ps, other

    math.CO math.OC

    Modular Forms in Combinatorial Optimization

    Authors: Varsha Gupta

    Abstract: Combinatorial optimization problems, such as the Asymmetric Traveling Salesman Problem (ATSP), find applications across various domains including logistics, genome sequencing, and robotics. Despite their extensive applications, there have not been significant advancements in deriving optimal solutions for these problems. The lack of theoretical understanding owing to the complex structure of these… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  30. arXiv:2404.15540  [pdf, ps, other

    physics.flu-dyn

    A Unified Framework for Total Variation Regularized Optimization in Fluid Dynamics and Related Physical Systems

    Authors: Varsha Gupta

    Abstract: An optimization framework is presented for minimizing the energy functional developed around a generalized equation governing physical systems such as fluid dynamics, particle transport, phase transition, and other related systems. The convexity of the energy functional is investigated to derive the necessary conditions for a smooth and global optimum solution. Furthermore, the Total Variation (TV… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  31. arXiv:2404.11757  [pdf, other

    cs.CL

    Language Models Still Struggle to Zero-shot Reason about Time Series

    Authors: Mike A. Merrill, Mingtian Tan, Vinayak Gupta, Tom Hartvigsen, Tim Althoff

    Abstract: Time series are critical for decision-making in fields like finance and healthcare. Their importance has driven a recent influx of works passing time series into language models, leading to non-trivial forecasting on some datasets. But it remains unknown whether non-trivial forecasting implies that language models can reason about time series. To address this gap, we generate a first-of-its-kind e… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  32. Improvement in Semantic Address Matching using Natural Language Processing

    Authors: Vansh Gupta, Mohit Gupta, Jai Garg, Nitesh Garg

    Abstract: Address matching is an important task for many businesses especially delivery and take out companies which help them to take out a certain address from their data warehouse. Existing solution uses similarity of strings, and edit distance algorithms to find out the similar addresses from the address database, but these algorithms could not work effectively with redundant, unstructured, or incomplet… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 5 pages, 7 tables, 2021 2nd International Conference for Emerging Technology (INCET)

    Journal ref: 2021 2nd International Conference for Emerging Technology (INCET), Belagavi, India, 2021, pp. 1-5

  33. Designing an Intelligent Parcel Management System using IoT & Machine Learning

    Authors: Mohit Gupta, Nitesh Garg, Jai Garg, Vansh Gupta, Devraj Gautam

    Abstract: Parcels delivery is a critical activity in railways. More importantly, each parcel must be thoroughly checked and sorted according to its destination address. We require an efficient and robust IoT system capable of doing all of these tasks with great precision and minimal human interaction. This paper discusses, We created a fully-fledged solution using IoT and machine learning to assist trains i… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 6 pages, 6 figures, 2022 IEEE IAS Global Conference on Emerging Technologies (GlobConET)

    Journal ref: 2022 IEEE IAS Global Conference on Emerging Technologies (GlobConET), Arad, Romania, 2022, pp. 751-756

  34. arXiv:2404.07461  [pdf, other

    cs.CL cs.AI

    "Confidently Nonsensical?'': A Critical Survey on the Perspectives and Challenges of 'Hallucinations' in NLP

    Authors: Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson

    Abstract: We investigate how hallucination in large language models (LLM) is characterized in peer-reviewed literature using a critical examination of 103 publications across NLP research. Through a comprehensive review of sociological and technological literature, we identify a lack of agreement with the term `hallucination.' Additionally, we conduct a survey with 171 practitioners from the field of NLP an… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  35. arXiv:2404.06959  [pdf, ps, other

    math.OA math.FA

    Regular inclusions of simple unital $C^*$-algebras

    Authors: Keshab Chandra Bakshi, Ved Prakash Gupta

    Abstract: We prove that an inclusion $\mathcal{B} \subset \mathcal{A}$ of simple unital $C^*$-algebras with a finite-index conditional expectation is regular if and only if there exists a finite group $G$ that admits a cocycle action $(α,σ)$ on the intermediate $C^*$-subalgebra $\mathcal{C}$ generated by $\mathcal{B}$ and its centralizer $\mathcal{C}_\mathcal{A}(\mathcal{B})$ such that $\mathcal{B}$ is oute… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 16 pages

  36. arXiv:2404.06751  [pdf, other

    cs.CY

    Leveraging open-source models for legal language modeling and analysis: a case study on the Indian constitution

    Authors: Vikhyath Gupta, Srinivasa Rao P

    Abstract: In recent years, the use of open-source models has gained immense popularity in various fields, including legal language modelling and analysis. These models have proven to be highly effective in tasks such as summarizing legal documents, extracting key information, and even predicting case outcomes. This has revolutionized the legal industry, enabling lawyers, researchers, and policymakers to qui… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 10 Pages , 3 figures

  37. arXiv:2403.05967  [pdf, ps, other

    math.OA

    On various notions of distance between subalgebras of operator algebras

    Authors: Ved Prakash Gupta, Sumit Kumar

    Abstract: Given any irreducible inclusion $\mB \subset \mA$ of unital $C^*$-algebras with a finite-index conditional expectation $E: \mA \to \mB$, we show that the set of $E$-compatible intermediate $C^*$-subalgebras is finite, thereby generalizing a finiteness result of Ino and Watatani (from \cite{IW}). A finiteness result for a certain collection of intermediate $C^*$-subalgebras of a non-irreducible inc… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  38. arXiv:2403.04007  [pdf, other

    cs.LG math.OC

    Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

    Authors: Wesley A. Suttle, Vipul K. Sharma, Krishna C. Kosaraju, S. Sivaranjani, Ji Liu, Vijay Gupta, Brian M. Sadler

    Abstract: We develop provably safe and convergent reinforcement learning (RL) algorithms for control of nonlinear dynamical systems, bridging the gap between the hard safety guarantees of control theory and the convergence guarantees of RL theory. Recent advances at the intersection of control and RL follow a two-stage, safety filter approach to enforcing hard safety constraints: model-free RL is used to le… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 20 pages, 7 figures

  39. arXiv:2403.03212  [pdf, other

    physics.ins-det hep-ex

    Performance of a modular ton-scale pixel-readout liquid argon time projection chamber

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

    Abstract: The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 47 pages, 41 figures

    Report number: FERMILAB-PUB-24-0073-LBNF

  40. arXiv:2403.01151  [pdf, ps, other

    math.CO math.DG

    A Ricci flow on graphs from effective resistance

    Authors: Aleyah Dawkins, Vishal Gupta, Mark Kempton, William Linz, Jeremy Quail, Harry Richman, Zachary Stier

    Abstract: In this paper, we introduce a new notion of curvature on the edges of a graph that is defined in terms of effective resistances. We call this the Ricci--Foster curvature. We study the Ricci flow resulting from this curvature. We prove the existence of solutions to Ricci flow on short time intervals, and prove that Ricci flow preserves graphs with nonnegative (resp. positive) curvature.

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: 14 pages, 8 figures, comments welcome!

    MSC Class: 05C10; 53E20; 05C22; 53A70; 53C21; 94C15

  41. arXiv:2403.01037  [pdf, other

    math.CO

    Node resistance curvature in Cartesian graph products

    Authors: Aleyah Dawkins, Vishal Gupta, Mark Kempton, William Linz, Jeremy Quail, Harry Richman, Zachary Stier

    Abstract: Devriendt and Lambiotte recently introduced the \emph{node resistance curvature}, a notion of graph curvature based on the effective resistance matrix. In this paper, we begin the study of the behavior of the node resistance curvature under the operation of the Cartesian graph product. We study the natural question of global positivity of node resistance curvature of the Cartesian product of posit… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    MSC Class: 05C99; 05C81

  42. arXiv:2402.17108  [pdf, ps, other

    cs.GT cs.DS cs.LG

    Repeated Contracting with Multiple Non-Myopic Agents: Policy Regret and Limited Liability

    Authors: Natalie Collina, Varun Gupta, Aaron Roth

    Abstract: We study a repeated contracting setting in which a Principal adaptively chooses amongst $k$ Agents at each of $T$ rounds. The Agents are non-myopic, and so a mechanism for the Principal induces a $T$-round extensive form game amongst the Agents. We give several results aimed at understanding an under-explored aspect of contract theory -- the game induced when choosing an Agent to contract with. Fi… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  43. arXiv:2402.11755  [pdf, other

    cs.LG cs.CL cs.CR cs.PL

    SPML: A DSL for Defending Language Models Against Prompt Attacks

    Authors: Reshabh K Sharma, Vinayak Gupta, Dan Grossman

    Abstract: Large language models (LLMs) have profoundly transformed natural language applications, with a growing reliance on instruction-based definitions for designing chatbots. However, post-deployment the chatbot definitions are fixed and are vulnerable to attacks by malicious users, emphasizing the need to prevent unethical applications and financial losses. Existing studies explore user prompts' impact… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  44. arXiv:2402.11194  [pdf, other

    cs.CL

    Evaluating LLMs' Mathematical Reasoning in Financial Document Question Answering

    Authors: Pragya Srivastava, Manuj Malik, Vivek Gupta, Tanuja Ganu, Dan Roth

    Abstract: Large Language Models (LLMs), excel in natural language understanding, but their capability for complex mathematical reasoning with an amalgamation of structured tables and unstructured text is uncertain. This study explores LLMs' mathematical reasoning on four financial tabular question-answering datasets: TATQA, FinQA, ConvFinQA, and Multihiertt. Through extensive experiments with various models… ▽ More

    Submitted 29 February, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: 25 pages, 17 figures

  45. arXiv:2402.09658  [pdf

    eess.IV cs.CV

    Towards Precision Cardiovascular Analysis in Zebrafish: The ZACAF Paradigm

    Authors: Amir Mohammad Naderi, Jennifer G. Casey, Mao-Hsiang Huang, Rachelle Victorio, David Y. Chiang, Calum MacRae, Hung Cao, Vandana A. Gupta

    Abstract: Quantifying cardiovascular parameters like ejection fraction in zebrafish as a host of biological investigations has been extensively studied. Since current manual monitoring techniques are time-consuming and fallible, several image processing frameworks have been proposed to automate the process. Most of these works rely on supervised deep-learning architectures. However, supervised methods tend… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  46. arXiv:2402.08747  [pdf, other

    cs.GT eess.SY

    Rationality of Learning Algorithms in Repeated Normal-Form Games

    Authors: Shivam Bajaj, Pranoy Das, Yevgeniy Vorobeychik, Vijay Gupta

    Abstract: Many learning algorithms are known to converge to an equilibrium for specific classes of games if the same learning algorithm is adopted by all agents. However, when the agents are self-interested, a natural question is whether agents have a strong incentive to adopt an alternative learning algorithm that yields them greater individual utility. We capture such incentives as an algorithm's rational… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  47. arXiv:2402.04632  [pdf, other

    cs.CV cs.GR

    GSN: Generalisable Segmentation in Neural Radiance Field

    Authors: Vinayak Gupta, Rahul Goel, Sirikonda Dhawal, P. J. Narayanan

    Abstract: Traditional Radiance Field (RF) representations capture details of a specific scene and must be trained afresh on each scene. Semantic feature fields have been added to RFs to facilitate several segmentation tasks. Generalised RF representations learn the principles of view interpolation. A generalised RF can render new views of an unknown and untrained scene, given a few views. We present a way t… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted at the Main Technical Track of AAAI 2024

  48. arXiv:2402.04146  [pdf, other

    stat.ML cs.LG

    Interpretable Multi-Source Data Fusion Through Latent Variable Gaussian Process

    Authors: Sandipp Krishnan Ravi, Yigitcan Comlek, Wei Chen, Arjun Pathak, Vipul Gupta, Rajnikant Umretiya, Andrew Hoffman, Ghanshyam Pilania, Piyush Pandita, Sayan Ghosh, Nathaniel Mckeever, Liping Wang

    Abstract: With the advent of artificial intelligence (AI) and machine learning (ML), various domains of science and engineering communites has leveraged data-driven surrogates to model complex systems from numerous sources of information (data). The proliferation has led to significant reduction in cost and time involved in development of superior systems designed to perform specific functionalities. A high… ▽ More

    Submitted 15 July, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 27 Pages,10 Figures, 3 Supplementary Figures, 2 Supplementary Tables

  49. arXiv:2402.03256  [pdf, ps, other

    cs.LG math.OC stat.ML

    Decision-Focused Learning with Directional Gradients

    Authors: Michael Huang, Vishal Gupta

    Abstract: We propose a novel family of decision-aware surrogate losses, called Perturbation Gradient (PG) losses, for the predict-then-optimize framework. The key idea is to connect the expected downstream decision loss with the directional derivative of a particular plug-in objective, and then approximate this derivative using zeroth order gradient techniques. Unlike the original decision loss which is typ… ▽ More

    Submitted 23 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  50. arXiv:2402.02589  [pdf, other

    stat.AP

    Prospective Prediction of Body Mass Index Trajectories using Multi-task Gaussian Processes

    Authors: Arthur Leroy, Varsha Gupta, Mya Thway Tint, Delicia Ooi Shu Qin, Keith M. Godfrey, Fabian Yap, Leck Ngee, Yung Seng Lee, Johan G. Eriksson, Navin Michael, Mauricio A. Alvarez, Dennis Wang

    Abstract: Clinicians often investigate the body mass index (BMI) trajectories of children to assess their growth with respect to their peers, as well as to anticipate future growth and disease risk. While retrospective modelling of BMI trajectories has been an active area of research, prospective prediction of continuous BMI trajectories from historical growth data has not been well investigated. Using weig… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 17 pages, 9 figures, 5 tables