Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 640 results for author: Kumar, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18571  [pdf, other

    cs.CV

    UltraCortex: Submillimeter Ultra-High Field 9.4 T1 Brain MR Image Collection and Manual Cortical Segmentations

    Authors: Lucas Mahler, Julius Steiglechner, Benjamin Bender, Tobias Lindig, Dana Ramadan, Jonas Bause, Florian Birk, Rahel Heule, Edyta Charyasz, Michael Erb, Vinod Jangir Kumar, Gisela E Hagberg, Pascal Martin, Gabriele Lohmann, Klaus Scheffler

    Abstract: The UltraCortex repository (https://www.ultracortex.org) houses magnetic resonance imaging data of the human brain obtained at an ultra-high field strength of 9.4 T. It contains 86 structural MR images with spatial resolutions ranging from 0.6 to 0.8 mm. Additionally, the repository includes segmentations of 12 brains into gray and white matter compartments. These segmentations have been independe… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2406.17249  [pdf, other

    cs.RO

    SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation

    Authors: Xu Liu, Jiuzhou Lei, Ankit Prabhu, Yuezhan Tao, Igor Spasojevic, Pratik Chaudhari, Nikolay Atanasov, Vijay Kumar

    Abstract: This paper develops a real-time decentralized metric-semantic Simultaneous Localization and Mapping (SLAM) approach that leverages a sparse and lightweight object-based representation to enable a heterogeneous robot team to autonomously explore 3D environments featuring indoor, urban, and forested areas without relying on GPS. We use a hierarchical metric-semantic representation of the environment… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Preliminary release

  3. arXiv:2406.10824  [pdf, other

    cs.CL

    Citation-Based Summarization of Landmark Judgments

    Authors: Purnima Bindal, Vikas Kumar, Vasudha Bhatnagar, Parikshet Sirohi, Ashwini Siwal

    Abstract: Landmark judgments are of prime importance in the Common Law System because of their exceptional jurisprudence and frequent references in other judgments. In this work, we leverage contextual references available in citing judgments to create an extractive summary of the target judgment. We evaluate the proposed algorithm on two datasets curated from the judgments of Indian Courts and find the res… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted for publication at ICON 2023

  4. arXiv:2406.09631  [pdf, other

    cs.RO

    Optimal Convex Cover as Collision-free Space Approximation for Trajectory Generation

    Authors: Yuwei Wu, Igor Spasojevic, Pratik Chaudhari, Vijay Kumar

    Abstract: We propose an online iterative algorithm to find a suitable convex cover to under-approximate the free space for autonomous navigation to delineate Safe Flight Corridors (SFC). The convex cover consists of a set of polytopes such that the union of the polytopes represents obstacle-free space, allowing us to find trajectories for robots that lie within the convex cover. In order to find the SFC tha… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.08641  [pdf, ps, other

    cs.SD cs.CL eess.AS

    ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets

    Authors: Jiatong Shi, Shih-Heng Wang, William Chen, Martijn Bartelds, Vanya Bannihatti Kumar, Jinchuan Tian, Xuankai Chang, Dan Jurafsky, Karen Livescu, Hung-yi Lee, Shinji Watanabe

    Abstract: ML-SUPERB evaluates self-supervised learning (SSL) models on the tasks of language identification and automatic speech recognition (ASR). This benchmark treats the models as feature extractors and uses a single shallow downstream model, which can be fine-tuned for a downstream task. However, real-world use cases may require different configurations. This paper presents ML-SUPERB~2.0, which is a ne… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by Interspeech 2024

  6. arXiv:2406.06570  [pdf, other

    cs.CL

    Review of Computational Epigraphy

    Authors: Vishal Kumar

    Abstract: Computational Epigraphy refers to the process of extracting text from stone inscription, transliteration, interpretation, and attribution with the aid of computational methods. Traditional epigraphy methods are time consuming, and tend to damage the stone inscriptions while extracting text. Additionally, interpretation and attribution are subjective and can vary between different epigraphers. Howe… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2406.06461  [pdf, other

    cs.CL

    Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies

    Authors: Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun

    Abstract: A diverse array of reasoning strategies has been proposed to elicit the capabilities of large language models. However, in this paper, we point out that traditional evaluations which focus solely on performance metrics miss a key factor: the increased effectiveness due to additional compute. By overlooking this aspect, a skewed view of strategy efficiency is often presented. This paper introduces… ▽ More

    Submitted 14 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  8. arXiv:2406.00724  [pdf, other

    cs.HC cs.RO

    Exploring Child-Robot Interaction in Individual and Group settings in India

    Authors: Gayathri Manikutty, Sai Ankith Potapragada, Devasena Pasupuleti, Mahesh S. Unnithan, Arjun Venugopal, Pranav Prabha, Arunav H., Vyshnavi Anil Kumar, Rthuraj P. R., Rao R Bhavani

    Abstract: This study evaluates the effectiveness of child-robot interactions with the HaKsh-E social robot in India, examining both individual and group interaction settings. The research centers on game-based interactions designed to teach hand hygiene to children aged 7-11. Utilizing video analysis, rubric assessments, and post-study questionnaires, the study gathered data from 36 participants. Findings i… ▽ More

    Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 6 pages, 6 figures, Accepted for presentation at ICRAS 2024 (https://www.icras.org/)

  9. arXiv:2405.18649  [pdf, other

    cs.CL cs.AI cs.SE

    Training LLMs to Better Self-Debug and Explain Code

    Authors: Nan Jiang, Xiaopeng Li, Shiqi Wang, Qiang Zhou, Soneya Binta Hossain, Baishakhi Ray, Varun Kumar, Xiaofei Ma, Anoop Deoras

    Abstract: In the domain of code generation, self-debugging is crucial. It allows LLMs to refine their generated code based on execution feedback. This is particularly important because generating correct solutions in one attempt proves challenging for complex tasks. Prior works on self-debugging mostly focus on prompting methods by providing LLMs with few-shot examples, which work poorly on small open-sourc… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  10. arXiv:2405.15123  [pdf, other

    cs.CY

    Probeable Problems for Beginner-level Programming-with-AI Contests

    Authors: Mrigank Pawagi, Viraj Kumar

    Abstract: To broaden participation, competitive programming contests may include beginner-level problems that do not require knowledge of advanced Computer Science concepts (e.g., algorithms and data structures). However, since most participants have easy access to AI code-generation tools, these problems often become trivial to solve. For beginner-friendly programming contests that do not prohibit the use… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 14 pages, 9 figures, ICER 2024

    ACM Class: D.2.1; K.4.m

  11. arXiv:2405.10391  [pdf, other

    cs.RO cs.AI eess.IV

    Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance

    Authors: Anish Bhattacharya, Nishanth Rao, Dhruv Parikh, Pratik Kunapuli, Nikolai Matni, Vijay Kumar

    Abstract: We demonstrate the capabilities of an attention-based end-to-end approach for high-speed quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional vision-based navigation via independent mapping, p… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 10 figures, 3 tables

  12. arXiv:2405.07169  [pdf, other

    cs.RO

    Challenges and Opportunities for Large-Scale Exploration with Air-Ground Teams using Semantics

    Authors: Fernando Cladera, Ian D. Miller, Zachary Ravichandran, Varun Murali, Jason Hughes, M. Ani Hsieh, C. J. Taylor, Vijay Kumar

    Abstract: One common and desirable application of robots is exploring potentially hazardous and unstructured environments. Air-ground collaboration offers a synergistic approach to addressing such exploration challenges. In this paper, we demonstrate a system for large-scale exploration using a team of aerial and ground robots. Our system uses semantics as lingua franca, and relies on fully opportunistic co… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 6 pages, 5 figres

  13. arXiv:2405.06641  [pdf, other

    cs.IT

    On Existence of Latency Optimal Uncoded Storage Schemes in Geo-Distributed Data Storage Systems

    Authors: Srivathsa Acharya, P. Vijay Kumar, Viveck R. Cadambe

    Abstract: We consider the problem of geographically distributed data storage in a network of servers (or nodes) where the nodes are connected to each other via communication links having certain round-trip times (RTTs). Each node serves a specific set of clients, where a client can request for any of the files available in the distributed system. The parent node provides the requested file if available loca… ▽ More

    Submitted 13 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  14. arXiv:2405.06621  [pdf, other

    cs.IT

    On Streaming Codes for Simultaneously Correcting Burst and Random Erasures

    Authors: Shobhit Bhatnagar, Biswadip Chakraborty, P. Vijay Kumar

    Abstract: Streaming codes are packet-level codes that recover dropped packets within a strict decoding-delay constraint. We study streaming codes over a sliding-window (SW) channel model which admits only those erasure patterns which allow either a single burst erasure of $\le b$ packets along with $\le e$ random packet erasures, or else, $\le a$ random packet erasures, in any sliding-window of $w$ time slo… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  15. arXiv:2405.06606  [pdf, other

    cs.IT

    On Streaming Codes for Burst and Random Errors

    Authors: Shobhit Bhatnagar, P. Vijay Kumar

    Abstract: Streaming codes (SCs) are packet-level codes that recover erased packets within a strict decoding-delay deadline. Streaming codes for various packet erasure channel models such as sliding-window (SW) channel models that admit random or burst erasures in any SW of a fixed length have been studied in the literature, and the optimal rate as well as rate-optimal code constructions of SCs over such cha… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  16. arXiv:2405.02770  [pdf, other

    cs.LG

    PhilHumans: Benchmarking Machine Learning for Personal Health

    Authors: Vadim Liventsev, Vivek Kumar, Allmin Pradhap Singh Susaiyah, Zixiu Wu, Ivan Rodin, Asfand Yaar, Simone Balloccu, Marharyta Beraziuk, Sebastiano Battiato, Giovanni Maria Farinella, Aki Härmä, Rim Helaoui, Milan Petkovic, Diego Reforgiato Recupero, Ehud Reiter, Daniele Riboni, Raymond Sterling

    Abstract: The use of machine learning in Healthcare has the potential to improve patient outcomes as well as broaden the reach and affordability of Healthcare. The history of other application areas indicates that strong benchmarks are essential for the development of intelligent systems. We present Personal Health Interfaces Leveraging HUman-MAchine Natural interactions (PhilHumans), a holistic suite of be… ▽ More

    Submitted 16 May, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

  17. arXiv:2404.16060  [pdf

    cs.HC physics.ed-ph physics.optics

    Pocket Schlieren: a background oriented schlieren imaging platform on a smartphone

    Authors: Diganta Rabha, Vimod Kumar, Akshay Kumar, Dinesh Saini, Manish Kumar

    Abstract: Background-oriented schlieren (BOS) is a powerful technique for flow visualization. Nevertheless, the widespread dissemination of BOS is impeded by its dependence on scientific cameras, computing hardware, and dedicated analysis software. In this work, we aim to democratize BOS by providing a smartphone based scientific tool called "Pocket Schlieren". Pocket Schlieren enables users to directly cap… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 24 pages, 6 figures, 4 Supplementary figures

  18. arXiv:2404.10830  [pdf, other

    cs.CL cs.AI cs.LG

    Fewer Truncations Improve Language Modeling

    Authors: Hantian Ding, Zijian Wang, Giovanni Paolini, Varun Kumar, Anoop Deoras, Dan Roth, Stefano Soatto

    Abstract: In large language model training, input documents are typically concatenated together and then split into sequences of equal length to avoid padding tokens. Despite its efficiency, the concatenation approach compromises data integrity -- it inevitably breaks many documents into incomplete pieces, leading to excessive truncations that hinder the model from learning to compose logically coherent and… ▽ More

    Submitted 2 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: ICML 2024

  19. arXiv:2404.07880  [pdf, other

    cs.RO

    Multi-Robot Target Tracking with Sensing and Communication Danger Zones

    Authors: Jiazhen Liu, Peihan Li, Yuwei Wu, Gaurav S. Sukhatme, Vijay Kumar, Lifeng Zhou

    Abstract: Multi-robot target tracking finds extensive applications in different scenarios, such as environmental surveillance and wildfire management, which require the robustness of the practical deployment of multi-robot systems in uncertain and dangerous environments. Traditional approaches often focus on the performance of tracking accuracy with no modeling and assumption of the environments, neglecting… ▽ More

    Submitted 20 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  20. arXiv:2404.07602  [pdf, other

    cs.CV cs.LG

    Attention based End to end network for Offline Writer Identification on Word level data

    Authors: Vineet Kumar, Suresh Sundaram

    Abstract: Writer identification due to its widespread application in various fields has gained popularity over the years. In scenarios where optimum handwriting samples are available, whether they be in the form of a single line, a sentence, or an entire page, writer identification algorithms have demonstrated noteworthy levels of accuracy. However, in scenarios where only a limited number of handwritten sa… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  21. arXiv:2404.06352  [pdf, other

    cs.CV cs.RO

    DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning

    Authors: Senthil Yogamani, David Unger, Venkatraman Narayanan, Varun Ravi Kumar

    Abstract: Semantic segmentation is an effective way to perform scene understanding. Recently, segmentation in 3D Bird's Eye View (BEV) space has become popular as its directly used by drive policy. However, there is limited work on BEV segmentation for surround-view fisheye cameras, commonly used in commercial vehicles. As this task has no real-world public dataset and existing synthetic datasets do not han… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  22. arXiv:2404.00769  [pdf, other

    cs.RO

    An Active Perception Game for Robust Autonomous Exploration

    Authors: Siming He, Yuezhan Tao, Igor Spasojevic, Vijay Kumar, Pratik Chaudhari

    Abstract: We formulate active perception for an autonomous agent that explores an unknown environment as a two-player zero-sum game: the agent aims to maximize information gained from the environment while the environment aims to minimize the information gained by the agent. In each episode, the environment reveals a set of actions with their potentially erroneous information gain. In order to select the be… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  23. arXiv:2403.17067  [pdf, other

    cs.RO

    Trajectory Optimization with Global Yaw Parameterization for Field-of-View Constrained Autonomous Flight

    Authors: Yuwei Wu, Yuezhan Tao, Igor Spasojevic, Vijay Kumar

    Abstract: Trajectory generation for quadrotors with limited field-of-view sensors has numerous applications such as aerial exploration, coverage, inspection, videography, and target tracking. Most previous works simplify the task of optimizing yaw trajectories by either aligning the heading of the robot with its velocity, or potentially restricting the feasible space of candidate trajectories by using a lim… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  24. arXiv:2403.16592  [pdf, other

    cs.CL

    TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques

    Authors: Ashok Urlana, Aditya Saibewar, Bala Mallikarjunarao Garlapati, Charaka Vinayak Kumar, Ajeet Kumar Singh, Srinivasa Rao Chalamala

    Abstract: The Large Language Models (LLMs) exhibit remarkable ability to generate fluent content across a wide spectrum of user queries. However, this capability has raised concerns regarding misinformation and personal information leakage. In this paper, we present our methods for the SemEval2024 Task8, aiming to detect machine-generated text across various domains in both mono-lingual and multi-lingual co… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: 8 pages, 1 Figure

    ACM Class: I.2.7

  25. arXiv:2403.16338  [pdf, other

    cs.CV cs.AI

    Impact of Video Compression Artifacts on Fisheye Camera Visual Perception Tasks

    Authors: Madhumitha Sakthi, Louis Kerofsky, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Autonomous driving systems require extensive data collection schemes to cover the diverse scenarios needed for building a robust and safe system. The data volumes are in the order of Exabytes and have to be stored for a long period of time (i.e., more than 10 years of the vehicle's life cycle). Lossless compression doesn't provide sufficient compression ratios, hence, lossy video compression has b… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  26. arXiv:2403.15989  [pdf, other

    cs.LG cs.AI cs.CE

    Knowledge-guided Machine Learning: Current Trends and Future Prospects

    Authors: Anuj Karpatne, Xiaowei Jia, Vipin Kumar

    Abstract: This paper presents an overview of scientific modeling and discusses the complementary strengths and weaknesses of ML methods for scientific modeling in comparison to process-based models. It also provides an introduction to the current state of research in the emerging field of scientific knowledge-guided machine learning (KGML) that aims to use both scientific knowledge and data in ML frameworks… ▽ More

    Submitted 1 May, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  27. arXiv:2403.15522  [pdf, other

    cs.CR cs.CV

    Medical Image Data Provenance for Medical Cyber-Physical System

    Authors: Vijay Kumar, Kolin Paul

    Abstract: Continuous advancements in medical technology have led to the creation of affordable mobile imaging devices suitable for telemedicine and remote monitoring. However, the rapid examination of large populations poses challenges, including the risk of fraudulent practices by healthcare professionals and social workers exchanging unverified images via mobile applications. To mitigate these risks, this… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    ACM Class: I.4.9; I.4.m; K.6.5; J.3; J.7

  28. arXiv:2403.11228  [pdf

    cs.NI

    Routing Algorithms

    Authors: Ujjwal Sinha, Vikas Kumar, Shubham Kumar Singh

    Abstract: Routing algorithms play a crucial role in the efficient transmission of data within computer networks by determining the optimal paths for packet forwarding. This paper presents a comprehensive exploration of routing algorithms, focusing on their fundamental principles, classification, challenges, recent advancements, and practical applications. Beginning with an overview of the significance of ro… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  29. arXiv:2403.10944  [pdf, other

    cs.HC cs.AI

    Human Centered AI for Indian Legal Text Analytics

    Authors: Sudipto Ghosh, Devanshu Verma, Balaji Ganesan, Purnima Bindal, Vikas Kumar, Vasudha Bhatnagar

    Abstract: Legal research is a crucial task in the practice of law. It requires intense human effort and intellectual prudence to research a legal case and prepare arguments. Recent boom in generative AI has not translated to proportionate rise in impactful legal applications, because of low trustworthiness and and the scarcity of specialized datasets for training Large Language Models (LLMs). This position… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 7 pages, 7 figures

  30. arXiv:2403.04900  [pdf, other

    eess.SY cs.RO math.OC

    Almost Global Asymptotic Trajectory Tracking for Fully-Actuated Mechanical Systems on Homogeneous Riemannian Manifolds

    Authors: Jake Welde, Vijay Kumar

    Abstract: In this work, we address the design of tracking controllers that drive a mechanical system's state asymptotically towards a reference trajectory. Motivated by aerospace and robotics applications, we consider fully-actuated systems evolving on the broad class of homogeneous spaces (encompassing all vector spaces, Lie groups, and spheres of any finite dimension). In this setting, the transitive acti… ▽ More

    Submitted 9 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Preprint. To appear in IEEE Control Systems Letters

  31. arXiv:2403.04786  [pdf, other

    cs.CR cs.CL

    Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models

    Authors: Arijit Ghosh Chowdhury, Md Mofijul Islam, Vaibhav Kumar, Faysal Hossain Shezan, Vaibhav Kumar, Vinija Jain, Aman Chadha

    Abstract: Large Language Models (LLMs) have become a cornerstone in the field of Natural Language Processing (NLP), offering transformative capabilities in understanding and generating human-like text. However, with their rising prominence, the security and vulnerability aspects of these models have garnered significant attention. This paper presents a comprehensive survey of the various forms of attacks ta… ▽ More

    Submitted 23 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  32. arXiv:2403.01481  [pdf, other

    cs.CL

    Infusing Knowledge into Large Language Models with Contextual Prompts

    Authors: Kinshuk Vasisht, Balaji Ganesan, Vikas Kumar, Vasudha Bhatnagar

    Abstract: Knowledge infusion is a promising method for enhancing Large Language Models for domain-specific NLP tasks rather than pre-training models over large data from scratch. These augmented LLMs typically depend on additional pre-training or knowledge prompts from an existing knowledge graph, which is impractical in many applications. In contrast, knowledge infusion directly from relevant documents is… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 5 pages, 1 figure, In Proceedings of ICON 2023

  33. arXiv:2403.00975  [pdf, other

    cs.LG cs.AI math.FA stat.AP

    Equipment Health Assessment: Time Series Analysis for Wind Turbine Performance

    Authors: Jana Backhus, Aniruddha Rajendra Rao, Chandrasekar Venkatraman, Abhishek Padmanabhan, A. Vinoth Kumar, Chetan Gupta

    Abstract: In this study, we leverage SCADA data from diverse wind turbines to predict power output, employing advanced time series methods, specifically Functional Neural Networks (FNN) and Long Short-Term Memory (LSTM) networks. A key innovation lies in the ensemble of FNN and LSTM models, capitalizing on their collective learning. This ensemble approach outperforms individual models, ensuring stable and a… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 19 Pages, 17 Figures, 3 Tables, Submitted at Applied Sciences (MDPI)

  34. arXiv:2402.14558  [pdf, other

    cs.CL

    LLMs with Industrial Lens: Deciphering the Challenges and Prospects -- A Survey

    Authors: Ashok Urlana, Charaka Vinayak Kumar, Ajeet Kumar Singh, Bala Mallikarjunarao Garlapati, Srinivasa Rao Chalamala, Rahul Mishra

    Abstract: Large language models (LLMs) have become the secret ingredient driving numerous industrial applications, showcasing their remarkable versatility across a diverse spectrum of tasks. From natural language processing and sentiment analysis to content generation and personalized recommendations, their unparalleled adaptability has facilitated widespread adoption across industries. This transformative… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 25 pages, 7 figures

  35. arXiv:2402.13957  [pdf

    cs.SD cs.LG eess.AS

    Advancing Audio Fingerprinting Accuracy Addressing Background Noise and Distortion Challenges

    Authors: Navin Kamuni, Sathishkumar Chintala, Naveen Kunchakuri, Jyothi Swaroop Arlagadda Narasimharaju, Venkat Kumar

    Abstract: Audio fingerprinting, exemplified by pioneers like Shazam, has transformed digital audio recognition. However, existing systems struggle with accuracy in challenging conditions, limiting broad applicability. This research proposes an AI and ML integrated audio fingerprinting algorithm to enhance accuracy. Built on the Dejavu Project's foundations, the study emphasizes real-world scenario simulatio… ▽ More

    Submitted 1 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Journal ref: 2024 IEEE 18th International Conference on Semantic Computing (ICSC), Laguna Hills, CA, USA, 2024, pp. 341-345

  36. arXiv:2402.13192  [pdf, other

    math.PR cs.PF

    Spatial Queues with Nearest Neighbour Shifts

    Authors: B. R. Vinay Kumar, Lasse Leskelä

    Abstract: In this work we study multi-server queues on a Euclidean space. Consider $N$ servers that are distributed uniformly in $[0,1]^d$. Customers (users) arrive at the servers according to independent Poisson processes of intensity $λ$. However, they probabilistically decide whether to join the queue they arrived at, or move to one of the nearest neighbours. The strategy followed by the customers affect… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: A part of this work was accepted to the conference International Teletraffic Congress (ITC 35) held between 3--5 October 2023 in Turin, Italy

    MSC Class: 60K30; 05C80

  37. arXiv:2402.12098  [pdf, other

    cs.CV cs.AI

    Towards Explainable LiDAR Point Cloud Semantic Segmentation via Gradient Based Target Localization

    Authors: Abhishek Kuriyal, Vaibhav Kumar

    Abstract: Semantic Segmentation (SS) of LiDAR point clouds is essential for many applications, such as urban planning and autonomous driving. While much progress has been made in interpreting SS predictions for images, interpreting point cloud SS predictions remains a challenge. This paper introduces pGS-CAM, a novel gradient-based method for generating saliency maps in neural network activation layers. Ins… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  38. arXiv:2402.06826  [pdf, other

    cs.CV cs.RO

    Neural Rendering based Urban Scene Reconstruction for Autonomous Driving

    Authors: Shihao Shen, Louis Kerofsky, Varun Ravi Kumar, Senthil Yogamani

    Abstract: Dense 3D reconstruction has many applications in automated driving including automated annotation validation, multimodal data augmentation, providing ground truth annotations for systems lacking LiDAR, as well as enhancing auto-labeling accuracy. LiDAR provides highly accurate but sparse depth, whereas camera images enable estimation of dense depth but noisy particularly at long ranges. In this pa… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in Electronic Imaging, Autonomous Vehicles and Machines 2024. Qualitative results are shared in https://youtu.be/EK47fYJiY3M

  39. arXiv:2401.15875  [pdf, other

    cs.CV

    Combining Satellite and Weather Data for Crop Type Mapping: An Inverse Modelling Approach

    Authors: Praveen Ravirathinam, Rahul Ghosh, Ankush Khandelwal, Xiaowei Jia, David Mulla, Vipin Kumar

    Abstract: Accurate and timely crop mapping is essential for yield estimation, insurance claims, and conservation efforts. Over the years, many successful machine learning models for crop mapping have been developed that use just the multi-spectral imagery from satellites to predict crop type over the area of interest. However, these traditional methods do not account for the physical processes that govern c… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 10 pages, SIAM International Conference on Data Mining (SDM24)

  40. arXiv:2401.15562  [pdf, other

    eess.SP cs.IT

    A Survey on Integrated Sensing and Communication with Intelligent Metasurfaces: Trends, Challenges, and Opportunities

    Authors: Ahmed Magbool, Vaibhav Kumar, Qingqing Wu, Marco Di Renzo, Mark F. Flanagan

    Abstract: The emergence of various technologies demanding both high data rates and precise sensing performance, such as autonomous vehicles and internet of things devices, has propelled an increasing popularity of integrated sensing and communication (ISAC) in recent years. ISAC offers an efficient framework for communication and sensing where both functionalities are carried out in a shared spectrum, utili… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Submitted to IEEE for possible publication

  41. arXiv:2401.08420  [pdf, other

    cs.CL

    Ask the experts: sourcing high-quality datasets for nutritional counselling through Human-AI collaboration

    Authors: Simone Balloccu, Ehud Reiter, Vivek Kumar, Diego Reforgiato Recupero, Daniele Riboni

    Abstract: Large Language Models (LLMs), with their flexible generation abilities, can be powerful data sources in domains with few or no available corpora. However, problems like hallucinations and biases limit such applications. In this case study, we pick nutrition counselling, a domain lacking any public resource, and show that high-quality datasets can be gathered by combining LLMs, crowd-workers and nu… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  42. arXiv:2401.07960  [pdf, other

    cs.CR

    ADMIn: Attacks on Dataset, Model and Input. A Threat Model for AI Based Software

    Authors: Vimal Kumar, Juliette Mayo, Khadija Bahiss

    Abstract: Machine learning (ML) and artificial intelligence (AI) techniques have now become commonplace in software products and services. When threat modelling a system, it is therefore important that we consider threats unique to ML and AI techniques, in addition to threats to our software. In this paper, we present a threat model that can be used to systematically uncover threats to AI based software. Th… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    ACM Class: K.6.5

  43. arXiv:2401.04960  [pdf, other

    cs.RO cs.LG eess.SY

    Why Change Your Controller When You Can Change Your Planner: Drag-Aware Trajectory Generation for Quadrotor Systems

    Authors: Hanli Zhang, Anusha Srikanthan, Spencer Folk, Vijay Kumar, Nikolai Matni

    Abstract: Motivated by the increasing use of quadrotors for payload delivery, we consider a joint trajectory generation and feedback control design problem for a quadrotor experiencing aerodynamic wrenches. Unmodeled aerodynamic drag forces from carried payloads can lead to catastrophic outcomes. Prior work model aerodynamic effects as residual dynamics or external disturbances in the control problem leadin… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 10 pages, 3 figures. Submitted to L4DC 2024

  44. arXiv:2401.04855  [pdf, other

    cs.RO cs.LG

    LPAC: Learnable Perception-Action-Communication Loops with Applications to Coverage Control

    Authors: Saurav Agarwal, Ramya Muthukrishnan, Walker Gosrich, Vijay Kumar, Alejandro Ribeiro

    Abstract: Coverage control is the problem of navigating a robot swarm to collaboratively monitor features or a phenomenon of interest not known a priori. The problem is challenging in decentralized settings with robots that have limited communication and sensing capabilities. We propose a learnable Perception-Action-Communication (LPAC) architecture for the problem, wherein a convolution neural network (CNN… ▽ More

    Submitted 8 February, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  45. arXiv:2312.10617  [pdf, other

    cs.CL cs.LG

    Deep dive into language traits of AI-generated Abstracts

    Authors: Vikas Kumar, Amisha Bharti, Devanshu Verma, Vasudha Bhatnagar

    Abstract: Generative language models, such as ChatGPT, have garnered attention for their ability to generate human-like writing in various fields, including academic research. The rapid proliferation of generated texts has bolstered the need for automatic identification to uphold transparency and trust in the information. However, these generated texts closely resemble human writing and often have subtle di… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted for Cods-Comad Conference

  46. GuardRails: Automated Suggestions for Clarifying Ambiguous Purpose Statements

    Authors: Mrigank Pawagi, Viraj Kumar

    Abstract: Before implementing a function, programmers are encouraged to write a purpose statement i.e., a short, natural-language explanation of what the function computes. A purpose statement may be ambiguous i.e., it may fail to specify the intended behaviour when two or more inequivalent computations are plausible on certain inputs. Our paper makes four contributions. First, we propose a novel heuristic… ▽ More

    Submitted 3 May, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Journal ref: Proceedings of the 16th Annual ACM India Compute Conference (2023) 55-60

  47. arXiv:2312.00775  [pdf, other

    cs.RO cs.CV cs.LG

    Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans

    Authors: Homanga Bharadhwaj, Abhinav Gupta, Vikash Kumar, Shubham Tulsiani

    Abstract: We pursue the goal of developing robots that can interact zero-shot with generic unseen objects via a diverse repertoire of manipulation skills and show how passive human videos can serve as a rich source of data for learning such generalist robots. Unlike typical robot learning approaches which directly learn how a robot should act from interaction data, we adopt a factorized approach that can le… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Preprint. Under Review

  48. arXiv:2311.08788  [pdf, other

    cs.CL cs.AI cs.LG

    X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects

    Authors: Minqian Liu, Ying Shen, Zhiyang Xu, Yixin Cao, Eunah Cho, Vaibhav Kumar, Reza Ghanadan, Lifu Huang

    Abstract: Natural Language Generation (NLG) typically involves evaluating the generated text in various aspects (e.g., consistency and naturalness) to obtain a comprehensive assessment. However, multi-aspect evaluation remains challenging as it may require the evaluator to generalize to any given evaluation aspect even if it's absent during training. In this paper, we introduce X-Eval, a two-stage instructi… ▽ More

    Submitted 13 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: NAACL 2024 Main Conference. 20 pages, 6 figures, 17 tables

  49. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  50. arXiv:2310.08004  [pdf, other

    cs.CC quant-ph

    On the Rational Degree of Boolean Functions and Applications

    Authors: Vishnu Iyer, Siddhartha Jain, Matt Kovacs-Deak, Vinayak M. Kumar, Luke Schaeffer, Daochen Wang, Michael Whitmeyer

    Abstract: We study a natural complexity measure of Boolean functions known as the (exact) rational degree. For total functions $f$, it is conjectured that $\mathrm{rdeg}(f)$ is polynomially related to $\mathrm{deg}(f)$, where $\mathrm{deg}(f)$ is the Fourier degree. Towards this conjecture, we show that symmetric functions have rational degree at least $\mathrm{deg}(f)/2$ and monotone functions have rationa… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 17 pages, 3 figures