Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 156 results for author: Prakash, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18090  [pdf, other

    cs.FL

    On the Minimisation of Deterministic and History-Deterministic Generalised (co)Büchi Automata

    Authors: Antonio Casares, Olivier Idir, Denis Kuperberg, Corto Mascle, Aditya Prakash

    Abstract: We present a polynomial-time algorithm minimising the number of states of history-deterministic generalised coBüchi automata, building on the work of Abu Radi and Kupferman on coBüchi automata. On the other hand, we establish that the minimisation problem for both deterministic and history-deterministic generalised Büchi automata is NP-complete, as well as the problem of minimising at the same tim… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  2. arXiv:2407.12777  [pdf, other

    cs.CV cs.GR

    Generalizable Human Gaussians for Sparse View Synthesis

    Authors: Youngjoong Kwon, Baole Fang, Yixing Lu, Haoye Dong, Cheng Zhang, Francisco Vicente Carrasco, Albert Mosella-Montoro, Jianjin Xu, Shingo Takagi, Daeil Kim, Aayush Prakash, Fernando De la Torre

    Abstract: Recent progress in neural rendering has brought forth pioneering methods, such as NeRF and Gaussian Splatting, which revolutionize view rendering across various domains like AR/VR, gaming, and content creation. While these methods excel at interpolating {\em within the training data}, the challenge of generalizing to new scenes and objects from very sparse views persists. Specifically, modeling 3D… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  3. arXiv:2407.08620  [pdf, other

    cs.FL

    History-Determinism vs Fair Simulation

    Authors: Udi Boker, Thomas A. Henzinger, Karoliina Lehtinen, Aditya Prakash

    Abstract: An automaton is history-deterministic if its nondeterminism can be resolved on the fly, only using the prefix of the word read so far. This mild form of nondeterminism has attracted particular attention for its applications in synthesis problems. An automaton $A$ is guidable with respect to a class $C$ of automata if it can fairly simulate every automaton in $C$ whose language is contained in that… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Full version of the paper accepted at CONCUR 2024

  4. arXiv:2407.02657  [pdf, other

    cs.LG stat.ME

    Large Scale Hierarchical Industrial Demand Time-Series Forecasting incorporating Sparsity

    Authors: Harshavardhan Kamarthi, Aditya B. Sasanur, Xinjie Tong, Xingyu Zhou, James Peters, Joe Czyzyk, B. Aditya Prakash

    Abstract: Hierarchical time-series forecasting (HTSF) is an important problem for many real-world business applications where the goal is to simultaneously forecast multiple time-series that are related to each other via a hierarchical relation. Recent works, however, do not address two important challenges that are typically observed in many demand forecasting applications at large companies. First, many t… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at KDD 2024

  5. arXiv:2407.02641  [pdf, other

    cs.LG cs.AI

    Learning Graph Structures and Uncertainty for Accurate and Calibrated Time-series Forecasting

    Authors: Harshavardhan Kamarthi, Lingkai Kong, Alexander Rodriguez, Chao Zhang, B Aditya Prakash

    Abstract: Multi-variate time series forecasting is an important problem with a wide range of applications. Recent works model the relations between time-series as graphs and have shown that propagating information over the relation graph can improve time series forecasting. However, in many cases, relational information is not available or is noisy and reliable. Moreover, most works ignore the underlying un… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  6. arXiv:2406.12747  [pdf, other

    cs.LG cs.AI

    TSI-Bench: Benchmarking Time Series Imputation

    Authors: Wenjie Du, Jun Wang, Linglong Qian, Yiyuan Yang, Fanxing Liu, Zepu Wang, Zina Ibrahim, Haoxin Liu, Zhiyuan Zhao, Yingjie Zhou, Wenjia Wang, Kaize Ding, Yuxuan Liang, B. Aditya Prakash, Qingsong Wen

    Abstract: Effective imputation is a crucial preprocessing step for time series analysis. Despite the development of numerous deep learning algorithms for time series imputation, the community lacks standardized and comprehensive benchmark platforms to effectively evaluate imputation performance across different settings. Moreover, although many deep learning forecasting algorithms have demonstrated excellen… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.09130  [pdf, other

    cs.LG cs.AI

    Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning

    Authors: Haoxin Liu, Harshavardhan Kamarthi, Lingkai Kong, Zhiyuan Zhao, Chao Zhang, B. Aditya Prakash

    Abstract: Time-series forecasting (TSF) finds broad applications in real-world scenarios. Due to the dynamic nature of time-series data, it is crucial to equip TSF models with out-of-distribution (OOD) generalization abilities, as historical training data and future test data can have different distributions. In this paper, we aim to alleviate the inherent OOD problem in TSF via invariant learning. We ident… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages

    ACM Class: H.0

  8. arXiv:2406.08627  [pdf, other

    cs.LG cs.CL

    Time-MMD: A New Multi-Domain Multimodal Dataset for Time Series Analysis

    Authors: Haoxin Liu, Shangqing Xu, Zhiyuan Zhao, Lingkai Kong, Harshavardhan Kamarthi, Aditya B. Sasanur, Megha Sharma, Jiaming Cui, Qingsong Wen, Chao Zhang, B. Aditya Prakash

    Abstract: Time series data are ubiquitous across a wide range of real-world domains. While real-world time series analysis (TSA) requires human experts to integrate numerical series data with multimodal domain-specific knowledge, most existing TSA models rely solely on numerical data, overlooking the significance of information beyond numerical series. This oversight is due to the untapped potential of text… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  9. arXiv:2406.04273  [pdf, other

    cs.CV cs.AI

    ELFS: Enhancing Label-Free Coreset Selection via Clustering-based Pseudo-Labeling

    Authors: Haizhong Zheng, Elisa Tsai, Yifu Lu, Jiachen Sun, Brian R. Bartoldson, Bhavya Kailkhura, Atul Prakash

    Abstract: High-quality human-annotated data is crucial for modern deep learning pipelines, yet the human annotation process is both costly and time-consuming. Given a constrained human labeling budget, selecting an informative and representative data subset for labeling can significantly reduce human annotation effort. Well-performing state-of-the-art (SOTA) coreset selection methods require ground-truth la… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  10. arXiv:2405.14899  [pdf, other

    cs.CL cs.AI cs.LG

    DETAIL: Task DEmonsTration Attribution for Interpretable In-context Learning

    Authors: Zijian Zhou, Xiaoqiang Lin, Xinyi Xu, Alok Prakash, Daniela Rus, Bryan Kian Hsiang Low

    Abstract: In-context learning (ICL) allows transformer-based language models that are pre-trained on general text to quickly learn a specific task with a few "task demonstrations" without updating their parameters, significantly boosting their flexibility and generality. ICL possesses many distinct characteristics from conventional machine learning, thereby requiring new approaches to interpret this learnin… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  11. arXiv:2404.17530  [pdf, other

    cs.FL

    Lookahead Games and Efficient Determinisation of History-Deterministic Büchi Automata

    Authors: Rohan Acharya, Marcin Jurdziński, Aditya Prakash

    Abstract: Our main technical contribution is a polynomial-time determinisation procedure for history-deterministic Büchi automata, which settles an open question of Kuperberg and Skrzypczak, 2015. A key conceptual contribution is the lookahead game, which is a variant of Bagnol and Kuperberg's token game, in which Adam is given a fixed lookahead. We prove that the lookahead game is equivalent to the 1-token… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Full version of paper accepted at ICALP 2024

  12. From Batch to Stream: Automatic Generation of Online Algorithms

    Authors: Ziteng Wang, Shankara Pailoor, Aaryan Prakash, Yuepeng Wang, Isil Dillig

    Abstract: Online streaming algorithms, tailored for continuous data processing, offer substantial benefits but are often more intricate to design than their offline counterparts. This paper introduces a novel approach for automatically synthesizing online streaming algorithms from their offline versions. In particular, we propose a novel methodology, based on the notion of relational function signature (RFS… ▽ More

    Submitted 8 May, 2024; v1 submitted 6 April, 2024; originally announced April 2024.

    ACM Class: D.3.0

    Journal ref: Proc. ACM Program. Lang. 8, PLDI, Article 188 (June 2024), 32 pages

  13. arXiv:2403.19852  [pdf, other

    cs.LG cs.SI physics.soc-ph q-bio.PE

    A Review of Graph Neural Networks in Epidemic Modeling

    Authors: Zewen Liu, Guancheng Wan, B. Aditya Prakash, Max S. Y. Lau, Wei Jin

    Abstract: Since the onset of the COVID-19 pandemic, there has been a growing interest in studying epidemiological models. Traditional mechanistic models mathematically describe the transmission mechanisms of infectious diseases. However, they often suffer from limitations of oversimplified or fixed assumptions, which could cause sub-optimal predictive power and inefficiency in capturing complex relation inf… ▽ More

    Submitted 21 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  14. arXiv:2403.16428  [pdf, other

    cs.CV

    Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

    Authors: Zicong Fan, Takehiko Ohkawa, Linlin Yang, Nie Lin, Zhishan Zhou, Shihao Zhou, Jiajun Liang, Zhong Gao, Xuanyang Zhang, Xue Zhang, Fei Li, Zheng Liu, Feng Lu, Karim Abou Zeid, Bastian Leibe, Jeongwan On, Seungryul Baek, Aditya Prakash, Saurabh Gupta, Kun He, Yoichi Sato, Otmar Hilliges, Hyung Jin Chang, Angela Yao

    Abstract: We interact with the world with our hands and see it through our own (egocentric) perspective. A holistic 3Dunderstanding of such interactions from egocentric views is important for tasks in robotics, AR/VR, action recognition and motion generation. Accurately reconstructing such interactions in 3D is challenging due to heavy occlusion, viewpoint bias, camera distortion, and motion blur from the h… ▽ More

    Submitted 5 August, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted to ECCV 2024

  15. arXiv:2403.05578  [pdf, other

    cs.HC cs.AI cs.CV cs.IR cs.LG

    Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners

    Authors: Shanu Vashishtha, Abhinav Prakash, Lalitesh Morishetti, Kaushiki Nag, Yokila Arora, Sushant Kumar, Kannan Achan

    Abstract: Text-to-image models such as stable diffusion have opened a plethora of opportunities for generating art. Recent literature has surveyed the use of text-to-image models for enhancing the work of many creative artists. Many e-commerce platforms employ a manual process to generate the banners, which is time-consuming and has limitations of scalability. In this work, we demonstrate the use of text-to… ▽ More

    Submitted 28 February, 2024; originally announced March 2024.

    Comments: 10 pages

  16. arXiv:2403.04781  [pdf

    cs.CR cs.CV cs.LG eess.IV

    Selective Encryption using Segmentation Mask with Chaotic Henon Map for Multidimensional Medical Images

    Authors: S Arut Prakash, Aditya Ganesh Kumar, Prabhu Shankar K. C., Lithicka Anandavel, Aditya Lakshmi Narayanan

    Abstract: A user-centric design and resource optimization should be at the center of any technology or innovation. The user-centric perspective gives the developer the opportunity to develop with task-based optimization. The user in the medical image field is a medical professional who analyzes the medical images and gives their diagnosis results to the patient. This scheme, having the medical professional… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  17. arXiv:2402.16132  [pdf, other

    cs.CL cs.AI

    LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting

    Authors: Haoxin Liu, Zhiyuan Zhao, Jindong Wang, Harshavardhan Kamarthi, B. Aditya Prakash

    Abstract: Time-series forecasting (TSF) finds broad applications in real-world scenarios. Prompting off-the-shelf Large Language Models (LLMs) demonstrates strong zero-shot TSF capabilities while preserving computational efficiency. However, existing prompting methods oversimplify TSF as language next-token predictions, overlooking its dynamic nature and lack of integration with state-of-the-art prompt stra… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures, 3 tables, 2 page references, 2 page appendix

  18. arXiv:2402.15911  [pdf, other

    cs.CR cs.CL

    PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails

    Authors: Neal Mangaokar, Ashish Hooda, Jihye Choi, Shreyas Chandrashekaran, Kassem Fawaz, Somesh Jha, Atul Prakash

    Abstract: Large language models (LLMs) are typically aligned to be harmless to humans. Unfortunately, recent work has shown that such models are susceptible to automated jailbreak attacks that induce them to generate harmful content. More recent LLMs often incorporate an additional layer of defense, a Guard Model, which is a second LLM that is designed to check and moderate the output response of the primar… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  19. arXiv:2402.12280  [pdf, other

    cs.CL cs.AI

    Adaptive Skeleton Graph Decoding

    Authors: Shuowei Jin, Yongji Wu, Haizhong Zheng, Qingzhao Zhang, Matthew Lentz, Z. Morley Mao, Atul Prakash, Feng Qian, Danyang Zhuo

    Abstract: Large language models (LLMs) have seen significant adoption for natural language tasks, owing their success to massive numbers of model parameters (e.g., 70B+); however, LLM inference incurs significant computation and memory costs. Recent approaches propose parallel decoding strategies, such as Skeleton-of-Thought (SoT), to improve performance by breaking prompts down into sub-problems that can b… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  20. arXiv:2402.06126  [pdf, other

    cs.CL cs.AI cs.LG

    Learn To be Efficient: Build Structured Sparsity in Large Language Models

    Authors: Haizhong Zheng, Xiaoyan Bai, Xueshen Liu, Z. Morley Mao, Beidi Chen, Fan Lai, Atul Prakash

    Abstract: Large Language Models (LLMs) have achieved remarkable success with their billion-level parameters, yet they incur high inference overheads. The emergence of activation sparsity in LLMs provides a natural approach to reduce this cost by involving only parts of the parameters for inference. However, existing methods only focus on utilizing this naturally formed activation sparsity in a post-training… ▽ More

    Submitted 3 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

  21. arXiv:2402.03358  [pdf, other

    cs.SI cs.AI cs.DS cs.LG

    A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and Condensation

    Authors: Mohammad Hashemi, Shengbo Gong, Juntong Ni, Wenqi Fan, B. Aditya Prakash, Wei Jin

    Abstract: Many real-world datasets can be naturally represented as graphs, spanning a wide range of domains. However, the increasing complexity and size of graph datasets present significant challenges for analysis and computation. In response, graph reduction, or graph summarization, has gained prominence for simplifying large graphs while preserving essential properties. In this survey, we aim to provide… ▽ More

    Submitted 29 June, 2024; v1 submitted 28 January, 2024; originally announced February 2024.

    Comments: Accepted by IJCAI 2024 (This ArXiv version is a long version of our IJCAI paper)

  22. arXiv:2401.12437  [pdf, other

    cs.GT

    Convex-Concave Zero-sum Markov Stackelberg Games

    Authors: Denizalp Goktas, Arjun Prakash, Amy Greenwald

    Abstract: Zero-sum Markov Stackelberg games can be used to model myriad problems, in domains ranging from economics to human robot interaction. In this paper, we develop policy gradient methods that solve these games in continuous state and action settings using noisy gradient estimates computed from observed trajectories of play. When the games are convex-concave, we prove that our algorithms converge to S… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  23. arXiv:2312.06594  [pdf, other

    cs.CV cs.AI cs.LG

    Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops

    Authors: Aditya Prakash, Arjun Gupta, Saurabh Gupta

    Abstract: Objects undergo varying amounts of perspective distortion as they move across a camera's field of view. Models for predicting 3D from a single image often work with crops around the object of interest and ignore the location of the object in the camera's field of view. We note that ignoring this location information further exaggerates the inherent ambiguity in making 3D inferences from 2D images… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Project Page: https://ap229997.github.io/projects/ambiguity/

  24. arXiv:2312.06583  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    3D Hand Pose Estimation in Egocentric Images in the Wild

    Authors: Aditya Prakash, Ruisen Tu, Matthew Chang, Saurabh Gupta

    Abstract: We present WildHands, a method for 3D hand pose estimation in egocentric images in the wild. This is challenging due to (a) lack of 3D hand pose annotations for images in the wild, and (b) a form of perspective distortion-induced shape ambiguity that arises in the analysis of crops around hands. For the former, we use auxiliary supervision on in-the-wild data in the form of segmentation masks & gr… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Project page: https://ap229997.github.io/projects/hands/

  25. arXiv:2311.11413  [pdf, other

    cs.LG

    Large Pre-trained time series models for cross-domain Time series analysis tasks

    Authors: Harshavardhan Kamarthi, B. Aditya Prakash

    Abstract: Large pre-trained models have been vital in recent advancements in domains like language and vision, making model training for individual downstream tasks more efficient and provide superior performance. However, tackling time-series analysis tasks usually involves designing and training a separate model from scratch leveraging training data and domain expertise specific to the task. We tackle a s… ▽ More

    Submitted 11 July, 2024; v1 submitted 19 November, 2023; originally announced November 2023.

    Comments: 16 pages, 5 Figures, 4 Tables

  26. arXiv:2311.07841  [pdf, other

    cs.LG cs.SI

    PEMS: Pre-trained Epidemic Time-series Models

    Authors: Harshavardhan Kamarthi, B. Aditya Prakash

    Abstract: Providing accurate and reliable predictions about the future of an epidemic is an important problem for enabling informed public health decisions. Recent works have shown that leveraging data-driven solutions that utilize advances in deep learning methods to learn from past data of an epidemic often outperform traditional mechanistic models. However, in many cases, the past data is sparse and may… ▽ More

    Submitted 19 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: 18 pages

  27. arXiv:2310.13498  [pdf, other

    cs.FL

    Checking History-Determinism is NP-hard for Parity Automata

    Authors: Aditya Prakash

    Abstract: We show that the problem of checking if a given nondeterministic parity automaton simulates another given nondeterministic parity automaton is NP-hard. We then adapt the techniques used for this result to show that the problem of checking history-determinism for a given parity automaton is NP-hard. This is an improvement from Kuperberg and Skrzypczak's previous lower bound of solving parity games… ▽ More

    Submitted 23 January, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: Full version of the paper accepted at FoSSaCS 2024. Some minor editorial changes from the previous version following suggestions from anonymous reviewers

  28. arXiv:2310.11569   

    cs.LG cs.AI

    When Rigidity Hurts: Soft Consistency Regularization for Probabilistic Hierarchical Time Series Forecasting

    Authors: Harshavardhan Kamarthi, Lingkai Kong, Alexander Rodríguez, Chao Zhang, B. Aditya Prakash

    Abstract: Probabilistic hierarchical time-series forecasting is an important variant of time-series forecasting, where the goal is to model and forecast multivariate time-series that have underlying hierarchical relations. Most methods focus on point predictions and do not provide well-calibrated probabilistic forecasts distributions. Recent state-of-art probabilistic forecasting methods also impose hierarc… ▽ More

    Submitted 19 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: I have submitted this paper as a revision of arXiv:2206.07940. Apologies for the confusion

  29. arXiv:2310.07506  [pdf, other

    cs.CV cs.LG

    Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation

    Authors: Haizhong Zheng, Jiachen Sun, Shutong Wu, Bhavya Kailkhura, Zhuoqing Mao, Chaowei Xiao, Atul Prakash

    Abstract: Given a real-world dataset, data condensation (DC) aims to synthesize a small synthetic dataset that captures the knowledge of a natural dataset while being usable for training models with comparable accuracy. Recent works propose to enhance DC with data parameterization, which condenses data into very compact parameterized data containers instead of images. The intuition behind data parameterizat… ▽ More

    Submitted 18 July, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Journal ref: ECCV 2024

  30. arXiv:2310.06077  [pdf, other

    cs.LG cs.AI

    Performative Time-Series Forecasting

    Authors: Zhiyuan Zhao, Alexander Rodriguez, B. Aditya Prakash

    Abstract: Time-series forecasting is a critical challenge in various domains and has witnessed substantial progress in recent years. Many real-life scenarios, such as public health, economics, and social applications, involve feedback loops where predictions can influence the predicted outcome, subsequently altering the target variable's distribution. This phenomenon, known as performativity, introduces the… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 12 pages (7 main text, 2 reference, 3 appendix), 3 figures, 4 tables

  31. arXiv:2309.06252  [pdf, other

    cs.RO

    Predicting Routine Object Usage for Proactive Robot Assistance

    Authors: Maithili Patel, Aswin Prakash, Sonia Chernova

    Abstract: Proactivity in robot assistance refers to the robot's ability to anticipate user needs and perform assistive actions without explicit requests. This requires understanding user routines, predicting consistent activities, and actively seeking information to predict inconsistent behaviors. We propose SLaTe-PRO (Sequential Latent Temporal model for Predicting Routine Object usage), which improves upo… ▽ More

    Submitted 28 January, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  32. arXiv:2308.16559  [pdf

    cs.HC

    VisAhoi: Towards a Library to Generate and Integrate Visualization Onboarding Using High-level Visualization Grammars

    Authors: Christina Stoiber, Daniela Moitzi, Holger Stitz, Florian Grassinger, Anto Silviya Geo Prakash, Dominic Girardi, Marc Streit, Wolfgang Aigner

    Abstract: Visualization onboarding supports users in reading, interpreting, and extracting information from visual data representations. General-purpose onboarding tools and libraries are applicable for explaining a wide range of graphical user interfaces but cannot handle specific visualization requirements. This paper describes a first step towards developing an onboarding library called VisAhoi, which is… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  33. arXiv:2308.05889  [pdf, other

    cs.LG cs.AI

    DF2: Distribution-Free Decision-Focused Learning

    Authors: Lingkai Kong, Wenhao Mu, Jiaming Cui, Yuchen Zhuang, B. Aditya Prakash, Bo Dai, Chao Zhang

    Abstract: Decision-focused learning (DFL) has recently emerged as a powerful approach for predict-then-optimize problems by customizing a predictive model to a downstream optimization task. However, existing end-to-end DFL methods are hindered by three significant bottlenecks: model mismatch error, sample average approximation error, and gradient approximation error. Model mismatch error stems from the misa… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

    Comments: 24 pages

  34. arXiv:2307.16331  [pdf, other

    cs.LG cs.CR

    Theoretically Principled Trade-off for Stateful Defenses against Query-Based Black-Box Attacks

    Authors: Ashish Hooda, Neal Mangaokar, Ryan Feng, Kassem Fawaz, Somesh Jha, Atul Prakash

    Abstract: Adversarial examples threaten the integrity of machine learning systems with alarming success rates even under constrained black-box conditions. Stateful defenses have emerged as an effective countermeasure, detecting potential attacks by maintaining a buffer of recent queries and detecting new queries that are too similar. However, these defenses fundamentally pose a trade-off between attack dete… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: 2nd AdvML Frontiers Workshop at ICML 2023

  35. arXiv:2307.11833  [pdf, other

    cs.CE cs.LG

    PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks

    Authors: Zhiyuan Zhao, Xueying Ding, B. Aditya Prakash

    Abstract: Physics-Informed Neural Networks (PINNs) have emerged as a promising deep learning framework for approximating numerical solutions to partial differential equations (PDEs). However, conventional PINNs, relying on multilayer perceptrons (MLP), neglect the crucial temporal dependencies inherent in practical physics systems and thus fail to propagate the initial condition constraints globally and acc… ▽ More

    Submitted 7 May, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: 17 pages (including 9 pages of main text, 3 pages of references, and 5 pages of appendix), 9 figures, 7 tables

  36. arXiv:2307.08849  [pdf, other

    cs.AI cs.LG

    Autoregressive Diffusion Model for Graph Generation

    Authors: Lingkai Kong, Jiaming Cui, Haotian Sun, Yuchen Zhuang, B. Aditya Prakash, Chao Zhang

    Abstract: Diffusion-based graph generative models have recently obtained promising results for graph generation. However, existing diffusion-based graph generative models are mostly one-shot generative models that apply Gaussian diffusion in the dequantized adjacency matrix space. Such a strategy can suffer from difficulty in model training, slow sampling speed, and incapability of incorporating constraints… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 18 pages

  37. arXiv:2306.04527  [pdf, other

    eess.IV cs.CV cs.LG

    ContriMix: Scalable stain color augmentation for domain generalization without domain labels in digital pathology

    Authors: Tan H. Nguyen, Dinkar Juyal, Jin Li, Aaditya Prakash, Shima Nofallah, Chintan Shah, Sai Chowdary Gullapally, Limin Yu, Michael Griffin, Anand Sampat, John Abel, Justin Lee, Amaro Taylor-Weiner

    Abstract: Differences in staining and imaging procedures can cause significant color variations in histopathology images, leading to poor generalization when deploying deep-learning models trained from a different data source. Various color augmentation methods have been proposed to generate synthetic images during training to make models more robust, eliminating the need for stain normalization during test… ▽ More

    Submitted 8 March, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

  38. arXiv:2306.00349  [pdf, other

    cs.CV cs.LG

    CALICO: Self-Supervised Camera-LiDAR Contrastive Pre-training for BEV Perception

    Authors: Jiachen Sun, Haizhong Zheng, Qingzhao Zhang, Atul Prakash, Z. Morley Mao, Chaowei Xiao

    Abstract: Perception is crucial in the realm of autonomous driving systems, where bird's eye view (BEV)-based architectures have recently reached state-of-the-art performance. The desirability of self-supervised representation learning stems from the expensive and laborious process of annotating 2D and 3D data. Although previous research has investigated pretraining methods for both LiDAR and camera-based 3… ▽ More

    Submitted 27 November, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  39. arXiv:2305.16301  [pdf, other

    cs.CV cs.LG cs.RO

    Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos

    Authors: Matthew Chang, Aditya Prakash, Saurabh Gupta

    Abstract: The analysis and use of egocentric videos for robotic tasks is made challenging by occlusion due to the hand and the visual mismatch between the human hand and a robot end-effector. In this sense, the human hand presents a nuisance. However, often hands also provide a valuable signal, e.g. the hand pose may suggest what kind of object is being held. In this work, we propose to extract a factored r… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: for project website with video, see https://matthewchang.github.io/vidm/

  40. arXiv:2305.03036  [pdf, other

    cs.CV cs.AI cs.LG

    Learning Hand-Held Object Reconstruction from In-The-Wild Videos

    Authors: Aditya Prakash, Matthew Chang, Matthew Jin, Saurabh Gupta

    Abstract: Prior works for reconstructing hand-held objects from a single image rely on direct 3D shape supervision which is challenging to gather in real world at scale. Consequently, these approaches do not generalize well when presented with novel objects in in-the-wild settings. While 3D supervision is a major bottleneck, there is an abundance of in-the-wild raw video data showing hand-object interaction… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Project Webpage: https://ap229997.github.io/projects/horse/

  41. arXiv:2305.02401  [pdf, other

    cs.CV cs.LG

    Synthetic DOmain-Targeted Augmentation (S-DOTA) Improves Model Generalization in Digital Pathology

    Authors: Sai Chowdary Gullapally, Yibo Zhang, Nitin Kumar Mittal, Deeksha Kartik, Sandhya Srinivasan, Kevin Rose, Daniel Shenker, Dinkar Juyal, Harshith Padigela, Raymond Biju, Victor Minden, Chirag Maheshwari, Marc Thibault, Zvi Goldstein, Luke Novak, Nidhi Chandra, Justin Lee, Aaditya Prakash, Chintan Shah, John Abel, Darren Fahy, Amaro Taylor-Weiner, Anand Sampat

    Abstract: Machine learning algorithms have the potential to improve patient outcomes in digital pathology. However, generalization of these tools is currently limited by sensitivity to variations in tissue preparation, staining procedures and scanning equipment that lead to domain shift in digitized slides. To overcome this limitation and improve model generalization, we studied the effectiveness of two Syn… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  42. arXiv:2304.12483  [pdf, other

    cs.CV

    Towards Realistic Generative 3D Face Models

    Authors: Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando de la Torre

    Abstract: In recent years, there has been significant progress in 2D generative face models fueled by applications such as animation, synthetic data generation, and digital avatars. However, due to the absence of 3D information, these 2D models often struggle to accurately disentangle facial attributes like pose, expression, and illumination, limiting their editing capabilities. To address this limitation,… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Preprint

  43. arXiv:2303.13814  [pdf, other

    cs.CV

    Multimodal Adaptive Fusion of Face and Gait Features using Keyless attention based Deep Neural Networks for Human Identification

    Authors: Ashwin Prakash, Thejaswin S, Athira Nambiar, Alexandre Bernardino

    Abstract: Biometrics plays a significant role in vision-based surveillance applications. Soft biometrics such as gait is widely used with face in surveillance tasks like person recognition and re-identification. Nevertheless, in practical scenarios, classical fusion techniques respond poorly to changes in individual users and in the external environment. To this end, we propose a novel adaptive multi-biomet… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

    Comments: -

  44. Stateful Defenses for Machine Learning Models Are Not Yet Secure Against Black-box Attacks

    Authors: Ryan Feng, Ashish Hooda, Neal Mangaokar, Kassem Fawaz, Somesh Jha, Atul Prakash

    Abstract: Recent work has proposed stateful defense models (SDMs) as a compelling strategy to defend against a black-box attacker who only has query access to the model, as is common for online machine learning platforms. Such stateful defenses aim to defend against black-box attacks by tracking the query history and detecting and rejecting queries that are "similar" and thus preventing black-box attacks fr… ▽ More

    Submitted 26 September, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: ACM CCS 2023

  45. arXiv:2302.11052  [pdf, other

    cs.IR cs.AI cs.LG

    Que2Engage: Embedding-based Retrieval for Relevant and Engaging Products at Facebook Marketplace

    Authors: Yunzhong He, Yuxin Tian, Mengjiao Wang, Feier Chen, Licheng Yu, Maolong Tang, Congcong Chen, Ning Zhang, Bin Kuang, Arul Prakash

    Abstract: Embedding-based Retrieval (EBR) in e-commerce search is a powerful search retrieval technique to address semantic matches between search queries and products. However, commercial search engines like Facebook Marketplace Search are complex multi-stage systems optimized for multiple business objectives. At Facebook Marketplace, search retrieval focuses on matching search queries with relevant produc… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted by WWW'2023

  46. arXiv:2302.10527  [pdf, other

    cs.IR cs.AI cs.LG

    HierCat: Hierarchical Query Categorization from Weakly Supervised Data at Facebook Marketplace

    Authors: Yunzhong He, Cong Zhang, Ruoyan Kong, Chaitanya Kulkarni, Qing Liu, Ashish Gandhe, Amit Nithianandan, Arul Prakash

    Abstract: Query categorization at customer-to-customer e-commerce platforms like Facebook Marketplace is challenging due to the vagueness of search intent, noise in real-world data, and imbalanced training data across languages. Its deployment also needs to consider challenges in scalability and downstream integration in order to translate modeling advances into better search result relevance. In this paper… ▽ More

    Submitted 21 February, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted by WWW'2023

  47. arXiv:2302.06227  [pdf, other

    eess.AS cs.SD

    Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages

    Authors: Sudhanshu Srivastava, Ishika Gupta, Anusha Prakash, Jom Kuriakose, Hema A. Murthy

    Abstract: Hidden-Markov-model (HMM) based text-to-speech (HTS) offers flexibility in speaking styles along with fast training and synthesis while being computationally less intense. HTS performs well even in low-resource scenarios. The primary drawback is that the voice quality is poor compared to that of E2E systems. A hybrid approach combining HMM-based feature generation and neural-network-based HiFi-GAN… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 5 pages, 5 figures

  48. arXiv:2212.05614  [pdf, other

    cs.CR cs.AR

    Generic Tagging for RISC-V Binaries

    Authors: David Demicco, Matthew Cole, Gokturk Yuksek, Ravi Theja Gollapudi, Aravind Prakash, Kanad Ghose, Zerksis Umrigar

    Abstract: With the widespread popularity of RISC-V -- an open-source ISA -- custom hardware security solutions targeting specific defense needs are gaining popularity. These solutions often require specialized compilers that can insert metadata (called tags) into the generated binaries, and/or extend the RISC-V ISA with new instructions. Developing such compilers can be a tedious and time-consuming process.… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  49. arXiv:2211.13837  [pdf, other

    cs.LG physics.soc-ph

    End-to-End Stochastic Optimization with Energy-Based Model

    Authors: Lingkai Kong, Jiaming Cui, Yuchen Zhuang, Rui Feng, B. Aditya Prakash, Chao Zhang

    Abstract: Decision-focused learning (DFL) was recently proposed for stochastic optimization problems that involve unknown parameters. By integrating predictive modeling with an implicitly differentiable optimization layer, DFL has shown superior performance to the standard two-stage predict-then-optimize pipeline. However, most existing DFL methods are only applicable to convex problems or a subset of nonco… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022 Oral

  50. arXiv:2211.01338  [pdf, other

    eess.AS cs.CL cs.MM cs.SD eess.IV

    Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

    Authors: Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya , et al. (2 additional authors not shown)

    Abstract: Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.