Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–26 of 26 results for author: Hegde, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19985  [pdf, other

    cs.CV cs.AI cs.LG

    Mixture of Nested Experts: Adaptive Processing of Visual Tokens

    Authors: Gagan Jain, Nidhi Hegde, Aditya Kusupati, Arsha Nagrani, Shyamal Buch, Prateek Jain, Anurag Arnab, Sujoy Paul

    Abstract: The visual medium (images and videos) naturally contains a large amount of information redundancy, thereby providing a great opportunity for leveraging efficiency in processing. While Vision Transformer (ViT) based models scale effectively to large data regimes, they fail to capitalize on this inherent redundancy, leading to higher computational costs. Mixture of Experts (MoE) networks demonstrate… ▽ More

    Submitted 30 July, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

  2. arXiv:2405.01010  [pdf, other

    cs.LG stat.ML

    Efficient and Adaptive Posterior Sampling Algorithms for Bandits

    Authors: Bingshan Hu, Zhiming Huang, Tianyue H. Zhang, Mathias Lécuyer, Nidhi Hegde

    Abstract: We study Thompson Sampling-based algorithms for stochastic bandits with bounded rewards. As the existing problem-dependent regret bound for Thompson Sampling with Gaussian priors [Agrawal and Goyal, 2017] is vacuous when $T \le 288 e^{64}$, we derive a more practical bound that tightens the coefficient of the leading term %from $288 e^{64}$ to $1270$. Additionally, motivated by large-scale real-wo… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  3. arXiv:2401.05422  [pdf, ps, other

    eess.SP cs.AI

    Machine Learning (ML)-assisted Beam Management in millimeter (mm)Wave Distributed Multiple Input Multiple Output (D-MIMO) systems

    Authors: Karthik R M, Dhiraj Nagaraja Hegde, Muris Sarajlic, Abhishek Sarkar

    Abstract: Beam management (BM) protocols are critical for establishing and maintaining connectivity between network radio nodes and User Equipments (UEs). In Distributed Multiple Input Multiple Output systems (D-MIMO), a number of access points (APs), coordinated by a central processing unit (CPU), serves a number of UEs. At mmWave frequencies, the problem of finding the best AP and beam to serve the UEs is… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  4. arXiv:2309.14389  [pdf, other

    cs.CV cs.AI

    Analyzing the Efficacy of an LLM-Only Approach for Image-based Document Question Answering

    Authors: Nidhi Hegde, Sujoy Paul, Gagan Madan, Gaurav Aggarwal

    Abstract: Recent document question answering models consist of two key components: the vision encoder, which captures layout and visual elements in images, and a Large Language Model (LLM) that helps contextualize questions to the image and supplements them with external world knowledge to generate accurate answers. However, the relative contributions of the vision encoder and the language model in these ta… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  5. arXiv:2306.17173  [pdf, other

    cs.NI

    Photon: A Cross Platform P2P Data Transfer Application

    Authors: Abhilash Shreedhar Hegde, Amruta Narayana Hegde, Adeep Krishna Keelar, Ananya Mathur

    Abstract: Modern computing requires efficient and dependable data transport. Current solutions like Bluetooth, SMS (Short Message Service), and Email have their restrictions on efficiency, file size, compatibility, and cost. In order to facilitate direct communication and resource sharing amongst linked devices, this research study offers a cross-platform peer-to-peer (P2P) data transmission solution that t… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  6. arXiv:2306.06823  [pdf, other

    cs.CV cs.CL

    Weakly supervised information extraction from inscrutable handwritten document images

    Authors: Sujoy Paul, Gagan Madan, Akankshya Mishra, Narayan Hegde, Pradeep Kumar, Gaurav Aggarwal

    Abstract: State-of-the-art information extraction methods are limited by OCR errors. They work well for printed text in form-like documents, but unstructured, handwritten documents still remain a challenge. Adapting existing models to domain-specific training data is quite expensive, because of two factors, 1) limited availability of the domain-specific documents (such as handwritten prescriptions, lab note… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted at ICDAR 2023

  7. arXiv:2304.11983  [pdf, other

    cs.DC

    Protecting Locks Against Unbalanced Unlock()

    Authors: Vivek Shahare, Milind Chabbi, Nikhil Hegde

    Abstract: The lock is a building-block synchronization primitive that enables mutually exclusive access to shared data in shared-memory parallel programs. Mutual exclusion is typically achieved by guarding the code that accesses the shared data with a pair of lock() and unlock() operations. Concurrency bugs arise when this ordering of operations is violated. In this paper, we study a particular pattern of m… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: Paper Accepted to the 35th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA 23)

  8. arXiv:2302.02453  [pdf, ps, other

    cs.CL cs.CY

    FineDeb: A Debiasing Framework for Language Models

    Authors: Akash Saravanan, Dhruv Mullick, Habibur Rahman, Nidhi Hegde

    Abstract: As language models are increasingly included in human-facing machine learning tools, bias against demographic subgroups has gained attention. We propose FineDeb, a two-phase debiasing framework for language models that starts with contextual debiasing of embeddings learned by pretrained language models. The model is then fine-tuned on a language modeling objective. Our results show that FineDeb of… ▽ More

    Submitted 5 February, 2023; originally announced February 2023.

    Comments: Poster presentation at AAAI 2023: The Workshop on Artificial Intelligence for Social Good 2023 (https://amulyayadav.github.io/AI4SG2023/)

  9. arXiv:2211.13508  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

    Authors: Benjamin Kiefer, Matej Kristan, Janez Perš, Lojze Žust, Fabio Poiesi, Fabio Augusto de Alcantara Andrade, Alexandre Bernardino, Matthew Dawkins, Jenni Raitoharju, Yitong Quan, Adem Atmaca, Timon Höfer, Qiming Zhang, Yufei Xu, Jing Zhang, Dacheng Tao, Lars Sommer, Raphael Spraul, Hangyue Zhao, Hongpu Zhang, Yanyun Zhao, Jan Lukas Augustin, Eui-ik Jeon, Impyeong Lee, Luca Zedda , et al. (48 additional authors not shown)

    Abstract: The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detec… ▽ More

    Submitted 28 November, 2022; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: MaCVi 2023 was part of WACV 2023. This report (38 pages) discusses the competition as part of MaCVi

  10. arXiv:2207.05777  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    Long Term Fairness for Minority Groups via Performative Distributionally Robust Optimization

    Authors: Liam Peet-Pare, Nidhi Hegde, Alona Fyshe

    Abstract: Fairness researchers in machine learning (ML) have coalesced around several fairness criteria which provide formal definitions of what it means for an ML model to be fair. However, these criteria have some serious limitations. We identify four key shortcomings of these formal fairness criteria, and aim to help to address them by extending performative prediction to include a distributionally robus… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: From a submission to Responsible Decision Making in Dynamics Environments Workshop at ICML 2022

  11. arXiv:2206.08653  [pdf, other

    cs.LG cs.AI cs.CV

    All Mistakes Are Not Equal: Comprehensive Hierarchy Aware Multi-label Predictions (CHAMP)

    Authors: Ashwin Vaswani, Gaurav Aggarwal, Praneeth Netrapalli, Narayan G Hegde

    Abstract: This paper considers the problem of Hierarchical Multi-Label Classification (HMC), where (i) several labels can be present for each example, and (ii) labels are related via a domain-specific hierarchy tree. Guided by the intuition that all mistakes are not equal, we present Comprehensive Hierarchy Aware Multi-label Predictions (CHAMP), a framework that penalizes a misprediction depending on its se… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  12. Leveraging Clinically Relevant Biometric Constraints To Supervise A Deep Learning Model For The Accurate Caliper Placement To Obtain Sonographic Measurements Of The Fetal Brain

    Authors: Hari Shankar, Adithya Narayan, Shefali Jain, Divya Singh, Pooja Vyas, Nivedita Hegde, Purbayan Kar, Abhi Lad, Jens Thang, Jagruthi Atada, Duy Nguyen, PS Roopa, Akhila Vasudeva, Prathima Radhakrishnan, Sripad Krishna Devalla

    Abstract: Multiple studies have demonstrated that obtaining standardized fetal brain biometry from mid-trimester ultrasonography (USG) examination is key for the reliable assessment of fetal neurodevelopment and the screening of central nervous system (CNS) anomalies. Obtaining these measurements is highly subjective, expertise-driven, and requires years of training experience, limiting quality prenatal car… ▽ More

    Submitted 31 July, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted for presentation at 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI)

  13. arXiv:2203.11992  [pdf, other

    cs.LG stat.ML

    Resonance in Weight Space: Covariate Shift Can Drive Divergence of SGD with Momentum

    Authors: Kirby Banman, Liam Peet-Pare, Nidhi Hegde, Alona Fyshe, Martha White

    Abstract: Most convergence guarantees for stochastic gradient descent with momentum (SGDm) rely on iid sampling. Yet, SGDm is often used outside this regime, in settings with temporally correlated input samples such as continual learning and reinforcement learning. Existing work has shown that SGDm with a decaying step-size can converge under Markovian temporal correlation. In this work, we show that SGDm u… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: In International Conference on Learning Representations. 2021

  14. arXiv:2202.13553  [pdf, other

    eess.IV cs.CV cs.LG

    Towards A Device-Independent Deep Learning Approach for the Automated Segmentation of Sonographic Fetal Brain Structures: A Multi-Center and Multi-Device Validation

    Authors: Abhi Lad, Adithya Narayan, Hari Shankar, Shefali Jain, Pooja Punjani Vyas, Divya Singh, Nivedita Hegde, Jagruthi Atada, Jens Thang, Saw Shier Nee, Arunkumar Govindarajan, Roopa PS, Muralidhar V Pai, Akhila Vasudeva, Prathima Radhakrishnan, Sripad Krishna Devalla

    Abstract: Quality assessment of prenatal ultrasonography is essential for the screening of fetal central nervous system (CNS) anomalies. The interpretation of fetal brain structures is highly subjective, expertise-driven, and requires years of training experience, limiting quality prenatal care for all pregnant mothers. With recent advancement in Artificial Intelligence (AI), specifically deep learning (DL)… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

    Comments: SPIE Medical Imaging 2022: Computer Aided Diagnosis (12033-75), 11 pages, 7 figures

  15. arXiv:2106.14815  [pdf, other

    cs.LG cs.AI cs.CR

    Feature Importance Guided Attack: A Model Agnostic Adversarial Attack

    Authors: Gilad Gressel, Niranjan Hegde, Archana Sreekumar, Rishikumar Radhakrishnan, Kalyani Harikumar, Anjali S., Krishnashree Achuthan

    Abstract: Research in adversarial learning has primarily focused on homogeneous unstructured datasets, which often map into the problem space naturally. Inverting a feature space attack on heterogeneous datasets into the problem space is much more challenging, particularly the task of finding the perturbation to perform. This work presents a formal search strategy: the `Feature Importance Guided Attack' (FI… ▽ More

    Submitted 13 January, 2023; v1 submitted 28 June, 2021; originally announced June 2021.

  16. arXiv:2102.07929  [pdf, other

    cs.LG

    Near-Optimal Algorithms for Differentially Private Online Learning in a Stochastic Environment

    Authors: Bingshan Hu, Zhiming Huang, Nishant A. Mehta, Nidhi Hegde

    Abstract: In this paper, we study differentially private online learning problems in a stochastic environment under both bandit and full information feedback. For differentially private stochastic bandits, we propose both UCB and Thompson Sampling-based algorithms that are anytime and achieve the optimal $O \left(\sum_{j: Δ_j>0} \frac{\ln(T)}{\min \left\{Δ_j, ε\right\}} \right)$ instance-dependent regret bo… ▽ More

    Submitted 30 May, 2024; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 40 pages. New in v3: (i) Removed Hybrid-UCB (although its analysis is correct to our knowledge); (ii) Added Lazy-DP-TS from UAI 2022 paper of Hu and Hegde (2022)

  17. Interpretable Survival Prediction for Colorectal Cancer using Deep Learning

    Authors: Ellery Wulczyn, David F. Steiner, Melissa Moran, Markus Plass, Robert Reihs, Fraser Tan, Isabelle Flament-Auvigne, Trissia Brown, Peter Regitnig, Po-Hsuan Cameron Chen, Narayan Hegde, Apaar Sadhwani, Robert MacDonald, Benny Ayalew, Greg S. Corrado, Lily H. Peng, Daniel Tse, Heimo Müller, Zhaoyang Xu, Yun Liu, Martin C. Stumpe, Kurt Zatloukal, Craig H. Mermel

    Abstract: Deriving interpretable prognostic features from deep-learning-based prognostic histopathology models remains a challenge. In this study, we developed a deep learning system (DLS) for predicting disease specific survival for stage II and III colorectal cancer using 3,652 cases (27,300 slides). When evaluated on two validation datasets containing 1,239 cases (9,340 slides) and 738 cases (7,140 slide… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

    Journal ref: Nature Partner Journal Digital Medicine (2021)

  18. arXiv:2002.02513  [pdf, other

    cs.MA cs.AI cs.LG

    Multi Type Mean Field Reinforcement Learning

    Authors: Sriram Ganapathi Subramanian, Pascal Poupart, Matthew E. Taylor, Nidhi Hegde

    Abstract: Mean field theory provides an effective way of scaling multiagent reinforcement learning algorithms to environments with many agents that can be abstracted by a virtual mean agent. In this paper, we extend mean field multiagent algorithms to multiple types. The types enable the relaxation of a core assumption in mean field reinforcement learning, which is that all agents in the environment are pla… ▽ More

    Submitted 21 June, 2022; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: The paper appears in the proceedings of International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) 2020. Revised version has some typos corrected

  19. arXiv:1902.02960  [pdf

    cs.HC cs.CY

    Human-Centered Tools for Coping with Imperfect Algorithms during Medical Decision-Making

    Authors: Carrie J. Cai, Emily Reif, Narayan Hegde, Jason Hipp, Been Kim, Daniel Smilkov, Martin Wattenberg, Fernanda Viegas, Greg S. Corrado, Martin C. Stumpe, Michael Terry

    Abstract: Machine learning (ML) is increasingly being used in image retrieval systems for medical decision making. One application of ML is to retrieve visually similar medical images from past patients (e.g. tissue from biopsies) to reference when making a medical decision with a new patient. However, no algorithm can perfectly capture an expert's ideal notion of similarity for every case: an image that is… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  20. Similar Image Search for Histopathology: SMILY

    Authors: Narayan Hegde, Jason D. Hipp, Yun Liu, Michael E. Buck, Emily Reif, Daniel Smilkov, Michael Terry, Carrie J. Cai, Mahul B. Amin, Craig H. Mermel, Phil Q. Nelson, Lily H. Peng, Greg S. Corrado, Martin C. Stumpe

    Abstract: The increasing availability of large institutional and public histopathology image datasets is enabling the searching of these datasets for diagnosis, research, and education. Though these datasets typically have associated metadata such as diagnosis or clinical notes, even carefully curated datasets rarely contain annotations of the location of regions of interest on each image. Because pathology… ▽ More

    Submitted 5 February, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 23 Pages with 6 figures and 3 tables. The file also has 6 pages of supplemental material. Improved figure resolution, edited metadata

    Journal ref: Nature Partner Journal Digital Medicine (2019)

  21. arXiv:1901.10634  [pdf, other

    stat.ML cs.AI cs.LG

    Privacy-preserving Q-Learning with Functional Noise in Continuous State Spaces

    Authors: Baoxiang Wang, Nidhi Hegde

    Abstract: We consider differentially private algorithms for reinforcement learning in continuous spaces, such that neighboring reward functions are indistinguishable. This protects the reward information from being exploited by methods such as inverse reinforcement learning. Existing studies that guarantee differential privacy are not extendable to infinite state spaces, as the noise level to ensure privacy… ▽ More

    Submitted 11 November, 2019; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: Advances in Neural Information Processing Systems (NeurIPS) 2019

  22. arXiv:1801.02889  [pdf, ps, other

    cs.PF

    Optimal Content Replication and Request Matching in Large Caching Systems

    Authors: Arpan Mukhopadhyay, Nidhi Hegde, Marc Lelarge

    Abstract: We consider models of content delivery networks in which the servers are constrained by two main resources: memory and bandwidth. In such systems, the throughput crucially depends on how contents are replicated across servers and how the requests of specific contents are matched to servers storing those contents. In this paper, we first formulate the problem of computing the optimal replication po… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: INFOCOM 2018

  23. arXiv:1212.0952  [pdf, ps, other

    cs.SI cs.GT cs.NI physics.soc-ph

    Self-Organizing Flows in Social Networks

    Authors: Nidhi Hegde, Laurent Massoulié, Laurent Viennot

    Abstract: Social networks offer users new means of accessing information, essentially relying on "social filtering", i.e. propagation and filtering of information by social contacts. The sheer amount of data flowing in these networks, combined with the limited budget of attention of each user, makes it difficult to ensure that social filtering brings relevant content to the interested users. Our motivation… ▽ More

    Submitted 28 February, 2015; v1 submitted 5 December, 2012; originally announced December 2012.

    Journal ref: Theoretical Computer Science, Elsevier, 2015, pp.16

  24. arXiv:1207.3269  [pdf, ps, other

    cs.LG cs.IT

    The Price of Privacy in Untrusted Recommendation Engines

    Authors: Siddhartha Banerjee, Nidhi Hegde, Laurent Massoulié

    Abstract: Recent increase in online privacy concerns prompts the following question: can a recommender system be accurate if users do not entrust it with their private data? To answer this, we study the problem of learning item-clusters under local differential privacy, a powerful, formal notion of data privacy. We develop bounds on the sample-complexity of learning item-clusters from privatized user inputs… ▽ More

    Submitted 27 October, 2014; v1 submitted 13 July, 2012; originally announced July 2012.

    Comments: Preliminary version presented at the 50th Allerton Conference, 2012

  25. arXiv:1203.1891  [pdf, ps, other

    cs.NI

    Optimal control of end-user energy storage

    Authors: Peter M. van de Ven, Nidhi Hegde, Laurent Massoulie, Theodoros Salonidis

    Abstract: An increasing number of retail energy markets show price fluctuations, providing users with the opportunity to buy energy at lower than average prices. We propose to temporarily store this inexpensive energy in a battery, and use it to satisfy demand when energy prices are high, thus allowing users to exploit the price variations without having to shift their demand to the low-price periods. We st… ▽ More

    Submitted 5 December, 2012; v1 submitted 8 March, 2012; originally announced March 2012.

  26. arXiv:0909.1713  [pdf, ps, other

    cs.NI

    Size Does Matter (in P2P Live Streaming)

    Authors: Nidhi Hegde, Fabien Mathieu, Diego Perino

    Abstract: Optimal dissemination schemes have previously been studied for peer-to-peer live streaming applications. Live streaming being a delay-sensitive application, fine tuning of dissemination parameters is crucial. In this report, we investigate optimal sizing of chunks, the units of data exchange, and probe sets, the number peers a given node probes before transmitting chunks. Chunk size can have sig… ▽ More

    Submitted 9 September, 2009; originally announced September 2009.

    Report number: RR-7032