Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 100 results for author: Nair, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17001  [pdf, other

    cs.LG nlin.CD

    Deep Learning for Prediction and Classifying the Dynamical behaviour of Piecewise Smooth Maps

    Authors: Vismaya V S, Bharath V Nair, Sishu Shankar Muni

    Abstract: This paper explores the prediction of the dynamics of piecewise smooth maps using various deep learning models. We have shown various novel ways of predicting the dynamics of piecewise smooth maps using deep learning models. Moreover, we have used machine learning models such as Decision Tree Classifier, Logistic Regression, K-Nearest Neighbor, Random Forest, and Support Vector Machine for predict… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 32 pages, 22 figures

  2. arXiv:2406.16792  [pdf, other

    cs.CR nlin.CD

    Deep Learning and Chaos: A combined Approach To Image Encryption and Decryption

    Authors: Bharath V Nair, Vismaya V S, Sishu Shankar Muni, Ali Durdu

    Abstract: In this paper, we introduce a novel image encryption and decryption algorithm using hyperchaotic signals from the novel 3D hyperchaotic map, 2D memristor map, Convolutional Neural Network (CNN), and key sensitivity analysis to achieve robust security and high efficiency. The encryption starts with the scrambling of gray images by using a 3D hyperchaotic map to yield complex sequences under disrupt… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.14861  [pdf, other

    eess.SY cs.ET

    Resilience of the Electric Grid through Trustable IoT-Coordinated Assets

    Authors: Vineet J. Nair, Venkatesh Venkataramanan, Priyank Srivastava, Partha S. Sarker, Anurag Srivastava, Laurentiu D. Marinovici, Jun Zha, Christopher Irwin, Prateek Mittal, John Williams, H. Vincent Poor, Anuradha M. Annaswamy

    Abstract: The electricity grid has evolved from a physical system to a cyber-physical system with digital devices that perform measurement, control, communication, computation, and actuation. The increased penetration of distributed energy resources (DERs) that include renewable generation, flexible loads, and storage provides extraordinary opportunities for improvements in efficiency and sustainability. Ho… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Submitted to the Proceedings of the National Academy of Sciences (PNAS), under review

  4. arXiv:2405.11511  [pdf, other

    cs.CV

    Online Action Representation using Change Detection and Symbolic Programming

    Authors: Vishnu S Nair, Sneha Sree, Jayaraj Joseph, Mohanasankar Sivaprakasam

    Abstract: This paper addresses the critical need for online action representation, which is essential for various applications like rehabilitation, surveillance, etc. The task can be defined as representation of actions as soon as they happen in a streaming video without access to video frames in the future. Most of the existing methods use predefined window sizes for video segments, which is a restrictive… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  5. arXiv:2405.10190  [pdf, other

    cs.LG nlin.CD

    Comparative Analysis of Predicting Subsequent Steps in Hénon Map

    Authors: Vismaya V S, Alok Hareendran, Bharath V Nair, Sishu Shankar Muni, Martin Lellep

    Abstract: This paper explores the prediction of subsequent steps in Hénon Map using various machine learning techniques. The Hénon map, well known for its chaotic behaviour, finds applications in various fields including cryptography, image encryption, and pattern recognition. Machine learning methods, particularly deep learning, are increasingly essential for understanding and predicting chaotic phenomena.… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: 19 pages, 9 figures

  6. Surveyor: Facilitating Discovery Within Video Games for Blind and Low Vision Players

    Authors: Vishnu Nair, Hanxiu 'Hazel' Zhu, Peize Song, Jizhong Wang, Brian A. Smith

    Abstract: Video games are increasingly accessible to blind and low vision (BLV) players, yet many aspects remain inaccessible. One aspect is the joy players feel when they explore environments and make new discoveries, which is integral to many games. Sighted players experience discovery by surveying environments and identifying unexplored areas. Current accessibility tools, however, guide BLV players direc… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Journal ref: Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems (CHI '24), May 2024

  7. arXiv:2311.08303  [pdf, other

    cs.CL cs.AI

    Extrinsically-Focused Evaluation of Omissions in Medical Summarization

    Authors: Elliot Schumacher, Daniel Rosenthal, Varun Nair, Luladay Price, Geoffrey Tso, Anitha Kannan

    Abstract: The goal of automated summarization techniques (Paice, 1990; Kupiec et al, 1995) is to condense text by focusing on the most critical information. Generative large language models (LLMs) have shown to be robust summarizers, yet traditional metrics struggle to capture resulting performance (Goyal et al, 2022) in more powerful LLMs. In safety-critical domains such as medicine, more rigorous evaluati… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  8. arXiv:2311.05090  [pdf, other

    cs.HC cs.CR

    Deep Motion Masking for Secure, Usable, and Scalable Real-Time Anonymization of Virtual Reality Motion Data

    Authors: Vivek Nair, Wenbo Guo, James F. O'Brien, Louis Rosenberg, Dawn Song

    Abstract: Virtual reality (VR) and "metaverse" systems have recently seen a resurgence in interest and investment as major technology companies continue to enter the space. However, recent studies have demonstrated that the motion tracking "telemetry" data used by nearly all VR applications is as uniquely identifiable as a fingerprint scan, raising significant privacy concerns surrounding metaverse technolo… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  9. arXiv:2311.02101  [pdf, other

    cs.AI cs.LG cs.LO

    Solving MaxSAT with Matrix Multiplication

    Authors: David Warde-Farley, Vinod Nair, Yujia Li, Ivan Lobov, Felix Gimeno, Simon Osindero

    Abstract: We propose an incomplete algorithm for Maximum Satisfiability (MaxSAT) specifically designed to run on neural network accelerators such as GPUs and TPUs. Given a MaxSAT problem instance in conjunctive normal form, our procedure constructs a Restricted Boltzmann Machine (RBM) with an equilibrium distribution wherein the probability of a Boolean assignment is exponential in the number of clauses it… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  10. Berkeley Open Extended Reality Recordings 2023 (BOXRR-23): 4.7 Million Motion Capture Recordings from 105,852 Extended Reality Device Users

    Authors: Vivek Nair, Wenbo Guo, Rui Wang, James F. O'Brien, Louis Rosenberg, Dawn Song

    Abstract: Extended reality (XR) devices such as the Meta Quest and Apple Vision Pro have seen a recent surge in attention, with motion tracking "telemetry" data lying at the core of nearly all XR and metaverse experiences. Researchers are just beginning to understand the implications of this data for security, privacy, usability, and more, but currently lack large-scale human motion datasets to study. The B… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: Learn more at https://rdi.berkeley.edu/metaverse/boxrr-23

    Journal ref: IEEE Transactions on Visualization and Computer Graphics, pages 1-8, March 2024. IEEE VR 2024, Orlando, FL March 16-21, 2024. Best Paper Honorable Mention

  11. arXiv:2309.02426  [pdf

    stat.ML cs.LG

    Monotone Tree-Based GAMI Models by Adapting XGBoost

    Authors: Linwei Hu, Soroush Aramideh, Jie Chen, Vijayan N. Nair

    Abstract: Recent papers have used machine learning architecture to fit low-order functional ANOVA models with main effects and second-order interactions. These GAMI (GAM + Interaction) models are directly interpretable as the functional main effects and interactions can be easily plotted and visualized. Unfortunately, it is not easy to incorporate the monotonicity requirement into the existing GAMI models b… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 12 pages

  12. arXiv:2308.09341  [pdf

    cs.CL cs.LG

    Document Automation Architectures: Updated Survey in Light of Large Language Models

    Authors: Mohammad Ahmadi Achachlouei, Omkar Patil, Tarun Joshi, Vijayan N. Nair

    Abstract: This paper surveys the current state of the art in document automation (DA). The objective of DA is to reduce the manual effort during the generation of documents by automatically creating and integrating input from different sources and assembling documents conforming to defined templates. There have been reviews of commercial solutions of DA, particularly in the legal domain, but to date there h… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: The current paper is the updated version of an earlier survey on document automation [Ahmadi Achachlouei et al. 2021]. Updates in the current paper are as follows: We shortened almost all sections to reduce the size of the main paper (without references) from 28 pages to 10 pages, added a review of selected papers on large language models, removed certain sections and most of diagrams. arXiv admin note: substantial text overlap with arXiv:2109.11603

    MSC Class: 68T50 ACM Class: I.7.0; I.2.7; I.2.4

  13. arXiv:2306.14746  [pdf, other

    cs.CR

    MFDPG: Multi-Factor Authenticated Password Management With Zero Stored Secrets

    Authors: Vivek Nair, Dawn Song

    Abstract: While password managers are a vital tool for internet security, they can also create a massive central point of failure, as evidenced by several major recent data breaches. For over 20 years, deterministic password generators (DPGs) have been proposed, and largely rejected, as a viable alternative to password management tools. In this paper, we survey 45 existing DPGs to asses the main security, p… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  14. Multi-Factor Credential Hashing for Asymmetric Brute-Force Attack Resistance

    Authors: Vivek Nair, Dawn Song

    Abstract: Since the introduction of bcrypt in 1999, adaptive password hashing functions, whereby brute-force resistance increases symmetrically with computational difficulty for legitimate users, have been our most powerful post-breach countermeasure against credential disclosure. Unfortunately, the relatively low tolerance of users to added latency places an upper bound on the deployment of this technique… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Journal ref: 8th IEEE European Symposium on Security and Privacy (2023) 56-72

  15. Decentralizing Custodial Wallets with MFKDF

    Authors: Vivek Nair, Dawn Song

    Abstract: The average cryptocurrency user today faces a difficult choice between centralized custodial wallets, which are notoriously prone to spontaneous collapse, or cumbersome self-custody solutions, which if not managed properly can cause a total loss of funds. In this paper, we present a "best of both worlds" cryptocurrency wallet design that looks like, and inherits the user experience of, a centraliz… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Journal ref: 5th IEEE International Conference on Blockchain and Cryptocurrency (2023) 1-9

  16. Truth in Motion: The Unprecedented Risks and Opportunities of Extended Reality Motion Data

    Authors: Vivek Nair, Louis Rosenberg, James F. O'Brien, Dawn Song

    Abstract: Motion tracking "telemetry" data lies at the core of nearly all modern extended reality (XR) and metaverse experiences. While generally presumed innocuous, recent studies have demonstrated that motion data actually has the potential to profile and deanonymize XR users, posing a significant threat to security and privacy in the metaverse.

    Submitted 10 June, 2023; originally announced June 2023.

    Journal ref: IEEE Security & Privacy (2024)

  17. arXiv:2305.19198  [pdf, other

    cs.HC cs.CR

    Inferring Private Personal Attributes of Virtual Reality Users from Head and Hand Motion Data

    Authors: Vivek Nair, Christian Rack, Wenbo Guo, Rui Wang, Shuixian Li, Brandon Huang, Atticus Cull, James F. O'Brien, Marc Latoschik, Louis Rosenberg, Dawn Song

    Abstract: Motion tracking "telemetry" data lies at the core of nearly all modern virtual reality (VR) and metaverse experiences. While generally presumed innocuous, recent studies have demonstrated that motion data actually has the potential to uniquely identify VR users. In this study, we go a step further, showing that a variety of private user information can be inferred just by analyzing motion data rec… ▽ More

    Submitted 10 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  18. arXiv:2305.15670  [pdf

    stat.ML cs.LG

    Interpretable Machine Learning based on Functional ANOVA Framework: Algorithms and Comparisons

    Authors: Linwei Hu, Vijayan N. Nair, Agus Sudjianto, Aijun Zhang, Jie Chen

    Abstract: In the early days of machine learning (ML), the emphasis was on developing complex algorithms to achieve best predictive performance. To understand and explain the model results, one had to rely on post hoc explainability techniques, which are known to have limitations. Recently, with the recognition that interpretability is just as important, researchers are compromising on small increases in pre… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 24 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2207.06950

  19. arXiv:2305.14320  [pdf, other

    cs.HC

    Results of the 2023 Census of Beat Saber Users: Virtual Reality Gaming Population Insights and Factors Affecting Virtual Reality E-Sports Performance

    Authors: Vivek Nair, Viktor Radulov, James F. O'Brien

    Abstract: The emergence of affordable standalone virtual reality (VR) devices has allowed VR technology to reach mass-market adoption in recent years, driven primarily by the popularity of VR gaming applications such as Beat Saber. However, despite being the top-grossing VR application to date and the most popular VR e-sport, the population of over 6 million Beat Saber users has not yet been widely studied.… ▽ More

    Submitted 30 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: for interactive version, see https://www.beatleader.xyz/census2023

  20. arXiv:2305.05982  [pdf, other

    cs.CL cs.AI cs.LG

    Generating medically-accurate summaries of patient-provider dialogue: A multi-stage approach using large language models

    Authors: Varun Nair, Elliot Schumacher, Anitha Kannan

    Abstract: A medical provider's summary of a patient visit serves several critical purposes, including clinical decision-making, facilitating hand-offs between providers, and as a reference for the patient. An effective summary is required to be coherent and accurately capture all the medically relevant information in the dialogue, despite the complexity of patient-generated language. Even minor inaccuracies… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  21. arXiv:2304.14364  [pdf, other

    cs.CL cs.AI cs.LG

    CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants

    Authors: Albert Yu Sun, Varun Nair, Elliot Schumacher, Anitha Kannan

    Abstract: A wave of new task-based virtual assistants has been fueled by increasingly powerful large language models (LLMs), such as GPT-4 (OpenAI, 2023). A major challenge in deploying LLM-based virtual conversational assistants in real world settings is ensuring they operate within what is admissible for the task. To overcome this challenge, the designers of these virtual assistants rely on an independent… ▽ More

    Submitted 3 April, 2024; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: To appear in NAACL 2024

  22. arXiv:2303.17071  [pdf, other

    cs.CL cs.AI cs.LG

    DERA: Enhancing Large Language Model Completions with Dialog-Enabled Resolving Agents

    Authors: Varun Nair, Elliot Schumacher, Geoffrey Tso, Anitha Kannan

    Abstract: Large language models (LLMs) have emerged as valuable tools for many natural language understanding tasks. In safety-critical applications such as healthcare, the utility of these models is governed by their ability to generate outputs that are factually accurate and complete. In this work, we present dialog-enabled resolving agents (DERA). DERA is a paradigm made possible by the increased convers… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  23. arXiv:2303.16205  [pdf

    eess.IV cs.LG physics.optics

    mHealth hyperspectral learning for instantaneous spatiospectral imaging of hemodynamics

    Authors: Yuhyun Ji, Sang Mok Park, Semin Kwon, Jung Woo Leem, Vidhya Vijayakrishnan Nair, Yunjie Tong, Young L. Kim

    Abstract: Hyperspectral imaging acquires data in both the spatial and frequency domains to offer abundant physical or biological information. However, conventional hyperspectral imaging has intrinsic limitations of bulky instruments, slow data acquisition rate, and spatiospectral tradeoff. Here we introduce hyperspectral learning for snapshot hyperspectral imaging in which sampled hyperspectral data in a sm… ▽ More

    Submitted 5 April, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Journal ref: PNAS Nexus, pgad111, 2023

  24. ImageAssist: Tools for Enhancing Touchscreen-Based Image Exploration Systems for Blind and Low Vision Users

    Authors: Vishnu Nair, Hanxiu 'Hazel' Zhu, Brian A. Smith

    Abstract: Blind and low vision (BLV) users often rely on alt text to understand what a digital image is showing. However, recent research has investigated how touch-based image exploration on touchscreens can supplement alt text. Touchscreen-based image exploration systems allow BLV users to deeply understand images while granting a strong sense of agency. Yet, prior work has found that these systems requir… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Journal ref: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23), April 2023

  25. arXiv:2302.08927  [pdf, other

    cs.CR cs.LG

    Unique Identification of 50,000+ Virtual Reality Users from Head & Hand Motion Data

    Authors: Vivek Nair, Wenbo Guo, Justus Mattern, Rui Wang, James F. O'Brien, Louis Rosenberg, Dawn Song

    Abstract: With the recent explosive growth of interest and investment in virtual reality (VR) and the so-called "metaverse," public attention has rightly shifted toward the unique security and privacy threats that these platforms may pose. While it has long been known that people reveal information about themselves via their motion, the extent to which this makes an individual globally identifiable within v… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Journal ref: 32nd USENIX Security Symposium (2023) 895-910

  26. Local retail electricity markets for distribution grid services

    Authors: Vineet Jagadeesan Nair, Anuradha Annaswamy

    Abstract: We propose a hierarchical local electricity market (LEM) at the primary and secondary feeder levels in a distribution grid, to optimally coordinate and schedule distributed energy resources (DER) and provide valuable grid services like voltage control. At the primary level, we use a current injection-based model that is valid for both radial and meshed, balanced and unbalanced, multi-phase systems… ▽ More

    Submitted 11 July, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 9 pages, 13 figures, Accepted to the 7th IEEE Conference on Control Technology and Applications (CCTA) 2023

  27. SoK: Data Privacy in Virtual Reality

    Authors: Gonzalo Munilla Garrido, Vivek Nair, Dawn Song

    Abstract: The adoption of virtual reality (VR) technologies has rapidly gained momentum in recent years as companies around the world begin to position the so-called "metaverse" as the next major medium for accessing and interacting with the internet. While consumers have become accustomed to a degree of data harvesting on the web, the real-time nature of data sharing in the metaverse indicates that privacy… ▽ More

    Submitted 18 May, 2023; v1 submitted 14 January, 2023; originally announced January 2023.

    Journal ref: 24th Privacy Enhancing Technologies Symposium (2024) 21-40

  28. arXiv:2211.12796  [pdf, other

    cs.SD cs.CL eess.AS

    IMaSC -- ICFOSS Malayalam Speech Corpus

    Authors: Deepa P Gopinath, Thennal D K, Vrinda V Nair, Swaraj K S, Sachin G

    Abstract: Modern text-to-speech (TTS) systems use deep learning to synthesize speech increasingly approaching human quality, but they require a database of high quality audio-text sentence pairs for training. Malayalam, the official language of the Indian state of Kerala and spoken by 35+ million people, is a low resource language in terms of available corpora for TTS systems. In this paper, we present IMaS… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: 18 pages, 8 figures

  29. arXiv:2211.08536  [pdf

    cs.LG

    Behavior of Hyper-Parameters for Selected Machine Learning Algorithms: An Empirical Investigation

    Authors: Anwesha Bhattacharyya, Joel Vaughan, Vijayan N. Nair

    Abstract: Hyper-parameters (HPs) are an important part of machine learning (ML) model development and can greatly influence performance. This paper studies their behavior for three algorithms: Extreme Gradient Boosting (XGB), Random Forest (RF), and Feedforward Neural Network (FFNN) with structured data. Our empirical investigation examines the qualitative behavior of model performance as the HPs vary, quan… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  30. Uncovering Visually Impaired Gamers' Preferences for Spatial Awareness Tools Within Video Games

    Authors: Vishnu Nair, Shao-en Ma, Ricardo E. Gonzalez Penuela, Yicheng He, Karen Lin, Mason Hayes, Hannah Huddleston, Matthew Donnelly, Brian A. Smith

    Abstract: Sighted players gain spatial awareness within video games through sight and spatial awareness tools (SATs) such as minimaps. Visually impaired players (VIPs), however, must often rely heavily on SATs to gain spatial awareness, especially in complex environments where using rich ambient sound design alone may be insufficient. Researchers have developed many SATs for facilitating spatial awareness w… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Journal ref: Proceedings of the 24th International ACM SIGACCESS Conference on Computers and Accessibility (ASSETS '22), October 2022

  31. arXiv:2208.09254  [pdf, other

    cs.LG cs.AI

    Mitigating Disparity while Maximizing Reward: Tight Anytime Guarantee for Improving Bandits

    Authors: Vishakha Patil, Vineet Nair, Ganesh Ghalme, Arindam Khan

    Abstract: We study the Improving Multi-Armed Bandit (IMAB) problem, where the reward obtained from an arm increases with the number of pulls it receives. This model provides an elegant abstraction for many real-world problems in domains such as education and employment, where decisions about the distribution of opportunities can affect the future capabilities of communities and the disparity between them. A… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  32. arXiv:2208.06096  [pdf

    cs.LG stat.ML

    Comparing Baseline Shapley and Integrated Gradients for Local Explanation: Some Additional Insights

    Authors: Tianshu Feng, Zhipu Zhou, Joshi Tarun, Vijayan N. Nair

    Abstract: There are many different methods in the literature for local explanation of machine learning results. However, the methods differ in their approaches and often do not provide same explanations. In this paper, we consider two recent methods: Integrated Gradients (Sundararajan, Taly, & Yan, 2017) and Baseline Shapley (Sundararajan and Najmi, 2020). The original authors have already studied the axiom… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  33. Going Incognito in the Metaverse: Achieving Theoretically Optimal Privacy-Usability Tradeoffs in VR

    Authors: Vivek Nair, Gonzalo Munilla Garrido, Dawn Song

    Abstract: Virtual reality (VR) telepresence applications and the so-called "metaverse" promise to be the next major medium of human-computer interaction. However, with recent studies demonstrating the ease at which VR users can be profiled and deanonymized, metaverse platforms carry many of the privacy risks of the conventional internet (and more) while at present offering few of the defensive utilities tha… ▽ More

    Submitted 23 October, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: Learn more at https://rdi.berkeley.edu/metaverse/metaguard/

    Journal ref: 36th Annual ACM Symposium on User Interface Software and Technology (2023)

  34. arXiv:2208.05586  [pdf, other

    cs.CR

    Multi-Factor Key Derivation Function (MFKDF) for Fast, Flexible, Secure, & Practical Key Management

    Authors: Vivek Nair, Dawn Song

    Abstract: We present the first general construction of a Multi-Factor Key Derivation Function (MFKDF). Our function expands upon password-based key derivation functions (PBKDFs) with support for using other popular authentication factors like TOTP, HOTP, and hardware tokens in the key derivation process. In doing so, it provides an exponential security improvement over PBKDFs with less than 12 ms of additio… ▽ More

    Submitted 16 February, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: To appear in USENIX Security '23

    Journal ref: 32nd USENIX Security Symposium (2023) 2097-2114

  35. arXiv:2208.04351  [pdf, other

    cs.SE cs.LG

    Learning to Learn to Predict Performance Regressions in Production at Meta

    Authors: Moritz Beller, Hongyu Li, Vivek Nair, Vijayaraghavan Murali, Imad Ahmad, Jürgen Cito, Drew Carlson, Ari Aye, Wes Dyer

    Abstract: Catching and attributing code change-induced performance regressions in production is hard; predicting them beforehand, even harder. A primer on automatically learning to predict performance regressions in software, this article gives an account of the experiences we gained when researching and deploying an ML-based regression prediction pipeline at Meta. In this paper, we report on a comparative… ▽ More

    Submitted 22 May, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

  36. Exploring the Privacy Risks of Adversarial VR Game Design

    Authors: Vivek Nair, Gonzalo Munilla Garrido, Dawn Song, James F. O'Brien

    Abstract: Fifty study participants playtested an innocent-looking "escape room" game in virtual reality (VR). Within just a few minutes, an adversarial program had accurately inferred over 25 of their personal data attributes, from anthropometrics like height and wingspan to demographics like age and gender. As notoriously data-hungry companies become increasingly involved in VR development, this experiment… ▽ More

    Submitted 13 December, 2023; v1 submitted 26 July, 2022; originally announced July 2022.

    Comments: Learn more at https://rdi.berkeley.edu/metaverse/metadata

    Journal ref: 23rd Privacy Enhancing Technologies Symposium (2023) 238-256

  37. arXiv:2207.06950  [pdf

    stat.ML cs.LG

    Using Model-Based Trees with Boosting to Fit Low-Order Functional ANOVA Models

    Authors: Linwei Hu, Jie Chen, Vijayan N. Nair

    Abstract: Low-order functional ANOVA (fANOVA) models have been rediscovered in the machine learning (ML) community under the guise of inherently interpretable machine learning. Explainable Boosting Machines or EBM (Lou et al. 2013) and GAMI-Net (Yang et al. 2021) are two recently proposed ML algorithms for fitting functional main effects and second-order interactions. We propose a new algorithm, called GAMI… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: 25 pages plus appendix

  38. arXiv:2206.12353  [pdf

    stat.ML cs.LG

    Quantifying Inherent Randomness in Machine Learning Algorithms

    Authors: Soham Raste, Rahul Singh, Joel Vaughan, Vijayan N. Nair

    Abstract: Most machine learning (ML) algorithms have several stochastic elements, and their performances are affected by these sources of randomness. This paper uses an empirical study to systematically examine the effects of two sources: randomness in model training and randomness in the partitioning of a dataset into training and test subsets. We quantify and compare the magnitude of the variation in pred… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 14 pages, 4 Figures, 5 tables

  39. arXiv:2206.08542  [pdf, other

    cs.LG cs.GT

    Strategic Representation

    Authors: Vineet Nair, Ganesh Ghalme, Inbal Talgam-Cohen, Nir Rosenfeld

    Abstract: Humans have come to rely on machines for reducing excessive information to manageable representations. But this reliance can be abused -- strategic machines might craft representations that manipulate their users. How can a user make good choices based on strategic representations? We formalize this as a learning problem, and pursue algorithms for decision-making that are robust to manipulation. I… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  40. arXiv:2205.12723  [pdf

    cs.LG

    Interpretable Feature Engineering for Time Series Predictors using Attention Networks

    Authors: Tianjie Wang, Jie Chen, Joel Vaughan, Vijayan N. Nair

    Abstract: Regression problems with time-series predictors are common in banking and many other areas of application. In this paper, we use multi-head attention networks to develop interpretable features and use them to achieve good predictive performance. The customized attention layer explicitly uses multiplicative interactions and builds feature-engineering heads that capture temporal dynamics in a parsim… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  41. arXiv:2204.13663  [pdf, other

    cs.AI cs.CY

    ADVISER: AI-Driven Vaccination Intervention Optimiser for Increasing Vaccine Uptake in Nigeria

    Authors: Vineet Nair, Kritika Prakash, Michael Wilbur, Aparna Taneja, Corinne Namblard, Oyindamola Adeyemo, Abhishek Dubey, Abiodun Adereni, Milind Tambe, Ayan Mukhopadhyay

    Abstract: More than 5 million children under five years die from largely preventable or treatable medical conditions every year, with an overwhelmingly large proportion of deaths occurring in under-developed countries with low vaccination uptake. One of the United Nations' sustainable development goals (SDG 3) aims to end preventable deaths of newborns and children under five years of age. We focus on Niger… ▽ More

    Submitted 5 July, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

    Comments: Accepted for publication at International Joint Conference on Artificial Intelligence 2022, AI for Good Track (IJCAI-22)

  42. arXiv:2204.12868  [pdf

    stat.ML cs.LG

    Performance and Interpretability Comparisons of Supervised Machine Learning Algorithms: An Empirical Study

    Authors: Alice J. Liu, Arpita Mukherjee, Linwei Hu, Jie Chen, Vijayan N. Nair

    Abstract: This paper compares the performances of three supervised machine learning algorithms in terms of predictive ability and model interpretation on structured or tabular data. The algorithms considered were scikit-learn implementations of extreme gradient boosting machines (XGB) and random forests (RFs), and feedforward neural networks (FFNNs) from TensorFlow. The paper is organized in a findings-base… ▽ More

    Submitted 5 May, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

  43. arXiv:2204.12365  [pdf

    stat.ML cs.LG

    Explaining Adverse Actions in Credit Decisions Using Shapley Decomposition

    Authors: Vijayan N. Nair, Tianshu Feng, Linwei Hu, Zach Zhang, Jie Chen, Agus Sudjianto

    Abstract: When a financial institution declines an application for credit, an adverse action (AA) is said to occur. The applicant is then entitled to an explanation for the negative decision. This paper focuses on credit decisions based on a predictive model for probability of default and proposes a methodology for AA explanation. The problem involves identifying the important predictors responsible for the… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 20 pages, 8 figures

  44. arXiv:2112.02958  [pdf, other

    cs.LG cs.DC

    Automap: Towards Ergonomic Automated Parallelism for ML Models

    Authors: Michael Schaarschmidt, Dominik Grewe, Dimitrios Vytiniotis, Adam Paszke, Georg Stefan Schmid, Tamara Norman, James Molloy, Jonathan Godwin, Norman Alexander Rink, Vinod Nair, Dan Belov

    Abstract: The rapid rise in demand for training large neural network architectures has brought into focus the need for partitioning strategies, for example by using data, model, or pipeline parallelism. Implementing these methods is increasingly supported through program primitives, but identifying efficient partitioning strategies requires expensive experimentation and expertise. We present the prototype o… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Workshop on ML for Systems at NeurIPS 2021

  45. arXiv:2111.09839  [pdf, other

    cs.LG

    Training Neural Networks with Fixed Sparse Masks

    Authors: Yi-Lin Sung, Varun Nair, Colin Raffel

    Abstract: During typical gradient-based training of deep neural networks, all of the model's parameters are updated at each iteration. Recent work has shown that it is possible to update only a small subset of the model's parameters during training, which can alleviate storage and communication requirements. In this paper, we show that it is possible to induce a fixed sparse mask on the model's parameters t… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  46. arXiv:2111.07564  [pdf, other

    cs.CL cs.AI cs.LG

    Adding more data does not always help: A study in medical conversation summarization with PEGASUS

    Authors: Varun Nair, Namit Katariya, Xavier Amatriain, Ilya Valmianski, Anitha Kannan

    Abstract: Medical conversation summarization is integral in capturing information gathered during interactions between patients and physicians. Summarized conversations are used to facilitate patient hand-offs between physicians, and as part of providing care in the future. Summaries, however, can be time-consuming to produce and require domain expertise. Modern pre-trained NLP models such as PEGASUS have e… ▽ More

    Submitted 28 November, 2021; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted to Machine Learning for Healthcare Workshop, NeurIPS 2021

  47. arXiv:2109.11603  [pdf

    cs.CL cs.LG

    Document Automation Architectures and Technologies: A Survey

    Authors: Mohammad Ahmadi Achachlouei, Omkar Patil, Tarun Joshi, Vijayan N. Nair

    Abstract: This paper surveys the current state of the art in document automation (DA). The objective of DA is to reduce the manual effort during the generation of documents by automatically integrating input from different sources and assembling documents conforming to defined templates. There have been reviews of commercial solutions of DA, particularly in the legal domain, but to date there has been no co… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 34 pages, 11 figures, 5 tables

    MSC Class: 68T50 ACM Class: I.7.0; I.2.7; I.2.4

  48. arXiv:2109.04244  [pdf, other

    stat.ML cs.LG

    Supervised Linear Dimension-Reduction Methods: Review, Extensions, and Comparisons

    Authors: Shaojie Xu, Joel Vaughan, Jie Chen, Agus Sudjianto, Vijayan Nair

    Abstract: Principal component analysis (PCA) is a well-known linear dimension-reduction method that has been widely used in data analysis and modeling. It is an unsupervised learning technique that identifies a suitable linear subspace for the input variable that contains maximal variation and preserves as much information as possible. PCA has also been used in prediction models where the original, high-dim… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

  49. NavStick: Making Video Games Blind-Accessible via the Ability to Look Around

    Authors: Vishnu Nair, Jay L. Karp, Samuel Silverman, Mohar Kalra, Hollis Lehv, Faizan Jamil, Brian A. Smith

    Abstract: Video games remain largely inaccessible to visually impaired people (VIPs). Today's blind-accessible games are highly simplified renditions of what sighted players enjoy, and they do not give VIPs the same freedom to look around and explore game worlds on their own terms. In this work, we introduce NavStick, an audio-based tool for looking around within virtual environments, with the aim of making… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: 14 pages with 13 figures and 2 tables

    Journal ref: Proceedings of the 34th Annual ACM Symposium on User Interface Software and Technology (UIST '21), October 2021

  50. arXiv:2107.10201  [pdf, other

    math.OC cs.LG

    Learning a Large Neighborhood Search Algorithm for Mixed Integer Programs

    Authors: Nicolas Sonnerat, Pengming Wang, Ira Ktena, Sergey Bartunov, Vinod Nair

    Abstract: Large Neighborhood Search (LNS) is a combinatorial optimization heuristic that starts with an assignment of values for the variables to be optimized, and iteratively improves it by searching a large neighborhood around the current assignment. In this paper we consider a learning-based LNS approach for mixed integer programs (MIPs). We train a Neural Diving model to represent a probability distribu… ▽ More

    Submitted 20 May, 2022; v1 submitted 21 July, 2021; originally announced July 2021.