Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 77 results for author: Beirami, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16738  [pdf, other

    cs.LG cs.AI cs.CY

    Inducing Group Fairness in LLM-Based Decisions

    Authors: James Atwood, Preethi Lahoti, Ananth Balashankar, Flavien Prost, Ahmad Beirami

    Abstract: Prompting Large Language Models (LLMs) has created new and interesting means for classifying textual data. While evaluating and remediating group fairness is a well-studied problem in classifier fairness literature, some classical approaches (e.g., regularization) do not carry over, and some new opportunities arise (e.g., prompt-based remediation). We measure fairness of LLM-based classifiers on a… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.05946  [pdf, other

    cs.CR cs.AI

    Safety Alignment Should Be Made More Than Just a Few Tokens Deep

    Authors: Xiangyu Qi, Ashwinee Panda, Kaifeng Lyu, Xiao Ma, Subhrajit Roy, Ahmad Beirami, Prateek Mittal, Peter Henderson

    Abstract: The safety alignment of current Large Language Models (LLMs) is vulnerable. Relatively simple attacks, or even benign fine-tuning, can jailbreak aligned models. We argue that many of these vulnerabilities are related to a shared underlying issue: safety alignment can take shortcuts, wherein the alignment adapts a model's generative distribution primarily over only its very first few output tokens.… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2405.19316  [pdf, other

    cs.LG cs.CL

    Robust Preference Optimization through Reward Model Distillation

    Authors: Adam Fisch, Jacob Eisenstein, Vicky Zayats, Alekh Agarwal, Ahmad Beirami, Chirag Nagpal, Pete Shaw, Jonathan Berant

    Abstract: Language model (LM) post-training (or alignment) involves maximizing a reward function that is derived from preference annotations. Direct Preference Optimization (DPO) is a popular offline alignment method that trains a policy directly on preference data without the need to train a reward model or apply reinforcement learning. However, typical preference datasets have only a single, or at most a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  4. arXiv:2405.18654  [pdf, other

    cs.CV

    Mitigating Object Hallucination via Data Augmented Contrastive Tuning

    Authors: Pritam Sarkar, Sayna Ebrahimi, Ali Etemad, Ahmad Beirami, Sercan Ö. Arık, Tomas Pfister

    Abstract: Despite their remarkable progress, Multimodal Large Language Models (MLLMs) tend to hallucinate factually inaccurate information. In this work, we address object hallucinations in MLLMs, where information is offered about an object that is not present in the model input. We introduce a contrastive tuning method that can be applied to a pretrained off-the-shelf MLLM for mitigating hallucinations wh… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  5. arXiv:2404.12318  [pdf, other

    cs.CL

    Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment

    Authors: Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami

    Abstract: Aligning language models (LMs) based on human-annotated preference data is a crucial step in obtaining practical and performant LM-based systems. However, multilingual human preference data are difficult to obtain at scale, making it challenging to extend this framework to diverse languages. In this work, we evaluate a simple approach for zero-shot cross-lingual alignment, where a reward model is… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  6. arXiv:2404.01730  [pdf, other

    cs.LG cs.IT stat.ML

    Asymptotics of Language Model Alignment

    Authors: Joy Qiping Yang, Salman Salamatian, Ziteng Sun, Ananda Theertha Suresh, Ahmad Beirami

    Abstract: Let $p$ denote a generative language model. Let $r$ denote a reward model that returns a scalar that captures the degree at which a draw from $p$ is preferred. The goal of language model alignment is to alter $p$ to a new distribution $φ$ that results in a higher expected reward while keeping $φ$ close to $p.$ A popular alignment method is the KL-constrained reinforcement learning (RL), which choo… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  7. arXiv:2403.10444  [pdf, other

    cs.LG cs.CL cs.DS cs.IT

    Block Verification Accelerates Speculative Decoding

    Authors: Ziteng Sun, Uri Mendlovic, Yaniv Leviathan, Asaf Aharoni, Ahmad Beirami, Jae Hun Ro, Ananda Theertha Suresh

    Abstract: Speculative decoding is an effective method for lossless acceleration of large language models during inference. It uses a fast model to draft a block of tokens which are then verified in parallel by the target model, and provides a guarantee that the output is distributed identically to a sample from the target model. In prior works, draft verification is performed independently token-by-token. S… ▽ More

    Submitted 25 July, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  8. arXiv:2401.16656  [pdf, other

    cs.CL

    Gradient-Based Language Model Red Teaming

    Authors: Nevan Wichers, Carson Denison, Ahmad Beirami

    Abstract: Red teaming is a common strategy for identifying weaknesses in generative language models (LMs), where adversarial prompts are produced that trigger an LM to generate unsafe responses. Red teaming is instrumental for both model alignment and evaluation, but is labor-intensive and difficult to scale when done by humans. In this paper, we present Gradient-Based Red Teaming (GBRT), a red teaming meth… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: EACL 2024 main conference

  9. arXiv:2401.01879  [pdf, other

    cs.LG cs.CL cs.IT

    Theoretical guarantees on the best-of-n alignment policy

    Authors: Ahmad Beirami, Alekh Agarwal, Jonathan Berant, Alexander D'Amour, Jacob Eisenstein, Chirag Nagpal, Ananda Theertha Suresh

    Abstract: A simple and effective method for the alignment of generative models is the best-of-$n$ policy, where $n$ samples are drawn from a base policy, and ranked based on a reward function, and the highest ranking one is selected. A commonly used analytical expression in the literature claims that the KL divergence between the best-of-$n$ policy and the base policy is equal to $\log (n) - (n-1)/n.$ We di… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  10. arXiv:2312.09244  [pdf, other

    cs.LG

    Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

    Authors: Jacob Eisenstein, Chirag Nagpal, Alekh Agarwal, Ahmad Beirami, Alex D'Amour, DJ Dvijotham, Adam Fisch, Katherine Heller, Stephen Pfohl, Deepak Ramachandran, Peter Shaw, Jonathan Berant

    Abstract: Reward models play a key role in aligning language model applications towards human preferences. However, this setup creates an incentive for the language model to exploit errors in the reward model to achieve high estimated reward, a phenomenon often termed \emph{reward hacking}. A natural mitigation is to train an ensemble of reward models, aggregating over model outputs to obtain a more robust… ▽ More

    Submitted 16 August, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Published at the 2024 Conference on Language Modeling (CoLM)

  11. arXiv:2312.03867  [pdf, other

    cs.LG cs.CY cs.IT stat.ML

    Multi-Group Fairness Evaluation via Conditional Value-at-Risk Testing

    Authors: Lucas Monteiro Paes, Ananda Theertha Suresh, Alex Beutel, Flavio P. Calmon, Ahmad Beirami

    Abstract: Machine learning (ML) models used in prediction and classification tasks may display performance disparities across population groups determined by sensitive attributes (e.g., race, sex, age). We consider the problem of evaluating the performance of a fixed ML model across population groups defined by multiple sensitive attributes (e.g., race and sex and age). Here, the sample complexity for estim… ▽ More

    Submitted 25 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in the IEEE Journal on Selected Areas in Information Theory (JSAIT)

  12. arXiv:2312.02592  [pdf, other

    cs.LG cs.CY

    FRAPPE: A Group Fairness Framework for Post-Processing Everything

    Authors: Alexandru Tifrea, Preethi Lahoti, Ben Packer, Yoni Halpern, Ahmad Beirami, Flavien Prost

    Abstract: Despite achieving promising fairness-error trade-offs, in-processing mitigation techniques for group fairness cannot be employed in numerous practical applications with limited computation resources or no access to the training pipeline of the prediction model. In these situations, post-processing is a viable alternative. However, current methods are tailored to specific problem settings and fairn… ▽ More

    Submitted 20 June, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Conference paper at ICML 2024

  13. arXiv:2311.01047  [pdf, other

    cs.LG cs.IT eess.SP

    Improving Robustness via Tilted Exponential Layer: A Communication-Theoretic Perspective

    Authors: Bhagyashree Puranik, Ahmad Beirami, Yao Qin, Upamanyu Madhow

    Abstract: State-of-the-art techniques for enhancing robustness of deep networks mostly rely on empirical risk minimization with suitable data augmentation. In this paper, we propose a complementary approach motivated by communication theory, aimed at enhancing the signal-to-noise ratio at the output of a neural network layer via neural competition during learning and inference. In addition to standard empir… ▽ More

    Submitted 3 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 27th International Conference on Artificial Intelligence and Statistics (AISTATS 2024)

  14. arXiv:2310.17022  [pdf, other

    cs.LG cs.AI cs.CL

    Controlled Decoding from Language Models

    Authors: Sidharth Mudgal, Jong Lee, Harish Ganapathy, YaGuang Li, Tao Wang, Yanping Huang, Zhifeng Chen, Heng-Tze Cheng, Michael Collins, Trevor Strohman, Jilin Chen, Alex Beutel, Ahmad Beirami

    Abstract: KL-regularized reinforcement learning (RL) is a popular alignment framework to control the language model responses towards high reward outcomes. We pose a tokenwise RL objective and propose a modular solver for it, called controlled decoding (CD). CD exerts control through a separate prefix scorer module, which is trained to learn a value function for the reward. The prefix scorer is used at infe… ▽ More

    Submitted 3 June, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: ICML 2024

  15. arXiv:2310.16959  [pdf, other

    cs.LG

    Improving Few-shot Generalization of Safety Classifiers via Data Augmented Parameter-Efficient Fine-Tuning

    Authors: Ananth Balashankar, Xiao Ma, Aradhana Sinha, Ahmad Beirami, Yao Qin, Jilin Chen, Alex Beutel

    Abstract: As large language models (LLMs) are widely adopted, new safety issues and policies emerge, to which existing safety classifiers do not generalize well. If we have only observed a few examples of violations of a new safety rule, how can we build a classifier to detect violations? In this paper, we study the novel setting of domain-generalized few-shot learning for LLM-based text safety classifiers.… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  16. arXiv:2310.16955  [pdf, other

    cs.LG

    Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks

    Authors: Aradhana Sinha, Ananth Balashankar, Ahmad Beirami, Thi Avrahami, Jilin Chen, Alex Beutel

    Abstract: Real-world natural language processing systems need to be robust to human adversaries. Collecting examples of human adversaries for training is an effective but expensive solution. On the other hand, training on synthetic attacks with small perturbations - such as word-substitution - does not actually improve robustness to human adversaries. In this paper, we propose an adversarial training framew… ▽ More

    Submitted 14 February, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Journal ref: Transactions on Machine Learning Research (2024)

  17. arXiv:2310.16523  [pdf, other

    cs.CL cs.AI

    Improving Diversity of Demographic Representation in Large Language Models via Collective-Critiques and Self-Voting

    Authors: Preethi Lahoti, Nicholas Blumm, Xiao Ma, Raghavendra Kotikalapudi, Sahitya Potluri, Qijun Tan, Hansa Srinivasan, Ben Packer, Ahmad Beirami, Alex Beutel, Jilin Chen

    Abstract: A crucial challenge for generative large language models (LLMs) is diversity: when a user's prompt is under-specified, models may follow implicit assumptions while generating a response, which may result in homogenization of the responses, as well as certain demographic groups being under-represented or even erased from the generated responses. In this paper, we formalize diversity of representati… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: To appear at EMNLP 2023 main conference

  18. arXiv:2310.15141  [pdf, other

    cs.LG cs.CL cs.DS cs.IT

    SpecTr: Fast Speculative Decoding via Optimal Transport

    Authors: Ziteng Sun, Ananda Theertha Suresh, Jae Hun Ro, Ahmad Beirami, Himanshu Jain, Felix Yu

    Abstract: Autoregressive sampling from large language models has led to state-of-the-art results in several natural language tasks. However, autoregressive sampling generates tokens one at a time making it slow, and even prohibitive in certain tasks. One way to speed up sampling is $\textit{speculative decoding}$: use a small model to sample a $\textit{draft}$ (block or sequence of tokens), and then score a… ▽ More

    Submitted 17 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  19. arXiv:2310.11401  [pdf, other

    cs.LG

    Enhancing Group Fairness in Online Settings Using Oblique Decision Forests

    Authors: Somnath Basu Roy Chowdhury, Nicholas Monath, Ahmad Beirami, Rahul Kidambi, Avinava Dubey, Amr Ahmed, Snigdha Chaturvedi

    Abstract: Fairness, especially group fairness, is an important consideration in the context of machine learning systems. The most commonly adopted group fairness-enhancing techniques are in-processing methods that rely on a mixture of a fairness objective (e.g., demographic parity) and a task-specific objective (e.g., cross-entropy) during the training process. However, when data arrives in an online fashio… ▽ More

    Submitted 27 April, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 (Spotlight)

  20. arXiv:2307.12980  [pdf, other

    cs.CV

    A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

    Authors: Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr

    Abstract: Prompt engineering is a technique that involves augmenting a large pre-trained model with task-specific hints, known as prompts, to adapt the model to new tasks. Prompts can be created manually as natural language instructions or generated automatically as either natural language instructions or vector representations. Prompt engineering enables the ability to perform predictions based solely on p… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  21. arXiv:2307.05728  [pdf, other

    cs.LG cs.AI cs.CY

    Towards A Scalable Solution for Improving Multi-Group Fairness in Compositional Classification

    Authors: James Atwood, Tina Tian, Ben Packer, Meghana Deodhar, Jilin Chen, Alex Beutel, Flavien Prost, Ahmad Beirami

    Abstract: Despite the rich literature on machine learning fairness, relatively little attention has been paid to remediating complex systems, where the final prediction is the combination of multiple classifiers and where multiple groups are present. In this paper, we first show that natural baseline approaches for improving equal opportunity fairness scale linearly with the product of the number of remedia… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  22. arXiv:2306.14308  [pdf, ps, other

    cs.CL cs.AI

    Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning

    Authors: Xiao Ma, Swaroop Mishra, Ahmad Beirami, Alex Beutel, Jilin Chen

    Abstract: Language models still struggle on moral reasoning, despite their impressive performance in many other tasks. In particular, the Moral Scenarios task in MMLU (Multi-task Language Understanding) is among the worst performing tasks for many language models, including GPT-3. In this work, we propose a new prompting framework, Thought Experiments, to teach language models to do better moral reasoning u… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: 8 pages, ICML Neural Conversational AI workshop, thought experiments, moral reasoning

  23. arXiv:2306.02014  [pdf, other

    cs.CV cs.LG

    Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

    Authors: Pritam Sarkar, Ahmad Beirami, Ali Etemad

    Abstract: Video self-supervised learning (VSSL) has made significant progress in recent years. However, the exact behavior and dynamics of these models under different forms of distribution shift are not yet known. In this paper, we comprehensively study the behavior of six popular self-supervised methods (v-SimCLR, v-MoCo, v-BYOL, v-SimSiam, v-DINO, v-MAE) in response to various forms of natural distributi… ▽ More

    Submitted 30 October, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 Spotlight

  24. arXiv:2304.08479  [pdf, other

    cs.CV

    Towards Robust Prompts on Vision-Language Models

    Authors: Jindong Gu, Ahmad Beirami, Xuezhi Wang, Alex Beutel, Philip Torr, Yao Qin

    Abstract: With the advent of vision-language models (VLMs) that can perform in-context and prompt-based learning, how can we design prompting approaches that robustly generalize to distribution shift and can be used on novel classes outside the support set of the prompts? In this work, we first define two types of robustness to distribution shift on VLMs, namely, robustness on base classes (the classes incl… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  25. arXiv:2205.02392  [pdf, other

    cs.CL cs.AI

    Robust Conversational Agents against Imperceptible Toxicity Triggers

    Authors: Ninareh Mehrabi, Ahmad Beirami, Fred Morstatter, Aram Galstyan

    Abstract: Warning: this paper contains content that maybe offensive or upsetting. Recent research in Natural Language Processing (NLP) has advanced the development of various toxicity detection models with the intention of identifying and mitigating toxic language from existing systems. Despite the abundance of research in this area, less attention has been given to adversarial attacks that force the system… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

  26. arXiv:2112.08351  [pdf, other

    cs.CL

    Database Search Results Disambiguation for Task-Oriented Dialog Systems

    Authors: Kun Qian, Ahmad Beirami, Satwik Kottur, Shahin Shayandeh, Paul Crook, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar

    Abstract: As task-oriented dialog systems are becoming increasingly popular in our lives, more realistic tasks have been proposed and explored. However, new practical challenges arise. For instance, current dialog systems cannot effectively handle multiple search results when querying a database, due to the lack of such scenarios in existing public datasets. In this paper, we propose Database Search Result… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  27. arXiv:2112.08321  [pdf, other

    cs.CL

    Know Thy Strengths: Comprehensive Dialogue State Tracking Diagnostics

    Authors: Hyundong Cho, Chinnadhurai Sankar, Christopher Lin, Kaushik Ram Sadagopan, Shahin Shayandeh, Asli Celikyilmaz, Jonathan May, Ahmad Beirami

    Abstract: Recent works that revealed the vulnerability of dialogue state tracking (DST) models to distributional shifts have made holistic comparisons on robustness and qualitative analyses increasingly important for understanding their relative performance. We present our findings from standardized and comprehensive DST diagnoses, which have previously been sparse and uncoordinated, using our toolkit, Chec… ▽ More

    Submitted 4 November, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: EMNLP2022

  28. arXiv:2111.04579  [pdf, ps, other

    cs.LG cs.IT stat.ML

    Information-Theoretic Bayes Risk Lower Bounds for Realizable Models

    Authors: Matthew Nokleby, Ahmad Beirami

    Abstract: We derive information-theoretic lower bounds on the Bayes risk and generalization error of realizable machine learning models. In particular, we employ an analysis in which the rate-distortion function of the model parameters bounds the required mutual information between the training samples and the model parameters in order to learn a model up to a Bayes risk constraint. For realizable models, w… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  29. arXiv:2110.15252  [pdf, other

    cs.LG cs.AI cs.CR

    Federated Learning with Heterogeneous Differential Privacy

    Authors: Nasser Aldaghri, Hessam Mahdavifar, Ahmad Beirami

    Abstract: Federated learning (FL) takes a first step towards privacy-preserving machine learning by training models while keeping client data local. Models trained using FL may still leak private client information through model updates during training. Differential privacy (DP) may be employed on model updates to provide privacy guarantees within FL, typically at the cost of degraded performance of the fin… ▽ More

    Submitted 14 January, 2023; v1 submitted 28 October, 2021; originally announced October 2021.

  30. arXiv:2110.11205  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Robustness through Data Augmentation Loss Consistency

    Authors: Tianjian Huang, Shaunak Halbe, Chinnadhurai Sankar, Pooyan Amini, Satwik Kottur, Alborz Geramifard, Meisam Razaviyayn, Ahmad Beirami

    Abstract: While deep learning through empirical risk minimization (ERM) has succeeded at achieving human-level performance at a variety of complex tasks, ERM is not robust to distribution shifts or adversarial attacks. Synthetic data augmentation followed by empirical risk minimization (DA-ERM) is a simple and widely used solution to improve robustness in ERM. In addition, consistency regularization can be… ▽ More

    Submitted 24 January, 2023; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: 40 pages

  31. arXiv:2109.06141  [pdf, other

    cs.LG cs.IT math.OC stat.ML

    On Tilted Losses in Machine Learning: Theory and Applications

    Authors: Tian Li, Ahmad Beirami, Maziar Sanjabi, Virginia Smith

    Abstract: Exponential tilting is a technique commonly used in fields such as statistics, probability, information theory, and optimization to create parametric distribution shifts. Despite its prevalence in related fields, tilting has not seen widespread use in machine learning. In this work, we aim to bridge this gap by exploring the use of tilting in risk minimization. We study a simple extension to ERM -… ▽ More

    Submitted 1 June, 2023; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.01162

  32. arXiv:2105.14150  [pdf, other

    cs.CL

    Annotation Inconsistency and Entity Bias in MultiWOZ

    Authors: Kun Qian, Ahmad Beirami, Zhouhan Lin, Ankita De, Alborz Geramifard, Zhou Yu, Chinnadhurai Sankar

    Abstract: MultiWOZ is one of the most popular multi-domain task-oriented dialog datasets, containing 10K+ annotated dialogs covering eight domains. It has been widely accepted as a benchmark for various dialog tasks, e.g., dialog state tracking (DST), natural language generation (NLG), and end-to-end (E2E) dialog modeling. In this work, we identify an overlooked issue with dialog state annotation inconsiste… ▽ More

    Submitted 15 February, 2022; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: Accepted by SIGDIAL 2021

  33. arXiv:2102.12586  [pdf, other

    cs.LG cs.IT

    A Stochastic Optimization Framework for Fair Risk Minimization

    Authors: Andrew Lowy, Sina Baharlouei, Rakesh Pavan, Meisam Razaviyayn, Ahmad Beirami

    Abstract: Despite the success of large-scale empirical risk minimization (ERM) at achieving high accuracy across a variety of machine learning tasks, fair ERM is hindered by the incompatibility of fairness constraints with stochastic optimization. We consider the problem of fair classification with discrete sensitive attributes and potentially large models and data sets, requiring stochastic solvers. Existi… ▽ More

    Submitted 11 January, 2023; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: 44 pages

    Journal ref: Transactions on Machine Learning Research, 2022

  34. arXiv:2101.00151  [pdf, other

    cs.AI cs.CL cs.LG

    DVD: A Diagnostic Dataset for Multi-step Reasoning in Video Grounded Dialogue

    Authors: Hung Le, Chinnadhurai Sankar, Seungwhan Moon, Ahmad Beirami, Alborz Geramifard, Satwik Kottur

    Abstract: A video-grounded dialogue system is required to understand both dialogue, which contains semantic dependencies from turn to turn, and video, which contains visual cues of spatial and temporal scene variations. Building such dialogue systems is a challenging problem, involving various reasoning types on both visual and language inputs. Existing benchmarks do not have enough annotations to thoroughl… ▽ More

    Submitted 14 June, 2021; v1 submitted 31 December, 2020; originally announced January 2021.

    Comments: 20 pages, 14 figures, 8 tables

    Journal ref: Association for Computational Linguistics (2021)

  35. arXiv:2012.15721  [pdf, other

    cs.LG cs.AI cs.CR cs.IT

    Coded Machine Unlearning

    Authors: Nasser Aldaghri, Hessam Mahdavifar, Ahmad Beirami

    Abstract: There are applications that may require removing the trace of a sample from the system, e.g., a user requests their data to be deleted, or corrupted data is discovered. Simply removing a sample from storage units does not necessarily remove its entire trace since downstream machine learning models may store some information about the samples used to train them. A sample can be perfectly unlearned… ▽ More

    Submitted 15 June, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

    Comments: Accepted for publication in IEEE Access

  36. arXiv:2012.04221  [pdf, other

    cs.LG stat.ML

    Ditto: Fair and Robust Federated Learning Through Personalization

    Authors: Tian Li, Shengyuan Hu, Ahmad Beirami, Virginia Smith

    Abstract: Fairness and robustness are two important concerns for federated learning systems. In this work, we identify that robustness to data and model poisoning attacks and fairness, measured as the uniformity of performance across devices, are competing constraints in statistically heterogeneous networks. To address these constraints, we propose employing a simple, general framework for personalized fede… ▽ More

    Submitted 15 June, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted by ICML 2021

  37. arXiv:2011.06486  [pdf, ps, other

    cs.CL

    Overview of the Ninth Dialog System Technology Challenge: DSTC9

    Authors: Chulaka Gunasekara, Seokhwan Kim, Luis Fernando D'Haro, Abhinav Rastogi, Yun-Nung Chen, Mihail Eric, Behnam Hedayatnia, Karthik Gopalakrishnan, Yang Liu, Chao-Wei Huang, Dilek Hakkani-Tür, Jinchao Li, Qi Zhu, Lingxiao Luo, Lars Liden, Kaili Huang, Shahin Shayandeh, Runze Liang, Baolin Peng, Zheng Zhang, Swadheen Shukla, Minlie Huang, Jianfeng Gao, Shikib Mehri, Yulan Feng , et al. (14 additional authors not shown)

    Abstract: This paper introduces the Ninth Dialog System Technology Challenge (DSTC-9). This edition of the DSTC focuses on applying end-to-end dialog technologies for four distinct tasks in dialog systems, namely, 1. Task-oriented dialog Modeling with unstructured knowledge access, 2. Multi-domain task-oriented dialog, 3. Interactive evaluation of dialog, and 4. Situated interactive multi-modal dialog. This… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  38. arXiv:2011.05457  [pdf, other

    cs.CL cs.AI

    Resource Constrained Dialog Policy Learning via Differentiable Inductive Logic Programming

    Authors: Zhenpeng Zhou, Ahmad Beirami, Paul Crook, Pararth Shah, Rajen Subba, Alborz Geramifard

    Abstract: Motivated by the needs of resource constrained dialog policy learning, we introduce dialog policy via differentiable inductive logic (DILOG). We explore the tasks of one-shot learning and zero-shot domain transfer with DILOG on SimDial and MultiWoZ. Using a single representative dialog from the restaurant domain, we train DILOG on the SimDial dataset and obtain 99+% in-domain test accuracy. We als… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  39. arXiv:2008.12823  [pdf, other

    cs.IT

    Centralized vs Decentralized Targeted Brute-Force Attacks: Guessing with Side-Information

    Authors: Salman Salamatian, Wasim Huleihel, Ahmad Beirami, Asaf Cohen, Muriel Médard

    Abstract: According to recent empirical studies, a majority of users have the same, or very similar, passwords across multiple password-secured online services. This practice can have disastrous consequences, as one password being compromised puts all the other accounts at much higher risk. Generally, an adversary may use any side-information he/she possesses about the user, be it demographic information, p… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

    Journal ref: IEEE Transactions on Information Forensics and Security, 15, 3749-3759 (2020)

  40. arXiv:2007.01162  [pdf, other

    cs.LG cs.IT stat.ML

    Tilted Empirical Risk Minimization

    Authors: Tian Li, Ahmad Beirami, Maziar Sanjabi, Virginia Smith

    Abstract: Empirical risk minimization (ERM) is typically designed to perform well on the average loss, which can result in estimators that are sensitive to outliers, generalize poorly, or treat subgroups unfairly. While many methods aim to address these problems individually, in this work, we explore them through a unified framework -- tilted empirical risk minimization (TERM). In particular, we show that i… ▽ More

    Submitted 17 March, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: Accepted by ICLR 2021

  41. arXiv:2006.13763  [pdf, ps, other

    cs.SI cs.AI cs.HC cs.MA

    Competitive Balance in Team Sports Games

    Authors: Sofia M Nikolakaki, Ogheneovo Dibie, Ahmad Beirami, Nicholas Peterson, Navid Aghdaie, Kazi Zaman

    Abstract: Competition is a primary driver of player satisfaction and engagement in multiplayer online games. Traditional matchmaking systems aim at creating matches involving teams of similar aggregated individual skill levels, such as Elo score or TrueSkill. However, team dynamics cannot be solely captured using such linear predictors. Recently, it has been shown that nonlinear predictors that target to le… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: 2020 IEEE Conference in Games (COG 2020), 8 pages

  42. arXiv:2006.01460  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Situated and Interactive Multimodal Conversations

    Authors: Seungwhan Moon, Satwik Kottur, Paul A. Crook, Ankita De, Shivani Poddar, Theodore Levin, David Whitney, Daniel Difranco, Ahmad Beirami, Eunjoon Cho, Rajen Subba, Alborz Geramifard

    Abstract: Next generation virtual assistants are envisioned to handle multimodal inputs (e.g., vision, memories of previous interactions, in addition to the user's utterances), and perform multimodal actions (e.g., displaying a route in addition to generating the system's utterance). We introduce Situated Interactive MultiModal Conversations (SIMMC) as a new direction aimed at training agents that take mult… ▽ More

    Submitted 10 November, 2020; v1 submitted 2 June, 2020; originally announced June 2020.

    Comments: 20 pages, 5 figures, 11 tables, accepted to COLING 2020

  43. arXiv:1907.02482  [pdf, ps, other

    cs.IT

    Nonlinear Function Estimation with Empirical Bayes and Approximate Message Passing

    Authors: Hangjin Liu, You, Zhou, Ahmad Beirami, Dror Baron

    Abstract: Nonlinear function estimation is core to modern machine learning applications. In this paper, to perform nonlinear function estimation, we reduce a nonlinear inverse problem to a linear one using a polynomial kernel expansion. These kernels increase the feature set, and may result in poorly conditioned matrices. Nonetheless, we show several examples where the matrix in our linear inverse problem c… ▽ More

    Submitted 30 September, 2019; v1 submitted 4 July, 2019; originally announced July 2019.

    Comments: in Proc. of the 57th Annual Allerton Conference on Communication, Control, and Computing (8 pages, 2 figures)

  44. arXiv:1907.00531  [pdf, other

    cs.IT

    Mismatched Guesswork

    Authors: Salman Salamatian, Litian Liu, Ahmad Beirami, Muriel Médard

    Abstract: We study the problem of mismatched guesswork, where we evaluate the number of symbols $y \in \mathcal{Y}$ which have higher likelihood than $X \sim μ$ according to a mismatched distribution $ν$. We discuss the role of the tilted/exponential families of the source distribution $μ$ and of the mismatched distribution $ν$. We show that the value of guesswork can be characterized using the tilted famil… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

    Comments: Accepted to ITW 2019

  45. arXiv:1906.12005  [pdf, other

    cs.LG stat.ML

    Rényi Fair Inference

    Authors: Sina Baharlouei, Maher Nouiehed, Ahmad Beirami, Meisam Razaviyayn

    Abstract: Machine learning algorithms have been increasingly deployed in critical automated decision-making systems that directly affect human lives. When these algorithms are only trained to minimize the training/test error, they could suffer from systematic discrimination against individuals based on their sensitive attributes such as gender or race. Recently, there has been a surge in machine learning so… ▽ More

    Submitted 13 January, 2020; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: 11 pages, 1 figure

    Journal ref: International Conference on Learning Representation, 2020

  46. arXiv:1906.10124  [pdf, other

    cs.MA cs.AI cs.HC cs.LG

    On Multi-Agent Learning in Team Sports Games

    Authors: Yunqi Zhao, Igor Borovikov, Jason Rupert, Caedmon Somers, Ahmad Beirami

    Abstract: In recent years, reinforcement learning has been successful in solving video games from Atari to Star Craft II. However, the end-to-end model-free reinforcement learning (RL) is not sample efficient and requires a significant amount of computational resources to achieve superhuman level performance. Model-free RL is also unlikely to produce human-like agents for playtesting and gameplaying AI in t… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: Presented at ICML 2019 Workshop on Imitation, Intent, and Interaction (I3). arXiv admin note: substantial text overlap with arXiv:1903.10545

  47. arXiv:1906.00535  [pdf, other

    cs.LG cs.AI

    Towards Interactive Training of Non-Player Characters in Video Games

    Authors: Igor Borovikov, Jesse Harder, Michael Sadovsky, Ahmad Beirami

    Abstract: There is a high demand for high-quality Non-Player Characters (NPCs) in video games. Hand-crafting their behavior is a labor intensive and error prone engineering process with limited controls exposed to the game designers. We propose to create such NPC behaviors interactively by training an agent in the target environment using imitation learning with a human in the loop. While traditional behavi… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

    Comments: presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA

  48. arXiv:1905.10497  [pdf, other

    cs.LG stat.ML

    Fair Resource Allocation in Federated Learning

    Authors: Tian Li, Maziar Sanjabi, Ahmad Beirami, Virginia Smith

    Abstract: Federated learning involves training statistical models in massive, heterogeneous networks. Naively minimizing an aggregate loss function in such a network may disproportionately advantage or disadvantage some of the devices. In this work, we propose q-Fair Federated Learning (q-FFL), a novel optimization objective inspired by fair resource allocation in wireless networks that encourages a more fa… ▽ More

    Submitted 14 February, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: ICLR 2020

  49. arXiv:1903.10545  [pdf, other

    cs.AI cs.LG cs.MA cs.NE

    Winning Isn't Everything: Enhancing Game Development with Intelligent Agents

    Authors: Yunqi Zhao, Igor Borovikov, Fernando de Mesentier Silva, Ahmad Beirami, Jason Rupert, Caedmon Somers, Jesse Harder, John Kolen, Jervis Pinto, Reza Pourabolghasem, James Pestrak, Harold Chaput, Mohsen Sardari, Long Lin, Sundeep Narravula, Navid Aghdaie, Kazi Zaman

    Abstract: Recently, there have been several high-profile achievements of agents learning to play games against humans and beat them. In this paper, we study the problem of training intelligent agents in service of game development. Unlike the agents built to "beat the game", our agents aim to produce human-like behavior to help with game evaluation and balancing. We discuss two fundamental metrics based on… ▽ More

    Submitted 27 April, 2020; v1 submitted 25 March, 2019; originally announced March 2019.

    Comments: Accepted to IEEE Trans. Games

  50. arXiv:1901.03625  [pdf, other

    cs.IT

    Universal Compression with Side Information from a Correlated Source

    Authors: Ahmad Beirami, Faramarz Fekri

    Abstract: Packets originated from an information source in the network can be highly correlated. These packets are often routed through different paths, and compressing them requires to process them individually. Traditional universal compression solutions would not perform well over a single packet because of the limited data available for learning the unknown source parameters. In this paper, we define a… ▽ More

    Submitted 11 January, 2019; originally announced January 2019.

    Comments: submitted to IEEE Trans. Communications