Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 106 results for author: Tarokh, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.10075  [pdf, other

    cs.LG cs.NE

    Steinmetz Neural Networks for Complex-Valued Data

    Authors: Shyam Venkatasubramanian, Ali Pezeshki, Vahid Tarokh

    Abstract: In this work, we introduce a new approach to processing complex-valued data using DNNs consisting of parallel real-valued subnetworks with coupled outputs. Our proposed class of architectures, referred to as Steinmetz Neural Networks, leverages multi-view learning to construct more interpretable representations within the latent space. Subsequently, we present the Analytic Neural Network, which im… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  2. arXiv:2409.04986  [pdf, other

    cs.LG

    DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

    Authors: Qi Le, Enmao Diao, Xinran Wang, Vahid Tarokh, Jie Ding, Ali Anwar

    Abstract: Federated Learning (FL) is a collaborative machine learning framework that allows multiple users to train models utilizing their local data in a distributed manner. However, considerable statistical heterogeneity in local data across devices often leads to suboptimal model performance compared with independently and identically distributed (IID) data scenarios. In this paper, we introduce DynamicF… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  3. arXiv:2408.00131  [pdf, other

    stat.ML cs.AI cs.LG q-fin.RM

    Distributionally Robust Optimization as a Scalable Framework to Characterize Extreme Value Distributions

    Authors: Patrick Kuiper, Ali Hasan, Wenhao Yang, Yuting Ng, Hoda Bidkhori, Jose Blanchet, Vahid Tarokh

    Abstract: The goal of this paper is to develop distributionally robust optimization (DRO) estimators, specifically for multidimensional Extreme Value Theory (EVT) statistics. EVT supports using semi-parametric models called max-stable distributions built from spatial Poisson point processes. While powerful, these models are only asymptotically valid for large samples. However, since extreme data is by defin… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  4. arXiv:2407.17654  [pdf, other

    cs.LG stat.ML

    Generative Learning for Simulation of Vehicle Faults

    Authors: Patrick Kuiper, Sirui Lin, Jose Blanchet, Vahid Tarokh

    Abstract: We develop a novel generative model to simulate vehicle health and forecast faults, conditioned on practical operational considerations. The model, trained on data from the US Army's Predictive Logistics program, aims to support predictive maintenance. It forecasts faults far enough in advance to execute a maintenance intervention before a breakdown occurs. The model incorporates real-world factor… ▽ More

    Submitted 30 July, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

  5. arXiv:2407.12234  [pdf, other

    cs.LG cs.CE math.OC stat.ML

    Base Models for Parabolic Partial Differential Equations

    Authors: Xingzi Xu, Ali Hasan, Jie Ding, Vahid Tarokh

    Abstract: Parabolic partial differential equations (PDEs) appear in many disciplines to model the evolution of various mathematical objects, such as probability flows, value functions in control theory, and derivative prices in finance. It is often necessary to compute the solutions or a function of the solutions to a parametric PDE in multiple scenarios corresponding to different parameters of this PDE. Th… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Appears in UAI 2024

  6. arXiv:2406.09638  [pdf, other

    cs.LG eess.SP

    RASPNet: A Benchmark Dataset for Radar Adaptive Signal Processing Applications

    Authors: Shyam Venkatasubramanian, Bosung Kang, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: This work presents a large-scale dataset for radar adaptive signal processing (RASP) applications, aimed at supporting the development of data-driven models within the radar community. The dataset, called RASPNet, consists of 100 realistic scenarios compiled over a variety of topographies and land types from across the contiguous United States, designed to reflect a diverse array of real-world env… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  7. arXiv:2404.13844  [pdf, other

    cs.LG cs.AI

    ColA: Collaborative Adaptation with Gradient Learning

    Authors: Enmao Diao, Qi Le, Suya Wu, Xinran Wang, Ali Anwar, Jie Ding, Vahid Tarokh

    Abstract: A primary function of back-propagation is to compute both the gradient of hidden representations and parameters for optimization with gradient descent. Training large models requires high computational costs due to their vast parameter sizes. While Parameter-Efficient Fine-Tuning (PEFT) methods aim to train smaller auxiliary models to save computational space, they still present computational over… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  8. arXiv:2404.09402  [pdf, other

    cs.LG cs.AI stat.ML

    Neural McKean-Vlasov Processes: Distributional Dependence in Diffusion Processes

    Authors: Haoming Yang, Ali Hasan, Yuting Ng, Vahid Tarokh

    Abstract: McKean-Vlasov stochastic differential equations (MV-SDEs) provide a mathematical description of the behavior of an infinite number of interacting particles by imposing a dependence on the particle density. As such, we study the influence of explicitly including distributional information in the parameterization of the SDE. We propose a series of semi-parametric methods for representing MV-SDEs, an… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Appears in AISTATS 2024

  9. arXiv:2311.12356  [pdf, other

    cs.LG

    Random Linear Projections Loss for Hyperplane-Based Optimization in Neural Networks

    Authors: Shyam Venkatasubramanian, Ahmed Aloui, Vahid Tarokh

    Abstract: Advancing loss function design is pivotal for optimizing neural network training and performance. This work introduces Random Linear Projections (RLP) loss, a novel approach that enhances training efficiency by leveraging geometric relationships within the data. Distinct from traditional loss functions that target minimizing pointwise errors, RLP loss operates by minimizing the distance between se… ▽ More

    Submitted 30 May, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  10. arXiv:2311.03630  [pdf, other

    cs.LG stat.ME stat.ML

    Counterfactual Data Augmentation with Contrastive Learning

    Authors: Ahmed Aloui, Juncheng Dong, Cat P. Le, Vahid Tarokh

    Abstract: Statistical disparity between distinct treatment groups is one of the most significant challenges for estimating Conditional Average Treatment Effects (CATE). To address this, we introduce a model-agnostic data augmentation method that imputes the counterfactual outcomes for a selected subset of individuals. Specifically, we utilize contrastive learning to learn a representation space and a simila… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  11. arXiv:2310.07123  [pdf, other

    cs.LG cs.AI

    Off-Policy Evaluation for Human Feedback

    Authors: Qitong Gao, Ge Gao, Juncheng Dong, Vahid Tarokh, Min Chi, Miroslav Pajic

    Abstract: Off-policy evaluation (OPE) is important for closing the gap between offline training and evaluation of reinforcement learning (RL), by estimating performance and/or rank of target (evaluation) policies using offline trajectories only. It can improve the safety and efficiency of data collection and policy testing procedures in situations where online deployments are expensive, such as healthcare.… ▽ More

    Submitted 14 October, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  12. arXiv:2310.01720  [pdf, other

    cs.LG cs.AI

    Perceiver-based CDF Modeling for Time Series Forecasting

    Authors: Cat P. Le, Chris Cannella, Ali Hasan, Yuting Ng, Vahid Tarokh

    Abstract: Transformers have demonstrated remarkable efficacy in forecasting time series data. However, their extensive dependence on self-attention mechanisms demands significant computational resources, thereby limiting their practical applicability across diverse tasks, especially in multimodal problems. In this work, we propose a new architecture, called perceiver-CDF, for modeling cumulative distributio… ▽ More

    Submitted 24 June, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted in Winter Simulation Conference 2024

  13. arXiv:2306.11697  [pdf, other

    stat.ME cs.LG stat.ML

    Treatment Effects in Extreme Regimes

    Authors: Ahmed Aloui, Ali Hasan, Yuting Ng, Miroslav Pajic, Vahid Tarokh

    Abstract: Understanding treatment effects in extreme regimes is important for characterizing risks associated with different interventions. This is hindered by the unavailability of counterfactual outcomes and the rarity and difficulty of collecting extreme data in practice. To address this issue, we propose a new framework based on extreme value theory for estimating treatment effects in extreme regimes. W… ▽ More

    Submitted 22 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  14. arXiv:2306.07918  [pdf, other

    cs.LG stat.ML

    Causal Mediation Analysis with Multi-dimensional and Indirectly Observed Mediators

    Authors: Ziyang Jiang, Yiling Liu, Michael H. Klein, Ahmed Aloui, Yiman Ren, Keyu Li, Vahid Tarokh, David Carlson

    Abstract: Causal mediation analysis (CMA) is a powerful method to dissect the total effect of a treatment into direct and mediated effects within the potential outcome framework. This is important in many scientific applications to identify the underlying mechanisms of a treatment effect. However, in many scientific applications the mediator is unobserved, but there may exist related measurements. For examp… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: 16 pages, 4 figures, 5 tables

  15. arXiv:2306.07408  [pdf, other

    cs.LG cs.AI cs.RO

    Robust Reinforcement Learning through Efficient Adversarial Herding

    Authors: Juncheng Dong, Hao-Lun Hsu, Qitong Gao, Vahid Tarokh, Miroslav Pajic

    Abstract: Although reinforcement learning (RL) is considered the gold standard for policy design, it may not always provide a robust solution in various scenarios. This can result in severe performance degradation when the environment is exposed to potential disturbances. Adversarial training using a two-player max-min game has been proven effective in enhancing the robustness of RL agents. In this work, we… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  16. arXiv:2306.02925  [pdf, other

    cs.CE physics.comp-ph

    Deep Generalized Green's Functions

    Authors: Rixi Peng, Juncheng Dong, Jordan Malof, Willie J. Padilla, Vahid Tarokh

    Abstract: In this study, we address the challenge of obtaining a Green's function operator for linear partial differential equations (PDEs). The Green's function is well-sought after due to its ability to directly map inputs to solutions, bypassing the need for common numerical methods such as finite difference and finite elements methods. However, obtaining an explicit form of the Green's function kernel f… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  17. arXiv:2305.11400  [pdf, other

    cs.LG stat.ML

    Mode-Aware Continual Learning for Conditional Generative Adversarial Networks

    Authors: Cat P. Le, Juncheng Dong, Ahmed Aloui, Vahid Tarokh

    Abstract: The main challenge in continual learning for generative models is to effectively learn new target modes with limited samples while preserving previously learned ones. To this end, we introduce a new continual learning approach for conditional generative adversarial networks by leveraging a mode-affinity score specifically designed for generative modeling. First, the generator produces samples of e… ▽ More

    Submitted 23 September, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

  18. arXiv:2305.00003  [pdf, other

    cs.CE cond-mat.mtrl-sci cs.LG

    Neural Network Accelerated Process Design of Polycrystalline Microstructures

    Authors: Junrong Lin, Mahmudul Hasan, Pinar Acar, Jose Blanchet, Vahid Tarokh

    Abstract: Computational experiments are exploited in finding a well-designed processing path to optimize material structures for desired properties. This requires understanding the interplay between the processing-(micro)structure-property linkages using a multi-scale approach that connects the macro-scale (process parameters) to meso (homogenized properties) and micro (crystallographic texture) scales. Due… ▽ More

    Submitted 3 May, 2023; v1 submitted 11 April, 2023; originally announced May 2023.

  19. arXiv:2303.08241  [pdf, other

    cs.CV eess.SP

    Subspace Perturbation Analysis for Data-Driven Radar Target Localization

    Authors: Shyam Venkatasubramanian, Sandeep Gogineni, Bosung Kang, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: Recent works exploring data-driven approaches to classical problems in adaptive radar have demonstrated promising results pertaining to the task of radar target localization. Via the use of space-time adaptive processing (STAP) techniques and convolutional neural networks, these data-driven approaches to target localization have helped benchmark the performance of neural networks for matched scena… ▽ More

    Submitted 21 March, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 6 pages, 3 figures. Submitted to 2023 IEEE Radar Conference (RadarConf). Extension of arXiv:2209.02890

  20. arXiv:2302.05601  [pdf, other

    cs.LG

    Pruning Deep Neural Networks from a Sparsity Perspective

    Authors: Enmao Diao, Ganghua Wang, Jiawei Zhan, Yuhong Yang, Jie Ding, Vahid Tarokh

    Abstract: In recent years, deep network pruning has attracted significant attention in order to enable the rapid deployment of AI into small devices with computation and memory constraints. Pruning is often achieved by dropping redundant weights, neurons, or layers of a deep network while attempting to retain a comparable test performance. Many deep pruning algorithms have been proposed with impressive empi… ▽ More

    Submitted 23 August, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: ICLR 2023

  21. arXiv:2302.03821  [pdf, other

    cs.LG math.OC stat.ME stat.ML

    PASTA: Pessimistic Assortment Optimization

    Authors: Juncheng Dong, Weibin Mo, Zhengling Qi, Cong Shi, Ethan X. Fang, Vahid Tarokh

    Abstract: We consider a class of assortment optimization problems in an offline data-driven setting. A firm does not know the underlying customer choice model but has access to an offline dataset consisting of the historically offered assortment set, customer choice, and revenue. The objective is to use the offline dataset to find an optimal assortment. Due to the combinatorial nature of assortment optimiza… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  22. arXiv:2302.02009  [pdf, other

    cs.LG stat.ML

    Domain Adaptation via Rebalanced Sub-domain Alignment

    Authors: Yiling Liu, Juncheng Dong, Ziyang Jiang, Ahmed Aloui, Keyu Li, Hunter Klein, Vahid Tarokh, David Carlson

    Abstract: Unsupervised domain adaptation (UDA) is a technique used to transfer knowledge from a labeled source domain to a different but related unlabeled target domain. While many UDA methods have shown success in the past, they often assume that the source and target domains must have identical class label distributions, which can limit their effectiveness in real-world scenarios. To address this limitati… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 20 pages, 6 figures, 4 tables

  23. arXiv:2302.00250  [pdf, other

    stat.ML cs.LG

    Quickest Change Detection for Unnormalized Statistical Models

    Authors: Suya Wu, Enmao Diao, Taposh Banerjee, Jie Ding, Vahid Tarokh

    Abstract: Classical quickest change detection algorithms require modeling pre-change and post-change distributions. Such an approach may not be feasible for various machine learning models because of the complexity of computing the explicit distributions. Additionally, these methods may suffer from a lack of robustness to model mismatch and noise. This paper develops a new variant of the classical Cumulativ… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

    Comments: A version of this paper has been accepted by the 26th International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

  24. arXiv:2212.08779  [pdf, other

    cs.IR

    Personalized Federated Recommender Systems with Private and Partially Federated AutoEncoders

    Authors: Qi Le, Enmao Diao, Xinran Wang, Ali Anwar, Vahid Tarokh, Jie Ding

    Abstract: Recommender Systems (RSs) have become increasingly important in many application domains, such as digital marketing. Conventional RSs often need to collect users' data, centralize them on the server-side, and form a global model to generate reliable recommendations. However, they suffer from two critical limitations: the personalization problem that the RSs trained traditionally may not be customi… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

  25. arXiv:2210.00380  [pdf, other

    cs.LG stat.ME stat.ML

    Transfer Learning for Individual Treatment Effect Estimation

    Authors: Ahmed Aloui, Juncheng Dong, Cat P. Le, Vahid Tarokh

    Abstract: This work considers the problem of transferring causal knowledge between tasks for Individual Treatment Effect (ITE) estimation. To this end, we theoretically assess the feasibility of transferring ITE knowledge and present a practical framework for efficient transfer. A lower bound is introduced on the ITE error of the target task to demonstrate that ITE knowledge transfer is challenging due to t… ▽ More

    Submitted 5 June, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

  26. arXiv:2209.02890  [pdf, other

    cs.CV eess.SP

    Data-Driven Target Localization Using Adaptive Radar Processing and Convolutional Neural Networks

    Authors: Shyam Venkatasubramanian, Sandeep Gogineni, Bosung Kang, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: Leveraging the advanced functionalities of modern radio frequency (RF) modeling and simulation tools, specifically designed for adaptive radar processing applications, this paper presents a data-driven approach to improve accuracy in radar target localization post adaptive radar detection. To this end, we generate a large number of radar returns by randomly placing targets of variable strengths in… ▽ More

    Submitted 9 July, 2024; v1 submitted 6 September, 2022; originally announced September 2022.

  27. arXiv:2205.14025  [pdf, other

    stat.ME cs.LG stat.ML

    Inference and Sampling for Archimax Copulas

    Authors: Yuting Ng, Ali Hasan, Vahid Tarokh

    Abstract: Understanding multivariate dependencies in both the bulk and the tails of a distribution is an important problem for many applications, such as ensuring algorithms are robust to observations that are infrequent but have devastating effects. Archimax copulas are a family of distributions endowed with a precise representation that allows simultaneous modeling of the bulk and the tails of a distribut… ▽ More

    Submitted 20 September, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Yuting Ng and Ali Hasan contributed equally to this work. This work has been accepted at NeurIPS 2022

  28. arXiv:2201.11209  [pdf, other

    cs.LG eess.IV

    On The Energy Statistics of Feature Maps in Pruning of Neural Networks with Skip-Connections

    Authors: Mohammadreza Soltani, Suya Wu, Yuerong Li, Jie Ding, Vahid Tarokh

    Abstract: We propose a new structured pruning framework for compressing Deep Neural Networks (DNNs) with skip connections, based on measuring the statistical dependency of hidden layers and predicted outputs. The dependence measure defined by the energy statistics of hidden layers serves as a model-free measure of information between the feature maps and the output of the network. The estimated dependence m… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  29. arXiv:2201.10712  [pdf, other

    cs.CV eess.SP

    Toward Data-Driven STAP Radar

    Authors: Shyam Venkatasubramanian, Chayut Wongkamthong, Mohammadreza Soltani, Bosung Kang, Sandeep Gogineni, Ali Pezeshki, Muralidhar Rangaswamy, Vahid Tarokh

    Abstract: Using an amalgamation of techniques from classical radar, computer vision, and deep learning, we characterize our ongoing data-driven approach to space-time adaptive processing (STAP) radar. We generate a rich example dataset of received radar signals by randomly placing targets of variable strengths in a predetermined region using RFView, a site-specific radio frequency modeling and simulation to… ▽ More

    Submitted 9 March, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: 5 pages, 4 figures. Submitted to 2022 IEEE Radar Conference (RadarConf)

  30. arXiv:2201.09149  [pdf, other

    cs.MA cs.IT cs.LG

    Multi-Agent Adversarial Attacks for Multi-Channel Communications

    Authors: Juncheng Dong, Suya Wu, Mohammadreza Sultani, Vahid Tarokh

    Abstract: Recently Reinforcement Learning (RL) has been applied as an anti-adversarial remedy in wireless communication networks. However, studying the RL-based approaches from the adversary's perspective has received little attention. Additionally, RL-based approaches in an anti-adversary or adversarial paradigm mostly consider single-channel communication (either channel selection or single channel power… ▽ More

    Submitted 27 January, 2022; v1 submitted 22 January, 2022; originally announced January 2022.

  31. arXiv:2201.03617  [pdf, other

    physics.flu-dyn cs.LG

    A Physics-Informed Vector Quantized Autoencoder for Data Compression of Turbulent Flow

    Authors: Mohammadreza Momenifar, Enmao Diao, Vahid Tarokh, Andrew D. Bragg

    Abstract: Analyzing large-scale data from simulations of turbulent flows is memory intensive, requiring significant resources. This major challenge highlights the need for data compression techniques. In this study, we apply a physics-informed Deep Learning technique based on vector quantization to generate a discrete, low-dimensional representation of data from simulations of three-dimensional turbulent fl… ▽ More

    Submitted 11 January, 2022; v1 submitted 10 January, 2022; originally announced January 2022.

    Comments: this article is a conference version of arXiv:2103.01074

  32. arXiv:2112.03469  [pdf, other

    physics.flu-dyn cs.LG

    Emulating Spatio-Temporal Realizations of Three-Dimensional Isotropic Turbulence via Deep Sequence Learning Models

    Authors: Mohammadreza Momenifar, Enmao Diao, Vahid Tarokh, Andrew D. Bragg

    Abstract: We use a data-driven approach to model a three-dimensional turbulent flow using cutting-edge Deep Learning techniques. The deep learning framework incorporates physical constraints on the flow, such as preserving incompressibility and global statistical invariants of velocity gradient tensor. The accuracy of the model is assessed using statistical and physics-based metrics. The data set comes from… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: AI2ASE: AAAI Workshop on AI to Accelerate Science and Engineering, 2022

  33. arXiv:2111.13311  [pdf, other

    cs.LG cs.CE

    Blaschke Product Neural Networks (BPNN): A Physics-Infused Neural Network for Phase Retrieval of Meromorphic Functions

    Authors: Juncheng Dong, Simiao Ren, Yang Deng, Omar Khatib, Jordan Malof, Mohammadreza Soltani, Willie Padilla, Vahid Tarokh

    Abstract: Numerous physical systems are described by ordinary or partial differential equations whose solutions are given by holomorphic or meromorphic functions in the complex domain. In many cases, only the magnitude of these functions are observed on various points on the purely imaginary jw-axis since coherent measurement of their phases is often expensive. However, it is desirable to retrieve the lost… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  34. arXiv:2111.13207  [pdf, other

    cs.LG

    Characteristic Neural Ordinary Differential Equations

    Authors: Xingzi Xu, Ali Hasan, Khalil Elkhalil, Jie Ding, Vahid Tarokh

    Abstract: We propose Characteristic-Neural Ordinary Differential Equations (C-NODEs), a framework for extending Neural Ordinary Differential Equations (NODEs) beyond ODEs. While NODEs model the evolution of a latent variables as the solution to an ODE, C-NODE models the evolution of the latent variables as the solution of a family of first-order quasi-linear partial differential equations (PDEs) along curve… ▽ More

    Submitted 9 November, 2022; v1 submitted 25 November, 2021; originally announced November 2021.

  35. arXiv:2110.13340  [pdf, other

    cs.IR cs.LG

    Decentralized Multi-Target Cross-Domain Recommendation for Multi-Organization Collaborations

    Authors: Enmao Diao, Vahid Tarokh, Jie Ding

    Abstract: Recommender Systems (RSs) are operated locally by different organizations in many realistic scenarios. If various organizations can fully share their data and perform computation in a centralized manner, they may significantly improve the accuracy of recommendations. However, collaborations among multiple organizations in enhancing the performance of recommendations are primarily limited due to th… ▽ More

    Submitted 6 November, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

  36. arXiv:2110.02399  [pdf, other

    cs.LG cs.CV

    Task Affinity with Maximum Bipartite Matching in Few-Shot Learning

    Authors: Cat P. Le, Juncheng Dong, Mohammadreza Soltani, Vahid Tarokh

    Abstract: We propose an asymmetric affinity score for representing the complexity of utilizing the knowledge of one task for learning another one. Our method is based on the maximum bipartite matching algorithm and utilizes the Fisher Information matrix. We provide theoretical analyses demonstrating that the proposed score is mathematically well-defined, and subsequently use the affinity score to propose a… ▽ More

    Submitted 21 January, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper at ICLR 2022

  37. arXiv:2106.02104  [pdf, other

    cs.LG

    Semi-Empirical Objective Functions for MCMC Proposal Optimization

    Authors: Chris Cannella, Vahid Tarokh

    Abstract: Current objective functions used for training neural MCMC proposal distributions implicitly rely on architectural restrictions to yield sensible optimization results, which hampers the development of highly expressive neural MCMC proposal architectures. In this work, we introduce and demonstrate a semi-empirical procedure for determining approximate objective functions suitable for optimizing arbi… ▽ More

    Submitted 9 April, 2022; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: 41 pages, 21 tables, 22 figures

  38. arXiv:2106.01432  [pdf, other

    cs.LG

    SemiFL: Semi-Supervised Federated Learning for Unlabeled Clients with Alternate Training

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Federated Learning allows the training of machine learning models by using the computation and private data resources of many distributed clients. Most existing results on Federated Learning (FL) assume the clients have ground-truth labels. However, in many practical scenarios, clients may be unable to label task-specific data due to a lack of expertise or resource. We propose SemiFL to address th… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  39. arXiv:2106.01425  [pdf, other

    cs.LG

    GAL: Gradient Assisted Learning for Decentralized Multi-Organization Collaborations

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Collaborations among multiple organizations, such as financial institutions, medical centers, and retail markets in decentralized settings are crucial to providing improved service and performance. However, the underlying organizations may have little interest in sharing their local data, models, and objective functions. These requirements have created new challenges for multi-organization collabo… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  40. arXiv:2106.00110  [pdf, other

    cs.SD cs.LG eess.AS

    A Methodology for Exploring Deep Convolutional Features in Relation to Hand-Crafted Features with an Application to Music Audio Modeling

    Authors: Anna K. Yanchenko, Mohammadreza Soltani, Robert J. Ravier, Sayan Mukherjee, Vahid Tarokh

    Abstract: Understanding the features learned by deep models is important from a model trust perspective, especially as deep systems are deployed in the real world. Most recent approaches for deep feature understanding or model explanation focus on highlighting input data features that are relevant for classification decisions. In this work, we instead take the perspective of relating deep features to well-s… ▽ More

    Submitted 9 October, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: Code available at https://github.com/aky4wn/convolutions-for-music-audio

  41. arXiv:2103.12827  [pdf, other

    cs.LG eess.IV stat.ML

    Fisher Task Distance and Its Application in Neural Architecture Search

    Authors: Cat P. Le, Mohammadreza Soltani, Juncheng Dong, Vahid Tarokh

    Abstract: We formulate an asymmetric (or non-commutative) distance between tasks based on Fisher Information Matrices, called Fisher task distance. This distance represents the complexity of transferring the knowledge from one task to another. We provide a proof of consistency for our distance through theorems and experiments on various classification tasks from MNIST, CIFAR-10, CIFAR-100, ImageNet, and Tas… ▽ More

    Submitted 30 April, 2022; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Published in IEEE Access, Volume 10, 2022

  42. arXiv:2103.02260  [pdf, other

    cs.CR cs.CY cs.DC cs.NI

    Talaria: A Framework for Simulation of Permissioned Blockchains for Logistics and Beyond

    Authors: Jiali Xing, David Fischer, Nitya Labh, Ryan Piersma, Benjamin C. Lee, Yu Amy Xia, Tuhin Sahai, Vahid Tarokh

    Abstract: In this paper, we present Talaria, a novel permissioned blockchain simulator that supports numerous protocols and use cases, most notably in supply chain management. Talaria extends the capability of BlockSim, an existing blockchain simulator, to include permissioned blockchains and serves as a foundation for further private blockchain assessment. Talaria is designed with both practical Byzantine… ▽ More

    Submitted 30 March, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

  43. arXiv:2103.00241  [pdf, other

    cs.LG cs.CV

    Improved Automated Machine Learning from Transfer Learning

    Authors: Cat P. Le, Mohammadreza Soltani, Robert Ravier, Vahid Tarokh

    Abstract: In this paper, we propose a neural architecture search framework based on a similarity measure between some baseline tasks and a target task. We first define the notion of the task similarity based on the log-determinant of the Fisher Information matrix. Next, we compute the task similarity from each of the baseline tasks to the target task. By utilizing the relation between a target and a set of… ▽ More

    Submitted 29 January, 2022; v1 submitted 27 February, 2021; originally announced March 2021.

  44. arXiv:2102.11351  [pdf, other

    cs.LG stat.ML

    Generative Archimedean Copulas

    Authors: Yuting Ng, Ali Hasan, Khalil Elkhalil, Vahid Tarokh

    Abstract: We propose a new generative modeling technique for learning multidimensional cumulative distribution functions (CDFs) in the form of copulas. Specifically, we consider certain classes of copulas known as Archimedean and hierarchical Archimedean copulas, popular for their parsimonious representation and ability to model different tail dependencies. We consider their representation as mixture models… ▽ More

    Submitted 10 June, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: UAI 2021

  45. arXiv:2102.09042  [pdf, other

    stat.ML cs.LG stat.CO

    Modeling Extremes with d-max-decreasing Neural Networks

    Authors: Ali Hasan, Khalil Elkhalil, Yuting Ng, Joao M. Pereira, Sina Farsiu, Jose H. Blanchet, Vahid Tarokh

    Abstract: We propose a novel neural network architecture that enables non-parametric calibration and generation of multivariate extreme value distributions (MEVs). MEVs arise from Extreme Value Theory (EVT) as the necessary class of models when extrapolating a distributional fit over large spatial and temporal scales based on data observed in intermediate scales. In turn, EVT dictates that $d$-max-decreasin… ▽ More

    Submitted 1 March, 2022; v1 submitted 17 February, 2021; originally announced February 2021.

  46. arXiv:2012.13307  [pdf, other

    math.ST cs.IT cs.LG eess.SP physics.data-an

    On Statistical Efficiency in Learning

    Authors: Jie Ding, Enmao Diao, Jiawei Zhou, Vahid Tarokh

    Abstract: A central issue of many statistical learning problems is to select an appropriate model from a set of candidate models. Large models tend to inflate the variance (or overfitting), while small models tend to cause biases (or underfitting) for a given fixed dataset. In this work, we address the critical challenge of model selection to strike a balance between model fitting and model complexity, thus… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

    Comments: to be published by the IEEE Transactions on Information Theory

  47. arXiv:2010.13962  [pdf, ps, other

    cs.LG cs.AI

    Task-Aware Neural Architecture Search

    Authors: Cat P. Le, Mohammadreza Soltani, Robert Ravier, Vahid Tarokh

    Abstract: The design of handcrafted neural networks requires a lot of time and resources. Recent techniques in Neural Architecture Search (NAS) have proven to be competitive or better than traditional handcrafted design, although they require domain knowledge and have generally used limited search spaces. In this paper, we propose a novel framework for neural architecture search, utilizing a dictionary of m… ▽ More

    Submitted 15 March, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

  48. arXiv:2010.01264  [pdf, other

    cs.LG stat.ML

    HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients

    Authors: Enmao Diao, Jie Ding, Vahid Tarokh

    Abstract: Federated Learning (FL) is a method of training machine learning models on private data distributed over a large number of possibly heterogeneous clients such as mobile phones and IoT devices. In this work, we propose a new federated learning framework named HeteroFL to address heterogeneous clients equipped with very different computation and communication capabilities. Our solution can enable th… ▽ More

    Submitted 13 December, 2021; v1 submitted 2 October, 2020; originally announced October 2020.

    Comments: ICLR 2021

  49. arXiv:2007.06682  [pdf, other

    cs.LG cs.CV stat.ML

    GeoStat Representations of Time Series for Fast Classification

    Authors: Robert J. Ravier, Mohammadreza Soltani, Miguel Simões, Denis Garagic, Vahid Tarokh

    Abstract: Recent advances in time series classification have largely focused on methods that either employ deep learning or utilize other machine learning models for feature extraction. Though successful, their power often comes at the requirement of computational complexity. In this paper, we introduce GeoStat representations for time series. GeoStat representations are based off of a generalization of rec… ▽ More

    Submitted 11 January, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 28 pages, 8 tables, 5 figures

  50. arXiv:2007.06407  [pdf, other

    cs.NE q-bio.NC

    Deep Cross-Subject Mapping of Neural Activity

    Authors: Marko Angjelichinoski, Bijan Pesaran, Vahid Tarokh

    Abstract: Objective. In this paper, we consider the problem of cross-subject decoding, where neural activity data collected from the prefrontal cortex of a given subject (destination) is used to decode motor intentions from the neural activity of a different subject (source). Approach. We cast the problem of neural activity mapping in a probabilistic framework where we adopt deep generative modelling. Our p… ▽ More

    Submitted 21 February, 2022; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: More details on hyper-parameter tuning and some additional results