Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–27 of 27 results for author: Dey, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2312.11283  [pdf, other

    stat.AP cs.CR econ.EM

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    Authors: John M. Abowd, Tamara Adams, Robert Ashmead, David Darais, Sourya Dey, Simson L. Garfinkel, Nathan Goldschlag, Daniel Kifer, Philip Leclerc, Ethan Lew, Scott Moore, Rolando A. Rodríguez, Ramy N. Tadros, Lars Vilhuber

    Abstract: Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can veri… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2302.07415  [pdf, other

    stat.ML stat.AP

    Variable Selection for Kernel Two-Sample Tests

    Authors: Jie Wang, Santanu S. Dey, Yao Xie

    Abstract: We consider the variable selection problem for two-sample tests, aiming to select the most informative variables to distinguish samples from two groups. To solve this problem, we propose a framework based on the kernel maximum mean discrepancy (MMD). Our approach seeks a group of variables with a pre-specified size that maximizes the variance-regularized MMD statistics. This formulation also corre… ▽ More

    Submitted 12 October, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 41 pages, 6 figures

  3. arXiv:2205.06320  [pdf, other

    stat.AP

    Modelling spatially autocorrelated detection probabilities in spatial capture-recapture using random effects

    Authors: Soumen Dey, Ehsan M. Moqanaki, Cyril Milleret, Pierre Dupont, Mahdieh Tourani, Richard Bischof

    Abstract: Spatial capture-recapture (SCR) models are now widely used for estimating density from repeated individual spatial encounters. SCR accounts for the inherent spatial autocorrelation in individual detections by modelling detection probabilities as a function of distance between the detectors and individual activity centres. However, additional spatial heterogeneity in detection probability may still… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

  4. arXiv:2203.02818  [pdf, other

    cs.LG stat.ML

    Fuzzy Forests For Feature Selection in High-Dimensional Survey Data: An Application to the 2020 U.S. Presidential Election

    Authors: Sreemanti Dey, R. Michael Alvarez

    Abstract: An increasingly common methodological issue in the field of social science is high-dimensional and highly correlated datasets that are unamenable to the traditional deductive framework of study. Analysis of candidate choice in the 2020 Presidential Election is one area in which this issue presents itself: in order to test the many theories explaining the outcome of the election, it is necessary to… ▽ More

    Submitted 5 March, 2022; originally announced March 2022.

    Comments: Paper presented at The 3rd International Conference on Applied Machine Learning and Data Analytics, December 16-17 2021, where it was named the Best Paper of the conference

  5. arXiv:2202.08107  [pdf, other

    stat.AP

    Estimating Software Reliability Using Size-biased Modelling

    Authors: Soumen Dey, Ashis Kumar Chakraborty

    Abstract: Software reliability estimation is one of the most active areas of research in software testing. Since time between failures (TBF) has often been challenging to record, software testing data are commonly recorded as test-case-wise in a discrete set up. We have developed a Bayesian generalised linear mixed model (GLMM) based on software testing detection data and a size-biased strategy which not… ▽ More

    Submitted 20 April, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: 14 pages. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  6. arXiv:2111.07419  [pdf, other

    cs.RO cs.LG stat.AP

    Learning a Shared Model for Motorized Prosthetic Joints to Predict Ankle-Joint Motion

    Authors: Sharmita Dey, Sabri Boughorbel, Arndt F. Schilling

    Abstract: Control strategies for active prostheses or orthoses use sensor inputs to recognize the user's locomotive intention and generate corresponding control commands for producing the desired locomotion. In this paper, we propose a learning-based shared model for predicting ankle-joint motion for different locomotion modes like level-ground walking, stair ascent, stair descent, slope ascent, and slope d… ▽ More

    Submitted 14 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 Workshop Spotlight presentation, Machine Learning for Health (ML4H) 2021 - Extended Abstract

  7. arXiv:2006.11440  [pdf, other

    stat.ML cs.LG

    Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples

    Authors: Josue Ortega Caro, Yilong Ju, Ryan Pyle, Sourav Dey, Wieland Brendel, Fabio Anselmi, Ankit Patel

    Abstract: Adversarial Attacks are still a significant challenge for neural networks. Recent work has shown that adversarial perturbations typically contain high-frequency features, but the root cause of this phenomenon remains unknown. Inspired by theoretical work on linear full-width convolutional models, we hypothesize that the local (i.e. bounded-width) convolutional operations commonly used in current n… ▽ More

    Submitted 8 March, 2023; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 23 pages, 11 figures, 12 Tables

  8. arXiv:2004.00974  [pdf, other

    cs.LG cs.CV stat.ML

    Deep-n-Cheap: An Automated Search Framework for Low Complexity Deep Learning

    Authors: Sourya Dey, Saikrishna C. Kanala, Keith M. Chugg, Peter A. Beerel

    Abstract: We present Deep-n-Cheap -- an open-source AutoML framework to search for deep learning models. This search includes both architecture and training hyperparameters, and supports convolutional neural networks and multi-layer perceptrons. Our framework is targeted for deployment on both benchmark and custom datasets, and as a result, offers a greater degree of search space customizability as compared… ▽ More

    Submitted 5 September, 2020; v1 submitted 27 March, 2020; originally announced April 2020.

    Comments: Accepted as a conference paper at ACML 2020

  9. arXiv:2003.09311  [pdf

    cs.LG cs.AI stat.ML

    Drift-Adjusted And Arbitrated Ensemble Framework For Time Series Forecasting

    Authors: Anirban Chatterjee, Subhadip Paul, Uddipto Dutta, Smaranya Dey

    Abstract: Time Series Forecasting is at the core of many practical applications such as sales forecasting for business, rainfall forecasting for agriculture and many others. Though this problem has been extensively studied for years, it is still considered a challenging problem due to complex and evolving nature of time series data. Typical methods proposed for time series forecasting modeled linear or non-… ▽ More

    Submitted 16 March, 2020; originally announced March 2020.

  10. arXiv:2001.04752  [pdf, ps, other

    stat.AP eess.SP

    Asymptotic Performance Analysis of Non-Bayesian Quickest Change Detection with an Energy Harvesting Sensor

    Authors: Subhrakanti Dey

    Abstract: In this paper, we consider a non-Bayesian sequential change detection based on the Cumulative Sum (CUSUM) algorithm employed by an energy harvesting sensor where the distributions before and after the change are assumed to be known. In a slotted discrete-time model, the sensor, exclusively powered by randomly available harvested energy, obtains a sample and computes the log-likelihood ratio of the… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: 7 pages, 1 figure

    MSC Class: 62L12 (primary)

  11. arXiv:1912.00846  [pdf, other

    cs.LG cs.CL cs.SD stat.ML

    Attentive Modality Hopping Mechanism for Speech Emotion Recognition

    Authors: Seunghyun Yoon, Subhadeep Dey, Hwanhee Lee, Kyomin Jung

    Abstract: In this work, we explore the impact of visual modality in addition to speech and text for improving the accuracy of the emotion detection system. The traditional approaches tackle this task by fusing the knowledge from the various modalities independently for performing emotion classification. In contrast to these approaches, we tackle the problem by introducing an attention mechanism to combine t… ▽ More

    Submitted 22 April, 2020; v1 submitted 29 November, 2019; originally announced December 2019.

    Comments: 5 pages, Accepted as a conference paper at ICASSP 2020

  12. arXiv:1911.09818  [pdf

    cs.LG cs.CL cs.IR stat.ML

    Order Matters at Fanatics Recommending Sequentially Ordered Products by LSTM Embedded with Word2Vec

    Authors: Jing Pan, Weian Sheng, Santanu Dey

    Abstract: A unique challenge for e-commerce recommendation is that customers are often interested in products that are more advanced than their already purchased products, but not reversed. The few existing recommender systems modeling unidirectional sequence output a limited number of categories or continuous variables. To model the ordered sequence, we design the first recommendation system that both embe… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: 5 pages, 2 figures, KDD 2019 Workshop, Deep Learning on Graphics,

  13. arXiv:1908.05227  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Exploiting semi-supervised training through a dropout regularization in end-to-end speech recognition

    Authors: Subhadeep Dey, Petr Motlicek, Trung Bui, Franck Dernoncourt

    Abstract: In this paper, we explore various approaches for semi supervised learning in an end to end automatic speech recognition (ASR) framework. The first step in our approach involves training a seed model on the limited amount of labelled data. Additional unlabelled speech data is employed through a data selection mechanism to obtain the best hypothesized output, further used to retrain the seed model.… ▽ More

    Submitted 8 August, 2019; originally announced August 2019.

    Comments: Interspeech 2019

    MSC Class: 62H30

  14. Deep Residual Autoencoders for Expectation Maximization-inspired Dictionary Learning

    Authors: Bahareh Tolooshams, Sourav Dey, Demba Ba

    Abstract: We introduce a neural-network architecture, termed the constrained recurrent sparse autoencoder (CRsAE), that solves convolutional dictionary learning problems, thus establishing a link between dictionary learning and neural networks. Specifically, we leverage the interpretation of the alternating-minimization algorithm for dictionary learning as an approximate Expectation-Maximization algorithm t… ▽ More

    Submitted 18 October, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

    Journal ref: in IEEE Transactions on Neural Networks and Learning Systems, pp. 1-15, 2020

  15. Pre-Defined Sparse Neural Networks with Hardware Acceleration

    Authors: Sourya Dey, Kuan-Wen Huang, Peter A. Beerel, Keith M. Chugg

    Abstract: Neural networks have proven to be extremely powerful tools for modern artificial intelligence applications, but computational and storage complexity remain limiting factors. This paper presents two compatible contributions towards reducing the time, energy, computational, and storage complexities associated with multilayer perceptrons. Pre-defined sparsity is proposed to reduce the complexity duri… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: This work has been submitted to the IEEE Journal on Emerging and Selected Topics in Circuits and Systems for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  16. arXiv:1810.02397  [pdf, ps, other

    stat.AP

    Bayesian Model Selection for a Class of Spatially-Explicit Capture Recapture Models

    Authors: Soumen Dey, Mohan Delampady, Arjun M. Gopalaswamy

    Abstract: A vast amount of ecological knowledge generated recently has hinged upon the ability of model selection methods to discriminate among various ecological hypotheses. The last decade has seen the rise of Bayesian hierarchical models in ecology. Consequently, popular tools, such as the AIC, become largely inapplicable and other tools are not universally applicable. We focus on a class of competing Ba… ▽ More

    Submitted 4 October, 2018; originally announced October 2018.

  17. arXiv:1809.06405  [pdf, other

    stat.ME stat.AP stat.CO

    Bayesian analysis of absolute continuous Marshall-Olkin bivariate Pareto distribution with location and scale parameters

    Authors: Biplab Paul, Arabin Kumar Dey, Sanku Dey

    Abstract: This paper provides two different novel approaches of slice sampling to estimate the parameters of absolute continuous Marshall-Olkin bivariate Pareto distribution with location and scale parameters. We carry out the bayesian analysis taking gamma prior for shape and scale parameters and truncated normal for location parameters. Credible intervals and coverage probabilities are also provided for a… ▽ More

    Submitted 17 September, 2018; originally announced September 2018.

  18. arXiv:1807.04734  [pdf, other

    cs.LG stat.ML

    Scalable Convolutional Dictionary Learning with Constrained Recurrent Sparse Auto-encoders

    Authors: Bahareh Tolooshams, Sourav Dey, Demba Ba

    Abstract: Given a convolutional dictionary underlying a set of observed signals, can a carefully designed auto-encoder recover the dictionary in the presence of noise? We introduce an auto-encoder architecture, termed constrained recurrent sparse auto-encoder (CRsAE), that answers this question in the affirmative. Given an input signal and an approximate dictionary, the encoder finds a sparse approximation… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

  19. arXiv:1807.04239  [pdf, other

    cs.LG cs.IT stat.ML

    Morse Code Datasets for Machine Learning

    Authors: Sourya Dey, Keith M. Chugg, Peter A. Beerel

    Abstract: We present an algorithm to generate synthetic datasets of tunable difficulty on classification of Morse code symbols for supervised machine learning problems, in particular, neural networks. The datasets are spatially one-dimensional and have a small number of input features, leading to high density of input information content. This makes them particularly challenging when implementing network co… ▽ More

    Submitted 30 November, 2018; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: Presented at the 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT)

    Journal ref: in 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), pp. 1-7, Jul 2018

  20. arXiv:1804.03565  [pdf

    cs.CY stat.ML

    Predicting Gross Movie Revenue

    Authors: Sharmistha Dey

    Abstract: 'There is no terror in the bang, only is the anticipation of it' - Alfred Hitchcock. Yet there is everything in correctly anticipating the bang a movie would make in the box-office. Movies make a high profile, billion dollar industry and prediction of movie revenue can be very lucrative. Predicted revenues can be used for planning both the production and distribution stages. For example, project… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

  21. arXiv:1801.05327  [pdf, other

    stat.AP

    The Frechet distribution: Estimation and Application an Overview

    Authors: Pedro Luiz Ramos, Francisco Louzada, Eduardo Ramos, Sanku Dey

    Abstract: In this article, we consider the problem of estimating the parameters of the Fréchet distribution from both frequentist and Bayesian points of view. First we briefly describe different frequentist approaches, namely, maximum likelihood, method of moments, percentile estimators, L-moments, ordinary and weighted least squares, maximum product of spacings, maximum goodness-of-fit estimators and compa… ▽ More

    Submitted 16 January, 2018; originally announced January 2018.

  22. arXiv:1712.10035  [pdf, other

    stat.AP

    A spatially explicit capture recapture model for partially identified individuals when trap detection rate is less than one

    Authors: Soumen Dey, Mohan Delampady, K. Ullas Karanth, Arjun M. Gopalaswamy

    Abstract: Spatially explicit capture recapture (SECR) models have gained enormous popularity to solve abundance estimation problems in ecology. In this study, we develop a novel Bayesian SECR model that disentangles the process of animal movement through a detector from the process of recording data by a detector in the face of imperfect detection. We integrate this complexity into an advanced version of a… ▽ More

    Submitted 28 December, 2017; originally announced December 2017.

    Comments: This draft has been submitted to a journal for review

  23. arXiv:1711.08413  [pdf

    cs.CV stat.AP stat.ML

    SolarisNet: A Deep Regression Network for Solar Radiation Prediction

    Authors: Subhadip Dey, Sawon Pratiher, Saon Banerjee, Chanchal Kumar Mukherjee

    Abstract: Effective utilization of photovoltaic (PV) plants requires weather variability robust global solar radiation (GSR) forecasting models. Random weather turbulence phenomena coupled with assumptions of clear sky model as suggested by Hottel pose significant challenges to parametric & non-parametric models in GSR conversion rate estimation. Also, a decent GSR estimate requires costly high-tech radiome… ▽ More

    Submitted 10 December, 2017; v1 submitted 22 November, 2017; originally announced November 2017.

  24. arXiv:1709.05906  [pdf, ps, other

    stat.ME stat.AP stat.CO

    Bayesian analysis of three parameter singular Marshall-Olkin bivariate Pareto distribution

    Authors: Biplab Paul, Arabin Kumar Dey, Sanku Dey, Debasis Kundu

    Abstract: This paper provides bayesian analysis of singular Marshall-Olkin bivariate Pareto distribution. We consider three parameter singular Marshall-Olkin bivariate Pareto distribution. We consider two types of prior - reference prior and gamma prior. Bayes estimate of the parameters are calculated based on slice cum gibbs sampler and Lindley approximation. Credible interval is also provided for all meth… ▽ More

    Submitted 29 September, 2017; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: 23 pages, 3 tables

  25. arXiv:1703.09124  [pdf, other

    stat.ME cs.IT

    Multi-sensor Transmission Management for Remote State Estimation under Coordination

    Authors: Kemi Ding, Yuzhe Li, Subhrakanti Dey, Ling Shi

    Abstract: This paper considers the remote state estimation in a cyber-physical system (CPS) using multiple sensors. The measurements of each sensor are transmitted to a remote estimator over a shared channel, where simultaneous transmissions from other sensors are regarded as interference signals. In such a competitive environment, each sensor needs to choose its transmission power for sending data packets… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

  26. arXiv:1605.06585  [pdf, other

    stat.CO stat.ME

    Bayesian Analysis of Modified Weibull distribution under progressively censored competing risk model

    Authors: Arabin Kumar Dey, Abhilash Jha, Sanku Dey

    Abstract: In this paper we study bayesian analysis of Modified Weibull distribution under progressively censored competing risk model. This study is made for progressively censored data. We use deterministic scan Gibbs sampling combined with slice sampling to generate from the posterior distribution. Posterior distribution is formed by taking prior distribution as reference prior. A real life data analysis… ▽ More

    Submitted 21 May, 2016; originally announced May 2016.

    Comments: 17 pages, 6 figures, 5 tables

  27. arXiv:1504.02835  [pdf

    stat.AP

    A multilevel multinomial logistic regression model for identifying risk factors of anemia in children aged 6-59 months in northeastern states of India

    Authors: Sanku Dey, Enayetur Raheem

    Abstract: In this article, we use multilevel multinomial logistic regression model to identify the risk factors of anemia in children of northeastern States of India. The data consisted of 10,136 children of age group 6-59 months. We considered the level of anemia as the outcome variable with four ordinal categories (severe, moderate, mild, and non-anemic) based on hemoglobin concentration in blood as per W… ▽ More

    Submitted 11 April, 2015; originally announced April 2015.

    Comments: 11 pages, 8 tables