Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–22 of 22 results for author: Nair, V N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.01300  [pdf

    stat.ML cs.LG

    Assessing Robustness of Machine Learning Models using Covariate Perturbations

    Authors: Arun Prakash R, Anwesha Bhattacharyya, Joel Vaughan, Vijayan N. Nair

    Abstract: As machine learning models become increasingly prevalent in critical decision-making models and systems in fields like finance, healthcare, etc., ensuring their robustness against adversarial attacks and changes in the input data is paramount, especially in cases where models potentially overfit. This paper proposes a comprehensive framework for assessing the robustness of machine learning models… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 31 pages, 11 figures, 14 tables

  2. arXiv:2309.02426  [pdf

    stat.ML cs.LG

    Monotone Tree-Based GAMI Models by Adapting XGBoost

    Authors: Linwei Hu, Soroush Aramideh, Jie Chen, Vijayan N. Nair

    Abstract: Recent papers have used machine learning architecture to fit low-order functional ANOVA models with main effects and second-order interactions. These GAMI (GAM + Interaction) models are directly interpretable as the functional main effects and interactions can be easily plotted and visualized. Unfortunately, it is not easy to incorporate the monotonicity requirement into the existing GAMI models b… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 12 pages

  3. arXiv:2308.09341  [pdf

    cs.CL cs.LG

    Document Automation Architectures: Updated Survey in Light of Large Language Models

    Authors: Mohammad Ahmadi Achachlouei, Omkar Patil, Tarun Joshi, Vijayan N. Nair

    Abstract: This paper surveys the current state of the art in document automation (DA). The objective of DA is to reduce the manual effort during the generation of documents by automatically creating and integrating input from different sources and assembling documents conforming to defined templates. There have been reviews of commercial solutions of DA, particularly in the legal domain, but to date there h… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: The current paper is the updated version of an earlier survey on document automation [Ahmadi Achachlouei et al. 2021]. Updates in the current paper are as follows: We shortened almost all sections to reduce the size of the main paper (without references) from 28 pages to 10 pages, added a review of selected papers on large language models, removed certain sections and most of diagrams. arXiv admin note: substantial text overlap with arXiv:2109.11603

    MSC Class: 68T50 ACM Class: I.7.0; I.2.7; I.2.4

  4. arXiv:2305.15670  [pdf

    stat.ML cs.LG

    Interpretable Machine Learning based on Functional ANOVA Framework: Algorithms and Comparisons

    Authors: Linwei Hu, Vijayan N. Nair, Agus Sudjianto, Aijun Zhang, Jie Chen

    Abstract: In the early days of machine learning (ML), the emphasis was on developing complex algorithms to achieve best predictive performance. To understand and explain the model results, one had to rely on post hoc explainability techniques, which are known to have limitations. Recently, with the recognition that interpretability is just as important, researchers are compromising on small increases in pre… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 24 pages, 15 figures. arXiv admin note: substantial text overlap with arXiv:2207.06950

  5. arXiv:2211.08536  [pdf

    cs.LG

    Behavior of Hyper-Parameters for Selected Machine Learning Algorithms: An Empirical Investigation

    Authors: Anwesha Bhattacharyya, Joel Vaughan, Vijayan N. Nair

    Abstract: Hyper-parameters (HPs) are an important part of machine learning (ML) model development and can greatly influence performance. This paper studies their behavior for three algorithms: Extreme Gradient Boosting (XGB), Random Forest (RF), and Feedforward Neural Network (FFNN) with structured data. Our empirical investigation examines the qualitative behavior of model performance as the HPs vary, quan… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  6. arXiv:2208.06096  [pdf

    cs.LG stat.ML

    Comparing Baseline Shapley and Integrated Gradients for Local Explanation: Some Additional Insights

    Authors: Tianshu Feng, Zhipu Zhou, Joshi Tarun, Vijayan N. Nair

    Abstract: There are many different methods in the literature for local explanation of machine learning results. However, the methods differ in their approaches and often do not provide same explanations. In this paper, we consider two recent methods: Integrated Gradients (Sundararajan, Taly, & Yan, 2017) and Baseline Shapley (Sundararajan and Najmi, 2020). The original authors have already studied the axiom… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  7. arXiv:2207.06950  [pdf

    stat.ML cs.LG

    Using Model-Based Trees with Boosting to Fit Low-Order Functional ANOVA Models

    Authors: Linwei Hu, Jie Chen, Vijayan N. Nair

    Abstract: Low-order functional ANOVA (fANOVA) models have been rediscovered in the machine learning (ML) community under the guise of inherently interpretable machine learning. Explainable Boosting Machines or EBM (Lou et al. 2013) and GAMI-Net (Yang et al. 2021) are two recently proposed ML algorithms for fitting functional main effects and second-order interactions. We propose a new algorithm, called GAMI… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: 25 pages plus appendix

  8. arXiv:2206.12353  [pdf

    stat.ML cs.LG

    Quantifying Inherent Randomness in Machine Learning Algorithms

    Authors: Soham Raste, Rahul Singh, Joel Vaughan, Vijayan N. Nair

    Abstract: Most machine learning (ML) algorithms have several stochastic elements, and their performances are affected by these sources of randomness. This paper uses an empirical study to systematically examine the effects of two sources: randomness in model training and randomness in the partitioning of a dataset into training and test subsets. We quantify and compare the magnitude of the variation in pred… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: 14 pages, 4 Figures, 5 tables

  9. arXiv:2205.12723  [pdf

    cs.LG

    Interpretable Feature Engineering for Time Series Predictors using Attention Networks

    Authors: Tianjie Wang, Jie Chen, Joel Vaughan, Vijayan N. Nair

    Abstract: Regression problems with time-series predictors are common in banking and many other areas of application. In this paper, we use multi-head attention networks to develop interpretable features and use them to achieve good predictive performance. The customized attention layer explicitly uses multiplicative interactions and builds feature-engineering heads that capture temporal dynamics in a parsim… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  10. arXiv:2204.12868  [pdf

    stat.ML cs.LG

    Performance and Interpretability Comparisons of Supervised Machine Learning Algorithms: An Empirical Study

    Authors: Alice J. Liu, Arpita Mukherjee, Linwei Hu, Jie Chen, Vijayan N. Nair

    Abstract: This paper compares the performances of three supervised machine learning algorithms in terms of predictive ability and model interpretation on structured or tabular data. The algorithms considered were scikit-learn implementations of extreme gradient boosting machines (XGB) and random forests (RFs), and feedforward neural networks (FFNNs) from TensorFlow. The paper is organized in a findings-base… ▽ More

    Submitted 5 May, 2022; v1 submitted 27 April, 2022; originally announced April 2022.

  11. arXiv:2204.12365  [pdf

    stat.ML cs.LG

    Explaining Adverse Actions in Credit Decisions Using Shapley Decomposition

    Authors: Vijayan N. Nair, Tianshu Feng, Linwei Hu, Zach Zhang, Jie Chen, Agus Sudjianto

    Abstract: When a financial institution declines an application for credit, an adverse action (AA) is said to occur. The applicant is then entitled to an explanation for the negative decision. This paper focuses on credit decisions based on a predictive model for probability of default and proposes a methodology for AA explanation. The problem involves identifying the important predictors responsible for the… ▽ More

    Submitted 26 April, 2022; originally announced April 2022.

    Comments: 20 pages, 8 figures

  12. arXiv:2109.11603  [pdf

    cs.CL cs.LG

    Document Automation Architectures and Technologies: A Survey

    Authors: Mohammad Ahmadi Achachlouei, Omkar Patil, Tarun Joshi, Vijayan N. Nair

    Abstract: This paper surveys the current state of the art in document automation (DA). The objective of DA is to reduce the manual effort during the generation of documents by automatically integrating input from different sources and assembling documents conforming to defined templates. There have been reviews of commercial solutions of DA, particularly in the legal domain, but to date there has been no co… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: 34 pages, 11 figures, 5 tables

    MSC Class: 68T50 ACM Class: I.7.0; I.2.7; I.2.4

  13. arXiv:2105.08589  [pdf

    cs.CL

    Self-interpretable Convolutional Neural Networks for Text Classification

    Authors: Wei Zhao, Rahul Singh, Tarun Joshi, Agus Sudjianto, Vijayan N. Nair

    Abstract: Deep learning models for natural language processing (NLP) are inherently complex and often viewed as black box in nature. This paper develops an approach for interpreting convolutional neural networks for text classification problems by exploiting the local-linear models inherent in ReLU-DNNs. The CNN model combines the word embedding through convolutional layers, filters them using max-pooling,… ▽ More

    Submitted 8 July, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

  14. arXiv:2105.06558  [pdf

    stat.ML cs.LG

    Bias, Fairness, and Accountability with AI and ML Algorithms

    Authors: Nengfeng Zhou, Zach Zhang, Vijayan N. Nair, Harsh Singhal, Jie Chen, Agus Sudjianto

    Abstract: The advent of AI and ML algorithms has led to opportunities as well as challenges. In this paper, we provide an overview of bias and fairness issues that arise with the use of ML algorithms. We describe the types and sources of data bias, and discuss the nature of algorithmic unfairness. This is followed by a review of fairness metrics in the literature, discussion of their limitations, and a desc… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

    Comments: 18 pages, 5 figures

    MSC Class: 00-02

  15. arXiv:2009.02358  [pdf

    cs.CL cs.AI

    Recent Trends in the Use of Deep Learning Models for Grammar Error Handling

    Authors: Mina Naghshnejad, Tarun Joshi, Vijayan N. Nair

    Abstract: Grammar error handling (GEH) is an important topic in natural language processing (NLP). GEH includes both grammar error detection and grammar error correction. Recent advances in computation systems have promoted the use of deep learning (DL) models for NLP problems such as GEH. In this survey we focus on two main DL approaches for GEH: neural machine translation models and editor models. We desc… ▽ More

    Submitted 4 September, 2020; originally announced September 2020.

  16. arXiv:2008.11825  [pdf

    cs.CL cs.LG

    SHAP values for Explaining CNN-based Text Classification Models

    Authors: Wei Zhao, Tarun Joshi, Vijayan N. Nair, Agus Sudjianto

    Abstract: Deep neural networks are increasingly used in natural language processing (NLP) models. However, the need to interpret and explain the results from complex algorithms are limiting their widespread adoption in regulated industries such as banking. There has been recent work on interpretability of machine learning algorithms with structured data. But there are only limited techniques for NLP applica… ▽ More

    Submitted 8 July, 2021; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: 17 pages, 5 figures

  17. arXiv:2008.05536  [pdf

    cs.CL cs.LG

    Model Robustness with Text Classification: Semantic-preserving adversarial attacks

    Authors: Rahul Singh, Tarun Joshi, Vijayan N. Nair, Agus Sudjianto

    Abstract: We propose algorithms to create adversarial attacks to assess model robustness in text classification problems. They can be used to create white box attacks and black box attacks while at the same time preserving the semantics and syntax of the original text. The attacks cause significant number of flips in white-box setting and same rule based can be used in black-box setting. In a black-box sett… ▽ More

    Submitted 13 August, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

    Comments: 12 Pages, 3 Figures, 10 Tables

  18. arXiv:2008.04059  [pdf

    q-fin.GN cs.LG stat.ML

    Supervised Machine Learning Techniques: An Overview with Applications to Banking

    Authors: Linwei Hu, Jie Chen, Joel Vaughan, Hanyu Yang, Kelly Wang, Agus Sudjianto, Vijayan N. Nair

    Abstract: This article provides an overview of Supervised Machine Learning (SML) with a focus on applications to banking. The SML techniques covered include Bagging (Random Forest or RF), Boosting (Gradient Boosting Machine or GBM) and Neural Networks (NNs). We begin with an introduction to ML tasks and techniques. This is followed by a description of: i) tree-based ensemble algorithms including Bagging wit… ▽ More

    Submitted 28 July, 2020; originally announced August 2020.

  19. arXiv:2007.14528  [pdf

    stat.ML cs.LG

    Surrogate Locally-Interpretable Models with Supervised Machine Learning Algorithms

    Authors: Linwei Hu, Jie Chen, Vijayan N. Nair, Agus Sudjianto

    Abstract: Supervised Machine Learning (SML) algorithms, such as Gradient Boosting, Random Forest, and Neural Networks, have become popular in recent years due to their superior predictive performance over traditional statistical methods. However, their complexity makes the results hard to interpret without additional tools. There has been a lot of recent work in developing global and local diagnostics for i… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

  20. arXiv:2004.02353  [pdf

    stat.ML cs.AI cs.LG

    Adaptive Explainable Neural Networks (AxNNs)

    Authors: Jie Chen, Joel Vaughan, Vijayan N. Nair, Agus Sudjianto

    Abstract: While machine learning techniques have been successfully applied in several fields, the black-box nature of the models presents challenges for interpreting and explaining the results. We develop a new framework called Adaptive Explainable Neural Networks (AxNN) for achieving the dual goals of good predictive performance and model interpretability. For predictive performance, we build a structured… ▽ More

    Submitted 2 June, 2020; v1 submitted 5 April, 2020; originally announced April 2020.

  21. arXiv:1806.01933  [pdf, other

    stat.ML cs.LG

    Explainable Neural Networks based on Additive Index Models

    Authors: Joel Vaughan, Agus Sudjianto, Erind Brahimi, Jie Chen, Vijayan N. Nair

    Abstract: Machine Learning algorithms are increasingly being used in recent years due to their flexibility in model fitting and increased predictive performance. However, the complexity of the models makes them hard for the data analyst to interpret the results and explain them without additional tools. This has led to much research in developing various approaches to understand the model behavior. In this… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

    Comments: 10 pages, 8 figures

  22. arXiv:1806.00663  [pdf

    stat.ML cs.LG

    Locally Interpretable Models and Effects based on Supervised Partitioning (LIME-SUP)

    Authors: Linwei Hu, Jie Chen, Vijayan N. Nair, Agus Sudjianto

    Abstract: Supervised Machine Learning (SML) algorithms such as Gradient Boosting, Random Forest, and Neural Networks have become popular in recent years due to their increased predictive performance over traditional statistical methods. This is especially true with large data sets (millions or more observations and hundreds to thousands of predictors). However, the complexity of the SML models makes them op… ▽ More

    Submitted 2 June, 2018; originally announced June 2018.

    Comments: 15 pages, 10 figures