Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–49 of 49 results for author: Paul, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.12655  [pdf, ps, other

    cs.AI cs.SE

    Benchmarks and Metrics for Evaluations of Code Generation: A Critical Review

    Authors: Debalina Ghosh Paul, Hong Zhu, Ian Bayley

    Abstract: With the rapid development of Large Language Models (LLMs), a large number of machine learning models have been developed to assist programming tasks including the generation of program code from natural language input. However, how to evaluate such LLMs for this task is still an open problem despite of the great amount of research efforts that have been made and reported to evaluate and compare t… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by the First IEEE International Workshop on Testing and Evaluation of Large Language Models (TELLMe 2024) and will be published in the proceedings of the IEEE AITest 2024 conference

  2. arXiv:2406.12635  [pdf, other

    cs.SE cs.AI

    ScenEval: A Benchmark for Scenario-Based Evaluation of Code Generation

    Authors: Debalina Ghosh Paul, Hong Zhu, Ian Bayley

    Abstract: In the scenario-based evaluation of machine learning models, a key problem is how to construct test datasets that represent various scenarios. The methodology proposed in this paper is to construct a benchmark and attach metadata to each test case. Then a test system can be constructed with test morphisms that filter the test cases based on metadata to form a dataset. The paper demonstrates this… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in the conference proceedings of IEEE AITest 2024

  3. arXiv:2403.19720  [pdf, other

    math.ST cs.LG stat.ML

    Meta-Learning with Generalized Ridge Regression: High-dimensional Asymptotics, Optimality and Hyper-covariance Estimation

    Authors: Yanhao Jin, Krishnakumar Balasubramanian, Debashis Paul

    Abstract: Meta-learning involves training models on a variety of training tasks in a way that enables them to generalize well on new, unseen test tasks. In this work, we consider meta-learning within the framework of high-dimensional multivariate random-effects linear models and study generalized ridge-regression based predictions. The statistical intuition of using generalized ridge regression in this sett… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  4. PromptSet: A Programmer's Prompting Dataset

    Authors: Kaiser Pister, Dhruba Jyoti Paul, Patrick Brophy, Ishan Joshi

    Abstract: The rise of capabilities expressed by large language models has been quickly followed by the integration of the same complex systems into application level logic. Algorithms, programs, systems, and companies are built around structured prompting to black box models where the majority of the design and implementation lies in capturing and quantifying the `agent mode'. The standard way to shape a cl… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 8 pages, ICSE '24 LLM4Code Workshop

  5. arXiv:2402.13950  [pdf, other

    cs.CL

    Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning

    Authors: Debjit Paul, Robert West, Antoine Bosselut, Boi Faltings

    Abstract: Large language models (LLMs) have been shown to perform better when asked to reason step-by-step before answering a question. However, it is unclear to what degree the model's final answer is faithful to the stated reasoning steps. In this paper, we perform a causal mediation analysis on twelve LLMs to examine how intermediate reasoning steps generated by the LLM influence the final outcome and fi… ▽ More

    Submitted 23 February, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  6. arXiv:2401.14135  [pdf, other

    cs.CL cs.CY cs.LG

    Convolutional Neural Networks can achieve binary bail judgement classification

    Authors: Amit Barman, Devangan Roy, Debapriya Paul, Indranil Dutta, Shouvik Kumar Guha, Samir Karmakar, Sudip Kumar Naskar

    Abstract: There is an evident lack of implementation of Machine Learning (ML) in the legal domain in India, and any research that does take place in this domain is usually based on data from the higher courts of law and works with English data. The lower courts and data from the different regional languages of India are often overlooked. In this paper, we deploy a Convolutional Neural Network (CNN) architec… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted on 20th International Conference on Natural Language Processing (ICON)

  7. arXiv:2401.03183  [pdf, other

    cs.CL

    Exploring Defeasibility in Causal Reasoning

    Authors: Shaobo Cui, Lazar Milikic, Yiyang Feng, Mete Ismayilzada, Debjit Paul, Antoine Bosselut, Boi Faltings

    Abstract: Defeasibility in causal reasoning implies that the causal relationship between cause and effect can be strengthened or weakened. Namely, the causal strength between cause and effect should increase or decrease with the incorporation of strengthening arguments (supporters) or weakening arguments (defeaters), respectively. However, existing works ignore defeasibility in causal reasoning and fail to… ▽ More

    Submitted 27 June, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Accepted by ACL 2024 (Findings)

  8. arXiv:2311.15384  [pdf, other

    stat.ML cs.LG stat.ME

    Robust and Automatic Data Clustering: Dirichlet Process meets Median-of-Means

    Authors: Supratik Basu, Jyotishka Ray Choudhury, Debolina Paul, Swagatam Das

    Abstract: Clustering stands as one of the most prominent challenges within the realm of unsupervised machine learning. Among the array of centroid-based clustering algorithms, the classic $k$-means algorithm, rooted in Lloyd's heuristic, takes center stage as one of the extensively employed techniques in the literature. Nonetheless, both $k$-means and its variants grapple with noteworthy limitations. These… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  9. arXiv:2311.04284  [pdf, other

    cs.CL cs.AI

    CRAB: Assessing the Strength of Causal Relationships Between Real-world Events

    Authors: Angelika Romanou, Syrielle Montariol, Debjit Paul, Leo Laugier, Karl Aberer, Antoine Bosselut

    Abstract: Understanding narratives requires reasoning about the cause-and-effect relationships between events mentioned in the text. While existing foundation models yield impressive results in many NLP tasks requiring reasoning, it is unclear whether they understand the complexity of the underlying network of causal relationships of events in narratives. In this work, we present CRAB, a new Causal Reasonin… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  10. arXiv:2311.04157  [pdf, other

    cs.CV cs.AI

    A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

    Authors: Dipanjyoti Paul, Arpita Chowdhury, Xinqi Xiong, Feng-Ju Chang, David Carlyn, Samuel Stevens, Kaiya L. Provost, Anuj Karpatne, Bryan Carstens, Daniel Rubenstein, Charles Stewart, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

    Abstract: We present a novel usage of Transformers to make image classification interpretable. Unlike mainstream classifiers that wait until the last fully connected layer to incorporate class information to make predictions, we investigate a proactive approach, asking each class to search for itself in an image. We realize this idea via a Transformer encoder-decoder inspired by DEtection TRansformer (DETR)… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted to International Conference on Learning Representations 2024 (ICLR 2024)

  11. arXiv:2311.03374  [pdf, other

    cs.SE cs.AI cs.IR

    Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023

    Authors: Srijoni Majumdar, Soumen Paul, Debjyoti Paul, Ayan Bandyopadhyay, Samiran Chattopadhyay, Partha Pratim Das, Paul D Clough, Prasenjit Majumder

    Abstract: The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs e… ▽ More

    Submitted 27 October, 2023; originally announced November 2023.

    Comments: Overview Paper of the Information Retrieval of Software Engineering Track at the Forum for Information Retrieval, 2023

  12. arXiv:2310.15239  [pdf, other

    cs.CL cs.AI

    CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks

    Authors: Mete Ismayilzada, Debjit Paul, Syrielle Montariol, Mor Geva, Antoine Bosselut

    Abstract: Recent efforts in natural language processing (NLP) commonsense reasoning research have yielded a considerable number of new datasets and benchmarks. However, most of these datasets formulate commonsense reasoning challenges in artificial scenarios that are not reflective of the tasks which real-world NLP systems are designed to solve. In this work, we present CRoW, a manually-curated, multi-task… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 37 pages, camera-ready for EMNLP 2023

  13. arXiv:2309.17339  [pdf, other

    cs.LG

    Scaling Experiments in Self-Supervised Cross-Table Representation Learning

    Authors: Maximilian Schambach, Dominique Paul, Johannes S. Otterbach

    Abstract: To analyze the scaling potential of deep tabular representation learning models, we introduce a novel Transformer-based architecture specifically tailored to tabular data and cross-table representation learning by utilizing table-specific tokenizers and a shared Transformer backbone. Our training approach encompasses both single-table and cross-table models, trained via missing value imputation th… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  14. arXiv:2309.08628  [pdf, other

    cs.CL cs.CR cs.LG

    Recovering from Privacy-Preserving Masking with Large Language Models

    Authors: Arpita Vats, Zhe Liu, Peng Su, Debjyoti Paul, Yingyi Ma, Yutong Pang, Zeeshan Ahmed, Ozlem Kalinli

    Abstract: Model adaptation is crucial to handle the discrepancy between proxy training data and actual users data received. To effectively perform adaptation, textual data of users is typically stored on servers or their local devices, where downstream natural language processing (NLP) models can be directly trained using such in-domain data. However, this might raise privacy and security concerns due to th… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP

  15. arXiv:2308.01285  [pdf, other

    cs.AI cs.HC

    Flows: Building Blocks of Reasoning and Collaborating AI

    Authors: Martin Josifoski, Lars Klein, Maxime Peyrard, Nicolas Baldwin, Yifei Li, Saibo Geng, Julian Paul Schnitzler, Yuxing Yao, Jiheng Wei, Debjit Paul, Robert West

    Abstract: Recent advances in artificial intelligence (AI) have produced highly capable and controllable systems. This creates unprecedented opportunities for structured reasoning as well as collaboration among multiple AI systems and humans. To fully realize this potential, it is essential to develop a principled way of designing and studying such structured interactions. For this purpose, we introduce the… ▽ More

    Submitted 7 February, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  16. arXiv:2305.09359  [pdf, other

    cs.CL

    Constructing and Interpreting Causal Knowledge Graphs from News

    Authors: Fiona Anting Tan, Debdeep Paul, Sahim Yamaura, Miura Koji, See-Kiong Ng

    Abstract: Many financial jobs rely on news to learn about causal events in the past and present, to make informed decisions and predictions about the future. With the ever-increasing amount of news available online, there is a need to automate the extraction of causal events from unstructured texts. In this work, we propose a methodology to construct causal knowledge graphs (KGs) from news using two steps:… ▽ More

    Submitted 30 July, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to AAAI Summer Symposium 2023 (AI4FinTech)

  17. arXiv:2304.01904  [pdf, other

    cs.CL

    REFINER: Reasoning Feedback on Intermediate Representations

    Authors: Debjit Paul, Mete Ismayilzada, Maxime Peyrard, Beatriz Borges, Antoine Bosselut, Robert West, Boi Faltings

    Abstract: Language models (LMs) have recently shown remarkable performance on reasoning tasks by explicitly generating intermediate inferences, e.g., chain-of-thought prompting. However, these intermediate inference steps may be inappropriate deductions from the initial context and lead to incorrect final predictions. Here we introduce REFINER, a framework for finetuning LMs to explicitly generate intermedi… ▽ More

    Submitted 4 February, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted at EACL 2024

  18. arXiv:2301.08506  [pdf, other

    cs.CL cs.LG

    Language Agnostic Data-Driven Inverse Text Normalization

    Authors: Szu-Jui Chen, Debjyoti Paul, Yutong Pang, Peng Su, Xuedong Zhang

    Abstract: With the emergence of automatic speech recognition (ASR) models, converting the spoken form text (from ASR) to the written form is in urgent need. This inverse text normalization (ITN) problem attracts the attention of researchers from various fields. Recently, several works show that data-driven ITN methods can output high-quality written form text. Due to the scarcity of labeled spoken-written d… ▽ More

    Submitted 23 January, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  19. arXiv:2210.07228  [pdf, other

    cs.CL cs.LG

    Language Model Decoding as Likelihood-Utility Alignment

    Authors: Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kıcıman, Boi Faltings, Robert West

    Abstract: A critical component of a successful language generation pipeline is the decoding algorithm. However, the general principles that should guide the choice of a decoding algorithm remain unclear. Previous works only compare decoding algorithms in narrow scenarios, and their findings do not generalize across tasks. We argue that the misalignment between the model's likelihood and the task-specific no… ▽ More

    Submitted 16 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at EACL (Findings) 2023

  20. arXiv:2207.09674  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Improving Data Driven Inverse Text Normalization using Data Augmentation

    Authors: Laxmi Pandey, Debjyoti Paul, Pooja Chitkara, Yutong Pang, Xuedong Zhang, Kjell Schubert, Mark Chou, Shu Liu, Yatharth Saraf

    Abstract: Inverse text normalization (ITN) is used to convert the spoken form output of an automatic speech recognition (ASR) system to a written form. Traditional handcrafted ITN rules can be complex to transcribe and maintain. Meanwhile neural modeling approaches require quality large-scale spoken-written pair examples in the same or similar domain as the ASR system (in-domain data), to train. Both these… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  21. arXiv:2201.01973  [pdf, other

    stat.ML cs.LG math.ST

    Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification

    Authors: Saptarshi Chakraborty, Debolina Paul, Swagatam Das

    Abstract: The problem of linear predictions has been extensively studied for the past century under pretty generalized frameworks. Recent advances in the robust statistics literature allow us to analyze robust versions of classical linear models through the prism of Median of Means (MoM). Combining these approaches in a piecemeal way might lead to ad-hoc procedures, and the restricted theoretical conclusion… ▽ More

    Submitted 11 March, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

  22. arXiv:2110.14148  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Uniform Concentration Bounds toward a Unified Framework for Robust Clustering

    Authors: Debolina Paul, Saptarshi Chakraborty, Swagatam Das, Jason Xu

    Abstract: Recent advances in center-based clustering continue to improve upon the drawbacks of Lloyd's celebrated $k$-means algorithm over $60$ years after its introduction. Various methods seek to address poor local minima, sensitivity to outliers, and data that are not well-suited to Euclidean measures of fit, but many are supported largely empirically. Moreover, combining such approaches in a piecemeal m… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: To appear (spotlight) in the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021

  23. arXiv:2108.04187  [pdf, other

    cs.MM cs.LG cs.MA

    Scaling New Peaks: A Viewership-centric Approach to Automated Content Curation

    Authors: Subhabrata Majumdar, Deirdre Paul, Eric Zavesky

    Abstract: Summarizing video content is important for video streaming services to engage the user in a limited time span. To this end, current methods involve manual curation or using passive interest cues to annotate potential high-interest segments to form the basis of summarized videos, and are costly and unreliable. We propose a viewership-driven, automated method that accommodates a range of segment ide… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  24. arXiv:2106.03973  [pdf, other

    cs.CL cs.AI

    Generating Hypothetical Events for Abductive Inference

    Authors: Debjit Paul, Anette Frank

    Abstract: Abductive reasoning starts from some observations and aims at finding the most plausible explanation for these observations. To perform abduction, humans often make use of temporal and causal inferences, and knowledge about how some hypothetical situation can result in different outcomes. This work offers the first study of how such knowledge impacts the Abductive NLI task -- which consists in cho… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Proceedings of The Tenth Joint Conference on Lexical and Computational Semantics (STARSEM 2021)

  25. arXiv:2106.02497  [pdf, other

    cs.CL cs.AI

    COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion

    Authors: Debjit Paul, Anette Frank

    Abstract: Despite recent successes of large pre-trained language models in solving reasoning tasks, their inference capabilities remain opaque. We posit that such models can be made more interpretable by explicitly generating interim inference rules, and using them to guide the generation of task-specific textual outputs. In this paper we present COINS, a recursive inference framework that i) iteratively re… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  26. arXiv:2105.12287  [pdf, other

    cs.DB cs.AI

    Database Workload Characterization with Query Plan Encoders

    Authors: Debjyoti Paul, Jie Cao, Feifei Li, Vivek Srikumar

    Abstract: Smart databases are adopting artificial intelligence (AI) technologies to achieve {\em instance optimality}, and in the future, databases will come with prepackaged AI models within their core components. The reason is that every database runs on different workloads, demands specific resources, and settings to achieve optimal performance. It prompts the necessity to understand workloads running in… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

  27. arXiv:2105.03157  [pdf, other

    cs.CL

    CO-NNECT: A Framework for Revealing Commonsense Knowledge Paths as Explicitations of Implicit Knowledge in Texts

    Authors: Maria Becker, Katharina Korfhage, Debjit Paul, Anette Frank

    Abstract: In this work we leverage commonsense knowledge in form of knowledge paths to establish connections between sentences, as a form of explicitation of implicit knowledge. Such connections can be direct (singlehop paths) or require intermediate concepts (multihop paths). To construct such paths we combine two model types in a joint framework we call Co-nnect: a relation classifier that predicts direct… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted at IWCS 2021

  28. arXiv:2105.00316  [pdf, other

    cs.IT

    t-Entropy: A New Measure of Uncertainty with Some Applications

    Authors: Saptarshi Chakraborty, Debolina Paul, Swagatam Das

    Abstract: The concept of Entropy plays a key role in Information Theory, Statistics, and Machine Learning.This paper introduces a new entropy measure, called the t-entropy, which exploits the concavity of the inverse-tan function. We analytically show that the proposed t-entropy satisfies the prominent axiomatic properties of an entropy measure. We demonstrate an application of the proposed entropy measure… ▽ More

    Submitted 5 May, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

  29. arXiv:2103.16285  [pdf

    cs.CV cs.AI

    Single Test Image-Based Automated Machine Learning System for Distinguishing between Trait and Diseased Blood Samples

    Authors: Sahar A. Nasser, Debjani Paul, Suyash P. Awate

    Abstract: We introduce a machine learning-based method for fully automated diagnosis of sickle cell disease of poor-quality unstained images of a mobile microscope. Our method is capable of distinguishing between diseased, trait (carrier), and normal samples unlike the previous methods that are limited to distinguishing the normal from the abnormal samples only. The novelty of this method comes from disting… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  30. arXiv:2102.03403  [pdf, other

    stat.ML cs.LG math.ST

    Robust Principal Component Analysis: A Median of Means Approach

    Authors: Debolina Paul, Saptarshi Chakraborty, Swagatam Das

    Abstract: Principal Component Analysis (PCA) is a fundamental tool for data visualization, denoising, and dimensionality reduction. It is widely popular in Statistics, Machine Learning, Computer Vision, and related fields. However, PCA is well-known to fall prey to outliers and often fails to detect the true underlying low-dimensional structure within the dataset. Following the Median of Means (MoM) philoso… ▽ More

    Submitted 20 July, 2023; v1 submitted 5 February, 2021; originally announced February 2021.

  31. arXiv:2012.10929  [pdf, other

    cs.LG stat.ML

    Automated Clustering of High-dimensional Data with a Feature Weighted Mean Shift Algorithm

    Authors: Saptarshi Chakraborty, Debolina Paul, Swagatam Das

    Abstract: Mean shift is a simple interactive procedure that gradually shifts data points towards the mode which denotes the highest density of data points in the region. Mean shift algorithms have been effectively used for data denoising, mode seeking, and finding the number of clusters in a dataset in an automated fashion. However, the merits of mean shift quickly fade away as the data dimensions increase… ▽ More

    Submitted 10 May, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

    Comments: To appear at the 35-th AAAI Conference on Artificial Intelligence, February 2-9, 2021

  32. Scheduling Beyond CPUs for HPC

    Authors: Yuping Fan, Zhiling Lan, Paul Rich, William E. Allcock, Michael E. Papka, Brian Austin, David Paul

    Abstract: High performance computing (HPC) is undergoing significant changes. The emerging HPC applications comprise both compute- and data-intensive applications. To meet the intense I/O demand from emerging data-intensive applications, burst buffers are deployed in production systems. Existing HPC schedulers are mainly CPU-centric. The extreme heterogeneity of hardware devices, combined with workload chan… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: Accepted by HPDC 2019

    Journal ref: Proceedings of the 28th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'19), 2019

  33. arXiv:2012.03162  [pdf, other

    cs.CR

    MeLPUF: Memory-in-Logic PUF Structures for Low-Overhead IC Authentication

    Authors: Christopher Vega, Shubhra Deb Paul, Patanjali SLPSK, Swarup Bhunia

    Abstract: Physically Unclonable Functions (PUFs) are used for securing electronic devices across the implementation spectrum ranging from Field Programmable Gate Array (FPGA) to system on chips (SoCs). However, existing PUF implementations often suffer from one or more significant deficiencies: (1) significant design overhead; (2) difficulty to configure and integrate based on application-specific requireme… ▽ More

    Submitted 29 March, 2023; v1 submitted 5 December, 2020; originally announced December 2020.

  34. arXiv:2011.06461  [pdf, other

    stat.ML cs.LG

    Kernel k-Means, By All Means: Algorithms and Strong Consistency

    Authors: Debolina Paul, Saptarshi Chakraborty, Swagatam Das, Jason Xu

    Abstract: Kernel $k$-means clustering is a powerful tool for unsupervised learning of non-linearly separable data. Since the earliest attempts, researchers have noted that such algorithms often become trapped by local minima arising from non-convexity of the underlying objective function. In this paper, we generalize recent results leveraging a general family of means to combat sub-optimal local solutions t… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  35. arXiv:2010.05587  [pdf, other

    cs.CL

    Social Commonsense Reasoning with Multi-Head Knowledge Attention

    Authors: Debjit Paul, Anette Frank

    Abstract: Social Commonsense Reasoning requires understanding of text, knowledge about social events and their pragmatic implications, as well as commonsense reasoning skills. In this work we propose a novel multi-head knowledge attention model that encodes semi-structured commonsense inference rules and learns to incorporate them in a transformer-based reasoning cell. We assess the model's performance on t… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  36. arXiv:2008.05809  [pdf, other

    cs.SD cs.LG eess.AS

    Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion

    Authors: Dipjyoti Paul, Muhammed PV Shifas, Yannis Pantazis, Yannis Stylianou

    Abstract: The increased adoption of digital assistants makes text-to-speech (TTS) synthesis systems an indispensable feature of modern mobile devices. It is hence desirable to build a system capable of generating highly intelligible speech in the presence of noise. Past studies have investigated style conversion in TTS synthesis, yet degraded synthesized quality often leads to worse intelligibility. To over… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Accepted in INTERSPEECH 2020

  37. arXiv:2008.05289  [pdf, other

    eess.AS cs.LG cs.SD

    Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions

    Authors: Dipjyoti Paul, Yannis Pantazis, Yannis Stylianou

    Abstract: Recent advancements in deep learning led to human-level performance in single-speaker speech synthesis. However, there are still limitations in terms of speech quality when generalizing those systems into multiple-speaker models especially for unseen speakers and unseen recording qualities. For instance, conventional neural vocoders are adjusted to the training speaker and have poor generalization… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: Accepted in INTERSPEECH 2020

  38. arXiv:2006.06625  [pdf, other

    cs.LG cs.IT stat.ML

    Cumulant GAN

    Authors: Yannis Pantazis, Dipjyoti Paul, Michail Fasoulakis, Yannis Stylianou, Markos Katsoulakis

    Abstract: In this paper, we propose a novel loss function for training Generative Adversarial Networks (GANs) aiming towards deeper theoretical understanding as well as improved stability and performance for the underlying optimization problem. The new loss function is based on cumulant generating functions giving rise to \emph{Cumulant GAN}. Relying on a recently-derived variational formula, we show that t… ▽ More

    Submitted 24 August, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 37 pages, 27 figures

  39. A Review of Computational Approaches for Evaluation of Rehabilitation Exercises

    Authors: Yalin Liao, Aleksandar Vakanski, Min Xian, David Paul, Russell Baker

    Abstract: Recent advances in data analytics and computer-aided diagnostics stimulate the vision of patient-centric precision healthcare, where treatment plans are customized based on the health records and needs of every patient. In physical rehabilitation, the progress in machine learning and the advent of affordable and reliable motion capture sensors have been conducive to the development of approaches f… ▽ More

    Submitted 19 March, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

    Comments: 29 pages, 1 figure

    ACM Class: J.3; A.1

    Journal ref: Computers in Biology and Medicine, vol. 119, 2020

  40. arXiv:2001.03452  [pdf, other

    stat.ML cs.LG

    Entropy Regularized Power k-Means Clustering

    Authors: Saptarshi Chakraborty, Debolina Paul, Swagatam Das, Jason Xu

    Abstract: Despite its well-known shortcomings, $k$-means remains one of the most widely used approaches to data clustering. Current research continues to tackle its flaws while attempting to preserve its simplicity. Recently, the \textit{power $k$-means} algorithm was proposed to avoid trapping in local minima by annealing through a family of smoother surfaces. However, the approach lacks theoretical justif… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

    Comments: Accepted (in updated form) for presentation in the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020), Palermo, Italy, June 03, 2020 - June 05, 2020

  41. arXiv:1908.08674  [pdf, other

    cs.CV cs.CL cs.LG

    A BLSTM Network for Printed Bengali OCR System with High Accuracy

    Authors: Debabrata Paul, Bidyut Baran Chaudhuri

    Abstract: This paper presents a printed Bengali and English text OCR system developed by us using a single hidden BLSTM-CTC architecture having 128 units. Here, we did not use any peephole connection and dropout in the BLSTM, which helped us in getting better accuracy. This architecture was trained by 47,720 text lines that include English words also. When tested over 20 different Bengali fonts, it has prod… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: 6 pages, 6 figures, This OCR system is available online at https://banglaocr.nltr.org

  42. arXiv:1905.09369  [pdf, other

    math.ST cs.LG eess.SP

    Sparse Equisigned PCA: Algorithms and Performance Bounds in the Noisy Rank-1 Setting

    Authors: Arvind Prasadan, Raj Rao Nadakuditi, Debashis Paul

    Abstract: Singular value decomposition (SVD) based principal component analysis (PCA) breaks down in the high-dimensional and limited sample size regime below a certain critical eigen-SNR that depends on the dimensionality of the system and the number of samples. Below this critical eigen-SNR, the estimates returned by the SVD are asymptotically uncorrelated with the latent principal components. We consider… ▽ More

    Submitted 16 December, 2019; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: To appear, Electronic Journal of Statistics, 2020

  43. arXiv:1904.00676  [pdf, other

    cs.CL

    Ranking and Selecting Multi-Hop Knowledge Paths to Better Predict Human Needs

    Authors: Debjit Paul, Anette Frank

    Abstract: To make machines better understand sentiments, research needs to move from polarity identification to understanding the reasons that underlie the expression of sentiment. Categorizing the goals or needs of humans is one way to explain the expression of sentiment in text. Humans are good at understanding situations described in natural language and can easily connect them to the character's psychol… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

  44. arXiv:1903.12008  [pdf, other

    cs.CL cs.LG

    Handling Noisy Labels for Robustly Learning from Self-Training Data for Low-Resource Sequence Labeling

    Authors: Debjit Paul, Mittul Singh, Michael A. Hedderich, Dietrich Klakow

    Abstract: In this paper, we address the problem of effectively self-training neural networks in a low-resource setting. Self-training is frequently used to automatically increase the amount of training data. However, in a low-resource scenario, it is less effective due to unreliable annotations created using self-labeling of unlabeled data. We propose to combine self-training with noise handling on the self… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

  45. arXiv:1901.08025  [pdf, ps, other

    cs.MM eess.AS

    Generalization of Spoofing Countermeasures: a Case Study with ASVspoof 2015 and BTAS 2016 Corpora

    Authors: Dipjyoti Paul, Md Sahidullah, Goutam Saha

    Abstract: Voice-based biometric systems are highly prone to spoofing attacks. Recently, various countermeasures have been developed for detecting different kinds of attacks such as replay, speech synthesis (SS) and voice conversion (VC). Most of the existing studies are conducted with a specific training set defined by the evaluation protocol. However, for realistic scenarios, selecting appropriate training… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Journal ref: Published in 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), New Orleans, LA, USA

  46. arXiv:1811.02598  [pdf, ps, other

    cs.LG stat.ML

    Training Generative Adversarial Networks with Weights

    Authors: Yannis Pantazis, Dipjyoti Paul, Michail Fasoulakis, Yannis Stylianou

    Abstract: The impressive success of Generative Adversarial Networks (GANs) is often overshadowed by the difficulties in their training. Despite the continuous efforts and improvements, there are still open issues regarding their convergence properties. In this paper, we propose a simple training variation where suitable weights are defined and assist the training of the Generator. We provide theoretical arg… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: 6 pages, 3 figures, submitted to Icassp2019

  47. arXiv:1612.07523  [pdf, ps, other

    cs.SD cs.LG stat.ML

    Robustness of Voice Conversion Techniques Under Mismatched Conditions

    Authors: Monisankha Pal, Dipjyoti Paul, Md Sahidullah, Goutam Saha

    Abstract: Most of the existing studies on voice conversion (VC) are conducted in acoustically matched conditions between source and target signal. However, the robustness of VC methods in presence of mismatch remains unknown. In this paper, we report a comparative analysis of different VC techniques under mismatched conditions. The extensive experiments with five different VC techniques on CMU ARCTIC corpus… ▽ More

    Submitted 22 December, 2016; originally announced December 2016.

    Comments: 5 pages, 2 figures

  48. arXiv:1606.07608  [pdf, other

    cs.IR

    Using Word Embeddings for Automatic Query Expansion

    Authors: Dwaipayan Roy, Debjyoti Paul, Mandar Mitra, Utpal Garain

    Abstract: In this paper a framework for Automatic Query Expansion (AQE) is proposed using distributed neural language model word2vec. Using semantic and contextual relation in a distributed and unsupervised framework, word2vec learns a low dimensional embedding for each vocabulary entry. Using such a framework, we devise a query expansion technique, where related terms to a query are obtained by K-nearest n… ▽ More

    Submitted 24 June, 2016; originally announced June 2016.

    Comments: 5 pages, 3 tables, 1 figure. Neu-IR '16 SIGIR Workshop on Neural Information Retrieval July 21, 2016, Pisa, Italy

  49. Novel Speech Features for Improved Detection of Spoofing Attacks

    Authors: Dipjyoti Paul, Monisankha Pal, Goutam Saha

    Abstract: Now-a-days, speech-based biometric systems such as automatic speaker verification (ASV) are highly prone to spoofing attacks by an imposture. With recent development in various voice conversion (VC) and speech synthesis (SS) algorithms, these spoofing attacks can pose a serious potential threat to the current state-of-the-art ASV systems. To impede such attacks and enhance the security of the ASV… ▽ More

    Submitted 14 March, 2016; originally announced March 2016.

    Comments: Presented in IEEE 2015 Annual IEEE India Conference (INDICON)