Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 59 results for author: Srinivasan, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.02429  [pdf, other

    cs.CV

    Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis

    Authors: Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan

    Abstract: We consider the problem of independently, in a disentangled fashion, controlling the outputs of text-to-image diffusion models with color and style attributes of a user-supplied reference image. We present the first training-free, test-time-only method to disentangle and condition text-to-image models on color and style attributes from reference image. To realize this, we propose two key innovatio… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: 16 pages, 17 figures

  2. arXiv:2408.16749  [pdf

    cs.CL cs.AI

    Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge

    Authors: Beidi Dong, Jin R. Lee, Ziwei Zhu, Balassubramanian Srinivasan

    Abstract: The United States has experienced a significant increase in violent extremism, prompting the need for automated tools to detect and limit the spread of extremist ideology online. This study evaluates the performance of Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-Trained Transformers (GPT) in detecting and classifying online domestic extremist posts. We collect… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  3. arXiv:2406.18893  [pdf, other

    cs.CV

    AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models

    Authors: Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan

    Abstract: We consider the problem of customizing text-to-image diffusion models with user-supplied reference images. Given new prompts, the existing methods can capture the key concept from the reference images but fail to align the generated image with the prompt. In this work, we seek to address this key issue by proposing new methods that can easily be used in conjunction with existing customization meth… ▽ More

    Submitted 27 June, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: 10 pages, 9 figures

  4. arXiv:2406.06938  [pdf, other

    cs.CL

    Post-Hoc Answer Attribution for Grounded and Trustworthy Long Document Comprehension: Task, Insights, and Challenges

    Authors: Abhilasha Sancheti, Koustava Goswami, Balaji Vasan Srinivasan

    Abstract: Attributing answer text to its source document for information-seeking questions is crucial for building trustworthy, reliable, and accountable systems. We formulate a new task of post-hoc answer attribution for long document comprehension (LDC). Owing to the lack of long-form abstractive and information-seeking LDC datasets, we refactor existing datasets to assess the strengths and weaknesses of… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to *SEM 2024

  5. arXiv:2406.04673  [pdf, other

    cs.CV cs.AI cs.MM eess.AS

    MeLFusion: Synthesizing Music from Image and Language Cues using Diffusion Models

    Authors: Sanjoy Chowdhury, Sayan Nag, K J Joseph, Balaji Vasan Srinivasan, Dinesh Manocha

    Abstract: Music is a universal language that can communicate emotions and feelings. It forms an essential part of the whole spectrum of creative media, ranging from movies to social media posts. Machine learning models that can synthesize music are predominantly conditioned on textual descriptions of it. Inspired by how musicians compose music not just from a movie script, but also through visualizations, w… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted at CVPR 2024 as Highlight paper. Webpage: https://schowdhury671.github.io/melfusion_cvpr2024/

  6. arXiv:2405.17980  [pdf, other

    cs.CL

    Peering into the Mind of Language Models: An Approach for Attribution in Contextual Question Answering

    Authors: Anirudh Phukan, Shwetha Somasundaram, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan

    Abstract: With the enhancement in the field of generative artificial intelligence (AI), contextual question answering has become extremely relevant. Attributing model generations to the input source document is essential to ensure trustworthiness and reliability. We observe that when large language models (LLMs) are used for contextual question answering, the output answer often consists of text copied verb… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  7. arXiv:2404.04521  [pdf, other

    cs.SE cs.PL

    Automated Computer Program Evaluation and Projects -- Our Experiences

    Authors: Bama Srinivasan, Mala Nehru, Ranjani Parthasarathi, Saswati Mukherjee, Jeena A Thankachan

    Abstract: This paper provides a few approaches to automating computer programming and project submission tasks, that we have been following for the last six years and have found to be successful. The approaches include using CodeRunner with Learning Management System (LMS) integration for programming practice and evaluation, and Git (GitHub) for project submissions and automatic code evaluation. In this pap… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 14 pages, 15 figures

    ACM Class: D.3

    Journal ref: https://www.sxcejournal.com/spe-apr-2023/17.pdf

  8. arXiv:2402.14361  [pdf, other

    cs.LG

    OpenTab: Advancing Large Language Models as Open-domain Table Reasoners

    Authors: Kezhi Kong, Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Chuan Lei, Christos Faloutsos, Huzefa Rangwala, George Karypis

    Abstract: Large Language Models (LLMs) trained on large volumes of data excel at various natural language tasks, but they cannot handle tasks requiring knowledge that has not been trained on previously. One solution is to use a retriever that fetches relevant information to expand LLM's knowledge scope. However, existing textual-oriented retrieval-based LLMs are not ideal on structured table data due to div… ▽ More

    Submitted 12 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted by ICLR 2024

  9. arXiv:2401.01637  [pdf, other

    cs.CL

    Social Media Ready Caption Generation for Brands

    Authors: Himanshu Maheshwari, Koustava Goswami, Apoorv Saxena, Balaji Vasan Srinivasan

    Abstract: Social media advertisements are key for brand marketing, aiming to attract consumers with captivating captions and pictures or logos. While previous research has focused on generating captions for general images, incorporating brand personalities into social media captioning remains unexplored. Brand personalities are shown to be affecting consumers' behaviours and social interactions and thus are… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  10. arXiv:2311.15516  [pdf, other

    eess.SY cs.AI cs.LG

    Active Foundational Models for Fault Diagnosis of Electrical Motors

    Authors: Sriram Anbalagan, Sai Shashank GP, Deepesh Agarwal, Balasubramaniam Natarajan, Babji Srinivasan

    Abstract: Fault detection and diagnosis of electrical motors are of utmost importance in ensuring the safe and reliable operation of several industrial systems. Detection and diagnosis of faults at the incipient stage allows corrective actions to be taken in order to reduce the severity of faults. The existing data-driven deep learning approaches for machine fault diagnosis rely extensively on huge amounts… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

    Comments: 30 pages, 2 figures, 7 tables

  11. arXiv:2311.11919  [pdf, other

    cs.CV

    An Image is Worth Multiple Words: Multi-attribute Inversion for Constrained Text-to-Image Synthesis

    Authors: Aishwarya Agarwal, Srikrishna Karanam, Tripti Shukla, Balaji Vasan Srinivasan

    Abstract: We consider the problem of constraining diffusion model outputs with a user-supplied reference image. Our key objective is to extract multiple attributes (e.g., color, object, layout, style) from this single reference image, and then generate new samples with them. One line of existing work proposes to invert the reference images into a single textual conditioning vector, enabling generation of ne… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  12. arXiv:2311.11471  [pdf

    cs.CV cs.AI

    Towards AI enabled automated tracking of multiple boxers

    Authors: A. S. Karthikeyan, Vipul Baghel, Anish Monsley Kirupakaran, John Warburton, Ranganathan Srinivasan, Babji Srinivasan, Ravi Sadananda Hegde

    Abstract: Continuous tracking of boxers across multiple training sessions helps quantify traits required for the well-known ten-point-must system. However, continuous tracking of multiple athletes across multiple training sessions remains a challenge, because it is difficult to precisely segment bout boundaries in a recorded video stream. Furthermore, re-identification of the same athlete over different per… ▽ More

    Submitted 9 August, 2023; originally announced November 2023.

  13. PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation

    Authors: Vikas Dwivedi, Balaji Srinivasan, Ganapathy Krishnamurthi

    Abstract: Effective training of deep image segmentation models is challenging due to the need for abundant, high-quality annotations. Generating annotations is laborious and time-consuming for human experts, especially in medical image segmentation. To facilitate image annotation, we introduce Physics Informed Contour Selection (PICS) - an interpretable, physics-informed algorithm for rapid image segmentati… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  14. arXiv:2310.20046  [pdf, other

    cs.CL

    Which Examples to Annotate for In-Context Learning? Towards Effective and Efficient Selection

    Authors: Costas Mavromatis, Balasubramaniam Srinivasan, Zhengyuan Shen, Jiani Zhang, Huzefa Rangwala, Christos Faloutsos, George Karypis

    Abstract: Large Language Models (LLMs) can adapt to new tasks via in-context learning (ICL). ICL is efficient as it does not require any parameter updates to the trained LLM, but only few annotated examples as input for the LLM. In this work, we investigate an active learning approach for ICL, where there is a limited budget for annotating examples. We propose a model-adaptive optimization-free algorithm, t… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  15. arXiv:2310.13196  [pdf, other

    cs.CL cs.DB cs.LG

    NameGuess: Column Name Expansion for Tabular Data

    Authors: Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Shen Wang, Huzefa Rangwala, George Karypis

    Abstract: Recent advances in large language models have revolutionized many sectors, including the database industry. One common challenge when dealing with large volumes of tabular data is the pervasive use of abbreviated column names, which can negatively impact performance on various data search, access, and understanding tasks. To address this issue, we introduce a new task, called NameGuess, to expand… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: This work has been accepted to EMNLP'23

  16. arXiv:2310.09656  [pdf, other

    cs.LG

    Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space

    Authors: Hengrui Zhang, Jiani Zhang, Balasubramaniam Srinivasan, Zhengyuan Shen, Xiao Qin, Christos Faloutsos, Huzefa Rangwala, George Karypis

    Abstract: Recent advances in tabular data generation have greatly enhanced synthetic data quality. However, extending diffusion models to tabular data is challenging due to the intricately varied distributions and a blend of data types of tabular data. This paper introduces Tabsyn, a methodology that synthesizes tabular data by leveraging a diffusion model within a variational autoencoder (VAE) crafted late… ▽ More

    Submitted 11 May, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

    Comments: Accepted by ICLR 2024 (Oral Presentation). Code is available at: https://github.com/amazon-science/tabsyn

  17. arXiv:2310.03320  [pdf, other

    cs.LG

    BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs

    Authors: Zifeng Wang, Zichen Wang, Balasubramaniam Srinivasan, Vassilis N. Ioannidis, Huzefa Rangwala, Rishita Anubhai

    Abstract: Foundation models (FMs) are able to leverage large volumes of unlabeled data to demonstrate superior performance across a wide range of tasks. However, FMs developed for biomedical domains have largely remained unimodal, i.e., independently trained and used for tasks on protein sequences alone, small molecule structures alone, or clinical data alone. To overcome this limitation of biomedical FMs,… ▽ More

    Submitted 18 January, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  18. arXiv:2309.00613  [pdf, other

    cs.CV cs.AI cs.LG

    Iterative Multi-granular Image Editing using Diffusion Models

    Authors: K J Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan

    Abstract: Recent advances in text-guided image synthesis has dramatically changed how creative professionals generate artistic and aesthetically pleasing visual assets. To fully support such creative endeavors, the process should possess the ability to: 1) iteratively edit the generations and 2) control the spatial reach of desired changes (global, local or anything in between). We formalize this pragmatic… ▽ More

    Submitted 28 October, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024

  19. arXiv:2308.16649  [pdf, other

    cs.CV

    Learning with Multi-modal Gradient Attention for Explainable Composed Image Retrieval

    Authors: Prateksha Udhayanan, Srikrishna Karanam, Balaji Vasan Srinivasan

    Abstract: We consider the problem of composed image retrieval that takes an input query consisting of an image and a modification text indicating the desired changes to be made on the image and retrieves images that match these changes. Current state-of-the-art techniques that address this problem use global features for the retrieval, resulting in incorrect localization of the regions of interest to be mod… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  20. arXiv:2307.16891  [pdf, other

    eess.SY cs.AI cs.LG

    Foundational Models for Fault Diagnosis of Electrical Motors

    Authors: Sriram Anbalagan, Deepesh Agarwal, Balasubramaniam Natarajan, Babji Srinivasan

    Abstract: A majority of recent advancements related to the fault diagnosis of electrical motors are based on the assumption that training and testing data are drawn from the same distribution. However, the data distribution can vary across different operating conditions during real-world operating scenarios of electrical motors. Consequently, this assumption limits the practical implementation of existing s… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: 7 pages, 1 figure, 5 tables, submitted to IEEE PESGRE 2023

  21. arXiv:2307.08623  [pdf, other

    cs.LG cs.AI cs.CL

    HYTREL: Hypergraph-enhanced Tabular Data Representation Learning

    Authors: Pei Chen, Soumajyoti Sarkar, Leonard Lausen, Balasubramaniam Srinivasan, Sheng Zha, Ruihong Huang, George Karypis

    Abstract: Language models pretrained on large collections of tabular data have demonstrated their effectiveness in several downstream tasks. However, many of these models do not take into account the row/column permutation invariances, hierarchical structure, etc. that exist in tabular data. To alleviate these limitations, we propose HYTREL, a tabular language model, that captures the permutation invariance… ▽ More

    Submitted 26 October, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023 (spotlight)

  22. arXiv:2307.00910  [pdf, other

    cs.CV cs.AI

    CoPL: Contextual Prompt Learning for Vision-Language Understanding

    Authors: Koustava Goswami, Srikrishna Karanam, Prateksha Udhayanan, K J Joseph, Balaji Vasan Srinivasan

    Abstract: Recent advances in multimodal learning has resulted in powerful vision-language models, whose representations are generalizable across a variety of downstream tasks. Recently, their generalization ability has been further extended by incorporating trainable prompts, borrowed from the natural language processing literature. While such prompt learning techniques have shown impressive results, we ide… ▽ More

    Submitted 12 December, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

    Comments: Accepted at AAAI 2024

  23. arXiv:2306.14603  [pdf, other

    cs.CV

    Learning with Difference Attention for Visually Grounded Self-supervised Representations

    Authors: Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan

    Abstract: Recent works in self-supervised learning have shown impressive results on single-object images, but they struggle to perform well on complex multi-object images as evidenced by their poor visual grounding. To demonstrate this concretely, we propose visual difference attention (VDA) to compute visual attention maps in an unsupervised fashion by comparing an image with its salient-regions-masked-out… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 15 pages, 14 figures

  24. arXiv:2306.14544  [pdf, other

    cs.CV

    A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis

    Authors: Aishwarya Agarwal, Srikrishna Karanam, K J Joseph, Apoorv Saxena, Koustava Goswami, Balaji Vasan Srinivasan

    Abstract: While recent developments in text-to-image generative models have led to a suite of high-performing methods capable of producing creative imagery from free-form text, there are several limitations. By analyzing the cross-attention representations of these models, we notice two key issues. First, for text prompts that contain multiple concepts, there is a significant amount of pixel-space overlap (… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: 15 pages, 16 figures

  25. arXiv:2212.09825  [pdf, other

    cs.CL

    What to Read in a Contract? Party-Specific Summarization of Legal Obligations, Entitlements, and Prohibitions

    Authors: Abhilasha Sancheti, Aparna Garimella, Balaji Vasan Srinivasan, Rachel Rudinger

    Abstract: Reviewing and comprehending key obligations, entitlements, and prohibitions in legal contracts can be a tedious task due to their length and domain-specificity. Furthermore, the key rights and duties requiring review vary for each contracting party. In this work, we propose a new task of party-specific extractive summarization for legal contracts to facilitate faster reviewing and improved compreh… ▽ More

    Submitted 24 October, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: EMNLP 2023

  26. arXiv:2211.12752  [pdf, other

    cs.CL

    Agent-Specific Deontic Modality Detection in Legal Language

    Authors: Abhilasha Sancheti, Aparna Garimella, Balaji Vasan Srinivasan, Rachel Rudinger

    Abstract: Legal documents are typically long and written in legalese, which makes it particularly difficult for laypeople to understand their rights and duties. While natural language understanding technologies can be valuable in supporting such understanding in the legal domain, the limited availability of datasets annotated for deontic modalities in the legal domain, due to the cost of hiring experts and… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022

  27. arXiv:2203.10483  [pdf, other

    cs.CL

    Entailment Relation Aware Paraphrase Generation

    Authors: Abhilasha Sancheti, Balaji Vasan Srinivasan, Rachel Rudinger

    Abstract: We introduce a new task of entailment relation aware paraphrase generation which aims at generating a paraphrase conforming to a given entailment relation (e.g. equivalent, forward entailing, or reverse entailing) with respect to a given input. We propose a reinforcement learning-based weakly-supervised paraphrasing system, ERAP, that can be trained using existing paraphrase and natural language i… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: 11 pages, 10 tables, 2 figures

  28. arXiv:2111.02987  [pdf

    cs.LG

    Numerical Approximation in CFD Problems Using Physics Informed Machine Learning

    Authors: Siddharth Rout, Vikas Dwivedi, Balaji Srinivasan

    Abstract: The thesis focuses on various techniques to find an alternate approximation method that could be universally used for a wide range of CFD problems but with low computational cost and low runtime. Various techniques have been explored within the field of machine learning to gauge the utility in fulfilling the core ambition. Steady advection diffusion problem has been used as the test case to unders… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  29. arXiv:2110.15794  [pdf, other

    cs.CL cs.AI

    CLAUSEREC: A Clause Recommendation Framework for AI-aided Contract Authoring

    Authors: Vinay Aggarwal, Aparna Garimella, Balaji Vasan Srinivasan, Anandhavelu N, Rajiv Jain

    Abstract: Contracts are a common type of legal document that frequent in several day-to-day business workflows. However, there has been very limited NLP research in processing such documents, and even lesser in generating them. These contracts are made up of clauses, and the unique nature of these clauses calls for specific methods to understand and generate such documents. In this paper, we introduce the t… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  30. arXiv:2110.03785  [pdf, other

    cs.LG

    Addressing practical challenges in Active Learning via a hybrid query strategy

    Authors: Deepesh Agarwal, Pravesh Srivastava, Sergio Martin-del-Campo, Balasubramaniam Natarajan, Babji Srinivasan

    Abstract: Active Learning (AL) is a powerful tool to address modern machine learning problems with significantly fewer labeled training instances. However, implementation of traditional AL methodologies in practical scenarios is accompanied by multiple challenges due to the inherent assumptions. There are several hindrances, such as unavailability of labels for the AL algorithm at the beginning; unreliable… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: 15 pages, 4 figures, 6 tables

  31. arXiv:2110.02910  [pdf, other

    cs.LG stat.ML

    Equivariant Subgraph Aggregation Networks

    Authors: Beatrice Bevilacqua, Fabrizio Frasca, Derek Lim, Balasubramaniam Srinivasan, Chen Cai, Gopinath Balamurugan, Michael M. Bronstein, Haggai Maron

    Abstract: Message-passing neural networks (MPNNs) are the leading architecture for deep learning on graph-structured data, in large part due to their simplicity and scalability. Unfortunately, it was shown that these architectures are limited in their expressive power. This paper proposes a novel framework called Equivariant Subgraph Aggregation Networks (ESAN) to address this issue. Our main observation is… ▽ More

    Submitted 16 March, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: Published at ICLR 2022, Spotlight. 46 pages

  32. arXiv:2104.07000  [pdf, other

    cs.CL

    IGA : An Intent-Guided Authoring Assistant

    Authors: Simeng Sun, Wenlong Zhao, Varun Manjunatha, Rajiv Jain, Vlad Morariu, Franck Dernoncourt, Balaji Vasan Srinivasan, Mohit Iyyer

    Abstract: While large-scale pretrained language models have significantly improved writing assistance functionalities such as autocomplete, more complex and controllable writing assistants have yet to be explored. We leverage advances in language modeling to build an interactive writing assistant that generates and rephrases text according to fine-grained author specifications. Users provide input to our In… ▽ More

    Submitted 19 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: EMNLP2021

  33. arXiv:2101.11836  [pdf, other

    cs.CL cs.AI cs.LG

    DRAG: Director-Generator Language Modelling Framework for Non-Parallel Author Stylized Rewriting

    Authors: Hrituraj Singh, Gaurav Verma, Aparna Garimella, Balaji Vasan Srinivasan

    Abstract: Author stylized rewriting is the task of rewriting an input text in a particular author's style. Recent works in this area have leveraged Transformer-based language models in a denoising autoencoder setup to generate author stylized text without relying on a parallel corpus of data. However, these approaches are limited by the lack of explicit control of target attributes and being entirely data-d… ▽ More

    Submitted 28 January, 2021; originally announced January 2021.

    Comments: Accepted as Long Paper to EACL 2021

  34. arXiv:2101.07773  [pdf, other

    cs.LG stat.ML

    Learning over Families of Sets -- Hypergraph Representation Learning for Higher Order Tasks

    Authors: Balasubramaniam Srinivasan, Da Zheng, George Karypis

    Abstract: Graph representation learning has made major strides over the past decade. However, in many relational domains, the input data are not suited for simple graph representations as the relationships between entities go beyond pairwise interactions. In such cases, the relationships in the data are better represented as hyperedges (set of entities) of a non-uniform hypergraph. While there have been wor… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: Published as a conference paper at SIAM International Conference on Data Mining(SDM 2021)

  35. arXiv:2010.11578  [pdf, other

    cs.CL

    Multi-Style Transfer with Discriminative Feedback on Disjoint Corpus

    Authors: Navita Goyal, Balaji Vasan Srinivasan, Anandhavelu Natarajan, Abhilasha Sancheti

    Abstract: Style transfer has been widely explored in natural language generation with non-parallel corpus by directly or indirectly extracting a notion of style from source and target domain corpus. A common shortcoming of existing approaches is the prerequisite of joint annotations across all the stylistic dimensions under consideration. Availability of such dataset across a combination of styles limits th… ▽ More

    Submitted 12 April, 2021; v1 submitted 22 October, 2020; originally announced October 2020.

    Report number: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3500–3510

  36. arXiv:2010.11553  [pdf, other

    cs.CL

    Incorporating Stylistic Lexical Preferences in Generative Language Models

    Authors: Hrituraj Singh, Gaurav Verma, Balaji Vasan Srinivasan

    Abstract: While recent advances in language modeling have resulted in powerful generation models, their generation style remains implicitly dependent on the training data and can not emulate a specific target style. Leveraging the generative capabilities of a transformer-based language models, we present an approach to induce certain target-author attributes by incorporating continuous multi-dimensional lex… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: To Appear in Findings of EMNLP 2020

  37. Abstracting Deep Neural Networks into Concept Graphs for Concept Level Interpretability

    Authors: Avinash Kori, Parth Natekar, Ganapathy Krishnamurthi, Balaji Srinivasan

    Abstract: The black-box nature of deep learning models prevents them from being completely trusted in domains like biomedicine. Most explainability techniques do not capture the concept-based reasoning that human beings follow. In this work, we attempt to understand the behavior of trained models that perform image processing tasks in the medical domain by building a graphical representation of the concepts… ▽ More

    Submitted 17 November, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

  38. arXiv:2005.05256  [pdf, other

    cs.CL cs.AI cs.LG

    Reinforced Rewards Framework for Text Style Transfer

    Authors: Abhilasha Sancheti, Kundan Krishna, Balaji Vasan Srinivasan, Anandhavelu Natarajan

    Abstract: Style transfer deals with the algorithms to transfer the stylistic properties of a piece of text into that of another while ensuring that the core content is preserved. There has been a lot of interest in the field of text style transfer due to its wide application to tailored text generation. Existing works evaluate the style transfer models based on content preservation and transfer strength. In… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: ECIR 2020

  39. arXiv:2004.14243  [pdf, other

    cs.CL

    Towards Transparent and Explainable Attention Models

    Authors: Akash Kumar Mohankumar, Preksha Nema, Sharan Narasimhan, Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran

    Abstract: Recent studies on interpretability of attention distributions have led to notions of faithful and plausible explanations for a model's predictions. Attention distributions can be considered a faithful explanation if a higher attention weight implies a greater impact on the model's prediction. They can be considered a plausible explanation if they provide a human-understandable justification for th… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: Accepted at ACL 2020

  40. arXiv:2001.00258  [pdf, other

    eess.IV cs.CV cs.LG q-bio.TO

    A Generalized Deep Learning Framework for Whole-Slide Image Segmentation and Analysis

    Authors: Mahendra Khened, Avinash Kori, Haran Rajkumar, Balaji Srinivasan, Ganapathy Krishnamurthi

    Abstract: Histopathology tissue analysis is considered the gold standard in cancer diagnosis and prognosis. Given the large size of these images and the increase in the number of potential cancer cases, an automated solution as an aid to histopathologists is highly desirable. In the recent past, deep learning-based techniques have provided state of the art results in a wide variety of image analysis tasks,… ▽ More

    Submitted 18 November, 2020; v1 submitted 1 January, 2020; originally announced January 2020.

  41. arXiv:1912.08492  [pdf, other

    cs.CL

    Generating summaries tailored to target characteristics

    Authors: Kushal Chawla, Hrituraj Singh, Arijit Pramanik, Mithlesh Kumar, Balaji Vasan Srinivasan

    Abstract: Recently, research efforts have gained pace to cater to varied user preferences while generating text summaries. While there have been attempts to incorporate a few handpicked characteristics such as length or entities, a holistic view around these preferences is missing and crucial insights on why certain characteristics should be incorporated in a specific manner are absent. With this objective,… ▽ More

    Submitted 18 December, 2019; originally announced December 2019.

    Comments: Appeared in CiCLing 2019

  42. arXiv:1911.08056  [pdf, other

    cs.LG stat.ML

    Modelling pressure-Hessian from local velocity gradients information in an incompressible turbulent flow field using deep neural networks

    Authors: Nishant Parashar, Sawan S. Sinha, Balaji Srinivasan

    Abstract: The understanding of the dynamics of the velocity gradients in turbulent flows is critical to understanding various non-linear turbulent processes. The pressure-Hessian and the viscous-Laplacian govern the evolution of the velocity-gradients and are known to be non-local in nature. Over the years, several simplified dynamical models have been proposed that models the viscous-Laplacian and the pres… ▽ More

    Submitted 18 November, 2019; originally announced November 2019.

  43. arXiv:1911.00523  [pdf, other

    cs.CL cs.AI cs.CY cs.HC cs.LG

    What Gets Echoed? Understanding the "Pointers" in Explanations of Persuasive Arguments

    Authors: David Atkinson, Kumar Bhargav Srinivasan, Chenhao Tan

    Abstract: Explanations are central to everyday life, and are a topic of growing interest in the AI community. To investigate the process of providing natural language explanations, we leverage the dynamics of the /r/ChangeMyView subreddit to build a dataset with 36K naturally occurring explanations of why an argument is persuasive. We propose a novel word-level prediction task to investigate how explanation… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: 19 pages, 3 figures, EMNLP 2019, the code and dataset are available at https://chenhaot.com/papers/explanation-pointers.html

  44. arXiv:1910.09563  [pdf, other

    cs.CY cs.CL cs.SI

    Content Removal as a Moderation Strategy: Compliance and Other Outcomes in the ChangeMyView Community

    Authors: Kumar Bhargav Srinivasan, Cristian Danescu-Niculescu-Mizil, Lillian Lee, Chenhao Tan

    Abstract: Moderators of online communities often employ comment deletion as a tool. We ask here whether, beyond the positive effects of shielding a community from undesirable content, does comment removal actually cause the behavior of the comment's author to improve? We examine this question in a particularly well-moderated community, the ChangeMyView subreddit. The standard analytic approach of interrup… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: 21 pages, 8 figures, accepted at CSCW 2019, the dataset is available at https://chenhaot.com/papers/content-removal.html

  45. arXiv:1910.00452  [pdf, other

    cs.LG stat.ML

    On the Equivalence between Positional Node Embeddings and Structural Graph Representations

    Authors: Balasubramaniam Srinivasan, Bruno Ribeiro

    Abstract: This work provides the first unifying theoretical framework for node (positional) embeddings and structural graph representations, bridging methods like matrix factorization and graph neural networks. Using invariant theory, we show that the relationship between structural representations and node embeddings is analogous to that of a distribution and its samples. We prove that all tasks that can b… ▽ More

    Submitted 21 September, 2020; v1 submitted 1 October, 2019; originally announced October 2019.

    Comments: This version corrects some typos in the definition of Σ, it should be Σ_n. Code available at https://github.com/PurdueMINDS/Equivalence

    Journal ref: Published as a conference paper at the Eighth International Conference on Learning Representations (ICLR 2020)

  46. arXiv:1909.09962  [pdf, other

    cs.CL cs.LG

    Adapting Language Models for Non-Parallel Author-Stylized Rewriting

    Authors: Bakhtiyar Syed, Gaurav Verma, Balaji Vasan Srinivasan, Anandhavelu Natarajan, Vasudeva Varma

    Abstract: Given the recent progress in language modeling using Transformer-based neural models and an active interest in generating stylized text, we present an approach to leverage the generalization capabilities of a language model to rewrite an input text in a target author's style. Our proposed approach adapts a pre-trained language model to generate author-stylized text by fine-tuning on the author-spe… ▽ More

    Submitted 31 October, 2020; v1 submitted 22 September, 2019; originally announced September 2019.

    Comments: Accepted for publication in Main Technical Track at AAAI 20

  47. arXiv:1909.08349  [pdf, other

    cs.CL cs.LG

    A Lexical, Syntactic, and Semantic Perspective for Understanding Style in Text

    Authors: Gaurav Verma, Balaji Vasan Srinivasan

    Abstract: With a growing interest in modeling inherent subjectivity in natural language, we present a linguistically-motivated process to understand and analyze the writing style of individuals from three perspectives: lexical, syntactic, and semantic. We discuss the stylistically expressive elements within each of these levels and use existing methods to quantify the linguistic intuitions related to some o… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

  48. arXiv:1909.05355  [pdf, other

    cs.CL cs.AI

    Let's Ask Again: Refine Network for Automatic Question Generation

    Authors: Preksha Nema, Akash Kumar Mohankumar, Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran

    Abstract: In this work, we focus on the task of Automatic Question Generation (AQG) where given a passage and an answer the task is to generate the corresponding question. It is desired that the generated question should be (i) grammatically correct (ii) answerable from the passage and (iii) specific to the given answer. An analysis of existing AQG models shows that they produce questions which do not adher… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: accepted in EMNLP 2019 in Main Conference, (10 pages)

  49. arXiv:1907.08967  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Distributed physics informed neural network for data-efficient solution to partial differential equations

    Authors: Vikas Dwivedi, Nishant Parashar, Balaji Srinivasan

    Abstract: The physics informed neural network (PINN) is evolving as a viable method to solve partial differential equations. In the recent past PINNs have been successfully tested and validated to find solutions to both linear and non-linear partial differential equations (PDEs). However, the literature lacks detailed investigation of PINNs in terms of their representation capability. In this work, we first… ▽ More

    Submitted 21 July, 2019; originally announced July 2019.

    Comments: 16 pages, 8 figures

    Journal ref: Neurocomputing, 420, 299-316

  50. arXiv:1907.03507  [pdf, other

    cs.LG physics.comp-ph stat.ML

    Physics Informed Extreme Learning Machine (PIELM) -- A rapid method for the numerical solution of partial differential equations

    Authors: Vikas Dwivedi, Balaji Srinivasan

    Abstract: There has been rapid progress recently on the application of deep networks to the solution of partial differential equations, collectively labelled as Physics Informed Neural Networks (PINNs). In this paper, we develop Physics Informed Extreme Learning Machine (PIELM), a rapid version of PINNs which can be applied to stationary and time dependent linear partial differential equations. We demonstra… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 29 pages, 30 figures