Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–12 of 12 results for author: Samanta, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.02412  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    LLM Augmented LLMs: Expanding Capabilities through Composition

    Authors: Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar

    Abstract: Foundational models with billions of parameters which have been trained on large corpora of data have demonstrated non-trivial skills in a variety of domains. However, due to their monolithic structure, it is challenging and expensive to augment them or impart new skills. On the other hand, due to their adaptation abilities, several new instances of these models are being trained towards new domai… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 17 pages, 2 figures, 8 tables

  2. XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

    Authors: Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson , et al. (2 additional authors not shown)

    Abstract: Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot;… ▽ More

    Submitted 24 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  3. arXiv:2301.05852  [pdf, other

    cs.LG cond-mat.mtrl-sci

    CrysGNN : Distilling pre-trained knowledge to enhance property prediction for crystalline materials

    Authors: Kishalay Das, Bidisha Samanta, Pawan Goyal, Seung-Cheol Lee, Satadeep Bhattacharjee, Niloy Ganguly

    Abstract: In recent years, graph neural network (GNN) based approaches have emerged as a powerful technique to encode complex topological structure of crystal materials in an enriched representation space. These models are often supervised in nature and using the property-specific training data, learn relationship between crystal structure and different properties like formation energy, bandgap, bulk modulu… ▽ More

    Submitted 14 January, 2023; originally announced January 2023.

    Comments: 16 Pages,5 figures

  4. arXiv:2210.07313  [pdf, other

    cs.CL cs.LG

    Bootstrapping Multilingual Semantic Parsers using Large Language Models

    Authors: Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta, Shachi Dave, Sunita Sarawagi, Partha Talukdar

    Abstract: Despite cross-lingual generalization demonstrated by pre-trained multilingual models, the translate-train paradigm of transferring English datasets across multiple languages remains to be a key mechanism for training task-specific multilingual models. However, for many low-resource languages, the availability of a reliable translation service entails significant amounts of costly human-annotated t… ▽ More

    Submitted 11 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL-23

  5. arXiv:2110.09570  [pdf

    cs.CL

    A Data Bootstrapping Recipe for Low Resource Multilingual Relation Classification

    Authors: Arijit Nag, Bidisha Samanta, Animesh Mukherjee, Niloy Ganguly, Soumen Chakrabarti

    Abstract: Relation classification (sometimes called 'extraction') requires trustworthy datasets for fine-tuning large language models, as well as for evaluation. Data collection is challenging for Indian languages, because they are syntactically and morphologically diverse, as well as different from resource-rich languages like English. Despite recent interest in deep generative models for Indian languages,… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  6. arXiv:2110.07385  [pdf, other

    cs.CL cs.LG

    Few-shot Controllable Style Transfer for Low-Resource Multilingual Settings

    Authors: Kalpesh Krishna, Deepak Nathani, Xavier Garcia, Bidisha Samanta, Partha Talukdar

    Abstract: Style transfer is the task of rewriting a sentence into a target style while approximately preserving content. While most prior literature assumes access to a large style-labelled corpus, recent work (Riley et al. 2021) has attempted "few-shot" style transfer using only 3-10 sentences at inference for style extraction. In this work we study a relevant low-resource setting: style transfer for langu… ▽ More

    Submitted 11 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: ACL 2022 camera ready, 30 pages

  7. arXiv:2006.09891  [pdf, other

    cs.CL cs.LG

    Fine-grained Sentiment Controlled Text Generation

    Authors: Bidisha Samanta, Mohit Agarwal, Niloy Ganguly

    Abstract: Controlled text generation techniques aim to regulate specific attributes (e.g. sentiment) while preserving the attribute independent content. The state-of-the-art approaches model the specified attribute as a structured or discrete representation while making the content representation independent of it to achieve a better control. However, disentangling the text representation into separate late… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  8. arXiv:1906.08972  [pdf, other

    cs.CL

    A Deep Generative Model for Code-Switched Text

    Authors: Bidisha Samanta, Sharmila Reddy, Hussain Jagirdar, Niloy Ganguly, Soumen Chakrabarti

    Abstract: Code-switching, the interleaving of two or more languages within a sentence or discourse is pervasive in multilingual societies. Accurate language models for code-switched text are critical for NLP tasks. State-of-the-art data-intensive neural language models are difficult to train well from scarce language-labeled code-switched text. A potential solution is to use deep generative models to synthe… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

  9. arXiv:1906.05725  [pdf, other

    cs.CL

    Improved Sentiment Detection via Label Transfer from Monolingual to Synthetic Code-Switched Text

    Authors: Bidisha Samanta, Niloy Ganguly, Soumen Chakrabarti

    Abstract: Multilingual writers and speakers often alternate between two languages in a single discourse, a practice called "code-switching". Existing sentiment detection methods are usually trained on sentiment-labeled monolingual text. Manually labeled code-switched text, especially involving minority languages, is extremely rare. Consequently, the best monolingual methods perform relatively poorly on code… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

  10. arXiv:1802.05283  [pdf, other

    cs.LG physics.soc-ph stat.ML

    NeVAE: A Deep Generative Model for Molecular Graphs

    Authors: Bidisha Samanta, Abir De, Gourhari Jana, Pratim Kumar Chattaraj, Niloy Ganguly, Manuel Gomez-Rodriguez

    Abstract: Deep generative models have been praised for their ability to learn smooth latent representation of images, text, and audio, which can then be used to generate new, plausible data. However, current generative models are unable to work with molecular graphs due to their unique characteristics-their underlying structure is not Euclidean or grid-like, they remain isomorphic under permutation of the n… ▽ More

    Submitted 6 September, 2019; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: Accepted in AAAI 2019

  11. All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media

    Authors: Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Abhipsa Basu, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee

    Abstract: In this paper, we present a set of computational methods to identify the likeliness of a word being borrowed, based on the signals from social media. In terms of Spearman correlation coefficient values, our methods perform more than two times better (nearly 0.62) in predicting the borrowing likeliness compared to the best performing baseline (nearly 0.26) reported in literature. Based on this like… ▽ More

    Submitted 29 July, 2017; v1 submitted 25 July, 2017; originally announced July 2017.

    Comments: 11 pages, accepted in the 2017 conference on Empirical Methods on Natural Language Processing(EMNLP 2017) arXiv admin note: substantial text overlap with arXiv:1703.05122

  12. arXiv:1703.05122  [pdf, other

    cs.CL

    Is this word borrowed? An automatic approach to quantify the likeliness of borrowing in social media

    Authors: Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee

    Abstract: Code-mixing or code-switching are the effortless phenomena of natural switching between two or more languages in a single conversation. Use of a foreign word in a language; however, does not necessarily mean that the speaker is code-switching because often languages borrow lexical items from other languages. If a word is borrowed, it becomes a part of the lexicon of a language; whereas, during cod… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

    Comments: 11 pages, 3 Figures