Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–44 of 44 results for author: Bhatia, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16412  [pdf, other

    cs.CL cs.LG

    KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World Knowledge

    Authors: Pengcheng Jiang, Lang Cao, Cao Xiao, Parminder Bhatia, Jimeng Sun, Jiawei Han

    Abstract: Knowledge Graph Embedding (KGE) techniques are crucial in learning compact representations of entities and relations within a knowledge graph, facilitating efficient reasoning and knowledge discovery. While existing methods typically focus either on training KGE models solely based on graph structure or fine-tuning pre-trained language models with classification data in KG, KG-FIT leverages LLM-gu… ▽ More

    Submitted 4 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2405.15973  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

    Authors: Xiyao Wang, Jiuhai Chen, Zhaoyang Wang, Yuhang Zhou, Yiyang Zhou, Huaxiu Yao, Tianyi Zhou, Tom Goldstein, Parminder Bhatia, Furong Huang, Cao Xiao

    Abstract: Large vision-language models (LVLMs) have achieved impressive results in various visual question-answering and reasoning tasks through vision instruction tuning on specific datasets. However, there is still significant room for improvement in the alignment between visual and language modalities. Previous methods to enhance this alignment typically require external models or data, heavily depending… ▽ More

    Submitted 7 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 15 pages, 8 figures

  3. arXiv:2403.10351  [pdf, other

    cs.CL

    TriSum: Learning Summarization Ability from Large Language Models with Structured Rationale

    Authors: Pengcheng Jiang, Cao Xiao, Zifeng Wang, Parminder Bhatia, Jimeng Sun, Jiawei Han

    Abstract: The advent of large language models (LLMs) has significantly advanced natural language processing tasks like text summarization. However, their large size and computational demands, coupled with privacy concerns in data transmission, limit their use in resource-constrained and privacy-centric settings. To overcome this, we introduce TriSum, a framework for distilling LLMs' text summarization abili… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: NAACL'24

  4. arXiv:2403.08845  [pdf, other

    cs.LG cs.AI

    Bifurcated Attention for Single-Context Large-Batch Sampling

    Authors: Ben Athiwaratkun, Sujan Kumar Gonugondla, Sanjay Krishna Gouda, Haifeng Qian, Hantian Ding, Qing Sun, Jun Wang, Jiacheng Guo, Liangfu Chen, Parminder Bhatia, Ramesh Nallapati, Sudipta Sengupta, Bing Xiang

    Abstract: In our study, we present bifurcated attention, a method developed for language model inference in single-context batch sampling contexts. This approach aims to reduce redundant memory IO costs, a significant factor in latency for high batch sizes and long context lengths. Bifurcated attention achieves this by dividing the attention mechanism during incremental decoding into two distinct GEMM opera… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2310.18642  [pdf

    cs.CV cs.AI

    One-shot Localization and Segmentation of Medical Images with Foundation Models

    Authors: Deepa Anand, Gurunath Reddy M, Vanika Singhal, Dattesh D. Shanbhag, Shriram KS, Uday Patil, Chitresh Bhushan, Kavitha Manickam, Dawei Gui, Rakesh Mullick, Avinash Gopal, Parminder Bhatia, Taha Kass-Hout

    Abstract: Recent advances in Vision Transformers (ViT) and Stable Diffusion (SD) models with their ability to capture rich semantic features of the image have been used for image correspondence tasks on natural images. In this paper, we examine the ability of a variety of pre-trained ViT (DINO, DINOv2, SAM, CLIP) and SD models, trained exclusively on natural images, for solving the correspondence problems o… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023 R0-FoMo Workshop

  6. arXiv:2310.16872  [pdf, other

    eess.IV cs.CV

    SonoSAMTrack -- Segment and Track Anything on Ultrasound Images

    Authors: Hariharan Ravishankar, Rohan Patil, Vikram Melapudi, Harsh Suthar, Stephan Anzengruber, Parminder Bhatia, Kass-Hout Taha, Pavan Annangi

    Abstract: In this paper, we present SonoSAMTrack - that combines a promptable foundational model for segmenting objects of interest on ultrasound images called SonoSAM, with a state-of-the art contour tracking model to propagate segmentations on 2D+t and 3D ultrasound datasets. Fine-tuned and tested exclusively on a rich, diverse set of objects from $\approx200$k ultrasound image-mask pairs, SonoSAM demonst… ▽ More

    Submitted 16 November, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  7. arXiv:2310.11248  [pdf, other

    cs.LG cs.CL cs.SE

    CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion

    Authors: Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Hantian Ding, Ming Tan, Nihal Jain, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth, Bing Xiang

    Abstract: Code completion models have made significant progress in recent years, yet current popular evaluation datasets, such as HumanEval and MBPP, predominantly focus on code completion tasks within a single file. This over-simplified setting falls short of representing the real-world software development scenario where repositories span multiple files with numerous cross-file dependencies, and accessing… ▽ More

    Submitted 16 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: To appear at NeurIPS 2023 (Datasets and Benchmarks Track)

  8. arXiv:2307.02435  [pdf, other

    cs.LG cs.CL cs.SE

    Exploring Continual Learning for Code Generation Models

    Authors: Prateek Yadav, Qing Sun, Hantian Ding, Xiaopeng Li, Dejiao Zhang, Ming Tan, Xiaofei Ma, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Mohit Bansal, Bing Xiang

    Abstract: Large-scale code generation models such as Codex and CodeT5 have achieved impressive performance. However, libraries are upgraded or deprecated very frequently and re-training large-scale language models is computationally expensive. Therefore, Continual Learning (CL) is an important aspect that remains underexplored in the code domain. In this paper, we introduce a benchmark called CodeTask-CL th… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: ACL 2023

  9. arXiv:2306.03203  [pdf, other

    cs.CL cs.SE

    A Static Evaluation of Code Completion by Large Language Models

    Authors: Hantian Ding, Varun Kumar, Yuchen Tian, Zijian Wang, Rob Kwiatkowski, Xiaopeng Li, Murali Krishna Ramanathan, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang

    Abstract: Large language models trained on code have shown great potential to increase productivity of software developers. Several execution-based benchmarks have been proposed to evaluate functional correctness of model-generated code on simple programming problems. Nevertheless, it is expensive to perform the same evaluation on complex real-world projects considering the execution cost. On the contrary,… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted by ACL 2023 industry track

  10. arXiv:2303.05378  [pdf, other

    cs.LG cs.SE

    Greener yet Powerful: Taming Large Code Generation Models with Quantization

    Authors: Xiaokai Wei, Sujan Gonugondla, Wasi Ahmad, Shiqi Wang, Baishakhi Ray, Haifeng Qian, Xiaopeng Li, Varun Kumar, Zijian Wang, Yuchen Tian, Qing Sun, Ben Athiwaratkun, Mingyue Shang, Murali Krishna Ramanathan, Parminder Bhatia, Bing Xiang

    Abstract: ML-powered code generation aims to assist developers to write code in a more productive manner, by intelligently generating code blocks based on natural language prompts. Recently, large pretrained deep learning models have substantially pushed the boundary of code generation and achieved impressive performance. Despite their great power, the huge number of model parameters poses a significant thr… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 10 pages, 7 figures, 10 tables

  11. arXiv:2302.14383  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Linear Spaces of Meanings: Compositional Structures in Vision-Language Models

    Authors: Matthew Trager, Pramuditha Perera, Luca Zancato, Alessandro Achille, Parminder Bhatia, Stefano Soatto

    Abstract: We investigate compositional structures in data embeddings from pre-trained vision-language models (VLMs). Traditionally, compositionality has been associated with algebraic operations on embeddings of words from a pre-existing vocabulary. In contrast, we seek to approximate representations from an encoder as combinations of a smaller set of vectors in the embedding space. These vectors can be see… ▽ More

    Submitted 11 January, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 18 pages, 9 figures, 7 tables

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision 2023 (pp. 15395-15404)

  12. arXiv:2212.10264  [pdf, other

    cs.LG cs.CL cs.SE

    ReCode: Robustness Evaluation of Code Generation Models

    Authors: Shiqi Wang, Zheng Li, Haifeng Qian, Chenghao Yang, Zijian Wang, Mingyue Shang, Varun Kumar, Samson Tan, Baishakhi Ray, Parminder Bhatia, Ramesh Nallapati, Murali Krishna Ramanathan, Dan Roth, Bing Xiang

    Abstract: Code generation models have achieved impressive performance. However, they tend to be brittle as slight edits to a prompt could lead to very different generations; these robustness properties, critical for user experience when deployed in real-life applications, are not well understood. Most existing works on robustness in text or code tasks have focused on classification, while robustness in gene… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: Code and data available at https://github.com/amazon-science/recode

  13. arXiv:2212.10007  [pdf, other

    cs.CL cs.SE

    CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context

    Authors: Yangruibo Ding, Zijian Wang, Wasi Uddin Ahmad, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth, Bing Xiang

    Abstract: While pre-trained language models (LM) for code have achieved great success in code completion, they generate code conditioned only on the contents within the file, i.e., in-file context, but ignore the rich semantics in other files within the same project, i.e., cross-file context, a critical source of information that is especially useful in modern modular software development. Such overlooking… ▽ More

    Submitted 24 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

  14. arXiv:2210.14868  [pdf, other

    cs.LG cs.CL

    Multi-lingual Evaluation of Code Generation Models

    Authors: Ben Athiwaratkun, Sanjay Krishna Gouda, Zijian Wang, Xiaopeng Li, Yuchen Tian, Ming Tan, Wasi Uddin Ahmad, Shiqi Wang, Qing Sun, Mingyue Shang, Sujan Kumar Gonugondla, Hantian Ding, Varun Kumar, Nathan Fulton, Arash Farahani, Siddhartha Jain, Robert Giaquinto, Haifeng Qian, Murali Krishna Ramanathan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Sudipta Sengupta, Dan Roth, Bing Xiang

    Abstract: We present new benchmarks on evaluation code generation models: MBXP and Multilingual HumanEval, and MathQA-X. These datasets cover over 10 programming languages and are generated using a scalable conversion framework that transpiles prompts and test cases from the original Python datasets into the corresponding data in the target language. Using these benchmarks, we are able to assess the perform… ▽ More

    Submitted 28 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: Code and data release: https://github.com/amazon-research/mxeval

  15. arXiv:2210.01185  [pdf, other

    cs.CL

    ContraCLM: Contrastive Learning For Causal Language Model

    Authors: Nihal Jain, Dejiao Zhang, Wasi Uddin Ahmad, Zijian Wang, Feng Nan, Xiaopeng Li, Ming Tan, Ramesh Nallapati, Baishakhi Ray, Parminder Bhatia, Xiaofei Ma, Bing Xiang

    Abstract: Despite exciting progress in causal language models, the expressiveness of the representations is largely limited due to poor discrimination ability. To remedy this issue, we present ContraCLM, a novel contrastive learning framework at both token-level and sequence-level. We assess ContraCLM on a variety of downstream tasks. We show that ContraCLM enhances discrimination of the representations and… ▽ More

    Submitted 2 May, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 10 pages

    Journal ref: ACL 2023

  16. arXiv:2205.09240  [pdf, other

    cs.IR cs.AI cs.CY

    Debiasing Neural Retrieval via In-batch Balancing Regularization

    Authors: Yuantong Li, Xiaokai Wei, Zijian Wang, Shen Wang, Parminder Bhatia, Xiaofei Ma, Andrew Arnold

    Abstract: People frequently interact with information retrieval (IR) systems, however, IR models exhibit biases and discrimination towards various demographics. The in-processing fair ranking methods provide a trade-offs between accuracy and fairness through adding a fairness-related regularization term in the loss function. However, there haven't been intuitive objective functions that depend on the click… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: 9 pages, 1 figure, and 3 tables. A version appears in the Proceedings of the 4th Workshop on Gender Bias in Natural Language Processing (GeBNLP), 2022

  17. arXiv:2203.11239  [pdf, other

    cs.CL

    DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization

    Authors: Zheng Li, Zijian Wang, Ming Tan, Ramesh Nallapati, Parminder Bhatia, Andrew Arnold, Bing Xiang, Dan Roth

    Abstract: Large-scale pre-trained sequence-to-sequence models like BART and T5 achieve state-of-the-art performance on many generative NLP tasks. However, such models pose a great challenge in resource-constrained scenarios owing to their large memory requirements and high latency. To alleviate this issue, we propose to jointly distill and quantize the model, where knowledge is transferred from the full-pre… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    Comments: ACL 2022

  18. arXiv:2202.01340  [pdf, other

    cs.LG

    An Artificial Intelligence Dataset for Solar Energy Locations in India

    Authors: Anthony Ortiz, Dhaval Negandhi, Sagar R Mysorekar, Joseph Kiesecker, Shivaprakash K Nagaraju, Caleb Robinson, Priyal Bhatia, Aditi Khurana, Jane Wang, Felipe Oviedo, Juan Lavista Ferres

    Abstract: Rapid development of renewable energy sources, particularly solar photovoltaics (PV), is critical to mitigate climate change. As a result, India has set ambitious goals to install 500 gigawatts of solar energy capacity by 2030. Given the large footprint projected to meet renewables energy targets, the potential for land use conflicts over environmental values is high. To expedite development of so… ▽ More

    Submitted 30 June, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: Accepted for publication in Nature Scientific Data

  19. arXiv:2111.06012  [pdf, other

    cs.CL cs.LG

    Kronecker Factorization for Preventing Catastrophic Forgetting in Large-scale Medical Entity Linking

    Authors: Denis Jered McInerney, Luyang Kong, Kristjan Arumae, Byron Wallace, Parminder Bhatia

    Abstract: Multi-task learning is useful in NLP because it is often practically desirable to have a single model that works across a range of tasks. In the medical domain, sequential training on tasks may sometimes be the only way to train models, either because access to the original (potentially sensitive) data is no longer available, or simply owing to the computational costs inherent to joint retraining.… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  20. arXiv:2110.08455  [pdf, other

    cs.CL

    Knowledge Enhanced Pretrained Language Models: A Compreshensive Survey

    Authors: Xiaokai Wei, Shen Wang, Dejiao Zhang, Parminder Bhatia, Andrew Arnold

    Abstract: Pretrained Language Models (PLM) have established a new paradigm through learning informative contextualized representations on large-scale text corpus. This new paradigm has revolutionized the entire field of natural language processing, and set the new state-of-the-art performance for a wide variety of NLP tasks. However, though PLMs could store certain knowledge/facts from training corpus, thei… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

  21. arXiv:2107.11094  [pdf, other

    cs.CL

    Improving Early Sepsis Prediction with Multi Modal Learning

    Authors: Fred Qin, Vivek Madan, Ujjwal Ratan, Zohar Karnin, Vishaal Kapoor, Parminder Bhatia, Taha Kass-Hout

    Abstract: Sepsis is a life-threatening disease with high morbidity, mortality and healthcare costs. The early prediction and administration of antibiotics and intravenous fluids is considered crucial for the treatment of sepsis and can save potentially millions of lives and billions in health care costs. Professional clinical care practitioners have proposed clinical criterion which aid in early detection o… ▽ More

    Submitted 23 July, 2021; originally announced July 2021.

  22. arXiv:2105.13225  [pdf, other

    cs.CL cs.AI

    Neural Entity Recognition with Gazetteer based Fusion

    Authors: Qing Sun, Parminder Bhatia

    Abstract: Incorporating external knowledge into Named Entity Recognition (NER) systems has been widely studied in the generic domain. In this paper, we focus on clinical domain where only limited data is accessible and interpretability is important. Recent advancement in technology and the acceleration of clinical trials has resulted in the discovery of new drugs, procedures as well as medical conditions. T… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Journal ref: the Association for Computational Linguistics (ACL) 2021

  23. arXiv:2105.12682  [pdf, other

    cs.CL cs.AI

    Zero-shot Medical Entity Retrieval without Annotation: Learning From Rich Knowledge Graph Semantics

    Authors: Luyang Kong, Christopher Winestock, Parminder Bhatia

    Abstract: Medical entity retrieval is an integral component for understanding and communicating information across various health systems. Current approaches tend to work well on specific medical domains but generalize poorly to unseen sub-specialties. This is of increasing concern under a public health crisis as new medical conditions and drug treatments come to light frequently. Zero-shot retrieval is cha… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  24. arXiv:2104.13498  [pdf, other

    cs.CL cs.LG

    Towards Clinical Encounter Summarization: Learning to Compose Discharge Summaries from Prior Notes

    Authors: Han-Chin Shing, Chaitanya Shivade, Nima Pourdamghani, Feng Nan, Philip Resnik, Douglas Oard, Parminder Bhatia

    Abstract: The records of a clinical encounter can be extensive and complex, thus placing a premium on tools that can extract and summarize relevant information. This paper introduces the task of generating discharge summaries for a clinical encounter. Summaries in this setting need to be faithful, traceable, and scale to multiple long documents, motivating the use of extract-then-abstract summarization casc… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  25. arXiv:2010.00784  [pdf, other

    cs.CL

    An Empirical Investigation Towards Efficient Multi-Domain Language Model Pre-training

    Authors: Kristjan Arumae, Qing Sun, Parminder Bhatia

    Abstract: Pre-training large language models has become a standard in the natural language processing community. Such models are pre-trained on generic data (e.g. BookCorpus and English Wikipedia) and often fine-tuned on tasks in the same domain. However, in order to achieve state-of-the-art performance on out of domain tasks such as clinical named entity recognition and relation extraction, additional in d… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: text overlap with arXiv:2004.03794

  26. arXiv:2009.07241  [pdf, other

    stat.ML cs.LG

    Improve black-box sequential anomaly detector relevancy with limited user feedback

    Authors: Luyang Kong, Lifan Chen, Ming Chen, Parminder Bhatia, Laurent Callot

    Abstract: Anomaly detectors are often designed to catch statistical anomalies. End-users typically do not have interest in all of the detected outliers, but only those relevant to their application. Given an existing black-box sequential anomaly detector, this paper proposes a method to improve its user relevancy using a small number of human feedback. As our first contribution, the method is agnostic to th… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  27. arXiv:2007.12731  [pdf, other

    cs.IR cs.AI cs.CL

    COVID-19 Knowledge Graph: Accelerating Information Retrieval and Discovery for Scientific Literature

    Authors: Colby Wise, Vassilis N. Ioannidis, Miguel Romero Calvo, Xiang Song, George Price, Ninad Kulkarni, Ryan Brand, Parminder Bhatia, George Karypis

    Abstract: The coronavirus disease (COVID-19) has claimed the lives of over 350,000 people and infected more than 6 million people worldwide. Several search engines have surfaced to provide researchers with additional tools to find and retrieve information from the rapidly growing corpora on COVID-19. These engines lack extraction and visualization tools necessary to retrieve and interpret complex relations… ▽ More

    Submitted 24 July, 2020; originally announced July 2020.

  28. arXiv:2007.09186  [pdf, other

    cs.IR

    AWS CORD-19 Search: A Neural Search Engine for COVID-19 Literature

    Authors: Parminder Bhatia, Lan Liu, Kristjan Arumae, Nima Pourdamghani, Suyog Deshpande, Ben Snively, Mona Mona, Colby Wise, George Price, Shyam Ramaswamy, Xiaofei Ma, Ramesh Nallapati, Zhiheng Huang, Bing Xiang, Taha Kass-Hout

    Abstract: Coronavirus disease (COVID-19) has been declared as a pandemic by WHO with thousands of cases being reported each day. Numerous scientific articles are being published on the disease raising the need for a service which can organize, and query them in a reliable fashion. To support this cause we present AWS CORD-19 Search (ACS), a public, COVID-19 specific, neural search engine that is powered by… ▽ More

    Submitted 7 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

  29. arXiv:2007.00492  [pdf, other

    cs.CL cs.CY cs.LG stat.ML

    Towards User Friendly Medication Mapping Using Entity-Boosted Two-Tower Neural Network

    Authors: Shaoqing Yuan, Parminder Bhatia, Busra Celikkaya, Haiyang Liu, Kyunghwan Choi

    Abstract: Recent advancements in medical entity linking have been applied in the area of scientific literature and social media data. However, with the adoption of telemedicine and conversational agents such as Alexa in healthcare settings, medical name inference has become an important task. Medication name inference is the task of mapping user friendly medication names from a free-form text to a concept i… ▽ More

    Submitted 9 October, 2020; v1 submitted 17 June, 2020; originally announced July 2020.

  30. arXiv:2006.13299  [pdf, other

    cs.CL cs.LG

    Supervised Understanding of Word Embeddings

    Authors: Halid Ziya Yerebakan, Parmeet Bhatia, Yoshihisa Shinagawa

    Abstract: Pre-trained word embeddings are widely used for transfer learning in natural language processing. The embeddings are continuous and distributed representations of the words that preserve their similarities in compact Euclidean spaces. However, the dimensions of these spaces do not provide any clear interpretation. In this study, we have obtained supervised projections in the form of the linear key… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

  31. arXiv:2004.04295  [pdf, ps, other

    cs.CL

    Severing the Edge Between Before and After: Neural Architectures for Temporal Ordering of Events

    Authors: Miguel Ballesteros, Rishita Anubhai, Shuai Wang, Nima Pourdamghani, Yogarshi Vyas, Jie Ma, Parminder Bhatia, Kathleen McKeown, Yaser Al-Onaizan

    Abstract: In this paper, we propose a neural architecture and a set of training methods for ordering events by predicting temporal relations. Our proposed models receive a pair of events within a span of text as input and they identify temporal relations (Before, After, Equal, Vague) between them. Given that a key challenge with this task is the scarcity of annotated data, our models rely on either pretrain… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  32. arXiv:2004.03794  [pdf, other

    cs.CL

    CALM: Continuous Adaptive Learning for Language Modeling

    Authors: Kristjan Arumae, Parminder Bhatia

    Abstract: Training large language representation models has become a standard in the natural language processing community. This allows for fine tuning on any number of specific tasks, however, these large high capacity models can continue to train on domain specific unlabeled data to make initialization even more robust for supervised tasks. We demonstrate that in practice these pre-trained models present… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  33. arXiv:1911.09787  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    LATTE: Latent Type Modeling for Biomedical Entity Linking

    Authors: Ming Zhu, Busra Celikkaya, Parminder Bhatia, Chandan K. Reddy

    Abstract: Entity linking is the task of linking mentions of named entities in natural language text, to entities in a curated knowledge-base. This is of significant importance in the biomedical domain, where it could be used to semantically annotate a large volume of clinical records and biomedical literature, to standardized concepts described in an ontology such as Unified Medical Language System (UMLS).… ▽ More

    Submitted 20 January, 2020; v1 submitted 21 November, 2019; originally announced November 2019.

    Comments: AAAI 2020 Conference

  34. arXiv:1910.07659  [pdf, ps, other

    cs.CL

    Towards Annotating and Creating Sub-Sentence Summary Highlights

    Authors: Kristjan Arumae, Parminder Bhatia, Fei Liu

    Abstract: Highlighting is a powerful tool to pick out important content and emphasize. Creating summary highlights at the sub-sentence level is particularly desirable, because sub-sentences are more concise than whole sentences. They are also better suited than individual words and phrases that can potentially lead to disfluent, fragmented summaries. In this paper we seek to generate summary highlights by a… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  35. arXiv:1910.07419  [pdf, other

    cs.CL

    Comprehend Medical: a Named Entity Recognition and Relationship Extraction Web Service

    Authors: Parminder Bhatia, Busra Celikkaya, Mohammed Khalilia, Selvan Senthivel

    Abstract: Comprehend Medical is a stateless and Health Insurance Portability and Accountability Act (HIPAA) eligible Named Entity Recognition (NER) and Relationship Extraction (RE) service launched under Amazon Web Services (AWS) trained using state-of-the-art deep learning models. Contrary to many existing open source tools, Comprehend Medical is scalable and does not require steep learning curve, dependen… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: ICMLA 2019. $©$ 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses in any current or future media including reprinting/republishing this material for advertising, promotional purposes, creating new collective works, for resale or redistribution to servers or lists, reuse of any copyrighted component of this work in other works. arXiv admin note: text overlap with arXiv:1812.05270

  36. arXiv:1902.09271  [pdf, other

    cs.CL

    Relation Extraction using Explicit Context Conditioning

    Authors: Gaurav Singh, Parminder Bhatia

    Abstract: Relation Extraction (RE) aims to label relations between groups of marked entities in raw text. Most current RE models learn context-aware representations of the target entities that are then used to establish relation between them. This works well for intra-sentence RE and we call them first-order relations. However, this methodology can sometimes fail to capture complex and long dependencies. To… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

    Comments: Accepted for Publication at NAACL 2019

  37. arXiv:1812.05288  [pdf, other

    cs.LG cs.CL stat.ML

    Dynamic Transfer Learning for Named Entity Recognition

    Authors: Parminder Bhatia, Kristjan Arumae, Busra Celikkaya

    Abstract: State-of-the-art named entity recognition (NER) systems have been improving continuously using neural architectures over the past several years. However, many tasks including NER require large sets of annotated data to achieve such performance. In particular, we focus on NER from clinical notes, which is one of the most fundamental and critical problems for medical text analysis. Our work centers… ▽ More

    Submitted 20 January, 2020; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: AAAI 2019 Workshop on Health Intelligence

  38. Joint Entity Extraction and Assertion Detection for Clinical Text

    Authors: Parminder Bhatia, Busra Celikkaya, Mohammed Khalilia

    Abstract: Negative medical findings are prevalent in clinical reports, yet discriminating them from positive findings remains a challenging task for information extraction. Most of the existing systems treat this task as a pipeline of two separate tasks, i.e., named entity recognition (NER) and rule-based negation detection. We consider this as a multi-task problem and present a novel end-to-end neural mode… ▽ More

    Submitted 22 January, 2020; v1 submitted 13 December, 2018; originally announced December 2018.

    Comments: Accepted at the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019)

    Journal ref: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence, Italy, 2019, pp. 954-959

  39. arXiv:1811.12276  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Hospital Mortality Prediction with Medical Named Entities and Multimodal Learning

    Authors: Mengqi Jin, Mohammad Taha Bahadori, Aaron Colak, Parminder Bhatia, Busra Celikkaya, Ram Bhakta, Selvan Senthivel, Mohammed Khalilia, Daniel Navarro, Borui Zhang, Tiberiu Doman, Arun Ravi, Matthieu Liger, Taha Kass-hout

    Abstract: Clinical text provides essential information to estimate the acuity of a patient during hospital stays in addition to structured clinical data. In this study, we explore how clinical text can complement a clinical predictive learning task. We leverage an internal medical natural language processing service to perform named entity extraction and negation detection on clinical notes and compose sele… ▽ More

    Submitted 3 December, 2018; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

  40. arXiv:1702.05512  [pdf, other

    cs.SI cs.CL

    soc2seq: Social Embedding meets Conversation Model

    Authors: Parminder Bhatia, Marsal Gavalda, Arash Einolghozati

    Abstract: While liking or upvoting a post on a mobile app is easy to do, replying with a written note is much more difficult, due to both the cognitive load of coming up with a meaningful response as well as the mechanics of entering the text. Here we present a novel textual reply generation model that goes beyond the current auto-reply and predictive text entry models by taking into account the content pre… ▽ More

    Submitted 27 November, 2017; v1 submitted 17 February, 2017; originally announced February 2017.

  41. arXiv:1608.01056  [pdf, other

    cs.CL

    Morphological Priors for Probabilistic Neural Word Embeddings

    Authors: Parminder Bhatia, Robert Guthrie, Jacob Eisenstein

    Abstract: Word embeddings allow natural language processing systems to share statistical information across related words. These embeddings are typically based on distributional statistics, making it difficult for them to generalize to rare or unseen words. We propose to improve word embeddings by incorporating morphological information, capturing shared sub-word features. Unlike previous work that construc… ▽ More

    Submitted 23 September, 2016; v1 submitted 2 August, 2016; originally announced August 2016.

    Comments: Appeared at the Conference on Empirical Methods in Natural Language Processing (EMNLP 2016, Austin)

  42. arXiv:1509.01599  [pdf, other

    cs.CL cs.AI

    Better Document-level Sentiment Analysis from RST Discourse Parsing

    Authors: Parminder Bhatia, Yangfeng Ji, Jacob Eisenstein

    Abstract: Discourse structure is the hidden link between surface features and document-level properties, such as sentiment polarity. We show that the discourse analyses produced by Rhetorical Structure Theory (RST) parsers can improve document-level sentiment analysis, via composition of local information up the discourse tree. First, we show that reweighting discourse units according to their position in a… ▽ More

    Submitted 11 September, 2015; v1 submitted 4 September, 2015; originally announced September 2015.

    Comments: Published at Empirical Methods in Natural Language Processing (EMNLP 2015)

  43. Framework for Wireless Network Security using Quantum Cryptography

    Authors: Priyanka Bhatia, Ronak Sumbaly

    Abstract: Data that is transient over an unsecured wireless network is always susceptible to being intercepted by anyone within the range of the wireless signal. Hence providing secure communication to keep the user information and devices safe when connected wirelessly has become one of the major concerns. Quantum cryptography provides a solution towards absolute communication security over the network by… ▽ More

    Submitted 8 December, 2014; originally announced December 2014.

    Comments: 17 pages, 11 figures

    Journal ref: International Journal of Computer Networks & Communications (IJCNC) Vol.6, No.6, November 2014

  44. arXiv:1002.4003  [pdf

    cs.DC

    A Cluster-based Approach for Outlier Detection in Dynamic Data Streams (KORM: k-median OutlieR Miner)

    Authors: Parneeta Dhaliwal, M. P. S. Bhatia, Priti Bansal

    Abstract: Outlier detection in data streams has gained wide importance presently due to the increasing cases of fraud in various applications of data streams. The techniques for outlier detection have been divided into either statistics based, distance based, density based or deviation based. Till now, most of the work in the field of fraud detection was distance based but it is incompetent from computati… ▽ More

    Submitted 21 February, 2010; originally announced February 2010.

    Journal ref: Journal of Computing, Volume 2, Issue 2, February 2010, https://sites.google.com/site/journalofcomputing/