Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–23 of 23 results for author: Tiwary, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.12388  [pdf, other

    cs.IR cs.AI

    Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models

    Authors: Ying-Chun Lin, Jennifer Neville, Jack W. Stokes, Longqi Yang, Tara Safavi, Mengting Wan, Scott Counts, Siddharth Suri, Reid Andersen, Xiaofeng Xu, Deepak Gupta, Sujay Kumar Jauhar, Xia Song, Georg Buscher, Saurabh Tiwary, Brent Hecht, Jaime Teevan

    Abstract: Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featur… ▽ More

    Submitted 8 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  2. arXiv:2402.14301   

    cs.IR cs.LG

    GenSERP: Large Language Models for Whole Page Presentation

    Authors: Zhenning Zhang, Yunan Zhang, Suyu Ge, Guangwei Weng, Mridu Narang, Xia Song, Saurabh Tiwary

    Abstract: The advent of large language models (LLMs) brings an opportunity to minimize the effort in search engine result page (SERP) organization. In this paper, we propose GenSERP, a framework that leverages LLMs with vision in a few-shot setting to dynamically organize intermediate search results, including generated chat answers, website snippets, multimedia data, knowledge panels into a coherent SERP l… ▽ More

    Submitted 16 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Microsoft corp policy

  3. arXiv:2401.13722  [pdf, other

    cs.HC cs.AI

    Proactive Emotion Tracker: AI-Driven Continuous Mood and Emotion Monitoring

    Authors: Mohammad Asif, Sudhakar Mishra, Ankush Sonker, Sanidhya Gupta, Somesh Kumar Maurya, Uma Shanker Tiwary

    Abstract: This research project aims to tackle the growing mental health challenges in today's digital age. It employs a modified pre-trained BERT model to detect depressive text within social media and users' web browsing data, achieving an impressive 93% test accuracy. Simultaneously, the project aims to incorporate physiological signals from wearable devices, such as smartwatches and EEG sensors, to prov… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  4. arXiv:2401.07892  [pdf, other

    cs.HC

    Deep Fuzzy Framework for Emotion Recognition using EEG Signals and Emotion Representation in Type-2 Fuzzy VAD Space

    Authors: Mohammad Asif, Noman Ali, Sudhakar Mishra, Anushka Dandawate, Uma Shanker Tiwary

    Abstract: Recently, the representation of emotions in the Valence, Arousal and Dominance (VAD) space has drawn enough attention. However, the complex nature of emotions and the subjective biases in self-reported values of VAD make the emotion model too specific to a particular experiment. This study aims to develop a generic model representing emotions using a fuzzy VAD space and improve emotion recognition… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  5. Inter Subject Emotion Recognition Using Spatio-Temporal Features From EEG Signal

    Authors: Mohammad Asif, Diya Srivastava, Aditya Gupta, Uma Shanker Tiwary

    Abstract: Inter-subject or subject-independent emotion recognition has been a challenging task in affective computing. This work is about an easy-to-implement emotion recognition model that classifies emotions from EEG signals subject independently. It is based on the famous EEGNet architecture, which is used in EEG-related BCIs. We used the Dataset on Emotion using Naturalistic Stimuli (DENS) dataset. The… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.

    Report number: 2023 27th International Computer Science and Engineering Conference (ICSEC)

  6. arXiv:2305.14218  [pdf, other

    cs.CV cs.AI

    DUBLIN -- Document Understanding By Language-Image Network

    Authors: Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Mohammed Khan, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary

    Abstract: Visual document understanding is a complex task that involves analyzing both the text and the visual elements in document images. Existing models often rely on manual feature engineering or domain-specific pipelines, which limit their generalization ability across different document types and languages. In this paper, we propose DUBLIN, which is pretrained on web pages using three novel objectives… ▽ More

    Submitted 27 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    ACM Class: F.2.2; I.2.7

  7. arXiv:2211.02637  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    Emotion Recognition With Temporarily Localized 'Emotional Events' in Naturalistic Context

    Authors: Mohammad Asif, Sudhakar Mishra, Majithia Tejas Vinodbhai, Uma Shanker Tiwary

    Abstract: Emotion recognition using EEG signals is an emerging area of research due to its broad applicability in BCI. Emotional feelings are hard to stimulate in the lab. Emotions do not last long, yet they need enough context to be perceived and felt. However, most EEG-related emotion databases either suffer from emotionally irrelevant details (due to prolonged duration stimulus) or have minimal context d… ▽ More

    Submitted 25 October, 2022; originally announced November 2022.

  8. arXiv:2204.06644  [pdf, other

    cs.LG cs.AI cs.CL

    METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals

    Authors: Payal Bajaj, Chenyan Xiong, Guolin Ke, Xiaodong Liu, Di He, Saurabh Tiwary, Tie-Yan Liu, Paul Bennett, Xia Song, Jianfeng Gao

    Abstract: We present an efficient method of pretraining large-scale autoencoding language models using training signals generated by an auxiliary model. Originated in ELECTRA, this training strategy has demonstrated sample-efficiency to pretrain models at the scale of hundreds of millions of parameters. In this work, we conduct a comprehensive empirical study, and propose a recipe, namely "Model generated d… ▽ More

    Submitted 16 April, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Update details in scaled initialization and add acknowledgement

  9. arXiv:2204.03243  [pdf, other

    cs.CL cs.LG

    Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

    Authors: Yu Meng, Chenyan Xiong, Payal Bajaj, Saurabh Tiwary, Paul Bennett, Jiawei Han, Xia Song

    Abstract: We present a new framework AMOS that pretrains text encoders with an Adversarial learning curriculum via a Mixture Of Signals from multiple auxiliary generators. Following ELECTRA-style pretraining, the main encoder is trained as a discriminator to detect replaced tokens generated by auxiliary masked language models (MLMs). Different from ELECTRA which trains one MLM as the generator, we jointly t… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: ICLR 2022. (Code and Models: https://github.com/microsoft/AMOS)

  10. arXiv:2201.11990  [pdf, other

    cs.CL

    Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model

    Authors: Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zhang, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro

    Abstract: Pretrained general-purpose language models can achieve state-of-the-art accuracies in various natural language processing domains by adapting to downstream tasks via zero-shot, few-shot and fine-tuning techniques. Because of their success, the size of these models has increased rapidly, requiring high-performance hardware, software, and algorithmic techniques to enable training such large models.… ▽ More

    Submitted 4 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Shaden Smith and Mostofa Patwary contributed equally

  11. arXiv:2111.07746  [pdf, other

    cs.CV

    Real-time Emotion and Gender Classification using Ensemble CNN

    Authors: Abhinav Lahariya, Varsha Singh, Uma Shanker Tiwary

    Abstract: Analysing expressions on the person's face plays a very vital role in identifying emotions and behavior of a person. Recognizing these expressions automatically results in a crucial component of natural human-machine interfaces. Therefore research in this field has a wide range of applications in bio-metric authentication, surveillance systems , emotion to emoticons in various social media platfor… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  12. arXiv:2102.08473  [pdf, other

    cs.CL cs.LG

    COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

    Authors: Yu Meng, Chenyan Xiong, Payal Bajaj, Saurabh Tiwary, Paul Bennett, Jiawei Han, Xia Song

    Abstract: We present a self-supervised learning framework, COCO-LM, that pretrains Language Models by COrrecting and COntrasting corrupted text sequences. Following ELECTRA-style pretraining, COCO-LM employs an auxiliary language model to corrupt text sequences, upon which it constructs two new tasks for pretraining the main model. The first token-level task, Corrective Language Modeling, is to detect and c… ▽ More

    Submitted 26 October, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

    Comments: NeurIPS 2021. (Code and Models: https://github.com/microsoft/COCO-LM)

  13. arXiv:2007.00655  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Knowledge-Aware Language Model Pretraining

    Authors: Corby Rosset, Chenyan Xiong, Minh Phan, Xia Song, Paul Bennett, Saurabh Tiwary

    Abstract: How much knowledge do pretrained language models hold? Recent research observed that pretrained transformers are adept at modeling semantics but it is unclear to what degree they grasp human knowledge, or how to ensure they do so. In this paper we incorporate knowledge-awareness in language model pretraining without changing the transformer architecture, inserting explicit knowledge layers, or add… ▽ More

    Submitted 4 February, 2021; v1 submitted 29 June, 2020; originally announced July 2020.

  14. arXiv:2006.14320  [pdf, other

    cs.CL

    Analyzing Effect of Repeated Reading on Oral Fluency and Narrative Production for Computer-Assisted Language Learning

    Authors: Santosh Kumar Barnwal, Uma Shanker Tiwary

    Abstract: Repeated reading (RR) helps learners, who have little to no experience with reading fluently to gain confidence, speed and process words automatically. The benefits of repeated readings include helping all learners with fact recall, aiding identification of learners' main ideas and vocabulary, increasing comprehension, leading to faster reading as well as increasing word recognition accuracy, and… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: 5 pages, 1 figure

  15. Generic Intent Representation in Web Search

    Authors: Hongfei Zhang, Xia Song, Chenyan Xiong, Corby Rosset, Paul N. Bennett, Nick Craswell, Saurabh Tiwary

    Abstract: This paper presents GEneric iNtent Encoder (GEN Encoder) which learns a distributed representation space for user intent in search. Leveraging large scale user clicks from Bing search logs as weak supervision of user intent, GEN Encoder learns to map queries with shared clicks into similar embeddings end-to-end and then finetunes on multiple paraphrase tasks. Experimental results on an intrinsic e… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Journal ref: SIGIR 2019: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

  16. arXiv:1904.06808  [pdf, other

    cs.IR

    An Axiomatic Approach to Regularizing Neural Ranking Models

    Authors: Corby Rosset, Bhaskar Mitra, Chenyan Xiong, Nick Craswell, Xia Song, Saurabh Tiwary

    Abstract: Axiomatic information retrieval (IR) seeks a set of principle properties desirable in IR models. These properties when formally expressed provide guidance in the search for better relevance estimation functions. Neural ranking models typically contain a large number of parameters. The training of these models involve a search for appropriate parameter values based on large quantities of labeled ex… ▽ More

    Submitted 14 April, 2019; originally announced April 2019.

  17. arXiv:1809.08510  [pdf, other

    cs.CL cs.LG stat.ML

    Towards Language Agnostic Universal Representations

    Authors: Armen Aghajanyan, Xia Song, Saurabh Tiwary

    Abstract: When a bilingual student learns to solve word problems in math, we expect the student to be able to solve these problem in both languages the student is fluent in,even if the math lessons were only taught in one language. However, current representations in machine learning are language dependent. In this work, we present a method to decouple the language from the problem by learning language agno… ▽ More

    Submitted 22 September, 2018; originally announced September 2018.

  18. arXiv:1804.04410  [pdf, other

    cs.IR

    Optimizing Query Evaluations using Reinforcement Learning for Web Search

    Authors: Corby Rosset, Damien Jose, Gargi Ghosh, Bhaskar Mitra, Saurabh Tiwary

    Abstract: In web search, typically a candidate generation step selects a small set of documents---from collections containing as many as billions of web pages---that are subsequently ranked and pruned before being presented to the user. In Bing, the candidate generation involves scanning the index using statically designed match plans that prescribe sequences of different match criteria and stopping conditi… ▽ More

    Submitted 18 August, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

    Comments: ACM SIGIR 2018 short paper (pre-print)

  19. arXiv:1711.09174  [pdf, other

    cs.IR

    Neural Ranking Models with Multiple Document Fields

    Authors: Hamed Zamani, Bhaskar Mitra, Xia Song, Nick Craswell, Saurabh Tiwary

    Abstract: Deep neural networks have recently shown promise in the ad-hoc retrieval task. However, such models have often been based on one field of the document, for example considering document title only or document body only. Since in practice documents typically have multiple fields, and given that non-neural ranking models such as BM25F have been developed to take advantage of document structure, this… ▽ More

    Submitted 24 November, 2017; originally announced November 2017.

    Comments: To Appear in Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM '17)

  20. A Hybrid Approach For Hindi-English Machine Translation

    Authors: Omkar Dhariya, Shrikant Malviya, Uma Shanker Tiwary

    Abstract: In this paper, an extended combined approach of phrase based statistical machine translation (SMT), example based MT (EBMT) and rule based MT (RBMT) is proposed to develop a novel hybrid data driven MT system capable of outperforming the baseline SMT, EBMT and RBMT systems from which it is derived. In short, the proposed hybrid MT process is guided by the rule based MT after getting a set of parti… ▽ More

    Submitted 6 February, 2017; originally announced February 2017.

    Comments: 31st International Conference on Information Networking (ICOIN-2017)

  21. arXiv:1701.08655  [pdf

    cs.CL

    Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus

    Authors: Shrikant Malviya, Rohit Mishra, Uma Shanker Tiwary

    Abstract: Automatic speech recognition (ASR) and Text to speech (TTS) are two prominent area of research in human computer interaction nowadays. A set of phonetically rich sentences is in a matter of importance in order to develop these two interactive modules of HCI. Essentially, the set of phonetically rich sentences has to cover all possible phone units distributed uniformly. Selecting such a set from a… ▽ More

    Submitted 6 February, 2017; v1 submitted 30 January, 2017; originally announced January 2017.

    Comments: 19th Coordination and Standardization of Speech Databases and Assessment Technique (O-COCOSDA) at Bali, Indonesia

  22. arXiv:1611.09268  [pdf, other

    cs.CL cs.IR

    MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

    Authors: Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Andrew McNamara, Bhaskar Mitra, Tri Nguyen, Mir Rosenberg, Xia Song, Alina Stoica, Saurabh Tiwary, Tong Wang

    Abstract: We introduce a large scale MAchine Reading COmprehension dataset, which we name MS MARCO. The dataset comprises of 1,010,916 anonymized questions---sampled from Bing's search query logs---each with a human generated answer and 182,669 completely human rewritten generated answers. In addition, the dataset contains 8,841,823 passages---extracted from 3,563,535 web documents retrieved by Bing---that… ▽ More

    Submitted 31 October, 2018; v1 submitted 28 November, 2016; originally announced November 2016.

  23. arXiv:1507.05398  [pdf, other

    cs.IT cs.DC

    Generating Binary Optimal Codes Using Heterogeneous Parallel Computing

    Authors: Srajan Paliwal, Saurabh Tiwary, Bhaskar Chaudhury, Manish K. Gupta

    Abstract: Generation of optimal codes is a well known problem in coding theory. Many computational approaches exist in the literature for finding record breaking codes. However generating codes with long lengths $n$ using serial algorithms is computationally very expensive, for example the worst case time complexity of a Greedy algorithm is $\mathcal{O}(n\; 4^n)$. In order to improve the efficiency of gener… ▽ More

    Submitted 20 July, 2015; originally announced July 2015.

    Comments: 8 pages, draft