default search action
Monojit Choudhury
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j15]Ishan Tarunesh, Somak Aditya, Monojit Choudhury:
LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI. Lang. Resour. Evaluation 58(2): 427-458 (2024) - [c110]Navreet Kaur, Monojit Choudhury, Danish Pruthi:
Evaluating Large Language Models for Health-related Queries with Presuppositions. ACL (Findings) 2024: 14308-14331 - [c109]Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury:
Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language We Prompt Them in. LREC/COLING 2024: 6330-6340 - [c108]Harshita Diddee, Anurag Shukla, Tanuja Ganu, Vivek Seshadri, Sandipan Dandapat, Monojit Choudhury, Kalika Bali:
INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation. LREC/COLING 2024: 9097-9109 - [c107]Abhinav Rao, Atharva Naik, Sachin Vashistha, Somak Aditya, Monojit Choudhury:
Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks. LREC/COLING 2024: 16802-16830 - [c106]Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, Sunayana Sitaram:
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? EACL (Findings) 2024: 1051-1070 - [c105]Aditi Khandelwal, Utkarsh Agarwal, Kumar Tanmay, Monojit Choudhury:
Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test. EACL (1) 2024: 2882-2894 - [c104]Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury:
Towards Measuring and Modeling "Culture" in LLMs: A Survey. EMNLP 2024: 15763-15784 - [c103]Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury:
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting. EMNLP 2024: 15811-15837 - [c102]Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury:
The Zeno's Paradox of 'Low-Resource' Languages. EMNLP 2024: 17753-17774 - [c101]Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanushree Mitra:
"They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations. EMNLP 2024: 20339-20369 - [i51]Aditi Khandelwal, Utkarsh Agarwal, Kumar Tanmay, Monojit Choudhury:
Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test. CoRR abs/2402.02135 (2024) - [i50]Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Ashutosh Dwivedi, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury:
Towards Measuring and Modeling "Culture" in LLMs: A Survey. CoRR abs/2403.15412 (2024) - [i49]Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury:
Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in. CoRR abs/2404.18460 (2024) - [i48]Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanushree Mitra:
"They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations. CoRR abs/2405.05378 (2024) - [i47]Prashant Kodali, Anmol Goel, Likhith Asapu, Vamshi Krishna Bonagiri, Anirudh Govil, Monojit Choudhury, Manish Shrivastava, Ponnurangam Kumaraguru:
From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences. CoRR abs/2405.05572 (2024) - [i46]Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gaël de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam:
Benchmark Underestimates the Readiness of Multi-lingual Dialogue Agents. CoRR abs/2405.17840 (2024) - [i45]Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury:
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting. CoRR abs/2406.11661 (2024) - [i44]Abhinav Rao, Monojit Choudhury, Somak Aditya:
[WIP] Jailbreak Paradox: The Achilles' Heel of LLMs. CoRR abs/2406.12702 (2024) - [i43]Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury:
The Zeno's Paradox of 'Low-Resource' Languages. CoRR abs/2410.20817 (2024) - 2023
- [c100]Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam:
X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents. ACL (Findings) 2023: 2773-2794 - [c99]Sunayana Sitaram, Monojit Choudhury, Barun Patra, Vishrav Chaudhary, Kabir Ahuja, Kalika Bali:
Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world. ACL (tutorial) 2023: 21-26 - [c98]Dipto Das, Parboti Roy, Carlos Toxtli, Kagonya Awori, Morgan Vigil-Hayes, Monojit Choudhury, Neha Kumar, Syed Ishtiaque Ahmed, Bryan C. Semaan:
Conceptualizing Indigeneity in Social Computing. CSCW Companion 2023: 501-505 - [c97]Shanu Kumar, Abbaraju Soujanya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer. EACL 2023: 385-406 - [c96]Krithika Ramesh, Sunayana Sitaram, Monojit Choudhury:
Fairness in Language Models Beyond English: Gaps and Challenges. EACL (Findings) 2023: 2061-2074 - [c95]Aniket Vashishtha, S. Sai Prasad, Payal Bajaj, Vishrav Chaudhary, Kate Cook, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Performance and Risk Trade-offs for Multi-word Text Prediction at Scale. EACL (Findings) 2023: 2181-2197 - [c94]Chenxi Whitehouse, Monojit Choudhury, Alham Fikri Aji:
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance. EMNLP 2023: 671-686 - [c93]Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN: Visual Document Understanding By Language-Image Network. EMNLP (Industry Track) 2023: 693-706 - [c92]Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury:
Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs. EMNLP (Findings) 2023: 13370-13388 - [c91]Deepanway Ghosal, Somak Aditya, Monojit Choudhury:
Prover: Generating Intermediate Steps for NLI with Commonsense Knowledge Retrieval and Next-Step Prediction. IJCNLP (1) 2023: 872-884 - [e2]Melissa Densmore, Monojit Choudhury, Josiah Chavula:
Proceedings of the 6th ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies, COMPASS 2023, Cape Town, South Africa, August 16-19, 2023. ACM 2023 [contents] - [i42]Krithika Ramesh, Sunayana Sitaram, Monojit Choudhury:
Fairness in Language Models Beyond English: Gaps and Challenges. CoRR abs/2302.12578 (2023) - [i41]Shanu Kumar, Abbaraju Soujanya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
DiTTO: A Feature Representation Imitation Approach for Improving Cross-Lingual Transfer. CoRR abs/2303.02357 (2023) - [i40]Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:
DUBLIN - Document Understanding By Language-Image Network. CoRR abs/2305.14218 (2023) - [i39]Chenxi Whitehouse, Monojit Choudhury, Alham Fikri Aji:
LLM-powered Data Augmentation for Enhanced Crosslingual Performance. CoRR abs/2305.14288 (2023) - [i38]Abhinav Rao, Sachin Vashistha, Atharva Naik, Somak Aditya, Monojit Choudhury:
Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks. CoRR abs/2305.14965 (2023) - [i37]Mehrad Moradshahi, Tianhao Shen, Kalika Bali, Monojit Choudhury, Gaël de Chalendar, Anmol Goel, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Nasredine Semmar, Sina J. Semnani, Jiwon Seo, Vivek Seshadri, Manish Shrivastava, Michael Sun, Aditya Yadavalli, Chaobin You, Deyi Xiong, Monica S. Lam:
X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents. CoRR abs/2306.17674 (2023) - [i36]Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, Sunayana Sitaram:
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? CoRR abs/2309.07462 (2023) - [i35]Kumar Tanmay, Aditi Khandelwal, Utkarsh Agarwal, Monojit Choudhury:
Probing the Moral Development of Large Language Models through Defining Issues Test. CoRR abs/2309.13356 (2023) - [i34]Abhinav Rao, Aditi Khandelwal, Kumar Tanmay, Utkarsh Agarwal, Monojit Choudhury:
Ethical Reasoning over Moral Alignment: A Case and Framework for In-Context Ethical Policies in LLMs. CoRR abs/2310.07251 (2023) - [i33]Navreet Kaur, Monojit Choudhury, Danish Pruthi:
Evaluating Large Language Models for Health-related Queries with Presuppositions. CoRR abs/2312.08800 (2023) - 2022
- [c90]Anirudh Srinivasan, Gauri Kholkar, Rahul Kejriwal, Tanuja Ganu, Sandipan Dandapat, Sunayana Sitaram, Balakrishnan Santhanam, Somak Aditya, Kalika Bali, Monojit Choudhury:
LITMUS Predictor: An AI Assistant for Building Reliable, High-Performing and Fair Multilingual NLP Systems. AAAI 2022: 13227-13229 - [c89]Prashant Kodali, Anmol Goel, Monojit Choudhury, Manish Shrivastava, Ponnurangam Kumaraguru:
SyMCoM - Syntactic Measure of Code Mixing A Study Of English-Hindi Code-Mixing. ACL (Findings) 2022: 472-480 - [c88]Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models. ACL (1) 2022: 5454-5467 - [c87]Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O'Neill, Kalika Bali, Monojit Choudhury:
Global Readiness of Language Technology for Healthcare: What Would It Take to Combat the Next Pandemic? COLING 2022: 4320-4335 - [c86]Harshita Diddee, Kalika Bali, Monojit Choudhury, Namrata Mukhija:
The Six Conundrums of Building and Deploying Language Technologies for Social Good. COMPASS 2022: 12-19 - [c85]Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury:
On the Calibration of Massively Multilingual Language Models. EMNLP 2022: 4310-4323 - [c84]Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Multilingual CheckList: Generation and Evaluation. AACL/IJCNLP (Findings) 2022: 282-295 - [c83]Deepanway Ghosal, Somak Aditya, Sandipan Dandapat, Monojit Choudhury:
Vector Space Interpolation for Query Expansion. AACL/IJCNLP (2) 2022: 405-410 - [c82]Ishani Mondal, Kalika Bali, Mohit Jain, Monojit Choudhury, Jacki O'Neill, Millicent Ochieng, Kagonya Awori, Keshet Ronen:
Language Patterns and Behaviour of the Peer Supporters in Multilingual Healthcare Conversational Forums. LREC 2022: 963-975 - [c81]Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer. NAACL-HLT (Findings) 2022: 1042-1055 - [c80]Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat:
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data. NAACL-HLT 2022: 1369-1384 - [c79]Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali:
Too Brittle to Touch: Comparing the Stability of Quantization and Distillation towards Developing Low-Resource MT Models. WMT 2022: 870-885 - [i32]Shamsuddeen Hassan Muhammad, David Ifeoluwa Adelani, Sebastian Ruder, Ibrahim Said Ahmad, Idris Abdulmumin, Bello Shehu Bello, Monojit Choudhury, Chris Chinenye Emezue, Saheed Abdullahi Salahudeen, Aremu Anuoluwapo, Alípio Jeorge, Pavel Brazdil:
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis. CoRR abs/2201.08277 (2022) - [i31]Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Multilingual CheckList: Generation and Evaluation. CoRR abs/2203.12865 (2022) - [i30]Ishani Mondal, Kabir Ahuja, Mohit Jain, Jacki O. Neil, Kalika Bali, Monojit Choudhury:
Global Readiness of Language Technology for Healthcare: What would it Take to Combat the Next Pandemic? CoRR abs/2204.02790 (2022) - [i29]Kabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
Multi Task Learning For Zero Shot Performance Prediction of Multilingual Models. CoRR abs/2205.06130 (2022) - [i28]Kabir Ahuja, Monojit Choudhury, Sandipan Dandapat:
On the Economics of Multilingual Few-shot Learning: Modeling the Cost-Performance Trade-offs of Machine Translated and Manual Data. CoRR abs/2205.06350 (2022) - [i27]Kabir Ahuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
Beyond Static Models and Test Sets: Benchmarking the Potential of Pre-trained Models Across Tasks and Languages. CoRR abs/2205.06356 (2022) - [i26]Shanu Kumar, Sandipan Dandapat, Monojit Choudhury:
"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer. CoRR abs/2206.15010 (2022) - [i25]Deepanway Ghosal, Somak Aditya, Monojit Choudhury:
Generating Intermediate Steps for NLI with Next-Step Supervision. CoRR abs/2208.14641 (2022) - [i24]Kabir Ahuja, Sunayana Sitaram, Sandipan Dandapat, Monojit Choudhury:
On the Calibration of Massively Multilingual Language Models. CoRR abs/2210.12265 (2022) - [i23]Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, Tanuja Ganu, Kalika Bali:
Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models. CoRR abs/2210.15184 (2022) - 2021
- [c78]Monojit Choudhury, Amit Deshpande:
How Linguistically Fair Are Multilingual Pre-Trained Language Models? AAAI 2021: 12710-12718 - [c77]Sebastin Santy, Anku Rani, Monojit Choudhury:
Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices. ACL/IJCNLP (Findings) 2021: 4704-4710 - [c76]Adithya Pratapa, Monojit Choudhury:
Comparing Grammatical Theories of Code-Mixing. W-NUT 2021: 158-167 - [c75]Sebastin Santy, Kalika Bali, Monojit Choudhury, Sandipan Dandapat, Tanuja Ganu, Anurag Shukla, Jahanvi Shah, Vivek Seshadri:
Language Translation as a Socio-Technical System: Case-Studies of Mixed-Initiative Interactions. COMPASS 2021: 156-172 - [c74]Mohd Sanad Zaki Rizvi, Anirudh Srinivasan, Tanuja Ganu, Monojit Choudhury, Sunayana Sitaram:
GCM: A Toolkit for Generating Synthetic Code-mixed Text. EACL (System Demonstrations) 2021: 205-211 - [c73]Shaily Bhatt, Poonam Goyal, Sandipan Dandapat, Monojit Choudhury, Sunayana Sitaram:
On the Universality of Deep Contextual Language Models. ICON 2021: 106-119 - [c72]Saujas Vaduguru, Partho Sarthi, Monojit Choudhury, Dipti Sharma:
Stress Rules from Surface Forms: Experiments with Program Synthesis. ICON 2021: 619-628 - [c71]Amar Budhiraja, Ankur Sharma, Rahul Agrawal, Monojit Choudhury, Joyojeet Pal:
American Politicians Diverge Systematically, Indian Politicians do so Chaotically: Text Embeddings as a Window into Party Polarization. ICWSM 2021: 1054-1058 - [i22]Sebastin Santy, Anku Rani, Monojit Choudhury:
Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices. CoRR abs/2106.01105 (2021) - [i21]Saujas Vaduguru, Aalok Sathe, Monojit Choudhury, Dipti Misra Sharma:
Sample-efficient Linguistic Generalizations through Program Synthesis: Experiments with Phonology Problems. CoRR abs/2106.06566 (2021) - [i20]Ishan Tarunesh, Somak Aditya, Monojit Choudhury:
Trusting RoBERTa over BERT: Insights from CheckListing the Natural Language Inference Task. CoRR abs/2107.07229 (2021) - [i19]Shaily Bhatt, Poonam Goyal, Sandipan Dandapat, Monojit Choudhury, Sunayana Sitaram:
On the Universality of Deep COntextual Language Models. CoRR abs/2109.07140 (2021) - [i18]Karthikeyan K, Aalok Sathe, Somak Aditya, Monojit Choudhury:
Analyzing the Effects of Reasoning Types on Cross-Lingual Transfer Performance. CoRR abs/2110.02386 (2021) - [i17]Namrata Mukhija, Monojit Choudhury, Kalika Bali:
Designing Language Technologies for Social Good: The Road not Taken. CoRR abs/2110.07444 (2021) - [i16]Anirudh Srinivasan, Sunayana Sitaram, Tanuja Ganu, Sandipan Dandapat, Kalika Bali, Monojit Choudhury:
Predicting the Performance of Multilingual NLP Models. CoRR abs/2110.08875 (2021) - [i15]Ishan Tarunesh, Somak Aditya, Monojit Choudhury:
LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI. CoRR abs/2112.02333 (2021) - 2020
- [j14]Anshul Bawa, Pranav Khadpe, Pratik Joshi, Kalika Bali, Monojit Choudhury:
Do Multilingual Users Prefer Chat-bots that Code-mix? Let's Nudge and Find Out! Proc. ACM Hum. Comput. Interact. 4(CSCW): 041:1-041:23 (2020) - [j13]Anmol Panda, Ramaravind Kommiya Mothilal, Monojit Choudhury, Kalika Bali, Joyojeet Pal:
Topical Focus of Political Campaigns and its Impact: Findings from Politicians' Hashtag Use during the 2019 Indian Elections. Proc. ACM Hum. Comput. Interact. 4(CSCW): 053:1-053:14 (2020) - [j12]Somnath Banerjee, Monojit Choudhury, Kunal Chakma, Sudip Kumar Naskar, Amitava Das, Sivaji Bandyopadhyay, Paolo Rosso:
MSIR@FIRE: A Comprehensive Report from 2013 to 2016. SN Comput. Sci. 1(1): 55 (2020) - [c70]Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury:
GLUECoS: An Evaluation Benchmark for Code-Switched NLP. ACL 2020: 3575-3585 - [c69]Pratik Joshi, Sebastin Santy, Amar Budhiraja, Kalika Bali, Monojit Choudhury:
The State and Fate of Linguistic Diversity and Inclusion in the NLP World. ACL 2020: 6282-6293 - [c68]Simran Khanuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
A New Dataset for Natural Language Inference from Code-mixed Conversations. CodeSwitch@LREC 2020: 9-16 - [c67]Abhishek Srivastava, Kalika Bali, Monojit Choudhury:
Understanding Script-Mixing: A Case Study of Hindi-English Bilingual Twitter Users. CodeSwitch@LREC 2020: 36-44 - [c66]Anirudh Srinivasan, Sandipan Dandapat, Monojit Choudhury:
Code-mixed parse trees and how to find them. CodeSwitch@LREC 2020: 57-64 - [c65]Pratik Joshi, Somak Aditya, Aalok Sathe, Monojit Choudhury:
TaxiNLI: Taking a Ride up the NLU Hill. CoNLL 2020: 41-55 - [c64]Ashish Sharma, Monojit Choudhury, Tim Althoff, Amit Sharma:
Engagement Patterns of Peer-to-Peer Interactions on Mental Health Platforms. ICWSM 2020: 614-625 - [c63]Basil Abraham, Danish Goel, Divya Siddarth, Kalika Bali, Manu Chopra, Monojit Choudhury, Pratik Joshi, Preethi Jyothi, Sunayana Sitaram, Vivek Seshadri:
Crowdsourcing Speech Data for Low-Resource Languages from Low-Income Workers. LREC 2020: 2819-2826 - [e1]Thamar Solorio, Monojit Choudhury, Kalika Bali, Sunayana Sitaram, Amitava Das, Mona T. Diab:
Proceedings of the The 4th Workshop on Computational Approaches to Code Switching, CodeSwitch@LREC 2020, Marseille, France, May, 2020. European Language Resources Association 2020, ISBN 979-10-95546-66-5 [contents] - [i14]Ashish Sharma, Monojit Choudhury, Tim Althoff, Amit Sharma:
Engagement Patterns of Peer-to-Peer Interactions on Mental Health Platforms. CoRR abs/2004.04999 (2020) - [i13]Simran Khanuja, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury:
A New Dataset for Natural Language Inference from Code-mixed Conversations. CoRR abs/2004.05051 (2020) - [i12]Pratik Joshi, Sebastin Santy, Amar Budhiraja, Kalika Bali, Monojit Choudhury:
The State and Fate of Linguistic Diversity and Inclusion in the NLP World. CoRR abs/2004.09095 (2020) - [i11]Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan, Sunayana Sitaram, Monojit Choudhury:
GLUECoS : An Evaluation Benchmark for Code-Switched NLP. CoRR abs/2004.12376 (2020) - [i10]Pratik Joshi, Somak Aditya, Aalok Sathe, Monojit Choudhury:
TaxiNLI: Taking a Ride up the NLU Hill. CoRR abs/2009.14505 (2020)
2010 – 2019
- 2019
- [j11]Koustav Rudra, Ashish Sharma, Kalika Bali, Monojit Choudhury, Niloy Ganguly:
Identifying and Analyzing Different Aspects of English-Hindi Code-Switching in Twitter. ACM Trans. Asian Low Resour. Lang. Inf. Process. 18(3): 29:1-29:28 (2019) - [c62]Monojit Choudhury, Anirudh Srinivasan, Sandipan Dandapat:
Processing and Understanding Mixed Language Data. EMNLP/IJCNLP (2) 2019 - [c61]Sebastin Santy, Sandipan Dandapat, Monojit Choudhury, Kalika Bali:
INMT: Interactive Neural Machine Translation Prediction. EMNLP/IJCNLP (3) 2019: 103-108 - [c60]Jasabanta Patro, Sabyasachee Baruah, Vivek Gupta, Monojit Choudhury, Pawan Goyal, Animesh Mukherjee:
Characterizing the Spread of Exaggerated Health News Content over Social Media. HT 2019: 279-280 - [i9]Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan, Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury, Kalika Bali:
Unsung Challenges of Building and Deploying Language Technologies for Low Resource Language Communities. CoRR abs/1912.03457 (2019) - 2018
- [c59]Adithya Pratapa, Gayatri Bhat, Monojit Choudhury, Sunayana Sitaram, Sandipan Dandapat, Kalika Bali:
Language Modeling for Code-Mixing: The Role of Linguistic Theory based Synthetic Data. ACL (1) 2018: 1543-1553 - [c58]Sunit Sivasankaran, Brij Mohan Lal Srivastava, Sunayana Sitaram, Kalika Bali, Monojit Choudhury:
Phone Merging For Code-Switched Speech Recognition. CodeSwitch@ACL 2018: 11-19 - [c57]Anshul Bawa, Monojit Choudhury, Kalika Bali:
Accommodation of Conversational Code-Choice. CodeSwitch@ACL 2018: 82-91 - [c56]Adithya Pratapa, Monojit Choudhury, Sunayana Sitaram:
Word Embeddings for Code-Mixed Language Processing. EMNLP 2018: 3067-3072 - [c55]Anshul Bawa, Monojit Choudhury, Kalika Bali:
User Perception of Code-Switching Dialog Systems. ICON 2018: 166-174 - [c54]Silvana Hartmann, Monojit Choudhury, Kalika Bali:
An Integrated Representation of Linguistic and Social Functions of Code-Switching. LREC 2018 - [c53]Sunayana Sitaram, Varun Manjunath, Varun Bharadwaj, Monojit Choudhury, Kalika Bali, Michael Tjalve:
Discovering Canonical Indian English Accents: A Crowdsourcing-based Approach. LREC 2018 - [i8]Jasabanta Patro, Sabyasachee Baruah, Vivek Gupta, Monojit Choudhury, Pawan Goyal, Animesh Mukherjee:
Characterizing the spread of exaggerated news content over social media. CoRR abs/1811.07853 (2018) - 2017
- [c52]Shruti Rijhwani, Royal Sequiera, Monojit Choudhury, Kalika Bali, Chandra Shekhar Maddila:
Estimating Code-Switching on Twitter with a Novel Generalized Word-Level Language Detection Technique. ACL (1) 2017: 1971-1982 - [c51]Prabhat Agarwal, Ashish Sharma, Jeenu Grover, Mayank Sikka, Koustav Rudra, Monojit Choudhury:
I may talk in English but gaali toh Hindi mein hi denge : A study of English-Hindi code-switching and swearing pattern on social networks. COMSNETS 2017: 554-557 - [c50]Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Abhipsa Basu, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee:
All that is English may be Hindi: Enhancing language identification through automatic ranking of the likeliness of word borrowing in social media. EMNLP 2017: 2264-2274 - [c49]Moumita Basu, Saptarshi Ghosh, Kripabandhu Ghosh, Monojit Choudhury:
Overview of the FIRE 2017 track: Information Retrieval from Microblogs during Disasters (IRMiDis). FIRE (Working Notes) 2017: 28-33 - [c48]Monojit Choudhury, Kalika Bali, Sunayana Sitaram, Ashutosh Baheti:
Curriculum Design for Code-switching: Experiments with Language Identification and Language Modeling with Deep Neural Networks. ICON 2017: 65-74 - [c47]Adithya Pratapa, Monojit Choudhury:
Quantitative Characterization of Code Switching Patterns in Complex Multi-Party Conversations: A Case Study on Hindi Movie Scripts. ICON 2017: 75-84 - [i7]Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee:
Is this word borrowed? An automatic approach to quantify the likeliness of borrowing in social media. CoRR abs/1703.05122 (2017) - [i6]Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Abhipsa Basu, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee:
All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media. CoRR abs/1707.08446 (2017) - 2016
- [j10]Rishiraj Saha Roy, Smith Agarwal, Niloy Ganguly, Monojit Choudhury:
Syntactic complexity of Web search queries through the lenses of language models, networks and users. Inf. Process. Manag. 52(5): 923-948 (2016) - [c46]Rishiraj Saha Roy, Anusha Suresh, Niloy Ganguly, Monojit Choudhury:
Improving Document Ranking for Long Queries with Nested Query Segmentation. ECIR 2016: 775-781 - [c45]Koustav Rudra, Shruti Rijhwani, Rafiya Begum, Kalika Bali, Monojit Choudhury, Niloy Ganguly:
Understanding Language Preference for Expression of Opinion and Sentiment: What do Hindi-English Speakers do on Twitter? EMNLP 2016: 1131-1141 - [c44]Somnath Banerjee, Kunal Chakma, Sudip Kumar Naskar, Amitava Das, Paolo Rosso, Sivaji Bandyopadhyay, Monojit Choudhury:
Overview of the Mixed Script Information Retrieval (MSIR) at FIRE-2016. FIRE Workshop 2016: 39-49 - [c43]Somnath Banerjee, Kunal Chakma, Sudip Kumar Naskar, Amitava Das, Paolo Rosso, Sivaji Bandyopadhyay, Monojit Choudhury:
Overview of the Mixed Script Information Retrieval (MSIR) at FIRE-2016. FIRE (Working Notes) 2016: 94-99 - [c42]Rafiya Begum, Kalika Bali, Monojit Choudhury, Koustav Rudra, Niloy Ganguly:
Functions of Code-Switching in Tweets: An Annotation Framework and Some Initial Experiments. LREC 2016 - [i5]Gayatri Bhat, Monojit Choudhury, Kalika Bali:
Grammatical Constraints on Intra-sentential Code-Switching: From Theories to Working Models. CoRR abs/1612.04538 (2016) - 2015
- [j9]Rishiraj Saha Roy, Rahul Katare, Niloy Ganguly, Srivatsan Laxman, Monojit Choudhury:
Discovering and understanding word level user intent in Web search queries. J. Web Semant. 30: 22-38 (2015) - [c41]Royal Sequiera, Monojit Choudhury, Parth Gupta, Paolo Rosso, Shubham Kumar, Somnath Banerjee, Sudip Kumar Naskar, Sivaji Bandyopadhyay, Gokul Chittaranjan, Amitava Das, Kunal Chakma:
Overview of FIRE-2015 Shared Task on Mixed Script Information Retrieval. FIRE Workshops 2015: 19-25 - [c40]Royal Sequiera, Monojit Choudhury, Kalika Bali:
POS Tagging of Hindi-English Code Mixed Text from Social Media: Some Machine Learning Experiments. ICON 2015: 237-246 - 2014
- [c39]Gokul Chittaranjan, Yogarshi Vyas, Kalika Bali, Monojit Choudhury:
Word-level Language Identification using CRF: Code-switching Shared Task Report of MSR India System. CodeSwitch@EMNLP 2014: 73-79 - [c38]Kalika Bali, Jatin Sharma, Monojit Choudhury, Yogarshi Vyas:
"I am borrowing ya mixing ?" An Analysis of English-Hindi Code Mixing in Facebook. CodeSwitch@EMNLP 2014: 116-126 - [c37]Rishiraj Saha Roy, Rahul Katare, Niloy Ganguly, Monojit Choudhury:
Automatic Discovery of Adposition Typology. COLING 2014: 1037-1046 - [c36]Yogarshi Vyas, Spandana Gella, Jatin Sharma, Kalika Bali, Monojit Choudhury:
POS Tagging of English-Hindi Code-Mixed Social Media Content. EMNLP 2014: 974-979 - [c35]Sharath Reddy Gunamgari, Sandipan Dandapat, Monojit Choudhury:
Hierarchical Recursive Tagset for Annotating Cooking Recipes. ICON 2014: 353-361 - [c34]Spandana Gella, Kalika Bali, Monojit Choudhury:
"ye word kis lang ka hai bhai?" Testing the Limits of Word level Language Identification. ICON 2014: 368-377 - [c33]Parth Gupta, Kalika Bali, Rafael E. Banchs, Monojit Choudhury, Paolo Rosso:
Query expansion for mixed-script information retrieval. SIGIR 2014: 677-686 - [c32]Rishiraj Saha Roy, Yogarshi Vyas, Niloy Ganguly, Monojit Choudhury:
Improving unsupervised query segmentation using parts-of-speech sequence information. SIGIR 2014: 935-938 - 2013
- [c31]Rohan Ramanath, Monojit Choudhury, Kalika Bali, Rishiraj Saha Roy:
Crowd Prefers the Middle Path: A New IAA Metric for Crowdsourcing Reveals Turker Biases in Query Segmentation. ACL (1) 2013: 1713-1722 - [c30]Rohan Ramanath, Monojit Choudhury, Kalika Bali:
Entailment: An Effective Metric for Comparing and Evaluating Hierarchical and Non-hierarchical Annotation Schemes. LAW@ACL 2013: 42-50 - [c29]Rishiraj Saha Roy, Monojit Choudhury, Prasenjit Majumder, Komal Agarwal:
Overview of the FIRE 2013 Track on Transliterated Search. FIRE 2013: 4:1-4:7 - [c28]Monojit Choudhury, Ranjita Bhagwan, Kalika Bali:
The Use Of Melodic Scales In Bollywood Music: An Empirical Study. ISMIR 2013: 59-64 - [c27]Sai Sumanth Miryala, Kalika Bali, Ranjita Bhagwan, Monojit Choudhury:
Automatically Identifying Vocal Expressions for Music Transcription. ISMIR 2013: 239-244 - [c26]Rishiraj Saha Roy, Anusha Suresh, Niloy Ganguly, Monojit Choudhury:
Place value: word position shifts vital to search dynamics. WWW (Companion Volume) 2013: 153-154 - [p1]Animesh Mukherjee, Monojit Choudhury, Niloy Ganguly, Anupam Basu:
Language Dynamics in the Framework of Complex Networks: A Case Study on Self-Organization of the Consonant Inventories. Cognitive Aspects of Computational Language Acquisition 2013: 51-78 - 2012
- [c25]Umair Z. Ahmed, Arpit Kumar, Monojit Choudhury, Kalika Bali:
Can Modern Statistical Parsers Lead to Better Natural Language Understanding for Education? CICLing (1) 2012: 415-427 - [c24]Kanika Gupta, Monojit Choudhury, Kalika Bali:
Mining Hindi-English Transliteration Pairs from Online Hindi Lyrics. LREC 2012: 2459-2465 - [c23]K. Saravanan, Monojit Choudhury, Raghavendra Udupa, A. Kumaran:
An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora. LREC 2012: 3118-3125 - [c22]Rishiraj Saha Roy, Niloy Ganguly, Monojit Choudhury, Srivatsan Laxman:
An IR-based evaluation framework for web search query segmentation. SIGIR 2012: 881-890 - 2011
- [j8]Animesh Mukherjee, Monojit Choudhury, Samer Hassan, Smaranda Muresan:
Network based models of cognitive and social dynamics of human languages. Comput. Speech Lang. 25(3): 635-638 (2011) - [c21]Umair Z. Ahmed, Kalika Bali, Monojit Choudhury, Sowmya V. B.:
Challenges in Designing Input Method Editors for Indian Lan-guages: The Role of Word-Origin and Context. WTIM@IJCNLP 2011: 1-9 - [c20]Nitin Dua, Kanika Gupta, Monojit Choudhury, Kalika Bali:
Query completion without query logs for song search. WWW (Companion Volume) 2011: 31-32 - [c19]Nikita Mishra, Rishiraj Saha Roy, Niloy Ganguly, Srivatsan Laxman, Monojit Choudhury:
Unsupervised query segmentation using only query logs. WWW (Companion Volume) 2011: 91-92 - [i4]Rishiraj Saha Roy, Niloy Ganguly, Monojit Choudhury, Srivatsan Laxman:
An IR-based Evaluation Framework for Web Search Query Segmentation. CoRR abs/1111.1497 (2011) - 2010
- [j7]Animesh Mukherjee, Monojit Choudhury, Anupam Basu, Niloy Ganguly:
Modelling the Redundancy of Human Speech Sound Inventories: An Information Theoretic Approach. J. Quant. Linguistics 17(4): 317-343 (2010) - [c18]Monojit Choudhury, Diptesh Chatterjee, Animesh Mukherjee:
Global topology of word co-occurrence networks: Beyond the two-regime power-law. COLING (Posters) 2010: 162-170 - [c17]Sowmya V. B., Monojit Choudhury, Kalika Bali, Tirthankar Dasgupta, Anupam Basu:
Resource Creation for Training and Testing of Transliteration Systems for Indian Languages. LREC 2010
2000 – 2009
- 2009
- [j6]Animesh Mukherjee, Monojit Choudhury, Anupam Basu, Niloy Ganguly:
Self-organization of the Sound Inventories: Analysis and Synthesis of the Occurrence and Co-occurrence Networks of Consonants. J. Quant. Linguistics 16(2): 157-184 (2009) - [c16]Chris Biemann, Monojit Choudhury, Animesh Mukherjee:
Syntax is from Mars while Semantics from Venus! Insights from Spectral Analysis of Distributional Similarity Networks. ACL/IJCNLP (2) 2009: 245-248 - [c15]Sandipan Dandapat, Priyanka Biswas, Monojit Choudhury, Kalika Bali:
Complex Linguistic Annotation - No Easy Way Out! A Case from Bangla and Hindi POS Labeling Tasks. Linguistic Annotation Workshop 2009: 10-18 - [c14]Cohan Sujay Carlos, Monojit Choudhury, Sandipan Dandapat:
Large-Coverage Root Lexicon Extraction for Hindi. EACL 2009: 121-129 - [c13]Animesh Mukherjee, Monojit Choudhury, Ravi Kannan:
Discovering Global Patterns in Linguistic Networks through Spectral Analysis: A Case Study of the Consonant Inventories. EACL 2009: 585-593 - [i3]Animesh Mukherjee, Monojit Choudhury, Ravi Kannan:
Discovering Global Patterns in Linguistic Networks through Spectral Analysis: A Case Study of the Consonant Inventories. CoRR abs/0901.2216 (2009) - [i2]Monojit Choudhury, Animesh Mukherjee, Anupam Basu, Niloy Ganguly, Ashish Garg, Vaibhav Jalan:
Language Diversity across the Consonant Inventories: A Study in the Framework of Complex Networks. CoRR abs/0904.1289 (2009) - [i1]Chris Biemann, Monojit Choudhury, Animesh Mukherjee:
Syntax is from Mars while Semantics from Venus! Insights from Spectral Analysis of Distributional Similarity Networks. CoRR abs/0906.1467 (2009) - 2008
- [j5]Animesh Mukherjee, Monojit Choudhury, Anupam Basu, Niloy Ganguly, Shamik Roy Chowdhury:
Rediscovering the Co-Occurrence Principles of vowel inventories: a Complex Network Approach. Adv. Complex Syst. 11(3): 371-392 (2008) - [j4]Abhishek B. Sharma, Ranjita Bhagwan, Monojit Choudhury, Leana Golubchik, Ramesh Govindan, Geoffrey M. Voelker:
Automatic request categorization in internet services. SIGMETRICS Perform. Evaluation Rev. 36(2): 16-25 (2008) - [c12]Animesh Mukherjee, Monojit Choudhury, Anupam Basu, Niloy Ganguly:
Modeling the Structure and Dynamics of the Consonant Inventories: A Complex Network Approach. COLING 2008: 601-608 - [c11]Monojit Choudhury:
Invited Talk: Breaking the Zipfian Barrier of NLP. IJCNLP 2008: 5 - [c10]Monojit Choudhury, Animesh Mukherjee, Niloy Ganguly:
Social Network Inspired Models of NLP and Language Evolution. IJCNLP 2008: 937 - [c9]Joy Deep Nath, Monojit Choudhury, Animesh Mukherjee, Christian Biemann, Niloy Ganguly:
Unsupervised Parts-of-Speech Induction for Bengali. LREC 2008 - [c8]Baskaran Sankaran, Kalika Bali, Monojit Choudhury, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Girish Nath Jha, S. Rajendran, K. Saravanan, L. Sobha, Karumuri V. Subbarao:
A Common Parts-of-Speech Tagset Framework for Indian Languages. LREC 2008 - 2007
- [j3]Monojit Choudhury, Rahul Saraf, Vijit Jain, Animesh Mukherjee, Sudeshna Sarkar, Anupam Basu:
Investigation and modeling of the structure of texting language. Int. J. Document Anal. Recognit. 10(3-4): 157-174 (2007) - [c7]Animesh Mukherjee, Monojit Choudhury, Anupam Basu, Niloy Ganguly:
Redundancy Ratio: An Invariant Property of the Consonant Inventories of the World's Languages. ACL 2007 - [c6]Monojit Choudhury, Vaibhav Jalan, Sudeshna Sarkar, Anupam Basu:
Evolution, Optimization, and Language Change: The Case of Bengali Verb Inflections. SIGMORPHON 2007: 65-74 - [c5]Animesh Mukherjee, Monojit Choudhury, Anupam Basu, Niloy Ganguly:
Emergence of Community Structures in Vowel Inventories: An Analysis Based on Complex Networks. SIGMORPHON 2007: 101-108 - 2006
- [j2]Arijit Mukhopadhyay, Sunandan Chakraborty, Monojit Choudhury, Anirban Lahiri, Soumyajit Dey, Anupam Basu:
Shruti: an embedded text-to-speech system for Indian languages. IEE Proc. Softw. 153(2): 75-79 (2006) - [j1]Monojit Choudhury, Anupam Basu, Sudeshna Sarkar:
Multi-Agent Simulation of Emergence of Schwa Deletion Pattern in Hindi. J. Artif. Soc. Soc. Simul. 9(2) (2006) - [c4]Monojit Choudhury, Animesh Mukherjee, Anupam Basu, Niloy Ganguly:
Analysis and Synthesis of the Distribution of Consonants over Languages: A Complex Network Approach. ACL 2006 - [c3]Anirban Lahiri, Anupam Basu, Monojit Choudhury, Srobona Mitra:
Battery-aware code partitioning for a text to speech system. DATE 2006: 672-677 - 2004
- [c2]Monojit Choudhury, Anupam Basu, Sudeshna Sarkar:
A Diachronic Approach for Schwa Deletion in Indo Aryan Languages. SIGMORPHON@ACL 2004 - 2003
- [c1]Shireesh Reddy Annam, Monojit Choudhury, Sudeshna Sarkar, Anupam Basu:
ABHIDHA: An extended WordNet for Indo-Aryan Languages. RIDE 2003: 1-8
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-04 20:15 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint