default search action
Souradip Chakraborty
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c14]Souradip Chakraborty, Amrit S. Bedi, Alec Koppel, Huazheng Wang, Dinesh Manocha, Mengdi Wang, Furong Huang:
PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback. ICLR 2024 - [c13]Xiangyu Liu, Souradip Chakraborty, Yanchao Sun, Furong Huang:
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL. ICLR 2024 - [c12]Souradip Chakraborty, Amrit S. Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, Furong Huang:
Position: On the Possibilities of AI-Generated Text Detection. ICML 2024 - [c11]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Dinesh Manocha, Furong Huang, Amrit S. Bedi, Mengdi Wang:
MaxMin-RLHF: Alignment with Diverse Human Preferences. ICML 2024 - [i27]Xingpeng Sun, Haoming Meng, Souradip Chakraborty, Amrit Singh Bedi, Aniket Bera:
Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues. CoRR abs/2402.03494 (2024) - [i26]Souradip Chakraborty, Jiahao Qiu, Hui Yuan, Alec Koppel, Furong Huang, Dinesh Manocha, Amrit Singh Bedi, Mengdi Wang:
MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences. CoRR abs/2402.08925 (2024) - [i25]Xiyang Wu, Ruiqi Xian, Tianrui Guan, Jing Liang, Souradip Chakraborty, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi:
On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities. CoRR abs/2402.10340 (2024) - [i24]Nirjhar Das, Souradip Chakraborty, Aldo Pacchiano, Sayak Ray Chowdhury:
Provably Sample Efficient RLHF via Active Preference Optimization. CoRR abs/2402.10500 (2024) - [i23]Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang:
Transfer Q Star: Principled Decoding for LLM Alignment. CoRR abs/2405.20495 (2024) - [i22]Utsav Singh, Souradip Chakraborty, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit Singh Bedi:
DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning. CoRR abs/2406.10892 (2024) - [i21]Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang:
Is poisoning a real threat to LLM alignment? Maybe more so than you think. CoRR abs/2406.12091 (2024) - [i20]Mucong Ding, Souradip Chakraborty, Vibhu Agrawal, Zora Che, Alec Koppel, Mengdi Wang, Amrit S. Bedi, Furong Huang:
SAIL: Self-Improving Efficient Online Alignment of Large Language Models. CoRR abs/2406.15567 (2024) - [i19]Michael-Andrei Panaitescu-Liess, Zora Che, Bang An, Yuancheng Xu, Pankayaraj Pathmanathan, Souradip Chakraborty, Sicheng Zhu, Tom Goldstein, Furong Huang:
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? CoRR abs/2407.17417 (2024) - [i18]Bhrij Patel, Souradip Chakraborty, Wesley A. Suttle, Mengdi Wang, Amrit Singh Bedi, Dinesh Manocha:
AIME: AI System Optimization via Multiple LLM Evaluators. CoRR abs/2410.03131 (2024) - [i17]Anas Barakat, Souradip Chakraborty, Peihong Yu, Pratap Tokekar, Amrit Singh Bedi:
On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning. CoRR abs/2410.04108 (2024) - [i16]Utsav Singh, Souradip Chakraborty, Wesley A. Suttle, Brian M. Sadler, Anit Kumar Sahu, Mubarak Shah, Vinay P. Namboodiri, Amrit Singh Bedi:
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction. CoRR abs/2411.00361 (2024) - 2023
- [j1]Soumya Suvra Ghosal, Souradip Chakraborty, Jonas Geiping, Furong Huang, Dinesh Manocha, Amrit S. Bedi:
A Survey on the Possibilities & Impossibilities of AI-generated Text Detection. Trans. Mach. Learn. Res. 2023 (2023) - [c10]Souradip Chakraborty, Amrit Singh Bedi, Pratap Tokekar, Alec Koppel, Brian M. Sadler, Furong Huang, Dinesh Manocha:
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning. AAAI 2023: 6980-6988 - [c9]Souradip Chakraborty, Amrit S. Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha:
STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning. ICML 2023: 3949-3978 - [c8]Souradip Chakraborty, Amrit Singh Bedi, Kasun Weerakoon, Prithvi Poddar, Alec Koppel, Pratap Tokekar, Dinesh Manocha:
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policy Optimization. ICRA 2023: 989-995 - [i15]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Mengdi Wang, Furong Huang, Dinesh Manocha:
STEERING: Stein Information Directed Exploration for Model-Based Reinforcement Learning. CoRR abs/2301.12038 (2023) - [i14]Souradip Chakraborty, Kasun Weerakoon, Prithvi Poddar, Pratap Tokekar, Amrit Singh Bedi, Dinesh Manocha:
RE-MOVE: An Adaptive Policy Design Approach for Dynamic Environments via Language-Based Feedback. CoRR abs/2303.07622 (2023) - [i13]Souradip Chakraborty, Amrit Singh Bedi, Sicheng Zhu, Bang An, Dinesh Manocha, Furong Huang:
On the Possibilities of AI-Generated Text Detection. CoRR abs/2304.04736 (2023) - [i12]Xiangyu Liu, Souradip Chakraborty, Yanchao Sun, Furong Huang:
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in Multi-Agent RL. CoRR abs/2305.17342 (2023) - [i11]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Dinesh Manocha, Huazheng Wang, Furong Huang, Mengdi Wang:
Aligning Agent Policy with Externalities: Reward Design via Bilevel RL. CoRR abs/2308.02585 (2023) - [i10]Soumya Suvra Ghosal, Souradip Chakraborty, Jonas Geiping, Furong Huang, Dinesh Manocha, Amrit Singh Bedi:
Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey. CoRR abs/2310.15264 (2023) - [i9]Souradip Chakraborty, Amisha Bhaskar, Anukriti Singh, Pratap Tokekar, Dinesh Manocha, Amrit Singh Bedi:
REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback. CoRR abs/2312.14436 (2023) - 2022
- [c7]Kasun Weerakoon, Souradip Chakraborty, Nare Karapetyan, Adarsh Jagan Sathyamoorthy, Amrit S. Bedi, Dinesh Manocha:
HTRON: Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm. CoRL 2022: 1629-1639 - [c6]Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian M. Sadler, Pratap Tokekar, Alec Koppel:
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces. ICML 2022: 1716-1731 - [i8]Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian M. Sadler, Pratap Tokekar, Alec Koppel:
On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces. CoRR abs/2201.12332 (2022) - [i7]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Brian M. Sadler, Furong Huang, Pratap Tokekar, Dinesh Manocha:
Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning. CoRR abs/2206.01162 (2022) - [i6]Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Pratap Tokekar, Dinesh Manocha:
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies. CoRR abs/2206.05652 (2022) - [i5]Kasun Weerakoon, Souradip Chakraborty, Nare Karapetyan, Adarsh Jagan Sathyamoorthy, Amrit Singh Bedi, Dinesh Manocha:
HTRON: Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm. CoRR abs/2207.03694 (2022) - 2020
- [c5]Souradip Chakraborty, O. Ekaba Bisong, Shweta Bhatt, Thomas Wagner, Riley Elliott, Francesco Mosconi:
BioMedBERT: A Pre-trained Biomedical Language Model for QA and IR. COLING 2020: 669-679 - [c4]Souradip Chakraborty, Ekansh Verma, Saswata Sahoo, Jyotishka Datta:
FairMixRep: Self-supervised Robust Representation Learning for Heterogeneous Data with Fairness constraints. ICDM (Workshops) 2020: 458-463 - [c3]Souradip Chakraborty, Aritra Roy Gosthipaty, Sayak Paul:
G-SimCLR: Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling. ICDM (Workshops) 2020: 912-916 - [c2]Saswata Sahoo, Souradip Chakraborty:
Graph Spectral Feature Learning for Mixed Data of Categorical and Numerical Type. ICPR 2020: 5712-5719 - [c1]Ekansh Verma, Vinodh Motupalli, Souradip Chakraborty:
Transformers at SemEval-2020 Task 11: Propaganda Fragment Detection Using Diversified BERT Architectures Based Ensemble Learning. SemEval@COLING 2020: 1823-1828 - [i4]Saswata Sahoo, Souradip Chakraborty:
Graph Spectral Feature Learning for Mixed Data of Categorical and Numerical Type. CoRR abs/2005.02817 (2020) - [i3]Saswata Sahoo, Souradip Chakraborty:
Learning Representation for Mixed Data Types with a Nonlinear Deep Encoder-Decoder Framework. CoRR abs/2009.09634 (2020) - [i2]Souradip Chakraborty, Aritra Roy Gosthipaty, Sayak Paul:
G-SimCLR : Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling. CoRR abs/2009.12007 (2020) - [i1]Souradip Chakraborty, Ekansh Verma, Saswata Sahoo, Jyotishka Datta:
FairMixRep : Self-supervised Robust Representation Learning for Heterogeneous Data with Fairness constraints. CoRR abs/2010.03228 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-11 20:39 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint