Jiongxiao Wang and Muhao Chen are qualified to endorse.
RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models
Jiongxiao Wang: | Is registered as an author of this paper. Can endorse for cs.AI, cs.CL, cs.CR, cs.CV, cs.HC, cs.LG. (why?) |
Muhao Chen: | Is registered as an author of this paper. Can endorse for cs.AI, cs.CL, cs.CR, cs.CV, cs.DB, cs.HC, cs.IR, cs.LG, cs.SE, cs.SI, q-bio.BM, q-bio.GN, q-bio.MN, q-bio.QM. (why?) |
Junlin Wu, Yevgeniy Vorobeychik and Chaowei Xiao are not registered as owners of this paper. (why?)