LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks.

AllImages Videos Books Maps News Shopping

Search tools

LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks

www.usenix.org › presentation › yu-jiahao

We introduce an automated solution for large-scale LLM jailbreak susceptibility assessment called LLM-Fuzzer.

Scholarly articles for LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks.

scholar.google.com › citations

… -Fuzzer}: Scaling assessment of large language model …
Yu · Cited by 6

[PDF] LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks

www.usenix.org › system › files

Aug 16, 2024 · This highlights that many open- source and commercial LLMs suffer from severe jailbreak issues, even after safety fine-tuning. 1 Introduction.

LLM-Fuzzer: scaling assessment of large language model jailbreaks

dl.acm.org › doi › abs

6 days ago · To address these scalability issues, we introduce an automated solution for large-scale LLM jailbreak susceptibility assessment called LLM- ...

LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks

pure.psu.edu › publications › llm-fuzzer-...

The jailbreak threat poses a significant concern for Large Language Models (LLMs), primarily due to their potential to generate content at scale.

LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks

openreview.net › forum

Sep 30, 2024 · LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks. Download PDF. Open Webpage. Jiahao Yu, Xingwei Lin, Zheng Yu, Xinyu Xing.

Understanding and Enhancing the Transferability of Jailbreaking Attacks

LLM Jailbreak Detection for (Almost) Free! - OpenReview

LLM Improvement for Jailbreak Defense: Analysis Through the ...

More results from openreview.net

LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks

scholar-chat.com › paper › web

We introduce an automated solution for large-scale LLM jailbreak susceptibility assessment called LLM-F UZZER.

People also search for

Don T Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models

Jailbreak and guard aligned Language Models with only few In-Context demonstrations

GPTFUZZER

FuzzLLM

publications | Jiahao Yu's Page

sherdencooper.github.io › publications

LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks. Jiahao Yu, Xingwei Lin, Zheng Yu, and 1 more author. In Proceedings of the 2024 USENIX ...

[PDF] arXiv:2309.05274v2 [cs.CR] 14 Apr 2024

arxiv.org › pdf

Apr 14, 2024 · In this work, we introduce FuzzLLM, a novel and universal framework that adeptly utilizes fuzzing techniques to proac- tively unearth jailbreak ...

[PDF] Comprehensive Assessment of Jailbreak Attacks Against LLMs - arXiv

arxiv.org › pdf

Dec 16, 2024 · Our study offers valuable insights for future research on jailbreak attacks and defenses and serves as a benchmark tool for re- searchers and ...

Official repo for GPTFUZZER : Red Teaming Large Language Models ...

github.com › sherdencooper › GPTFuzz

This is the official repository for GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts by Jiahao Yu, Xingwei Lin, Zheng Yu, ...