Jailbreaking Attack against Multimodal Large Language Model.

scholar.google.com › citations

Jailbreaking attack against multimodal large language …
Niu · Cited by 28

Jailbreaking Attack against Multimodal Large Language Model - arXiv

Feb 4, 2024 · This paper focuses on jailbreaking attacks against multi-modal large language models (MLLMs), seeking to elicit MLLMs to generate objectionable responses to ...

Visual-RolePlay: Universal Jailbreak Attack on MultiModal Large ... - arXiv

arxiv.org › html

May 25, 2024 · We first introduce the concept of “Role-play” into MLLM jailbreak attacks and propose a novel and effective method called Visual Role-play (VRP).

Jailbreaking Attack against Multimodal Large Language Model - arxiv-sanity

arxiv-sanity-lite.com › ...

Jailbreaking is an emerging adversarial attack that bypasses the safety alignment deployed in off-the-shelf large language models (LLMs) and has evolved into ...

JailBreakV: A Benchmark for Assessing the Robustness of ...

openreview.net › forum

Aug 26, 2024 · We introduce JailBreakV-28K, a comprehensive benchmark to evaluate the transferability of LLM jailbreak attacks to MLLMs and assess the robustness and safety ...

Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal ...

Visual Adversarial Examples Jailbreak Aligned Large Language Models

More results from openreview.net

isXinLiu/Awesome-MLLM-Safety: Accepted by IJCAI-24 Survey Track

github.com › isXinLiu › Awesome-MLL...

Attack. Jailbreaking Attack against Multimodal Large Language Model. Zhenxing Niu, Haodong Ren, Xinbo Gao, Gang Hua, Rong Jin; Xidian University | Wormpex AI ...

People also search for

Jailbreaking attack against multimodal large language model github

Jailbreaking attack against multimodal large language model arxiv

Jailbreaking attack against multimodal large language model pdf

jailbreak in pieces: compositional adversarial attacks on multi-modal language models

Jailbreaking black box large language models in twenty queries

Visual adversarial examples jailbreak aligned Large Language Models

A1. Jailbreak - GitHub

github.com › collection › paper › safety

A benchmark for assessing the robustness of multimodal large language models against jailbreak attacks.

Vulnerabilities of Large Language Models to Adversarial Attacks: ACL ...

llm-vulnerability.github.io

This tutorial offers a comprehensive overview of vulnerabilities in Large Language Models (LLMs) that are exposed by adversarial attacks.

Multimodal Large Language Model | Papers With Code

paperswithcode.com › task › multimodal...

This paper focuses on jailbreaking attacks against multi-modal large language models (MLLMs), seeking to elicit MLLMs to generate objectionable responses to ...

BaThe: Defense against the Jailbreak Attack in Multimodal Large ...

www.researchgate.net › publication › 38...

Aug 22, 2024 · Our work is motivated by recent research on jailbreak backdoor attack and virtual prompt backdoor attack in generative language models.

A Benchmark for Assessing the Robustness of MultiModal Large ...

huggingface.co › papers

Apr 3, 2024 · JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks. Published on Apr 3.

People also search for

EasyJailbreak A Unified Framework for Jailbreaking Large Language models

image hijacks: adversarial images can control generative models at runtime

mm-safetybench: a benchmark for safety evaluation of multimodal large language models

figstep: jailbreaking large vision-language models via typographic visual prompts

Universal and Transferable adversarial attacks on aligned language models

Attacks on large language models

Weak-to-strong jailbreaking on Large language models

Transferable multimodal attack on vision-language Pre training models