โ† Back

Overcoming the Retrieval Barrier: Indirect Prompt Injection in the Wild for LLM Systems

AI Agent Security arxiv arXiv:2601.07072 PDF โ†—
indirectbarrierwildinjectionpromptsystemsovercomingendretrieval
Proposes a black-box attack that decomposes indirect prompt injection into trigger and attack fragments to study end-to-end IPI exploits under natural queries across RAG and agentic systems.
5~10๋ถ„. ์ œ๋ชฉโ†’์ดˆ๋กโ†’์ธํŠธ๋กœโ†’์„น์…˜ํ—ค๋”โ†’๊ทธ๋ฆผโ†’๊ฒฐ๋ก ๋งŒ.
ํŒ๋‹จ: ์–ด๋–ค ๋ฌธ์ œ๋ฅผ ํ’€๊ณ  / ํ•ต์‹ฌ ์•„์ด๋””์–ด / ๋‚ด ์ž‘์—…๊ณผ ๊ด€๋ จ ์žˆ๋‚˜?
~1์‹œ๊ฐ„. ๊ทธ๋ฆผยทํ‘œ๋ฅผ ๊ผผ๊ผผํžˆ. ์ฆ๋ช…ยท์ˆ˜์‹ ๋””ํ…Œ์ผ์€ ๊ฑด๋„ˆ๋œ€.
์‚ฐ์ถœ๋ฌผ: "์ด๋“ค์ด ๋ญ˜ ํ–ˆ๊ณ  ์™œ ๊ทธ๊ฒŒ ํ†ตํ•˜๋Š”๊ฐ€" ํ•œ ๋ฌธ๋‹จ.
์žฌํ˜„ํ•˜๋“ฏ ์ฝ๊ธฐ. ๊ฐ€์ •์„ ์˜์‹ฌ. ์ง์ ‘ ์ธ์šฉ/๋ฐ˜๋ฐ•ํ•  ๋…ผ๋ฌธ๋งŒ.
๋ Œ์ฆˆ: "๋‚ด ํ”Œ๋ฆฟ์—์„œ ์ธก์ •ํ•˜๋ฉด ์ €์ž๊ฐ€ ๋ชป ํ•œ ๋ฌด์—‡์„ ๋ณด์—ฌ์ค„ ์ˆ˜ ์žˆ๋‚˜?"
View in Knowledge Graph โ†’