โ† Back

Hidden-in-Plain-Text: A Benchmark for Social-Web Indirect Prompt Injection in RAG

AI Agent Security arxiv arXiv:2601.10923 PDF โ†—
indirectwebplaininjectionpromptsocialtextbenchmark
Introduces a benchmark and harness for evaluating web-facing RAG systems under indirect prompt injection and retrieval poisoning attacks with standardized end-to-end evaluation from ingestion to generation.
5~10๋ถ„. ์ œ๋ชฉโ†’์ดˆ๋กโ†’์ธํŠธ๋กœโ†’์„น์…˜ํ—ค๋”โ†’๊ทธ๋ฆผโ†’๊ฒฐ๋ก ๋งŒ.
ํŒ๋‹จ: ์–ด๋–ค ๋ฌธ์ œ๋ฅผ ํ’€๊ณ  / ํ•ต์‹ฌ ์•„์ด๋””์–ด / ๋‚ด ์ž‘์—…๊ณผ ๊ด€๋ จ ์žˆ๋‚˜?
~1์‹œ๊ฐ„. ๊ทธ๋ฆผยทํ‘œ๋ฅผ ๊ผผ๊ผผํžˆ. ์ฆ๋ช…ยท์ˆ˜์‹ ๋””ํ…Œ์ผ์€ ๊ฑด๋„ˆ๋œ€.
์‚ฐ์ถœ๋ฌผ: "์ด๋“ค์ด ๋ญ˜ ํ–ˆ๊ณ  ์™œ ๊ทธ๊ฒŒ ํ†ตํ•˜๋Š”๊ฐ€" ํ•œ ๋ฌธ๋‹จ.
์žฌํ˜„ํ•˜๋“ฏ ์ฝ๊ธฐ. ๊ฐ€์ •์„ ์˜์‹ฌ. ์ง์ ‘ ์ธ์šฉ/๋ฐ˜๋ฐ•ํ•  ๋…ผ๋ฌธ๋งŒ.
๋ Œ์ฆˆ: "๋‚ด ํ”Œ๋ฆฟ์—์„œ ์ธก์ •ํ•˜๋ฉด ์ €์ž๊ฐ€ ๋ชป ํ•œ ๋ฌด์—‡์„ ๋ณด์—ฌ์ค„ ์ˆ˜ ์žˆ๋‚˜?"
View in Knowledge Graph โ†’