โ† Back

Early Diagnosis of Wasted Computation in Multi-Agent LLM Systems via Failure-Aware Observability

Eval & Observability arxiv arXiv:2606.01365 PDF โ†—
computationwastedfailurediagnosisobservabilityearlyanswerawarespend
Tool-using multi-agent large language model (LLM) systems spend computation through model tokens, tool calls, retries, and code execution before producing an answer. When a run fails, final-answer evaluation reveals the endpoint but usually not the point at which the trajectory stopped making recoverable progress. This paper introduces a failure-aw
5~10๋ถ„. ์ œ๋ชฉโ†’์ดˆ๋กโ†’์ธํŠธ๋กœโ†’์„น์…˜ํ—ค๋”โ†’๊ทธ๋ฆผโ†’๊ฒฐ๋ก ๋งŒ.
ํŒ๋‹จ: ์–ด๋–ค ๋ฌธ์ œ๋ฅผ ํ’€๊ณ  / ํ•ต์‹ฌ ์•„์ด๋””์–ด / ๋‚ด ์ž‘์—…๊ณผ ๊ด€๋ จ ์žˆ๋‚˜?
~1์‹œ๊ฐ„. ๊ทธ๋ฆผยทํ‘œ๋ฅผ ๊ผผ๊ผผํžˆ. ์ฆ๋ช…ยท์ˆ˜์‹ ๋””ํ…Œ์ผ์€ ๊ฑด๋„ˆ๋œ€.
์‚ฐ์ถœ๋ฌผ: "์ด๋“ค์ด ๋ญ˜ ํ–ˆ๊ณ  ์™œ ๊ทธ๊ฒŒ ํ†ตํ•˜๋Š”๊ฐ€" ํ•œ ๋ฌธ๋‹จ.
์žฌํ˜„ํ•˜๋“ฏ ์ฝ๊ธฐ. ๊ฐ€์ •์„ ์˜์‹ฌ. ์ง์ ‘ ์ธ์šฉ/๋ฐ˜๋ฐ•ํ•  ๋…ผ๋ฌธ๋งŒ.
๋ Œ์ฆˆ: "๋‚ด ํ”Œ๋ฆฟ์—์„œ ์ธก์ •ํ•˜๋ฉด ์ €์ž๊ฐ€ ๋ชป ํ•œ ๋ฌด์—‡์„ ๋ณด์—ฌ์ค„ ์ˆ˜ ์žˆ๋‚˜?"
View in Knowledge Graph โ†’