AI Agent Papers

TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging

Proposes a multi-agent observe-analyze-repair loop that uses runtime traces to find and fix bugs in LLM-generated code.

Agent Tooling 2602.06875 notes →

Generative Ontology: When Structured Knowledge Learns to Create

Explores constraining LLM generation with executable schemas and multi-agent roles to produce structurally valid yet creative outputs.

Agent Tooling 2602.05636 notes →

Structured Context Engineering for File-Native Agentic Systems

Tests how context format (YAML, JSON, Markdown) affects agent accuracy across 9,649 experiments in file-native agentic systems.

Agent Tooling 2602.05447 notes →

ProAct: Agentic Lookahead in Interactive Environments

Explores training agents to think ahead by distilling environment search into causal reasoning chains in interactive environments.

Agent Tooling 2602.05327 notes →

Autonomous Question Formation for Large Language Model-Driven AI Systems

Investigates teaching agents to ask themselves the right questions before acting to adapt to new situations autonomously.

Agent Tooling 2602.01556 notes →

From Perception to Action: Spatial AI Agents and World Models

Surveys the connection between agentic architectures and spatial tasks like robotics and navigation, covering memory, planning, and world models in embodied agents.

Agent Tooling 2602.01644 notes →

World Models as an Intermediary between Agents and the Real World

Argues for using world models as a bridge between agents and high-cost real-world environments to provide richer learning signals across domains like robotics and ML engineering.

Agent Tooling 2602.00785 notes →

Engineering AI Agents for Clinical Workflows: A Case Study in Architecture, MLOps, and Governance

Presents a reference architecture for production AI agents integrating Clean Architecture, event-driven design, per-agent MLOps lifecycles, and human-in-the-loop governance.

Agent Tooling 2602.00751 notes →

Autonomous Data Processing using Meta-Agents

Proposes a meta-agent framework that builds, runs, and keeps refining data processing pipelines through hierarchical agent orchestration.

Agent Tooling 2602.00307 notes →

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering

Proposes a multi-agent framework for automatically building executable test environments across ten programming languages using planning-execution-verification with environment reu…

Agent Tooling 2601.22859 notes →

Learning with Challenges: Adaptive Difficulty-Aware Data Generation for Mobile GUI Agent Training

Proposes an adaptive data generation framework for training mobile GUI agents that matches task difficulty to the agent's current capability level.

Agent Tooling 2601.22781 notes →

AutoRefine: From Trajectories to Reusable Expertise for Continual LLM Agent Refinement

Proposes extracting dual-form reusable expertise from agent execution histories — specialized subagents for procedural tasks and skill patterns for static knowledge — with continuo…

Agent Tooling 2601.22758 notes →

ToolTok: Tool Tokenization for Efficient and Generalizable GUI Agents

Proposes modeling GUI agent operations as sequences of learnable tool tokens with semantic anchoring and curriculum-based training instead of coordinate-based visual grounding.

Agent Tooling 2602.02548 notes →

From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents

Proposes a framework combining a self-evolving multi-agent data engine with verifier-based reinforcement learning to train multi-turn interactive tool-using agents.

Agent Tooling 2601.22607 notes →

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents

Investigates why step-wise reasoning struggles with long-horizon planning in LLM agents and proposes future-aware lookahead with reward estimation to let early actions account for …

Agent Tooling 2601.22311 notes →

SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents

Proposes a test-time scaling method for software engineering agents that recycles prior trajectories and branches at critical intermediate steps instead of resampling from scratch.

Agent Tooling 2601.22129 notes →

Optimizing Agentic Workflows using Meta-tools

Proposes bundling recurring sequences of agent tool calls into deterministic meta-tools to skip unnecessary intermediate LLM reasoning steps and cut failures.

Agent Tooling 2601.22037 notes →

astra-langchain4j: Experiences Combining LLMs and Agent Programming

Explores integrating LLM capabilities into the ASTRA agent programming language to study how traditional agent toolkits and modern LLM-based agentic platforms can inform each other…

Agent Tooling 2601.21879 notes →

Meta Context Engineering via Agentic Skill Evolution

Introduces a bi-level framework where a meta-agent evolves context engineering skills via agentic crossover while a base agent executes them to optimize context as files and code.

Agent Tooling 2601.21557 notes →

DataCross: A Unified Benchmark and Agent Framework for Cross-Modal Heterogeneous Data Analysis

Proposes a multi-agent framework and benchmark for cross-modal data analysis that coordinates specialized sub-agents via a divide-and-conquer workflow across structured and unstruc…

Agent Tooling 2601.21403 notes →

CovAgent: Overcoming the 30% Curse of Mobile Application Coverage with Agentic AI and Dynamic Instrumentation

Explores agentic AI for Android app testing that uses code inspection and dynamic instrumentation to reach activities that standard GUI fuzzers cannot access.

Agent Tooling 2601.21253 notes →

CUA-Skill: Develop Skills for Computer Using Agent

Introduces a large-scale computer-using agent skill library with parameterized execution, composition graphs, dynamic retrieval, and memory-aware failure recovery for desktop appli…

Agent Tooling 2601.21123 notes →

Textual Equilibrium Propagation for Deep Compound AI Systems

Explores local equilibrium propagation for optimizing deep compound AI systems that avoids signal degradation in long-horizon agentic workflows by replacing global textual backprop…

Agent Tooling 2601.21064 notes →

Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control

Investigates counterfactual reasoning in agentic LLM control scenarios using structural causal models and conformal prediction for formal reliability guarantees.

Agent Tooling 2601.20090 notes →

Insight Agents: An LLM-Based Multi-Agent System for Data Insights

Introduces a hierarchical multi-agent system with out-of-domain detection and BERT-based agent routing for delivering personalized data insights at production scale.

Agent Tooling 2601.20048 notes →

Agentic Design Patterns: A System-Theoretic Framework

Introduces a system-theoretic framework that decomposes agentic AI into five functional subsystems and derives 12 reusable design patterns for building robust agent architectures.

Agent Tooling 2601.19752 notes →

A Practical Guide to Agentic AI Transition in Organizations

Explores a pragmatic framework for transitioning organizational processes to agentic AI, covering domain-driven use case identification, task delegation, and human-in-the-loop oper…

Agent Tooling 2602.10122 notes →

JitRL: Just-In-Time Reinforcement Learning for Continual Learning in LLM Agents Without Gradient Updates

Proposes a training-free continual learning framework for LLM agents that retrieves relevant past experiences and modulates output logits at test time without gradient updates.

Agent Tooling 2601.18510 notes →

Think-Augmented Function Calling: Improving LLM Parameter Accuracy Through Embedded Reasoning

Proposes embedding explicit reasoning at both function and parameter levels during agent tool calls, with dynamic complexity scoring to trigger granular justification for critical …

Agent Tooling 2601.18282 notes →

Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents

Investigates which RL training environment properties and modeling choices most influence cross-domain generalization for LLM agents deployed beyond their training domains.

Agent Tooling 2601.18217 notes →

Think Locally, Explain Globally: Graph-Guided LLM Investigations via Local Reasoning and Belief Propagation

Proposes disaggregating LLM investigation into bounded local evidence mining with deterministic graph traversal and belief propagation for reliable open-ended agent reasoning.

Agent Tooling 2601.17915 notes →

AI Agent for Reverse-Engineering Legacy Finite-Difference Code

Presents a LangGraph-based AI agent framework combining GraphRAG, multi-stage retrieval, and RL-inspired adaptive feedback for reverse-engineering legacy scientific code.

Agent Tooling 2601.18381 notes →

PatchIsland: Orchestration of LLM Agents for Continuous Vulnerability Repair

Proposes a continuous vulnerability repair system that orchestrates a diverse LLM agent ensemble with two-phase deduplication for integration with continuous fuzzing pipelines.

Agent Tooling 2601.17471 notes →

DALIA: Towards a Declarative Agentic Layer for Intelligent Agents in MCP-Based Server Ecosystems

Introduces a declarative architectural layer for agentic workflows with formalized capabilities, declarative discovery protocol, and deterministic task graph construction.

Agent Tooling 2601.17435 notes →

SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents

Presents a task-aware context pruning framework for coding agents that trains a lightweight neural skimmer to selectively retain relevant code lines based on explicit goals.

Agent Tooling 2601.16746 notes →

REprompt: Prompt Generation for Intelligent Software Development Guided by Requirements Engineering

Proposes a multi-agent prompt optimization framework guided by requirements engineering principles for system and user prompts in agent-based software development.

Agent Tooling 2601.16507 notes →

EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration

Introduces a self-evolving multi-agent framework for automated environment configuration with expert diagnosis and dynamic error-fixing priority adjustment.

Agent Tooling 2601.16489 notes →

SemanticALLI: Caching Reasoning, Not Just Responses, in Agentic Systems

Proposes a pipeline-aware caching architecture for agentic systems that elevates structured intermediate reasoning representations to first-class cacheable artifacts to reduce redu…

Agent Tooling 2601.16286 notes →

Controlling Long-Horizon Behavior in Language Model Agents with Explicit State Dynamics

Investigates imposing explicit dynamical structure on an external affective state to induce temporal coherence and controlled recovery in multi-turn dialogue agents.

Agent Tooling 2601.16087 notes →

Agentic Uncertainty Quantification

Proposes a Dual-Process framework that transforms verbalized uncertainty into bi-directional control signals for agent memory and reflection to prevent cascading hallucination erro…

Agent Tooling 2601.15703 notes →

Agentic AI Governance and Lifecycle Management in Healthcare

Presents a Unified Agent Lifecycle Management blueprint with five control-plane layers for governing agent fleets including identity registry, orchestration, and runtime policy enf…

Agent Tooling 2601.15630 notes →

Autonomous Business System via Neuro-symbolic AI

Introduces a neuro-symbolic architecture that integrates LLM agents with predicate-logic programming and knowledge graphs to orchestrate end-to-end business initiatives through tas…

Agent Tooling 2601.15599 notes →

How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework

Proposes a software engineering framework for capturing and embedding codified human domain knowledge into LLM-based agents through request classification, RAG, and expert rule int…

Agent Tooling 2601.15153 notes →

Agent Identity URI Scheme: Topology-Independent Naming and Capability-Based Discovery for Multi-Agent Systems

Defines the agent:// URI scheme that decouples agent identity from network location through trust roots, hierarchical capability paths, and cryptographic attestation for multi-agen…

Agent Tooling 2601.14567 notes →

Toward Efficient Agents: Memory, Tool learning, and Planning

Surveys efficiency in agent systems across memory, tool learning, and planning, comparing approaches under fixed cost budgets and analyzing the Pareto frontier between effectivenes…

Agent Tooling 2601.14192 notes →

Toward self-coding information systems

Proposes self-coding information systems that use agentic AI to dynamically generate, test, and redeploy their own source code at runtime to reduce feature delivery time.

Agent Tooling 2601.14132 notes →

A Lightweight Modular Framework for Constructing Autonomous Agents Driven by Large Language Models: Design, Implementation, and Applications in AgentForge

Presents a lightweight open-source Python framework for building LLM-driven agents with composable skill abstractions, a unified LLM backend interface, and declarative YAML-based c…

Agent Tooling 2601.13383 notes →

MagicGUI-RMS: A Multi-Agent Reward Model System for Self-Evolving GUI Agents via Automated Feedback Reflux

Introduces a multi-agent reward model system for GUI agents that combines domain-specific and general-purpose reward models with automated data reflux for self-evolving agent train…

Agent Tooling 2601.13060 notes →

Agentic AI Meets Edge Computing in Autonomous UAV Swarms

Investigates three deployment architectures for integrating LLM-based agentic AI with edge computing in UAV swarms, covering standalone, edge-enabled, and edge-cloud hybrid configu…

Agent Tooling 2601.14437 notes →

Agentic Artificial Intelligence (AI): Architectures, Taxonomies, and Evaluation of Large Language Model Agents

Proposes a unified taxonomy decomposing AI agents into Perception, Brain, Planning, Action, Tool Use, and Collaboration subsystems, covering MCP, native computer use, and evaluatio…

Agent Tooling 2601.12560 notes →

Agentic Reasoning for Large Language Models

Surveys agentic reasoning across foundational, self-evolving, and collective multi-agent dimensions, distinguishing in-context reasoning from post-training approaches across planni…

Agent Tooling 2601.12538 notes →

POLARIS: Typed Planning and Governed Execution for Agentic AI in Back-Office Automation

Introduces a governed orchestration framework that treats agentic automation as typed plan synthesis with DAG-based planning, rubric-guided selection, validator-gated execution, an…

Agent Tooling 2601.11816 notes →

From Everything-is-a-File to Files-Are-All-You-Need: How Unix Philosophy Informs the Design of Agentic AI Systems

Explores how the Unix 'everything is a file' principle informs agentic AI design through file-like abstractions and code-based specifications for composable, auditable agent interf…

Agent Tooling 2601.11672 notes →

Towards AGI A Pragmatic Approach Towards Self Evolving Agent

Introduces a hierarchical self-evolving multi-agent framework that integrates curriculum learning, reward-based learning, and genetic algorithm evolution for continuous autonomous …

Agent Tooling 2601.11658 notes →

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Proposes a self-evolving agent framework that evolves an explicit finite state machine instead of free-form code rewriting, constraining flow and skill optimization to a structured…

Agent Tooling 2601.09465 notes →

Investigating Tool-Memory Conflicts in Tool-Augmented LLMs

Identifies and studies a conflict type where a tool-augmented LLM's internal knowledge contradicts external tool outputs, evaluating whether existing resolution techniques like pro…

Agent Tooling 2601.09760 notes →

MAXS: Meta-Adaptive Exploration with LLM Agents

Uses lookahead planning to estimate the value of tool usage at each step and selects stable, high-value reasoning paths, with a convergence mechanism that halts rollouts once consi…

Agent Tooling 2601.09259 notes →

ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web

Trains history-aware routers for large-scale MCP tool ecosystems using dependency graphs and multi-turn trajectory synthesis to generalize across multi-agent collaboration and mass…

Agent Tooling 2601.08276 notes →

Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning

Proposes iterative query planning for tool retrieval that decomposes instructions into sub-tasks and dynamically generates queries, trained via synthetic trajectories and reinforce…

Agent Tooling 2601.07782 notes →

OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent

Introduces a Computer-Using Agent framework with milestone-driven long-term memory for trajectory-level self-correction and a multimodal searcher that synthesizes live, visually al…

Agent Tooling 2601.07779 notes →

SAGE: Tool-Augmented LLM Task Solving Strategies in Scalable Multi-Agent Environments

Presents a conversational AI interface for dynamic tool discovery and execution via the OPACA framework, comparing multiple task-solving strategies across different agent setups an…

Agent Tooling 2601.09750 notes →

Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning

Proposes test-time tool evolution where agents synthesize, verify, and evolve executable tools during inference instead of relying on static pre-defined tool libraries.

Agent Tooling 2601.07641 notes →

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Introduces a large-scale distributed orchestration system that decouples agent training into independent Model, Agent, and Environment services for scheduling tens of thousands of …

Agent Tooling 2601.07526 notes →

JudgeFlow: Agentic Workflow Optimization via Block Judge

Proposes an evaluation-judge-optimization pipeline that assigns block-level responsibility scores to failing logic blocks in agentic workflows, focusing modifications on the most p…

Agent Tooling 2601.07477 notes →

R-LAM: Reproducibility-Constrained Large Action Models for Scientific Workflow Automation

Introduces a reproducibility-constrained framework for Large Action Models with structured action schemas, deterministic execution policies, and provenance tracking to ensure audit…

Agent Tooling 2601.09749 notes →

OpenTinker: Separating Concerns in Agentic Reinforcement Learning

Proposes a composable RL infrastructure for LLM agents that separates algorithm design, execution, and agent-environment interaction with a centralized scheduler for managing share…

Agent Tooling 2601.07376 notes →

ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging

Introduces activation-guided, role-conditioned neuron transplantation for training-free merging of environment-specific LLM agent experts into a single generalist model.

Agent Tooling 2601.07309 notes →

PRISM: Disentangling SFT and RL Data via Gradient Concentration

Proposes a dynamics-aware framework grounded in Schema Theory that routes agent training data to SFT or RL based on gradient concentration, using cognitive conflict as the allocati…

Agent Tooling 2601.07224 notes →

ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration

Introduces a training framework for calibrating agent tool-use behavior through a self-evolving data flywheel and two-phase behavior calibration to reduce redundant and insufficien…

Agent Tooling 2601.06860 notes →

No More Stale Feedback: Co-Evolving Critics for Open-World Agent Learning

Proposes a co-evolutionary framework that jointly optimizes the agent policy and its natural-language critic through synchronized GRPO updates, preventing the critic from becoming …

Agent Tooling 2601.06794 notes →

CEDAR: Context Engineering for Agentic Data Science

Introduces context engineering techniques for agentic workflows including structured DS-specific prompting, separate plan and code agents, and smart history rendering for fault tol…

Agent Tooling 2601.06606 notes →

ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking

Proposes a reinforcement learning paradigm that replaces pointwise scalar scoring with intra-group relative ranking via tournament-based schemes to address discrimination collapse …

Agent Tooling 2601.06487 notes →

Architecting AgentOps Needs CHANGE

Introduces a conceptual framework with six capabilities (Contextualize, Harmonize, Anticipate, Negotiate, Generate, Evolve) for architecting AgentOps platforms that manage the life…

Agent Tooling 2601.06456 notes →

Can We Predict Before Executing Machine Learning Agents?

Proposes internalizing execution priors to predict agent outcomes before physical execution, using a Predict-then-Verify loop to accelerate ML agent workflows without running expen…

Agent Tooling 2601.05930 notes →

EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis

Proposes an automated framework for generating scalable tool-interaction environments via programmatic synthesis, constructing diverse environment skeletons and task scenarios for …

Agent Tooling 2601.05808 notes →

LIDL: LLM Integration Defect Localization via Knowledge Graph-Enhanced Multi-Agent Analysis

Proposes a multi-agent framework for localizing integration defects in LLM-integrated software using code knowledge graphs enriched with LLM-aware annotations and counterfactual re…

Agent Tooling 2601.05539 notes →

AT²PO: Agentic Turn-based Policy Optimization via Tree Search

Proposes a unified framework for multi-turn agentic RL that uses a turn-level tree structure for entropy-guided exploration, turn-wise credit assignment, and turn-based policy opti…

Agent Tooling 2601.04767 notes →

M-ASK: Multi-Agent Search and Knowledge Optimization Framework

Proposes a framework that decouples agentic search into Search Behavior Agents and Knowledge Management Agents with turn-level rewards for multi-hop QA.

Agent Tooling 2601.04703 notes →

AgentDevel: Reframing Self-Evolving LLM Agents as Release Engineering

Reframes agent self-improvement as a release engineering pipeline with implementation-blind quality signals, symptom-level diagnosis, and flip-centered regression gating.

Agent Tooling 2601.04620 notes →

4D-ARE: 4-Dimensional Attribution-Driven Agent Requirements Engineering

Proposes an attribution-driven requirements engineering methodology for specifying what domain knowledge LLM agents need at design time, organized along four causal dimensions.

Agent Tooling 2601.04556 notes →

XGrammar 2: Dynamic and Efficient Structured Generation Engine for Agentic LLMs

Proposes a structured generation engine for agentic LLMs with dynamic tag dispatching, JIT compilation, and cross-grammar caching for tool calling and conditional structured genera…

Agent Tooling 2601.04426 notes →

Transitive Expert Error and Routing Problems in Complex AI Systems

Formalizes transitive expert error in AI routing architectures including MoE, multi-model orchestration, and tool-using agents, proposing boundary-aware calibration and coverage ga…

Agent Tooling 2601.04416 notes →

O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL

Introduces a multi-agent workflow for synthesizing research-grade training data with a two-stage SFT plus agentic RL strategy for open-source deep research models.

Agent Tooling 2601.03743 notes →

Architecting Agentic Communities using Design Patterns

Proposes design patterns for architecting agentic communities derived from enterprise distributed systems standards, covering coordination, governance, and formal collaboration agr…

Agent Tooling 2601.03624 notes →

SCRIBE: Structured Mid-Level Supervision for Tool-Using Language Models

Proposes a skill-conditioned RL framework for tool-using agents that grounds reward modeling in a library of skill prototypes for mid-level credit assignment.

Agent Tooling 2601.03555 notes →

Enhancing Model Context Protocol (MCP) with Context-Aware Server Collaboration

Proposes a Context-Aware MCP architecture with a Shared Context Store that enables MCP servers to coordinate autonomously by reading from and writing to shared context memory.

Agent Tooling 2601.11595 notes →

Enhancing LLM Instruction Following: An Evaluation-Driven Multi-Agentic Workflow for Prompt Instructions Optimization

Proposes a multi-agentic workflow that decouples optimization of primary task descriptions from constraint optimization using quantitative feedback for iterative prompt refinement.

Agent Tooling 2601.03359 notes →

InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents

Proposes a general-purpose agent framework that keeps reasoning context bounded regardless of task duration by externalizing persistent state into a file-centric state abstraction.

Agent Tooling 2601.03204 notes →

The Path Ahead for Agentic AI: Challenges and Opportunities

Surveys agentic AI architectures covering planning, memory, tool use, and iterative reasoning with a critical assessment of safety, alignment, and reliability challenges.

Agent Tooling 2601.02749 notes →

AMER-RCL: Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices

Proposes an agentic memory enhanced recursive reasoning framework for root cause localization with cross-alert memory reuse and multi-agent recursive refinement.

Agent Tooling 2601.02732 notes →

Orchestral AI: A Framework for Agent Orchestration

Introduces a lightweight Python framework providing a unified, type-safe interface for building LLM agents across multiple providers with tool calling, memory management, and MCP i…

Agent Tooling 2601.02577 notes →

AI Agent Systems: Architectures, Applications, and Evaluation

Surveys AI agent architectures spanning reasoning, planning, tool calling, orchestration patterns, and deployment settings with a unified taxonomy of agent components and design tr…

Agent Tooling 2601.01743 notes →

CaveAgent: Transforming LLMs into Stateful Runtime Operators

Proposes a dual-stream architecture that elevates the persistent Python runtime as the central locus of agent state, with stateful runtime management and skill injection for long-h…

Agent Tooling 2601.01569 notes →

Actively Obtaining Environmental Feedback for Autonomous Action Evaluation Without Predefined Measurements

Proposes an active feedback model where AI agents proactively interact with the environment to discover and verify feedback without relying on predefined measurements.

Agent Tooling 2601.04235 notes →

Warp-Cortex: An Asynchronous, Memory-Efficient Architecture for Million-Agent Cognitive Scaling on Consumer Hardware

Proposes an asynchronous architecture for million-agent scaling that reduces memory complexity via singleton weight sharing and topological synapse-inspired KV-cache sparsification…

Agent Tooling 2601.01298 notes →

Building Effective Agents (Anthropic Engineering Blog)

Anthropic 엔지니어링 블로그. 프로덕션 실무자 관점에서 워크플로 vs 에이전트, 오케스트레이션 패턴 등 어휘와 직관을 잡아주는 워밍업 자료. 논문 읽기 전에 먼저 볼 것.

Agent Tooling blog/anthropic/build notes → 💬 Tier 0 — 논문 아님. 에이전트 관련 논문 읽기 전 워밍업. 짧고 그림 위주. 실무 …