AI Agent Papers

Corpus2Skill: Don't Retrieve, Navigate — Distilling Enterprise Knowledge into Navigable Agent Skills for QA and RAG

Compiles a corpus offline into a hierarchical tree of Agent Skills that the LLM agent navigates at query time, replacing retrieval with skill-tree traversal.

Memory & RAG 2604.14572 notes →

BudgetMem: Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory

Investigates routing agent memory queries to different processing tiers based on query difficulty to control the cost-accuracy trade-off at runtime.

Memory & RAG 2602.06025 notes →

Learning to Share: Selective Memory for Efficient Parallel Agentic Systems

Proposes a shared memory bank with a learned controller that decides what information is worth passing between parallel agent teams to reduce redundant work.

Memory & RAG 2602.05965 notes →

CompactRAG: Reducing LLM Calls and Token Overhead in Multi-Hop Question Answering

Explores converting a corpus into atomic QA pairs offline to resolve multi-hop questions with just two LLM calls regardless of hop count.

Memory & RAG 2602.05728 notes →

Mitigating Hallucination in Financial Retrieval-Augmented Generation via Fine-Grained Knowledge Verification

Examines breaking financial RAG answers into atomic facts and verifying each against retrieved documents using reinforcement learning rewards.

Memory & RAG 2602.05723 notes →

Graph-based Agent Memory: Taxonomy, Techniques, and Applications

Surveys graph-based memory architectures for agents, covering extraction, storage, retrieval, and how memory evolves over time.

Memory & RAG 2602.05665 notes →

AI Agent Systems for Supply Chains: Structured Decision Prompts and Memory Retrieval

Proposes a multi-agent system for inventory management that retrieves similar past decisions to adapt ordering across various supply chain scenarios.

Memory & RAG 2602.05524 notes →

SOPRAG: Multi-view Graph Experts Retrieval for Industrial Standard Operating Procedures

Explores replacing flat chunk-based RAG with graph experts that understand entity relationships, causality, and process flows for structured documents like SOPs.

Memory & RAG 2602.01858 notes →

ProcMEM: Learning Reusable Procedural Memory from Experience via Non-Parametric PPO for LLM Agents

Investigates letting agents save step-by-step procedural skills from past runs and reuse them later without retraining to reduce repeated computation.

Memory & RAG 2602.01869 notes →

Aggregation Queries over Unstructured Text: Benchmark and Agentic Method

Proposes an agentic method for aggregation queries over unstructured text that tries to find all matching evidence, breaking the task into disambiguation, filtering, and aggregatio…

Memory & RAG 2602.01355 notes →

DIVERGE: Diversity-Enhanced RAG for Open-Ended Information Seeking

Proposes an agentic RAG framework that uses reflection and memory-based refinement to generate diverse answers for open-ended questions.

Memory & RAG 2602.00238 notes →

JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG

Proposes joint optimization of planning and execution in agentic RAG by modeling the system as a cooperative multi-agent team with shared backbone and outcome-based rewards.

Memory & RAG 2601.21916 notes →

ProRAG: Process-Supervised Reinforcement Learning for Retrieval-Augmented Generation

Proposes process-supervised reinforcement learning for RAG that uses MCTS-based step-level rewards to identify and fix flawed reasoning steps in multi-hop retrieval.

Memory & RAG 2601.21912 notes →

E-mem: Multi-agent based Episodic Context Reconstruction for LLM Agent Memory

Introduces an episodic memory framework where assistant agents maintain uncompressed memory contexts while a master agent orchestrates global planning, replacing destructive memory…

Memory & RAG 2601.21714 notes →

ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory

Proposes a tiered memory service for agentic LLM systems that uses masked mixture-of-experts routing to probe only eligible memory shards under a fixed budget.

Memory & RAG 2601.21545 notes →

When should I search more: Adaptive Complex Query Optimization with Reinforcement Learning

Explores adaptive query optimization in RAG using reinforcement learning to dynamically decide when to split complex queries into sub-queries and fuse the retrieved results.

Memory & RAG 2601.21208 notes →

A2RAG: Adaptive Agentic Graph Retrieval for Cost-Aware and Reliable Reasoning

Introduces an adaptive agentic Graph-RAG framework that verifies evidence sufficiency and progressively escalates retrieval effort, mapping graph signals back to source text to han…

Memory & RAG 2601.21162 notes →

MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents

Investigates augmenting multimodal LLMs with a trainable memory gate that decides which observations to retain, update, or discard during online embodied agent exploration.

Memory & RAG 2601.20831 notes →

AMA: Adaptive Memory via Multi-Agent Collaboration

Proposes a multi-agent memory framework with hierarchical granularity, adaptive query routing, consistency verification, and targeted memory refresh for long-term agent interaction…

Memory & RAG 2601.20352 notes →

When Iterative RAG Beats Ideal Evidence: A Diagnostic Study in Scientific Multi-hop Question Answering

Examines when iterative retrieval-reasoning loops outperform static gold-context RAG in scientific multi-hop QA, diagnosing failure modes across retrieval coverage, hypothesis drif…

Memory & RAG 2601.19827 notes →

Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory

Introduces a dependency-aware search framework that uses GRPO reinforcement learning to teach LLMs to decompose questions with dependency relationships and store intermediate resul…

Memory & RAG 2601.18771 notes →

FadeMem: Biologically-Inspired Forgetting for Efficient Agent Memory

Proposes a biologically-inspired agent memory architecture with adaptive exponential decay, LLM-guided conflict resolution, and intelligent memory fusion across a dual-layer hierar…

Memory & RAG 2601.18642 notes →

FastInsight: Fast and Insightful Retrieval via Fusion Operators for Graph RAG

Explores two fusion operators for Graph RAG that combine graph-aware reranking with semantic-topological expansion to improve retrieval accuracy and generation quality.

Memory & RAG 2601.18579 notes →

Less is More for RAG: Information Gain Pruning for Generator-Aligned Reranking and Evidence Selection

Proposes a generator-aligned reranking and pruning module for RAG that selects evidence using utility signals and filters weak or harmful passages before context truncation.

Memory & RAG 2601.17532 notes →

DeepEra: A Deep Evidence Reranking Agent for Scientific Retrieval-Augmented Generated Question Answering

Introduces a step-by-step reasoning reranking agent for RAG that distinguishes semantically similar but logically irrelevant passages in retrieval-augmented question answering.

Memory & RAG 2601.16478 notes →

SPARC-RAG: Adaptive Sequential-Parallel Scaling with Context Management for Retrieval-Augmented Generation

Introduces a multi-agent RAG framework that coordinates sequential and parallel inference-time scaling under unified context management to prevent contamination and improve multi-h…

Memory & RAG 2602.00083 notes →

Incorporating Q&A Nuggets into Retrieval-Augmented Generation

Proposes a nugget-augmented generation system that constructs a bank of Q&A nuggets from retrieved documents to guide extraction, selection, and report generation with citation pro…

Memory & RAG 2601.13222 notes →

Augmenting Question Answering with A Hybrid RAG Approach

Introduces a hybrid RAG architecture combining query augmentation, agentic routing, and structured retrieval that merges vector and graph-based techniques with context unification …

Memory & RAG 2601.12658 notes →

Utilizing Metadata for Better Retrieval-Augmented Generation

Presents a systematic study of metadata-aware retrieval strategies for RAG, comparing prefix, suffix, unified embedding, and late-fusion approaches with field-level ablations on em…

Memory & RAG 2601.11863 notes →

Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration

Proposes a hierarchical global-to-local retrieval strategy for GraphRAG with beam search-optimized re-ranking and a compact LLM integration module trained via dynamic-weighting rei…

Memory & RAG 2601.11144 notes →

Grounding Agent Memory in Contextual Intent

Introduces an agentic memory system that indexes trajectory steps with structured contextual intent cues and retrieves history by intent compatibility to reduce interference in lon…

Memory & RAG 2601.10702 notes →

Structure and Diversity Aware Context Bubble Construction for Enterprise Retrieval Augmented Systems

Proposes a structure-informed and diversity-constrained context bubble construction framework for RAG that preserves document structure and balances relevance, coverage, and redund…

Memory & RAG 2601.10681 notes →

Topo-RAG: Topology-aware retrieval for hybrid text-table documents

Introduces a dual-architecture RAG framework that routes narrative through dense retrievers and tabular data through a cell-aware late interaction mechanism to preserve spatial rel…

Memory & RAG 2601.10215 notes →

Continuum Memory Architectures for Long-Horizon LLM Agents

Defines a class of memory systems for long-horizon agents that maintain persistent, temporally chained internal state instead of stateless RAG lookups, specifying the architectural…

Memory & RAG 2601.09913 notes →

Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey

Surveys foundation agent memory organized by substrate (internal/external), cognitive mechanism (episodic, semantic, working, procedural), and subject (agent- vs user-centric).

Memory & RAG 2602.06052 notes →

The AI Hippocampus: How Far are We From Human Memory?

Surveys memory in LLMs and multimodal LLMs across implicit, explicit, and agentic paradigms, covering cross-modal integration and challenges like capacity, alignment, and factual c…

Memory & RAG 2601.09113 notes →

AtomMem: Learnable Dynamic Agentic Memory with Atomic Memory Operation

Decomposes memory management into atomic CRUD operations and learns an autonomous policy via SFT + RL to study whether learnable memory outperforms static-workflow methods on long-…

Memory & RAG 2601.08323 notes →

OpenDecoder: Open LLM Decoding to Incorporate Document Quality in RAG

Feeds explicit document quality signals (relevance score, ranking, QPP) into RAG generation to study whether exposing retrieval metadata makes the model more robust to noisy contex…

Memory & RAG 2601.09028 notes →

Reliable Graph-RAG for Codebases: AST-Derived Graphs vs LLM-Extracted Knowledge Graphs

Benchmarks vector-only, LLM-extracted KG, and AST-derived graph pipelines for code RAG, comparing correctness and indexing cost across deterministic and LLM-based graph constructio…

Memory & RAG 2601.08773 notes →

To Retrieve or To Think? An Agentic Approach for Context Evolution

Proposes an agentic RAG framework that dynamically decides whether to retrieve new evidence or reason over existing context at each step, aiming to eliminate redundant retrieval.

Memory & RAG 2601.08747 notes →

Parallel Context-of-Experts Decoding for Retrieval Augmented Generation

Proposes a training-free RAG decoding method that treats retrieved documents as isolated "experts" and aggregates their logits via retrieval-aware contrastive decoding to recover c…

Memory & RAG 2601.08670 notes →

SwiftMem: Fast Agentic Memory via Query-aware Indexing

Proposes a query-aware agentic memory system that achieves sub-linear retrieval through temporal and semantic DAG-Tag indexing with an embedding-tag co-consolidation mechanism for …

Memory & RAG 2601.08160 notes →

Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory

Proposes treating memory abstraction as a learnable cognitive skill, training a memory copilot via DPO to determine how memories should be structured, abstracted, and reused across…

Memory & RAG 2601.07470 notes →

Beyond Dialogue Time: Temporal Semantic Memory for Personalized LLM Agents

Introduces a temporal semantic memory framework that organizes memories by actual occurrence time rather than dialogue time and consolidates temporally continuous information into …

Memory & RAG 2601.07468 notes →

Active Context Compression: Autonomous Memory Management in LLM Agents

Proposes an agent-centric architecture inspired by Physarum polycephalum where the agent autonomously decides when to consolidate learnings and prune raw interaction history to man…

Memory & RAG 2601.07190 notes →

Relink: Constructing Query-Driven Evidence Graph On-the-Fly for GraphRAG

Proposes a reason-and-construct paradigm for GraphRAG that dynamically builds query-specific evidence graphs by instantiating facts from a latent relation pool and discarding distr…

Memory & RAG 2601.07192 notes →

Seeing through the Conflict: Transparent Knowledge Conflict Handling in RAG

Introduces a plug-and-play RAG framework that disentangles semantic match from factual consistency and estimates self-answerability to make the conflict-resolution decision process…

Memory & RAG 2601.06842 notes →

CIRAG: Construction-Integration Retrieval and Adaptive Generation for Multi-hop Question Answering

Proposes a construction-integration approach for multi-hop RAG that preserves multiple evidence chains via iterative triple construction and adaptively expands context granularity …

Memory & RAG 2601.06799 notes →

Amory: Building Coherent Narrative-Driven Agent Memory through Agentic Reasoning

Proposes a working memory framework that constructs structured episodic narratives from conversational fragments, consolidates memories with momentum, and semanticizes peripheral f…

Memory & RAG 2601.06282 notes →

L-RAG: Balancing Context and Retrieval with Entropy-Based Lazy Loading

Proposes an adaptive RAG framework that uses entropy-based gating to bypass vector database retrieval when model uncertainty is low, triggering expensive chunk retrieval only when …

Memory & RAG 2601.06551 notes →

PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop QA

Proposes a decoupled multi-agent RAG framework for multi-hop QA with a Plan-Retrieve-Inspect-Solve-Memoize architecture and two-stage GRPO optimization to address retrieval collaps…

Memory & RAG 2601.05465 notes →

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Proposes a framework for user-controllable memory reliance in long-term agent interactions, modeling memory dependence as an explicit and steerable dimension.

Memory & RAG 2601.05107 notes →

Beyond Static Summarization: Proactive Memory Extraction for LLM Agents

Proposes proactive memory extraction using self-questioning feedback loops instead of one-off static summarization to recover missing information and correct errors iteratively.

Memory & RAG 2601.04463 notes →

Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents

Proposes a hierarchical memory architecture with a Topic Loom that groups consecutive same-topic dialogue turns into coherent memory boxes and links them via long-range event-timel…

Memory & RAG 2601.03785 notes →

MAGMA: A Multi-Graph based Agentic Memory Architecture

Proposes a multi-graph agentic memory architecture that represents memories across orthogonal semantic, temporal, causal, and entity graphs with policy-guided traversal for retriev…

Memory & RAG 2601.03236 notes →

HiMeS: Hippocampus-inspired Memory System for Personalized AI Assistants

Proposes a hippocampus-inspired memory architecture for AI assistants that fuses RL-trained short-term memory extraction with partitioned long-term memory for personalization.

Memory & RAG 2601.06152 notes →

SimpleMem: Efficient Lifelong Memory for LLM Agents

Proposes a three-stage memory framework based on semantic lossless compression with structured compression, online semantic synthesis, and intent-aware retrieval planning.

Memory & RAG 2601.02553 notes →

A Survey on Long-Term Memory Security in LLM Agents: Attacks, Defenses, and Governance Across the Memory Lifecycle

The emergence of writable, cross-session persistent memory in LLM agents introduces a qualitatively different threat landscape from conventional input-centric security concerns, ch…

Memory & RAG 2604.16548 notes → 💬 Tier 2 필독 (loom 작업자). 메모리 거버넌스를 5개 primitive로 형식화:…

Memory as Action: Autonomous Context Curation for Long-Horizon Agentic Tasks

Long-context Large Language Models, despite their expanded capacity, require careful working memory management to mitigate attention dilution during long-horizon tasks. Yet existin…

Memory & RAG 2510.12635 notes → 💬 Tier 2. 메모리를 action으로 — 롱호라이즌 태스크에서 컨텍스트 자율 큐레이션. …