AI Agent Papers

AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing

A multi-agent pipeline that reads a PDE problem description in plain text and writes, debugs, and validates a classical numerical solver end-to-end. Generates spectral and finite-d…

Multi-Agent 2602.17607 notes →

Beyond Offline A/B Testing: Context-Aware Agent Simulation for Recommender System Evaluation

Evaluates recommender systems via agent-RS interactions.

Multi-Agent 2604.09549 notes →

CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery

Introduces long-running multi-agent systems that self-evolve via shared persistent memory, asynchronous execution, and heartbeat-based interventions; 3–10× higher improvement rates…

Multi-Agent 2604.01658 notes →

DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching

Investigates dynamically rewiring agent-to-agent connections at each reasoning round via semantic matching instead of fixed communication topologies.

Multi-Agent 2602.06039 notes →

RuleSmith: Multi-Agent LLMs for Automated Game Balancing

Explores automated game balancing by combining multi-agent LLM self-play with Bayesian optimization on a civ-style game.

Multi-Agent 2602.06232 notes →

CommCP: Efficient Multi-Agent Coordination via LLM-Based Communication with Conformal Prediction

Examines how conformal prediction can filter noisy inter-agent messages to improve multi-robot coordination.

Multi-Agent 2602.06038 notes →

AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions

Introduces a 110+ task benchmark to evaluate how well multi-agent LLM systems handle buyer-seller negotiation through natural language.

Multi-Agent 2602.06008 notes →

Gender Dynamics and Homophily in a Social Network of LLM Agents

Analyzes social network formation among 70K+ autonomous LLM agents on Chirper.ai to study emergent group behavior and bias.

Multi-Agent 2602.02606 notes →

ROMA: Recursive Open Meta-Agent Framework for Long-Horizon Multi-Agent Systems

Proposes breaking large tasks into subtask trees that run in parallel across multiple agents to handle long-horizon workflows without exceeding context windows.

Multi-Agent 2602.01848 notes →

ORCH: many analyses, one merge — a deterministic multi-agent orchestrator

Proposes a deterministic multi-agent orchestrator where multiple LLMs analyze a problem independently and a merge agent selects the best answer without any training.

Multi-Agent 2602.01797 notes →

H-AdminSim: A Multi-Agent Simulator for Realistic Hospital Administrative Workflows

Simulates end-to-end hospital administrative workflows with multi-agent LLMs and FHIR integration to test LLM-driven automation in healthcare settings.

Multi-Agent 2602.05407 notes →

Agyn: A Multi-Agent System for Team-Based Autonomous Software Engineering

Proposes a multi-agent system for autonomous software engineering that assigns specialized agents to roles like coordination, research, implementation, and review.

Multi-Agent 2602.01465 notes →

Multi-Agent Teams Hold Experts Back

Examines whether self-organizing LLM agent teams can match or beat their best member's performance across collaborative benchmarks.

Multi-Agent 2602.01011 notes →

Evolving Interpretable Constitutions for Multi-Agent Coordination

Explores using LLM-driven genetic programming to automatically discover behavioral norms for multi-agent coordination in a survival-pressure grid-world simulation.

Multi-Agent 2602.00755 notes →

Scaling Multiagent Systems with Process Rewards

Proposes per-action process rewards from AI feedback to improve credit assignment and sample efficiency when finetuning multi-agent LLM systems.

Multi-Agent 2601.23228 notes →

MonoScale: Scaling Multi-Agent System with Monotonic Improvement

Proposes a framework for safely growing multi-agent pools by generating familiarization tasks and building routing memory, with a guaranteed non-decreasing performance across onboa…

Multi-Agent 2601.23219 notes →

Task-Aware LLM Council with Adaptive Decision Pathways for Decision Support

Proposes a task-adaptive multi-agent framework that routes control to the most suitable LLM at each decision step using semantic matching against each model's success history.

Multi-Agent 2601.22662 notes →

SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly

Explores using a pool of different LLM agents within MCTS planning to increase rollout diversity and improve multi-step reasoning.

Multi-Agent 2601.22623 notes →

Learning to Recommend Multi-Agent Subgraphs from Calling Trees

Proposes a recommendation framework that uses historical calling trees to select the best agents or agent teams for each subtask in multi-agent orchestration.

Multi-Agent 2601.22209 notes →

Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic

Investigates actor-critic reinforcement learning methods for training decentralized LLM agent collaboration across writing, coding, and game-playing tasks.

Multi-Agent 2601.21972 notes →

AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making

Proposes a role-structured multi-agent courtroom debate framework with defined agent roles, interaction protocols, and private reasoning strategies for auditable high-stakes decisi…

Multi-Agent 2601.21936 notes →

Epistemic Context Learning: Building Trust the Right Way in LLM-Based Multi-Agent Systems

Introduces a reasoning framework that builds peer reliability profiles from interaction history so agents in multi-agent systems learn which peers to trust when uncertain.

Multi-Agent 2601.21742 notes →

Adaptive Confidence Gating in Multi-Agent Collaboration for Efficient and Optimized Code Generation

Explores structured multi-agent debate with three role-based agents and adaptive confidence gating to improve small language model code generation.

Multi-Agent 2601.21469 notes →

CASTER: Context-Aware Strategy for Task Efficient Routing in Multi-Agent Systems

Proposes a lightweight router for dynamic model selection in graph-based multi-agent systems that combines semantic embeddings with structural meta-features and self-optimizes thro…

Multi-Agent 2601.19793 notes →

Phase Transition for Budgeted Multi-Agent Synergy

Develops a theory for predicting when budgeted multi-agent LLM systems improve, saturate, or collapse based on context windows, communication fidelity, and shared-error correlation…

Multi-Agent 2601.17311 notes →

Dynamic Role Assignment for Multi-Agent Debate

Proposes a meta-debate framework that dynamically assigns roles in multi-agent systems by matching model capabilities to positions through proposal and peer review stages.

Multi-Agent 2601.17152 notes →

Learning to Collaborate: An Orchestrated-Decentralized Framework for Peer-to-Peer LLM Federation

Introduces orchestrated decentralized peer-to-peer LLM collaboration that uses contextual bandits to learn optimal matchmaking between heterogeneous agents via secure distillation.

Multi-Agent 2601.17133 notes →

Mixture-of-Models: Unifying Heterogeneous Agents via N-Way Self-Evaluating Deliberation

Explores a runtime Mixture-of-Models architecture with a dynamic expertise broker and quadratic voting consensus that enables small model ensembles to match frontier performance.

Multi-Agent 2601.16863 notes →

Multi-Agent Constraint Factorization Reveals Latent Invariant Solution Structure

Formalizes through operator theory why multi-agent LLM systems access invariant solutions that a single agent applying all constraints simultaneously cannot reach.

Multi-Agent 2601.15077 notes →

MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks

Proposes a training-time framework that formulates multi-agent orchestration as function-calling reinforcement learning with holistic system-level reasoning and introduces MASBENCH…

Multi-Agent 2601.14652 notes →

MASCOT: Towards Multi-Agent Socio-Collaborative Companion Systems

Proposes a bi-level optimization framework for multi-agent companions that aligns individual personas via RLAIF and optimizes collaborative dialogue through group-level meta-policy…

Multi-Agent 2601.14230 notes →

If You Want Coherence, Orchestrate a Team of Rivals: Multi-Agent Models of Organizational Intelligence

Explores a team-of-rivals multi-agent architecture with specialized roles and a remote code executor that separates reasoning from data execution to maintain clean context windows.

Multi-Agent 2601.14351 notes →

The Orchestration of Multi-Agent Systems: Architectures, Protocols, and Enterprise Adoption

Formalizes a unified architectural framework for orchestrated multi-agent systems integrating MCP for tool access and Agent2Agent protocol for peer coordination, delegation, and po…

Multi-Agent 2601.13671 notes →

MARO: Learning Stronger Reasoning from Social Interaction

Proposes Multi-Agent Reward Optimization, a method that decomposes multi-agent social interaction outcomes into per-behavior learning signals to improve LLM reasoning through simul…

Multi-Agent 2601.12323 notes →

LSTM-MAS: A Long Short-Term Memory Inspired Multi-Agent System for Long-Context Understanding

Introduces an LSTM-inspired multi-agent architecture with worker, filter, judge, and manager agents that emulate gated memory mechanisms to control information flow for long-contex…

Multi-Agent 2601.11913 notes →

Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems

Examines whether query-level workflow generation is always necessary in multi-agent systems and proposes a low-cost task-level framework that uses self-prediction with few-shot cal…

Multi-Agent 2601.11147 notes →

Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems

Proposes a latency-aware multi-agent orchestration framework that explicitly optimizes the critical execution path under parallel execution to reduce end-to-end latency while maint…

Multi-Agent 2601.10560 notes →

TopoDIM: One-shot Topology Generation of Diverse Interaction Modes for Multi-Agent Systems

Proposes a one-shot topology generation framework with diverse interaction modes that enables decentralized agents to autonomously construct heterogeneous communication topologies …

Multi-Agent 2601.10120 notes →

Beyond Rule-Based Workflows: An Information-Flow-Orchestrated Multi-Agents Paradigm via A2A Communication from CORAL

Replaces predefined multi-agent workflows with a dynamic information-flow orchestrator that coordinates agents through natural-language A2A communication.

Multi-Agent 2601.09883 notes →

LLM-Based Agentic Systems for Software Engineering: Challenges and Opportunities

Reviews LLM-based multi-agent systems across the software development lifecycle, covering frameworks, communication protocols, and orchestration challenges from requirements to deb…

Multi-Agent 2601.09822 notes →

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Explores injecting structured textual experience into multi-agent deliberation at test time to improve reasoning accuracy without any model tuning.

Multi-Agent 2601.09667 notes →

The End of Reward Engineering: How LLMs Are Redefining Multi-Agent Coordination

Argues that LLMs can replace hand-crafted numerical reward functions with language-based objective specifications for multi-agent coordination, drawing on EUREKA and RLVR as eviden…

Multi-Agent 2601.08237 notes →

A Large-Scale Study on the Development and Issues of Multi-Agent AI Systems

Analyzes over 42K commits and 4.7K resolved issues across eight leading multi-agent AI systems (LangChain, CrewAI, AutoGen, etc.) to study development patterns, maintenance practic…

Multi-Agent 2601.07136 notes →

StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management

Proposes a hierarchical multi-agent framework that decouples high-level coordination from subtask execution with active task-level memory control and reinforcement-learning-driven …

Multi-Agent 2601.05890 notes →

CTHA: Constrained Temporal Hierarchical Architecture for Stable Multi-Agent LLM Systems

Proposes a constrained temporal hierarchical architecture for multi-agent LLM systems that projects inter-layer communication onto structured manifolds with typed message contracts…

Multi-Agent 2601.10738 notes →

DynaDebate: Breaking Homogeneity in Multi-Agent Debate with Dynamic Path Generation

Introduces dynamic path generation for multi-agent debate that allocates diverse solution paths to agents, shifts focus to step-by-step logic critique, and uses a trigger-based ver…

Multi-Agent 2601.05746 notes →

Demystifying Multi-Agent Debate: The Role of Confidence and Diversity

Investigates how diversity-aware initialization and confidence-modulated updates improve multi-agent debate, connecting findings from human deliberation research to LLM-based debat…

Multi-Agent 2601.19921 notes →

Orchestrating Intelligence: Confidence-Aware Routing for Multi-Agent Collaboration

Proposes a multi-agent framework with confidence-aware routing that dynamically selects agent roles and model scales across heterogeneous LLMs based on task complexity.

Multi-Agent 2601.04861 notes →

Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework

Analyzes role-based authority bias in multi-agent evaluation frameworks using French and Raven's power-based theory across legitimate, referent, and expert power types.

Multi-Agent 2601.04790 notes →

When Single-Agent with Skills Replace Multi-Agent Systems and When They Fail

Investigates when a single agent with a skill library can replace multi-agent systems, studying scaling limits and phase transitions in skill selection as libraries grow.

Multi-Agent 2601.04748 notes →

ResMAS: Resilience Optimization in LLM-based Multi-Agent Systems

Proposes a two-stage framework for enhancing multi-agent system resilience through RL-based topology generation and topology-aware prompt optimization under perturbations.

Multi-Agent 2601.04694 notes →

TCAndon-Router: Adaptive Reasoning Router for Multi-Agent Collaboration

Proposes an adaptive reasoning router for multi-agent systems that generates natural-language reasoning chains before predicting candidate agents, with a collaborative execution pi…

Multi-Agent 2601.04544 notes →

When Numbers Start Talking: Implicit Numerical Coordination Among LLM-Based Agents

Investigates covert communication in LLM multi-agent systems through game-theoretic analysis of implicit coordination signals across different communication regimes.

Multi-Agent 2601.03846 notes →

Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making

Proposes a Bayesian, cost-aware multi-LLM orchestration framework that treats LLMs as approximate likelihood models and aggregates across diverse models for sequential decision-mak…

Multi-Agent 2601.01522 notes →

OptimAI: Optimization from Natural Language Using LLM-Powered AI Agents

Turns natural-language optimization problems into working solver code with a four-agent pipeline (Formulator, Planner, Coder, Critic) and UCB bandit scheduling over candidate formu…

Multi-Agent 2504.16918 notes →

Why Do Multi-Agent LLM Systems Fail?

Despite enthusiasm for Multi-Agent LLM Systems (MAS), their performance gains on popular benchmarks are often minimal. This gap highlights a critical need for a principled understa…

Multi-Agent 2503.13657 notes → 💬 Tier 1 필독. MAST 실패 모드 택소노미 — 멀티에이전트 실패를 구조화한 논문. K…

Mathematical modelling of flow and adsorption in a gas chromatograph

In this paper, a mathematical model is developed to describe the evolution of the concentration of compounds through a gas chromatography column. The model couples mass balances an…

Multi-Agent 2501.00001 notes →

Agent-as-Judge for Factual Summarization of Long Narratives

Large Language Models (LLMs) have demonstrated near-human performance in summarization tasks based on traditional metrics such as ROUGE and BERTScore. However, these metrics do not…

Multi-Agent 2501.09993 notes →

A classification of restrictive polynomial correspondences

In this manuscript, we study a special class of correspondences on $\mathbb{P}^{1} \times \mathbb{P}^{1}$ given by a polynomial relation, say $P(z, w)$. We focus on what we call re…

Multi-Agent 2503.00001 notes →