cs.AI 2606.19911

Multi-Agent Transactive Memory

Proposed Multi-Agent Transactive Memory (MATM) enhances heterogeneous agent populations by sharing trajectories, improving success rate by 8% and reducing steps by 0.59 in interactive tasks.

To Eun Kim, Xuhong He, Dishank Jain et al.

2026-06-18 20
cs.AI 2606.11173

The Role of Feedback Alignment in Self-Distillation

This paper introduces feedback alignment in self-distillation, comparing three feedback types; structure-aligned critique outperforms others with +16.11% accuracy.

Semih Kara, Oğuzhan Ersoy

2026-06-10 72
cs.AI 2606.11078

A History-Aware Visually Grounded Critic for Computer Use Agents

Proposes HiViG, a history-aware visually grounded test-time framework, boosting GUI task success rates by 5.8% (Qwen3-VL-32B) and 9% (Gemini-3-Flash) through macro-action history and visual error verification.

Jaewoo Lee, Zaid Khan, Archiki Prasad et al.

2026-06-10 95
cs.AI 2606.02484

Iteris: Agentic Research Loops for Computational Mathematics

Iteris employs an explore-plan-execute loop with multi-agent collaboration to generate numerical evidence and proof drafts, verified through expert review, advancing open problems in computational mathematics.

Leheng Chen, Zihao Liu, Wanyi He et al.

2026-06-02 192
cs.AI 2605.28807

Calibrating Conservatism for Scalable Oversight

Proposes Calibrated Collective Oversight (CCO), integrating multiple auxiliary signals with Conformal Decision Theory for online calibration, ensuring AI behavior aligns with safety targets.

William Overman, Mohsen Bayati

2026-05-28 122