EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents
EEVEE framework employs a router-conditioned prompt set with co-evolution to enhance LLM robustness across heterogeneous task streams, improving scores by 10.38-24.32 points.
Weixian Xu, Shilong Liu, Mengdi Wang