每日 Harness 开源 · Source
返回本期 · Back to 2026-06-02

论文 · Papers2026-06-02 · Tuesday, June 2, 2026

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search

arxiv.org原文 ↗

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search
SAAS 聚焦 agentic search 的过度检索:模型在已有内部知识足够时仍继续调用搜索,增加成本和噪声。论文用 self-aware reinforcement learning 让 agent 学会判断何时检索、何时用内部知识、何时停止。它值得看在于把检索策略从“多查更好”改成可训练的成本-可靠性决策。
浏览

评论 · Comments