Frontier Signal
AgentFloor: Small Models Excel in Agentic Tool Use, Per arXiv
New AgentFloor benchmark from arXiv reveals small open-weight models are sufficient for routine agentic tool use, reserving frontier models for complex planning.
Read the briefing