Small AI Models Match GPT-5 on Routine Agent Tasks, Study Finds
New AgentFloor benchmark finds small open-weight AI models can match GPT-5 on routine agent tasks — suggesting enterprises could slash costs by routing most calls to smaller models.
New AgentFloor benchmark finds small open-weight AI models can match GPT-5 on routine agent tasks — suggesting enterprises could slash costs by routing most calls to smaller models.