Search-capable AI agents may cheat on benchmark tests

/ Uncategorized / By SecurityTicks

Search-capable AI agents may cheat on benchmark tests

2025-08-23 at 18:49

By Thomas Claburn

Data contamination can make models seem more capable than they really are

Researchers with Scale AI have found that search-based AI models may cheat on benchmark tests by fetching the answers directly from online sources rather than deriving those answers through a “reasoning” process.…