AI Agents Still Cannot Track Context — And Criminals Are Already Exploiting That
Microsoft's DELEGATE-52 benchmark proves frontier models corrupt documents beyond 20 interactions. One week later, Google confirmed criminals used AI for a real zero-day exploit. The two findings describe the same gap from opposite ends.
ai-agentsai-securitydelegationzero-daycontext-windowenterprise-aithreat-intelligence