2 articles
New theory reframes agent execution as continuous dialogue where users can intervene mid-task, not transaction completed in isolation.
Internal testing reveals the model introduces novel attack surfaces and defense evasion capabilities.