9 articles

New benchmarks and frameworks measure how agentic systems handle constrained evidence, multimodal perception, and complex sequential reasoning.

ChatGPT's new agents plug directly into Slack, Salesforce and other business tools.

New research identifies coordination failures and specification mismatches as primary causes of multi-agent LLM system breakdowns.

Two major executives depart as OpenAI kills Sora and consolidates science efforts into core products.

New model-native harness and security features let enterprises build safer, longer-running autonomous agents.

New guides cover responsible AI use, writing, research, operations, and customer success workflows.

Slackbot gets its biggest update since the Salesforce acquisition with new AI-powered capabilities.

The industry is moving beyond either-or thinking. Diverse AI architectures will power every company, every country, and every app.

New standardized methodology addresses critical gap in measuring autonomous voice system performance across enterprise deployments.