5 articles

New methods address episodic memory retrieval, hierarchical skill reuse, tool library evolution, and heterogeneous agent routing through DAGs and induction.

New benchmarks and multi-agent systems expose performance gaps when language models must reason through long chains of decisions.

New research tackles memory management, model transitions, and tool accuracy in production agent systems.

Researchers identify structural misalignment between what AI agents can do and what governance can control.

New research identifies coordination failures and specification mismatches as primary causes of multi-agent LLM system breakdowns.

500K lines of TypeScript exposing multi-agent orchestration, coordinator logic, and behavioral tracking systems.

Open-source project publishes autonomous system behavior tracking for transparency and debugging.