After 21 benchmarks across 3 weeks, the cumulative standings: SynapseKit 45, LangChain 31, LlamaIndex 26. SynapseKit dominates agent ergonomics (wins 4 of 6 Week-3 benchmarks). LangChain wins the most production-critical single benchmark: per-tool error handling. LlamaIndex's agent story is exposed as incomplete — strong in retrieval week, weak in agent week. Week 4 (production: async, graph workflows, evaluation, cost tracking, guardrails, MCP) begins next.
The ergonomics leader is clear. Whether it holds when Week 4 tests production concerns is the open question.