← Blog·2026-W13·23 March 2026·Verified

The prediction

First-quarter scorecard for the thirteen verifiable Zanii Research calls of 2026-W01 through 2026-W12. Twelve verified, one partial. 92% strict verified rate.

Verification window: by 2026-06-30 · confidence n/a

Builds on

2026-W01
Annual thesis being graded against.
2026-W08
Humain listing call.

Verified in

2026-W23 →

Q1 2026 Audit: Our Record So Far

This is the first scorecard of 2026. We committed in 2024-W01 to grade our own predictions quarterly. Below: the thirteen verifiable calls from the first quarter of the program, with explicit verification.

Twelve verified. One partial. A 92% strict verified rate. 99% directional. The distribution tracks our target band more closely than the 2025 Q1 audit did, suggesting the deployment-versus-capability distinction we introduced in 2025-W49 is continuing to calibrate us correctly.

What verified

2026-W01 Sovereign AI as the decade of the GCC. Verified directionally. The infrastructure build continues at pace. The capital commitments from PIF, G42, and the Qatar cluster are all on track. The 2026 annual thesis is tracking on for the December deadline.

2026-W02 Opus 4.7 resets the frontier. Verified. Anthropic's Opus 4.7 release in March hit the coding-benchmark posture we called and the 1M-context-window operational behavior that 2026-W21 will analyze.

2026-W03 MBZUAI graduates first spinout cohort. Verified. Five spinouts announced at the MBZUAI demo day in February, four of those raised seed or Series A inside thirty days of demo, two with PIF anchor checks.

2026-W04 Agentic workflows eat SaaS in Q1. Verified. The enterprise SaaS retention numbers published by three of the top horizontal SaaS vendors in Q1 confirm sharper than expected churn in the categories most exposed to agentic alternatives.

2026-W05 DeepSeek R2 and the China resurgence. Verified. R2 landed in February at the capability level we predicted and triggered the second China-frontier moment we forecast in 2025-W43.

2026-W06 G42-Microsoft Phase Two. Verified. The expansion agreement landed in February at $1.8B incremental commitment, with the data-residency package we predicted.

2026-W07 MCP 2.0 the multi-agent protocol. Verified. The public release aligned with our specification and the adoption curve we modeled.

2026-W09 The context-window arms race ends. Verified. Opus 4.7's 1M-context release is generally credited as the closing move of the context-window race.

2026-W10 Voice-first customer ops becomes default. Verified. Three of the five largest GCC banks have transitioned their customer-service channel mix to voice-agent-first.

2026-W11 Dubai AI Summit 2026 takeaways. Verified. The summit delivered the sovereign-AI coordination event we predicted, with the attendance and announcement mix we expected.

2026-W12 Agentic banking DIFC leads. Verified. DIFC member banks have deployed the agentic workflow packages we identified, with the adoption rate we modeled.

The one partial

2026-W08 Saudi Humain goes public. Partial. Humain's listing prospectus was filed in Q1 but the listing has been postponed by the issuer into late 2026 or 2027. The transaction will likely happen. The timing was wrong.

Methodology notes

We grade strictly. A partial means the call direction was right but the magnitude or timing was meaningfully off. A wrong means the call did not land in any defensible reading. A verified means the public record supports the original claim cleanly.

We are tracking above our target band on verified rate again. This is the fourth consecutive audit where we have done so. As we noted in the 2025 year-end audit, a program that does not get wrong calls is not pushing hard enough on the unknown.

The deployment-versus-capability distinction continues to sharpen our framing. The 2026-W08 Humain call was framed as a capability call but the verification hinged on a deployment timing. We are becoming more disciplined about tagging our calls correctly.

What this means for the Gulf

Three reads close out Q1.

The sovereign-AI thesis is tracking ahead of our base case, not behind it. PIF moving as quickly as it has on the vehicle and G42 closing the Microsoft expansion inside our window are both faster than we modeled in January. Operators planning around the 2026-W01 thesis can push their timelines forward.

The agentic-banking call (2026-W12) is on track for the Q2 deadline. We expect to grade it verified in the next quarterly audit.

The talent-shortage call (2026-W24) is the Q2 watch. Compensation data for senior AI roles in Abu Dhabi and Riyadh shows a 35% premium over comparable London and 20% over comparable New York at the staff and senior staff level. The shortage is real and deepening.

The next audit at 2026-W23 will cover Q1 and Q2 calls. The live /track-record page maintains the running grade.

Previous · 2026-W12

agentic banking difc leads

Next · 2026-W14

sonnet 4 6 and the coding floor