← all hypothesesVendor QBR Promise Ledger for Multi-Vendor IT Operations & Vendor Management Leads
ranked [TRIANGULATED] filter 8.0/15 spread ±2.5 signals: 2 independent
What is this?
In-house vendor management / IT operations leads at 200-2000 person companies oversee 5-15 outsourced operations vendors (MSPs, managed SOCs, managed cloud, outsourced helpdesk, BPO, RPO). Each vendor delivers monthly or quarterly QBRs dense with narrative-heavy, hedged promises: 'P1 MTTR will drop 35% once the shift-left model lands in Q2', 'SOC alert fidelity will improve 40% after tuning, assuming log volume stabilises'. The buyer enters each commitment into the ledger before accepting the QBR; AE's structured constraint language plus adversarial multi-model debate force vague hedges into testable specifics (metric + target + deadline + stated condition). 4-8 weeks later, ServiceNow / Jira / ticketing actuals are pasted in. AE grades each prior commitment hit/miss/partial and the 6-pattern autopsy categorises the miss narrative — Concession Laundering when 'learnings' replace numbers, Epistemological Shielding when blame shifts to 'scope creep', Cosmetic Confidence on next-quarter restatements. After 2-3 cycles the buyer has per-vendor calibration scores defensible across a £500K-£5M annual outsourced-ops portfolio at contract renewal.
Why did we consider it?
AE weaponises its prediction-grading and autopsy stack against vendor QBR hedging — a real, budgeted, under-tooled pain point — and the unit economics map cleanly onto a UK solo commander's £100-300K target.
What breaks?
- Breaks AE's <24h fast feedback loop by relying on 4-12 week Quarterly Business Review cycles.
- Manual data entry ('pasting actuals') cannot compete with native, automated ServiceNow/Jira SLA tracking.
- Enterprise procurement and infosec reviews for vendor management tools will crush a part-time solo founder.
What did we learn?
Still in evaluation (phase: ranked). No verdict yet.
Filter scores
Five axes, each scored 0-3. Three independent runs by different model perspectives. Median shown.
| Axis | What it measures |
|---|
| data moat | Does this product accumulate proprietary data that compounds? |
| 10x model test | Does a better model make this more valuable, or redundant? |
| fast feedback loops | Can outputs be graded against reality in <30 days? |
| solo founder feasible | Can a solo operator build and run this without a team? |
| AI providers cant eat it | Do hyperscalers have structural reasons NOT to build this? |
Composite median: 8.0 / 15. Graduation threshold: 9.0. IQR across runs: 2.5.
Evidence
Signal B — Competitor with documented gap
OneIO automates multi-vendor service integration and replaces manual coordination, but the snippet describes operational orchestration between vendors — not extraction, structuring, or grading of specific QBR commitments against ticketing actuals. No promise ledger, no adversarial hedge-to-testable-spec conversion, no miss-pattern taxonomy.
Signal D — Demand proxy
{"found":true,"summary":"Multiple articles confirm pain around multi-vendor cost opacity, QBR effectiveness concerns, and procurement-reality gaps — indicating demand for structured vendor accountability tooling in the 200-2000 employee segment.","sources":["https://www.netfor.com/resource-center/blog/it-vendor-management/","https://www.netsuite.com/portal/resource/articles/accounting/quarterly-annual-business-reviews.shtml","https://www.linkedin.com/posts/joelcollindemers_procurement-needs-to-stop-blaming-their-stakeholders-activity-7407041888169381890-IXHK"],"reason":"[3] Netfor discusses hi…
Evaluation history
| When | Stage | Phase |
|---|
| 2026-05-17 01:12 | evidence_search | ranked |
| 2026-05-17 00:54 | evidence_search | ranked |
| 2026-05-17 00:36 | evidence_search | ranked |
| 2026-05-16 01:48 | evidence_search | ranked |
| 2026-05-14 21:48 | evidence_search | ranked |
| 2026-05-14 21:25 | evidence_search | ranked |
| 2026-05-14 21:07 | evidence_search | ranked |
| 2026-05-14 20:49 | evidence_search | ranked |
| 2026-05-14 20:18 | evidence_search | ranked |
| 2026-05-14 19:48 | evidence_search | ranked |
| 2026-05-14 19:25 | evidence_search | ranked |
| 2026-05-14 17:36 | evidence_search | ranked |
| 2026-05-14 17:13 | evidence_search | ranked |
| 2026-05-14 16:54 | evidence_search | ranked |
| 2026-05-14 16:36 | evidence_search | ranked |
| 2026-05-14 16:18 | evidence_search | ranked |
| 2026-05-14 10:43 | evidence_search | ranked |
| 2026-05-14 10:37 | evidence_search | ranked |
| 2026-05-14 10:24 | evidence_search | ranked |
| 2026-05-14 10:19 | evidence_search | ranked |
| 2026-05-14 10:07 | evidence_search | ranked |
| 2026-05-14 09:54 | filter_score | scored |
| 2026-05-14 09:48 | filter_score | scored |
| 2026-05-14 09:42 | filter_score | scored |
| 2026-05-14 09:37 | evidence_search | argument |
| 2026-05-14 09:24 | audience_simulation | argument |
| 2026-05-14 09:18 | red_team_kill | argument |
| 2026-05-14 09:12 | steelman | argument |
| 2026-05-14 09:09 | genesis | argument |