← all hypotheses

Recruiter Fit-Claim Scorecard for SaaS Heads of Talent

graduated [TRIANGULATED] filter 9.5/15 spread ±0.5 signals: 3 independent
What is this?
A structured ledger that lets an in-house head of talent record the pre-interview 'fit claims' external search firms make about each candidate ('9/10 culture match', 'top of my pipeline', 'definitely worth a final round'), then grade those claims against eventual interview-panel scorecards, offer-acceptance, and 90-day retention pulled from the company's existing ATS. AE's adversarial debate stack scores claim language pre-interview for falsifiability vs. hedge-laden puffery; over weeks, per-firm calibration curves emerge — Firm A's 9/10s convert to offers 71% of the time; Firm B's convert 18%. The head of talent uses this to renegotiate retainers, drop low-calibration firms, and defend hiring-velocity numbers to the CEO with adjudicated evidence rather than vibes. AE is uniquely suited because the same grading-loop + lifecycle-state machinery that produced 508 graded predictions ports directly onto recruiter-claim resolution, and the manual-entry path means zero ATS integration is required for first paid engagement.
Why did we consider it?
Recruiter fit-claims are unrecorded predictions with objective ATS-resolved outcomes — exactly AE's grading-loop shape, sold to a single introverted buyer with no integration friction.
What breaks?
  • Temporal mismatch: AE's sub-24h feedback loop is entirely wasted on 4-5 month hiring and retention cycles.
  • Workflow friction: 'Zero ATS integration' means forcing overworked Heads of Talent to manually copy-paste email puffery into a separate ledger.
  • Subjective attribution failure: AE's objective grading cannot cleanly resolve vague 'culture fit' claims against complex, multi-variable interview rejections.
What did we learn?
Engine verdict: GATHER_MORE_SIGNAL (WORTH_SKIMMING). Real pain, open quadrant, but taxonomy port + buyer politics + async-solo distribution remain unverified — premature to build until weekend transferability test runs.

Filter scores

Five axes, each scored 0-3. Three independent runs by different model perspectives. Median shown.

AxisWhat it measures
data moatDoes this product accumulate proprietary data that compounds?
10x model testDoes a better model make this more valuable, or redundant?
fast feedback loopsCan outputs be graded against reality in <30 days?
solo founder feasibleCan a solo operator build and run this without a team?
AI providers cant eat itDo hyperscalers have structural reasons NOT to build this?
Composite median: 9.5 / 15. Graduation threshold: 9.0. IQR across runs: 0.5.

Evidence

Signal A — Primary source

Algorithmic hiring comprises algorithms, tools, and systems to automate or assist HR decisions on candidate recruitment and evaluation.

Signal B — Competitor with documented gap

Evaluates internal recruiting team performance with activity-level metrics but does not capture specific pre-interview fit-claims from external search firms, grade those claims against downstream interview scores, offer-acceptance, or 90-day retention, or produce per-firm calibration curves that distinguish high-signal agencies from low-signal ones.

Signal D — Demand proxy

{"found":true,"summary":"Multiple forum and social discussions show talent leaders frustrated with recruiter quality signals: a Reddit hiring manager notes interview scorecards full of blanks, a LinkedIn post argues recruiter success should be measured beyond time-to-hire, another LinkedIn post flags 'looks great on paper' hires who fail in EMEA SaaS, and a Hacker News thread shows builder interest in job-application tracking tools — all indicating latent demand for structured accountability in recruiting claims.","sources":["https://www.reddit.com/r/jobhunting/comments/1rxrf8e/hiring_manager_…

Evaluation history

WhenStagePhase
2026-05-14 07:15deep_council_verdictgraduated
2026-05-14 07:13deep_claude_takegraduated
2026-05-14 07:12deep_90day_plangraduated
2026-05-14 07:10deep_riskgraduated
2026-05-14 07:09deep_distributiongraduated
2026-05-14 07:08deep_pricinggraduated
2026-05-14 07:06deep_moatgraduated
2026-05-14 07:05deep_buyer_simgraduated
2026-05-14 07:04deep_icpgraduated
2026-05-14 07:02deep_competitorgraduated
2026-05-14 07:00deep_market_realitygraduated
2026-05-14 06:54filter_scorescored
2026-05-14 06:48filter_scorescored
2026-05-14 06:42filter_scorescored
2026-05-14 06:37evidence_searchargument
2026-05-14 06:24audience_simulationargument
2026-05-14 06:18red_team_killargument
2026-05-14 06:12steelmanargument
2026-05-14 06:07genesisargument