Recruiter Fit-Claim Scorecard for SaaS Heads of Talent

graduated [TRIANGULATED] filter 9.5/15 spread ±0.5 signals: 3 independent

What is this?

A structured ledger that lets an in-house head of talent record the pre-interview 'fit claims' external search firms make about each candidate ('9/10 culture match', 'top of my pipeline', 'definitely worth a final round'), then grade those claims against eventual interview-panel scorecards, offer-acceptance, and 90-day retention pulled from the company's existing ATS. AE's adversarial debate stack scores claim language pre-interview for falsifiability vs. hedge-laden puffery; over weeks, per-firm calibration curves emerge — Firm A's 9/10s convert to offers 71% of the time; Firm B's convert 18%. The head of talent uses this to renegotiate retainers, drop low-calibration firms, and defend hiring-velocity numbers to the CEO with adjudicated evidence rather than vibes. AE is uniquely suited because the same grading-loop + lifecycle-state machinery that produced 508 graded predictions ports directly onto recruiter-claim resolution, and the manual-entry path means zero ATS integration is required for first paid engagement.

Why did we consider it?

Recruiter fit-claims are unrecorded predictions with objective ATS-resolved outcomes — exactly AE's grading-loop shape, sold to a single introverted buyer with no integration friction.

What breaks?

Temporal mismatch: AE's sub-24h feedback loop is entirely wasted on 4-5 month hiring and retention cycles.
Workflow friction: 'Zero ATS integration' means forcing overworked Heads of Talent to manually copy-paste email puffery into a separate ledger.
Subjective attribution failure: AE's objective grading cannot cleanly resolve vague 'culture fit' claims against complex, multi-variable interview rejections.

What did we learn?

Engine verdict: GATHER_MORE_SIGNAL (WORTH_SKIMMING). Real pain, open quadrant, but taxonomy port + buyer politics + async-solo distribution remain unverified — premature to build until weekend transferability test runs.

Filter scores

Five axes, each scored 0-3. Three independent runs by different model perspectives. Median shown.

Axis	What it measures
data moat	Does this product accumulate proprietary data that compounds?
10x model test	Does a better model make this more valuable, or redundant?
fast feedback loops	Can outputs be graded against reality in <30 days?
solo founder feasible	Can a solo operator build and run this without a team?
AI providers cant eat it	Do hyperscalers have structural reasons NOT to build this?

Composite median: 9.5 / 15. Graduation threshold: 9.0. IQR across runs: 0.5.

Evidence

Signal A — Primary source

https://arxiv.org/html/2309.13933v2 credibility: medium

Algorithmic hiring comprises algorithms, tools, and systems to automate or assist HR decisions on candidate recruitment and evaluation.

Signal B — Competitor with documented gap

https://rallyrecruitmentmarketing.com/learn/recruiter-effectiveness-scorecard/

Evaluates internal recruiting team performance with activity-level metrics but does not capture specific pre-interview fit-claims from external search firms, grade those claims against downstream interview scores, offer-acceptance, or 90-day retention, or produce per-firm calibration curves that distinguish high-signal agencies from low-signal ones.

Signal D — Demand proxy

{"found":true,"summary":"Multiple forum and social discussions show talent leaders frustrated with recruiter quality signals: a Reddit hiring manager notes interview scorecards full of blanks, a LinkedIn post argues recruiter success should be measured beyond time-to-hire, another LinkedIn post flags 'looks great on paper' hires who fail in EMEA SaaS, and a Hacker News thread shows builder interest in job-application tracking tools — all indicating latent demand for structured accountability in recruiting claims.","sources":["https://www.reddit.com/r/jobhunting/comments/1rxrf8e/hiring_manager_…

Evaluation history

When	Stage	Phase
2026-05-14 07:15	deep_council_verdict	graduated
2026-05-14 07:13	deep_claude_take	graduated
2026-05-14 07:12	deep_90day_plan	graduated
2026-05-14 07:10	deep_risk	graduated
2026-05-14 07:09	deep_distribution	graduated
2026-05-14 07:08	deep_pricing	graduated
2026-05-14 07:06	deep_moat	graduated
2026-05-14 07:05	deep_buyer_sim	graduated
2026-05-14 07:04	deep_icp	graduated
2026-05-14 07:02	deep_competitor	graduated
2026-05-14 07:00	deep_market_reality	graduated
2026-05-14 06:54	filter_score	scored
2026-05-14 06:48	filter_score	scored
2026-05-14 06:42	filter_score	scored
2026-05-14 06:37	evidence_search	argument
2026-05-14 06:24	audience_simulation	argument
2026-05-14 06:18	red_team_kill	argument
2026-05-14 06:12	steelman	argument
2026-05-14 06:07	genesis	argument