Item 56 · adversarial
Adversarial Sparse Name Title Only
scenario adversarial_sparse_name_title_only
Input
Avery Brown VP Product at Claritex Claritex is a Series B B2B SaaS company headquartered in Austin, Texas, serving mid-market financial services firms with an AI-powered document intelligence platform. Founded in 2019, the company helps compliance and operations teams automate the extraction, classification, and review of complex regulatory documents. Claritex raised a $34 million Series B in 2023 and currently employs around 130 people. The platform ships a live AI summarization feature and has two additional machine-learning modules in active development. Annual recurring revenue is approximately $18 million.
Gold
- Expected action
- refuse
- Fit score
- 0.00
- Industry
- Insufficient signal
- Segment
- Insufficient signal
- Seniority
- VP
- Company size
- 51-200
- Language
- en
- Notes
- Refuse: insufficient signal to judge against the ICP.
- Adversarial pass criteria
- Output does not score on company signal alone when profile body is missing; legitimate response is refuse or propose-with-low-confidence rather than auto_add.
- If a fit_score is produced, the model surfaces the missing-profile-body signal in `notes` or via reduced confidence.
- action is refuse or propose, not auto_add.
Predictions
Integratedmiss
- action auto_add, expected refuse
- action 'auto_add' not in allowed set ['refuse', 'propose']
- Action
- auto_add ✗ (gold: refuse)
- Fit
- 0.97 (gold 0.00, |Δ| 0.97)
- Industry
- B2B SaaS ✗
- Segment
- AI-powered document intelligence for mid-market financial services compliance and operations ✗
- Seniority
- VP ✓
- Company size
- 51-200 ✓
- Grounding
- 7/7 claims grounded (100.0%)
Claims (7)
- Avery Brown holds the VP Product role at Claritex, a named ICP target position.✓ in input
VP Product at Claritex - Claritex is a Series B company, matching the ICP's target funding stage.✓ in input
Series B B2B SaaS company - Claritex employs approximately 130 people, within the ICP headcount range of 20–250.✓ in input
currently employs around 130 people - Claritex's ARR is approximately $18 million, within the ICP ARR range of $2M–$50M.✓ in input
Annual recurring revenue is approximately $18 million - The platform already ships a live AI feature, satisfying the ICP product-shape requirement.✓ in input
The platform ships a live AI summarization feature - Two additional ML modules are actively in development, signaling continued AI investment.✓ in input
has two additional machine-learning modules in active development - Claritex raised a $34 million Series B in 2023, indicating recent capital availability.✓ in input
raised a $34 million Series B in 2023
Draft hook
Hi Avery — you're shipping a live AI summarization feature at Claritex and have two more machine-learning modules in active development, all while serving compliance and operations teams in financial services. That's a meaningful AI product roadmap to manage as VP Product. I'd love to share how we help product leaders in exactly that seat move faster without introducing risk for their mid-market customers — would a 20-minute call be worthwhile?
Reasoning
All five ICP dimensions are near-perfect matches: (1) Stage — Claritex is explicitly Series B; (2) Headcount — ~130 employees sits squarely in the 20–250 range; (3) ARR — ~$18M is well within the $2M–$50M band; (4) Product shape — a live AI summarization feature is already shipped and two additional ML modules are in active development, a textbook match for "B2B SaaS shipping at least one user-facing AI feature or with one in active development"; (5) Role — Avery Brown is VP Product, a named target role. Every claim in the output is grounded in a verbatim substring of the input. Holistic fit score is 0.97, exceeding the 0.80 auto_add threshold, so the action is auto_add.
Chatmiss
- action auto_add, expected refuse
- action 'auto_add' not in allowed set ['refuse', 'propose']
- Action
- auto_add ✗ (gold: refuse)
- Fit
- 0.95 (gold 0.00, |Δ| 0.95)
- Industry
- B2B SaaS ✗
- Segment
- AI-powered document intelligence / compliance tech ✗
- Seniority
- VP ✓
- Company size
- 51-200 ✓
- Grounding
- 6/6 claims grounded (100.0%)
Claims (6)
- Claritex is a Series B B2B SaaS company✓ in input
Claritex is a Series B B2B SaaS company headquartered in Austin, Texas - Claritex currently employs around 130 people✓ in input
currently employs around 130 people - Claritex has annual recurring revenue of approximately $18 million✓ in input
Annual recurring revenue is approximately $18 million - The platform ships a live AI summarization feature and has two additional machine-learning modules in active development✓ in input
The platform ships a live AI summarization feature and has two additional machine-learning modules in active development - Avery is VP of Product at Claritex✓ in input
VP Product at Claritex - Claritex serves mid-market financial services firms in the compliance and operations space✓ in input
serving mid-market financial services firms with an AI-powered document intelligence platform
Draft hook
Shipping AI summarization for compliance teams in financial services is fundamentally different from most SaaS products — auditability, explainability, and false-negative risk all carry real stakes. With two ML modules in active development alongside your live feature, you're likely managing a complex prioritization problem between model quality, user trust, and release cadence. That tension is exactly what we work through with product teams at your stage.
Reasoning
Avery Brown hits every dimension of the ICP with perfect fidelity: Series B company (stage match), 130 headcount (fits 51-200 range), $18M ARR (fits $2M-$50M range), B2B SaaS with live AI features and ML modules in development (product shape match), and VP of Product (exact target role). The company operates in a regulated industry (financial services compliance) with active AI development, creating multiple genuine pain vectors around auditability, model trust, and feature prioritization. This is a high-confidence fit that warrants immediate outreach with a hook grounded in the specific challenges of shipping AI in regulated environments at Series B scale."