Cost Estimates — Phase 1¶

Topline Spend Buckets¶

Below are the major spend buckets we've estimated for Phase 1. This is intentionally “at-a-glance” so an investor can see where burn is going before diving into assumptions.

Monthly run-rate (Early Phase 1)¶

Founders (base): $33,333/mo
Founders (burden line item, 10%): $3,333/mo
Core infra: $1,000/mo
Support vendors: $500/mo
Trust & safety vendors: $200/mo
Paid ads (Reddit + LinkedIn): $2,500/mo
Reddit (candidates): $2,000/mo
LinkedIn (employers, Lead Gen + retargeting): $500/mo
Variable compute (LLM inference + embeddings): ~$742/mo (LLM dominates; embeddings negligible)

Early Phase 1 total monthly burn (run-rate): ~$41,608/mo

Monthly run-rate (Late Phase 1 — ops contractor for 6 months)¶

Everything above, plus:
Ops support contractor (annual-equivalent $70k / 12): $5,833/mo (for 6 months)
Ops support contractor burden cushion (10%): $583/mo (for 6 months)

Late Phase 1 total monthly burn (run-rate): ~$48,024/mo

One-time Phase 1 costs: 100,000¶

Design contractors: $50,000
Legal (conservative): $30,000
Grab bag (one-time + unknown unknowns): $20,000

TL;DR¶

Phase 1 Green is a people-burn story. Fixed monthly burn dwarfs token and ad spends
LLM inference is ~\$742/mo and embeddings are ~\$0.22/mo (effectively zero).
Adding always-on paid:
- Reddit: $2,000/mo aimed at candidate inventory
- LinkedIn: $500/mo aimed at employer lead gen (Lead Gen Forms + retargeting)
Estimated monthly burn:
- Early Phase 1: ~$41.6k/mo
- Late Phase 1 (6 months with ops contractor): ~$48.0k/mo
One-time costs modeled: $100k (design + legal + unknown-unknowns).

Scope & Ground Rules¶

Snapshot¶

Snapshot: Phase 1 Green

What this estimate includes¶

Founders, contractors, core infra, vendors, trust & safety vendors, legal, contingency (“unknown unknowns”)
Paid acquisition budgets (Reddit + LinkedIn)
Token-driven LLM inference cost (mainline + follow-up loop)
Embedding generation API cost (facet summaries → embeddings)

What this estimate explicitly excludes¶

Qdrant hosting/storage (assumed negligible vs tokens at Phase 1)
WorkOS SSO (pass-through per connection, rolled into B2B deals)
Benefits/healthcare (explicitly not included; founders paid directly; ops hire contractor-based)

Phase 1 Green Usage Model (the driver for variable compute)¶

Agent counts¶

Candidate agents: 3,000
Company/role agents: 600
Total agents: 3,600

Traffic & engagement assumptions (Phase 1 Green baseline)¶

Weekly non-owner visitors per agent: 3
Engagement rate (visitors who submit ≥1 prompt): 25%
Prompts per engaged visitor: 3

Derived (weekly): - Non-owner visits/week: 3,600 × 3 = 10,800 - Engaged visitors/week: 10,800 × 0.25 = 2,700 - Prompts/week: 2,700 × 3 = 8,100 - Avg prompts per visit: 8,100 / 10,800 = 0.75 - Prompts per agent/week: 8,100 / 3,600 = 2.25

LLM Inference: Call Structure & Follow-up Loop¶

Mainline call structure¶

Assume: 3 LLM calls per prompt on average
Example decomposition: synthesis + PII/safety + claim/update (or similar)

Follow-up loop (separate + additive)¶

Follow-up trigger rate: 25% of prompts
Follow-up depth: 1.5 questions per trigger (questions ≠ calls)
Implementation: 1 LLM call per trigger (generates the set of follow-up questions)

Derived (weekly): - Triggers/week: 8,100 × 0.25 = 2,025 - Follow-up questions/week: 2,025 × 1.5 = 3,038 (questions) - Follow-up calls/week: 2,025 (calls)

Token Budget Assumptions¶

Per call budget (starting baseline)¶

Input tokens/call: 1,000
Output tokens/call: 400
Total tokens/call: 1,400

Weekly call totals¶

Mainline calls/week: 8,100 prompts × 3 calls/prompt = 24,300
Follow-up calls/week: 2,025
Total calls/week: 24,300 + 2,025 = 26,325

Weekly token totals¶

Mainline tokens/week: 24,300 × 1,400 = 34,020,000
Follow-up tokens/week: 2,025 × 1,400 = 2,835,000
All-in tokens/week: 36,855,000

Breakdown: - Input tokens/week: 26,325 calls × 1,000 = 26,325,000 - Output tokens/week: 26,325 calls × 400 = 10,530,000

Monthly token totals¶

Using 52/12 ≈ 4.333 weeks/month: - Input tokens/month: 26.325M × 4.333 ≈ 114.08M - Output tokens/month: 10.53M × 4.333 ≈ 45.63M

Tokens per asker prompt (useful for unit economics)¶

Follow-up calls per prompt = 2,025 / 8,100 = 0.25
Avg calls per prompt (including follow-up loop) = 3 + 0.25 = 3.25

So: - Input tokens per prompt: 3.25 × 1,000 = 3,250 - Output tokens per prompt: 3.25 × 400 = 1,300

Variable Cost: LLM Inference (GPT-4o)¶

Pricing used (assumption for estimating)¶

GPT-4o: $2.50 / 1M input tokens, $10.00 / 1M output tokens

Monthly inference cost (at Green)¶

Input: 114.08 × $2.50 = $285.20 / month
Output: 45.63 × $10.00 = $456.30 / month
Total inference cost: $741.50 / month (≈ $742/month)

Your embedding approach (summary)¶

Embed summaries derived from capability claims (not raw text)
Maintain separate semantic facets:
skills, experience, landscape, intent, mobility
Claims grouped by facet → summarized together → summary embedded and stored in Qdrant
Qdrant storage spend excluded from this model

Embedding parameters you set¶

Embedding dimensions: 3,072
Facets: 5
Avg facet summary length: 200 tokens
Facet refreshes per resolved trigger: 1.2
Resolution rate: 80% of triggers resolve
Embedding cost assumption: $0.13 / 1M tokens (for “large” embeddings)

One-time initial embedding cost¶

Vectors to seed: - 3,600 agents × 5 facets = 18,000 facet vectors

Tokens to embed once: - Tokens per agent (all facets) = 5 × 200 = 1,000 - Total initial tokens = 3,600 × 1,000 = 3.6M tokens

Cost: - 3.6 × $0.13 = $0.47 one-time

Ongoing monthly re-embedding cost¶

Weekly resolved triggers: - Resolved triggers/week = 2,025 × 0.80 = 1,620

Facet updates/week: - Facet refreshes/week = 1,620 × 1.2 = 1,944

Embedding tokens/week: - 1,944 × 200 = 388,800 tokens/week

Embedding tokens/month: - 388,800 × 4.333 ≈ 1.685M tokens/month

Cost/month: - 1.685 × $0.13 = $0.219/month (≈ $0.22/month)

Conclusion: Embeddings are functionally negligible at Phase 1 Green scale.

Paid Acquisition (Ads): Reddit + LinkedIn¶

Guiding strategy (channel split)¶

Reddit: primary channel for candidate inventory generation (always-on, prospecting-heavy; retargeting if/when useful)
LinkedIn: primary channel for employer pipeline capture (retargeting + Lead Gen Forms; not cold prospecting)

Reddit (candidates)¶

Objective: candidate profile starts (agent_created)
Budget: $2,000/month (always-on)
Approach: subreddit-targeted prospecting; add retargeting later if it improves quality/CPA

LinkedIn (employers)¶

Primary objective: employer lead capture (Lead Gen Form + retargeting), not cold clicks
Budget: $500/month (always-on)
Lead definition: Work email + company + role + “hiring in next 12 months” (checkbox)
Target: 5 employer leads/month (implied working CPL target: ~$100/lead)

Notes on goals vs budgets¶

The LinkedIn budget is intentionally sized to the small goal (5/mo) and to avoid “LinkedIn becomes the whole burn.”
Reddit remains the main spend because the candidate funnel is volume-driven and you want iteration without LinkedIn CPC economics.

Fixed Costs (People + Operating)¶

Payroll / compensation posture¶

Founders paid directly (no benefits)
Ops support is contractor-based (modeled as annual-equivalent + cushion)
No benefits
Accounting for employment “burden” will occur in Phase 2, when people spend necessitates full payroll

Cost inputs¶

Founders: 2 × $200k base salary/year
Design contractors: $50k lump sum (Phase 1)
Core infra: $1,000/month
Support vendors: $500/month
Trust & safety vendors: $200/month (founder-heavy; inexpensive vendors)
Legal: $30k lump sum (explicitly conservative)
Grab bag: $20k lump sum (“one-time + unknown unknowns”)
Ops support: contractor equivalent of $70k annual for 6 months
WorkOS SSO: pass-through per connection (excluded from Phase 1 model)

Fixed cost table (fixed vs timing)¶

Cost area	Fixed/Variable	Timing	Assumption	Monthly run-rate	One-time
Founders (base)	Fixed	Phase 1	2 × $200k base	$33,333	—
Founders (burden)	Fixed	Phase 1	10% overhead (no benefits)	$3,333	—
Design contractors	Fixed	Phase 1	Lump sum	—	$50,000
Core infra	Fixed	Phase 1	Self-managed	$1,000	—
Support vendors	Fixed	Phase 1	Workspace/QBO/subscriptions	$500	—
Trust & safety vendors	Fixed	Phase 1	Cheap vendor stack	$200	—
Paid ads (Reddit + LinkedIn)	Fixed	Phase 1	Always-on budgets	$2,500	—
Legal (conservative)	Fixed	Phase 1	Mostly boilerplate + cleanup	—	$30,000
Grab bag	Fixed	Phase 1	One-time + unknown unknowns	—	$20,000
Ops support (contractor) base	Fixed	Late Phase 1 (6 mo)	$70k annual-equivalent / 12	$5,833*	—
Ops support cushion	Fixed	Late Phase 1 (6 mo)	+10% cushion	$583*	—

* Ops contractor line applies only during the 6 months you include the ops role.

Fixed monthly burn¶

Early Phase 1 (no ops contractor yet):
$33,333 + $3,333 + $1,000 + $500 + $200 + $2,500 = $40,866/month
Late Phase 1 (with ops contractor for 6 months):
$40,866 + $5,833 + $583 = $47,282/month

Fixed one-time costs (Phase 1)¶

Design $50,000 + Legal $30,000 + Grab bag $20,000 = $100,000

Combined Burn: Fixed + Variable (Phase 1 Green)¶

Variable monthly (usage-driven)¶

LLM inference: ~$742/month
Embedding generation: ~$0.22/month (negligible)

Total variable compute: ~$742/month

Total monthly burn (run-rate)¶

Early Phase 1: $40,866 fixed + $742 variable ≈ $41,608/month
Late Phase 1 (6 months w/ ops contractor): $47,282 fixed + $742 variable ≈ $48,024/month

One-time Phase 1 costs¶

$100,000 (design + legal + unknown-unknowns)

Unit Economics (Investors Will Ask)¶

Compute cost per asker prompt¶

Per prompt: - Input tokens ≈ 3,250 → 0.00325M × $2.50 = $0.00813 - Output tokens ≈ 1,300 → 0.00130M × $10.00 = $0.01300 - Total ≈ $0.0211 per prompt (~2.1¢)

Compute cost per engaged session¶

Prompts per engaged visitor = 3
Cost ≈ 3 × $0.0211 = $0.0633 (~6.3¢ per engaged session)

Compute cost per non-owner visit¶

Avg prompts per visit = 0.75
Cost ≈ 0.75 × $0.0211 = $0.0158 (~1.6¢ per visit)

Compute cost per agent-week¶

Prompts per agent-week = 2.25
Cost ≈ 2.25 × $0.0211 = $0.0475 (~4.8¢ per agent-week)

Paid acquisition unit metrics (reporting targets)¶

These are planning targets for reporting and iteration; they are not “guarantees.”

LinkedIn employer CPL target: ~$100/lead
Lead = work email + company + role + hiring-in-12-months checkbox
Goal = 5 leads/mo on $500/mo budget
Reddit candidate CPA: use actuals and iterate
Optimize to agent_created but report quality-adjusted conversion (e.g., agent_activated) as you instrument it

Notes / Interpretation¶

Why compute is cheap at Green (in this model)¶

Prompt volume is meaningful, but token budgets and call counts are contained.
Even with the follow-up loop modeled as additive, inference remains under $1k/month.

Sensitivity knobs (what can blow up)¶

If any of these move materially, compute changes quickly: - Token budget per call (context bloat, long transcripts, verbose responses) - Calls per prompt (extra safety passes, reranking, multi-step synthesis) - Prompt depth per engaged visitor - Follow-up trigger rate / additional loops

Legal estimate¶

Kept as a conservative lump sum ($30k). Expectation is cheaper due to boilerplate; this is intentionally padded.

Trust & safety¶

Modeled as $200/month for inexpensive vendors + founder effort (explicitly not staffing-heavy yet).

Ops support¶

Pushed to late Phase 1, contractor-based, modeled for 6 months only.

Quick Checklist (to be updated later with real data)¶

These will dial in as real telemetry comes in: - Actual prompts per engaged visitor - Actual token usage per call (measured from logs) - Actual calls per prompt (instrument each model invocation) - Actual follow-up trigger rate and resolution rate - Actual facet summary length (tokens) and facet refreshes per resolution - Actual CPL on LinkedIn for employer leads (and lead quality) - Actual Reddit CPA for candidate starts, plus activation/completion rates