Cost Estimates — Phase 1¶
Topline Spend Buckets¶
Below are the major spend buckets we've estimated for Phase 1. This is intentionally “at-a-glance” so an investor can see where burn is going before diving into assumptions.
Monthly run-rate (Early Phase 1)¶
- Founders (base): $33,333/mo
- Founders (burden line item, 10%): $3,333/mo
- Core infra: $1,000/mo
- Support vendors: $500/mo
- Trust & safety vendors: $200/mo
- Paid ads (Reddit + LinkedIn): $2,500/mo
- Reddit (candidates): $2,000/mo
- LinkedIn (employers, Lead Gen + retargeting): $500/mo
- Variable compute (LLM inference + embeddings): ~$742/mo (LLM dominates; embeddings negligible)
Early Phase 1 total monthly burn (run-rate): ~$41,608/mo
Monthly run-rate (Late Phase 1 — ops contractor for 6 months)¶
- Everything above, plus:
- Ops support contractor (annual-equivalent $70k / 12): $5,833/mo (for 6 months)
- Ops support contractor burden cushion (10%): $583/mo (for 6 months)
Late Phase 1 total monthly burn (run-rate): ~$48,024/mo
One-time Phase 1 costs: 100,000¶
- Design contractors: $50,000
- Legal (conservative): $30,000
- Grab bag (one-time + unknown unknowns): $20,000
TL;DR¶
- Phase 1 Green is a people-burn story. Fixed monthly burn dwarfs token and ad spends
- LLM inference is ~\$742/mo and embeddings are ~\$0.22/mo (effectively zero).
- Adding always-on paid:
- Reddit: $2,000/mo aimed at candidate inventory
- LinkedIn: $500/mo aimed at employer lead gen (Lead Gen Forms + retargeting)
- Estimated monthly burn:
- Early Phase 1: ~$41.6k/mo
- Late Phase 1 (6 months with ops contractor): ~$48.0k/mo
- One-time costs modeled: $100k (design + legal + unknown-unknowns).
Scope & Ground Rules¶
Snapshot¶
- Snapshot: Phase 1 Green
What this estimate includes¶
- Founders, contractors, core infra, vendors, trust & safety vendors, legal, contingency (“unknown unknowns”)
- Paid acquisition budgets (Reddit + LinkedIn)
- Token-driven LLM inference cost (mainline + follow-up loop)
- Embedding generation API cost (facet summaries → embeddings)
What this estimate explicitly excludes¶
- Qdrant hosting/storage (assumed negligible vs tokens at Phase 1)
- WorkOS SSO (pass-through per connection, rolled into B2B deals)
- Benefits/healthcare (explicitly not included; founders paid directly; ops hire contractor-based)
Phase 1 Green Usage Model (the driver for variable compute)¶
Agent counts¶
- Candidate agents: 3,000
- Company/role agents: 600
- Total agents: 3,600
Traffic & engagement assumptions (Phase 1 Green baseline)¶
- Weekly non-owner visitors per agent: 3
- Engagement rate (visitors who submit ≥1 prompt): 25%
- Prompts per engaged visitor: 3
Derived (weekly): - Non-owner visits/week: 3,600 × 3 = 10,800 - Engaged visitors/week: 10,800 × 0.25 = 2,700 - Prompts/week: 2,700 × 3 = 8,100 - Avg prompts per visit: 8,100 / 10,800 = 0.75 - Prompts per agent/week: 8,100 / 3,600 = 2.25
LLM Inference: Call Structure & Follow-up Loop¶
Mainline call structure¶
- Assume: 3 LLM calls per prompt on average
- Example decomposition: synthesis + PII/safety + claim/update (or similar)
Follow-up loop (separate + additive)¶
- Follow-up trigger rate: 25% of prompts
- Follow-up depth: 1.5 questions per trigger (questions ≠ calls)
- Implementation: 1 LLM call per trigger (generates the set of follow-up questions)
Derived (weekly): - Triggers/week: 8,100 × 0.25 = 2,025 - Follow-up questions/week: 2,025 × 1.5 = 3,038 (questions) - Follow-up calls/week: 2,025 (calls)
Token Budget Assumptions¶
Per call budget (starting baseline)¶
- Input tokens/call: 1,000
- Output tokens/call: 400
- Total tokens/call: 1,400
Weekly call totals¶
- Mainline calls/week: 8,100 prompts × 3 calls/prompt = 24,300
- Follow-up calls/week: 2,025
- Total calls/week: 24,300 + 2,025 = 26,325
Weekly token totals¶
- Mainline tokens/week: 24,300 × 1,400 = 34,020,000
- Follow-up tokens/week: 2,025 × 1,400 = 2,835,000
- All-in tokens/week: 36,855,000
Breakdown: - Input tokens/week: 26,325 calls × 1,000 = 26,325,000 - Output tokens/week: 26,325 calls × 400 = 10,530,000
Monthly token totals¶
Using 52/12 ≈ 4.333 weeks/month: - Input tokens/month: 26.325M × 4.333 ≈ 114.08M - Output tokens/month: 10.53M × 4.333 ≈ 45.63M
Tokens per asker prompt (useful for unit economics)¶
- Follow-up calls per prompt = 2,025 / 8,100 = 0.25
- Avg calls per prompt (including follow-up loop) = 3 + 0.25 = 3.25
So: - Input tokens per prompt: 3.25 × 1,000 = 3,250 - Output tokens per prompt: 3.25 × 400 = 1,300
Variable Cost: LLM Inference (GPT-4o)¶
Pricing used (assumption for estimating)¶
- GPT-4o: $2.50 / 1M input tokens, $10.00 / 1M output tokens
Monthly inference cost (at Green)¶
- Input: 114.08 × $2.50 = $285.20 / month
- Output: 45.63 × $10.00 = $456.30 / month
- Total inference cost: $741.50 / month (≈ $742/month)
Variable Cost: Embedding Generation (Facet Summaries → Embeddings)¶
Your embedding approach (summary)¶
- Embed summaries derived from capability claims (not raw text)
- Maintain separate semantic facets:
- skills, experience, landscape, intent, mobility
- Claims grouped by facet → summarized together → summary embedded and stored in Qdrant
- Qdrant storage spend excluded from this model
Embedding parameters you set¶
- Embedding dimensions: 3,072
- Facets: 5
- Avg facet summary length: 200 tokens
- Facet refreshes per resolved trigger: 1.2
- Resolution rate: 80% of triggers resolve
- Embedding cost assumption: $0.13 / 1M tokens (for “large” embeddings)
One-time initial embedding cost¶
Vectors to seed: - 3,600 agents × 5 facets = 18,000 facet vectors
Tokens to embed once: - Tokens per agent (all facets) = 5 × 200 = 1,000 - Total initial tokens = 3,600 × 1,000 = 3.6M tokens
Cost: - 3.6 × $0.13 = $0.47 one-time
Ongoing monthly re-embedding cost¶
Weekly resolved triggers: - Resolved triggers/week = 2,025 × 0.80 = 1,620
Facet updates/week: - Facet refreshes/week = 1,620 × 1.2 = 1,944
Embedding tokens/week: - 1,944 × 200 = 388,800 tokens/week
Embedding tokens/month: - 388,800 × 4.333 ≈ 1.685M tokens/month
Cost/month: - 1.685 × $0.13 = $0.219/month (≈ $0.22/month)
Conclusion: Embeddings are functionally negligible at Phase 1 Green scale.
Paid Acquisition (Ads): Reddit + LinkedIn¶
Guiding strategy (channel split)¶
- Reddit: primary channel for candidate inventory generation (always-on, prospecting-heavy; retargeting if/when useful)
- LinkedIn: primary channel for employer pipeline capture (retargeting + Lead Gen Forms; not cold prospecting)
Reddit (candidates)¶
- Objective: candidate profile starts (
agent_created) - Budget: $2,000/month (always-on)
- Approach: subreddit-targeted prospecting; add retargeting later if it improves quality/CPA
LinkedIn (employers)¶
- Primary objective: employer lead capture (Lead Gen Form + retargeting), not cold clicks
- Budget: $500/month (always-on)
- Lead definition: Work email + company + role + “hiring in next 12 months” (checkbox)
- Target: 5 employer leads/month (implied working CPL target: ~$100/lead)
Notes on goals vs budgets¶
- The LinkedIn budget is intentionally sized to the small goal (5/mo) and to avoid “LinkedIn becomes the whole burn.”
- Reddit remains the main spend because the candidate funnel is volume-driven and you want iteration without LinkedIn CPC economics.
Fixed Costs (People + Operating)¶
Payroll / compensation posture¶
- Founders paid directly (no benefits)
- Ops support is contractor-based (modeled as annual-equivalent + cushion)
- No benefits
- Accounting for employment “burden” will occur in Phase 2, when people spend necessitates full payroll
Cost inputs¶
- Founders: 2 × $200k base salary/year
- Design contractors: $50k lump sum (Phase 1)
- Core infra: $1,000/month
- Support vendors: $500/month
- Trust & safety vendors: $200/month (founder-heavy; inexpensive vendors)
- Legal: $30k lump sum (explicitly conservative)
- Grab bag: $20k lump sum (“one-time + unknown unknowns”)
- Ops support: contractor equivalent of $70k annual for 6 months
- WorkOS SSO: pass-through per connection (excluded from Phase 1 model)
Fixed cost table (fixed vs timing)¶
| Cost area | Fixed/Variable | Timing | Assumption | Monthly run-rate | One-time |
|---|---|---|---|---|---|
| Founders (base) | Fixed | Phase 1 | 2 × $200k base | $33,333 | — |
| Founders (burden) | Fixed | Phase 1 | 10% overhead (no benefits) | $3,333 | — |
| Design contractors | Fixed | Phase 1 | Lump sum | — | $50,000 |
| Core infra | Fixed | Phase 1 | Self-managed | $1,000 | — |
| Support vendors | Fixed | Phase 1 | Workspace/QBO/subscriptions | $500 | — |
| Trust & safety vendors | Fixed | Phase 1 | Cheap vendor stack | $200 | — |
| Paid ads (Reddit + LinkedIn) | Fixed | Phase 1 | Always-on budgets | $2,500 | — |
| Legal (conservative) | Fixed | Phase 1 | Mostly boilerplate + cleanup | — | $30,000 |
| Grab bag | Fixed | Phase 1 | One-time + unknown unknowns | — | $20,000 |
| Ops support (contractor) base | Fixed | Late Phase 1 (6 mo) | $70k annual-equivalent / 12 | $5,833* | — |
| Ops support cushion | Fixed | Late Phase 1 (6 mo) | +10% cushion | $583* | — |
* Ops contractor line applies only during the 6 months you include the ops role.
Fixed monthly burn¶
- Early Phase 1 (no ops contractor yet):
- $33,333 + $3,333 + $1,000 + $500 + $200 + $2,500 = $40,866/month
- Late Phase 1 (with ops contractor for 6 months):
- $40,866 + $5,833 + $583 = $47,282/month
Fixed one-time costs (Phase 1)¶
- Design $50,000 + Legal $30,000 + Grab bag $20,000 = $100,000
Combined Burn: Fixed + Variable (Phase 1 Green)¶
Variable monthly (usage-driven)¶
- LLM inference: ~$742/month
- Embedding generation: ~$0.22/month (negligible)
Total variable compute: ~$742/month
Total monthly burn (run-rate)¶
- Early Phase 1: $40,866 fixed + $742 variable ≈ $41,608/month
- Late Phase 1 (6 months w/ ops contractor): $47,282 fixed + $742 variable ≈ $48,024/month
One-time Phase 1 costs¶
- $100,000 (design + legal + unknown-unknowns)
Unit Economics (Investors Will Ask)¶
Compute cost per asker prompt¶
Per prompt: - Input tokens ≈ 3,250 → 0.00325M × $2.50 = $0.00813 - Output tokens ≈ 1,300 → 0.00130M × $10.00 = $0.01300 - Total ≈ $0.0211 per prompt (~2.1¢)
Compute cost per engaged session¶
- Prompts per engaged visitor = 3
- Cost ≈ 3 × $0.0211 = $0.0633 (~6.3¢ per engaged session)
Compute cost per non-owner visit¶
- Avg prompts per visit = 0.75
- Cost ≈ 0.75 × $0.0211 = $0.0158 (~1.6¢ per visit)
Compute cost per agent-week¶
- Prompts per agent-week = 2.25
- Cost ≈ 2.25 × $0.0211 = $0.0475 (~4.8¢ per agent-week)
Paid acquisition unit metrics (reporting targets)¶
These are planning targets for reporting and iteration; they are not “guarantees.”
- LinkedIn employer CPL target: ~$100/lead
- Lead = work email + company + role + hiring-in-12-months checkbox
-
Goal = 5 leads/mo on $500/mo budget
-
Reddit candidate CPA: use actuals and iterate
- Optimize to
agent_createdbut report quality-adjusted conversion (e.g.,agent_activated) as you instrument it
Notes / Interpretation¶
Why compute is cheap at Green (in this model)¶
- Prompt volume is meaningful, but token budgets and call counts are contained.
- Even with the follow-up loop modeled as additive, inference remains under $1k/month.
Sensitivity knobs (what can blow up)¶
If any of these move materially, compute changes quickly: - Token budget per call (context bloat, long transcripts, verbose responses) - Calls per prompt (extra safety passes, reranking, multi-step synthesis) - Prompt depth per engaged visitor - Follow-up trigger rate / additional loops
Legal estimate¶
- Kept as a conservative lump sum ($30k). Expectation is cheaper due to boilerplate; this is intentionally padded.
Trust & safety¶
- Modeled as $200/month for inexpensive vendors + founder effort (explicitly not staffing-heavy yet).
Ops support¶
- Pushed to late Phase 1, contractor-based, modeled for 6 months only.
Quick Checklist (to be updated later with real data)¶
These will dial in as real telemetry comes in: - Actual prompts per engaged visitor - Actual token usage per call (measured from logs) - Actual calls per prompt (instrument each model invocation) - Actual follow-up trigger rate and resolution rate - Actual facet summary length (tokens) and facet refreshes per resolution - Actual CPL on LinkedIn for employer leads (and lead quality) - Actual Reddit CPA for candidate starts, plus activation/completion rates