Blog · Tag
cost.
5 posts in this archive.
Our infra spend for proposal workloads, year one
Database, inference, storage, and ops. What we spent running proposal workloads over our first year, broken down by category. Where the cost curves went where we expected, and where they surprised us.
The claim-verification cost profile, stage by stage
Per-claim verification is the defense against citation hallucination. It also costs real money. A breakdown of token costs at each stage of the verification pipeline, with the numbers we actually see in production.
Caching the draft step
How we cache partial drafts across proposals without introducing stale-answer risk. The cache key design, invalidation rules, and the directional cost impact we measured internally.
Cost control for RAG: daily budgets, fallback models, burn alerts
How we keep RAG spend predictable per tenant. Daily budgets, model-tier fallbacks, and burn-rate alerts before the bill spikes — with the dashboard and the rules.
The cost per response, broken down to the penny
Embedding calls, retrieval compute, draft tokens, verifier tokens, storage. The unit cost structure of a single drafted RFP answer, with a worked example. We publish the unit economics, not customer costs.
See the proposal workflow
Take the 5-minute tour, then start a trial workspace when you're ready to run a real pursuit against your own source material.