AI inference is obviously profitable · seangoedecke.com RSS feed
Business, Finance & Industries · Jun 26, 2026
AI products can survive a funding collapse because inference (serving) is financially separable from model training—others can buy model weights and sell inference—so value is likely to shift from failed labs to acquirers, hosts, or infrastructure operators, making model rights, hosting capability, and customer access valuable in a downturn.
AI inference is obviously profitable · seangoedecke.com RSS feed
Business, Finance & Industries · Jun 26, 2026
Unit economics indicate consumer LLM inference can be profitable — e.g., a 70B model on four A100s yields ~2M tokens/hour with ~$0.13 per million tokens energy cost (≈$1/million including depreciation), while API prices like GPT‑5.4‑mini ($4.50/million) and stronger models (3–6× higher) imply large gross margins (70–80%), so the main challenge is pricing and margin allocation rather than subsidized serving, and metered APIs, agentic workflows, and narrow AI can remain durable if usage is metered rather than unlimited.
AI inference is obviously profitable · seangoedecke.com RSS feed
Business, Finance & Industries · Jun 26, 2026
Open-weight competition pressures inference pricing down toward actual serving cost—compressing pure-markup inference providers while creating a two-tier market where frontier labs keep high prices for R&D and operators must compete on distribution, reliability, integration, and optimization.
AI inference is obviously profitable · seangoedecke.com RSS feed
Business, Finance & Industries · Jun 26, 2026
Inference can be profitable, but many AI labs deliberately price inference to cross-subsidize expensive frontier training—driving high API prices and loss-making subscription experiments—creating a structural split where labs can appear unprofitable despite healthy serving margins while inference-focused providers operate profitably; thus a lab's financial distress often signals training-heavy business-model stress rather than weak demand for inference.