Post by Subquadratic

6,517 followers

If you're a founder building a product with AI, your choice of model - and architecture - can impact your margin wildly. Transformer-based models (Claude, Gemini, the "T" in GPT) process every possible relationship between tokens - so every time you double your context length, you quadruple the compute and costs. That's not a product problem. That's a model problem.