Hello,
The AI infrastructure boom is creating a crowded new middle layer.
Startups like Baseten, Fireworks AI, Together AI and Modal help developers run and customize AI models without managing the underlying GPU infrastructure themselves. That position has become increasingly valuable as more companies move from testing AI to putting models into production.
But fast growth has brought a harder question: how durable is this business? Many inference providers still rely on rented GPU capacity, face pressure from larger cloud providers and compete in a market where prices can fall quickly.
We asked Deep Research to examine whether these companies can turn surging demand into lasting, high-margin businesses—or risk being squeezed between developers, model makers, and the cloud giants that control compute.
Listen to this Deep Research article—and every article from The Information—on desktop and mobile web. Stay informed while you multitask. To read the full analysis, you’ll need The Information Pro. Take advantage of a special introductory rate of 25% off your Pro subscription.
Hear from our Pro subscriber, Lora Kratchounova, CEO of Scratch Marketing + Media, on why The Information Pro is worth it:
Before it's everywhere, it's on The Information. Subscribe today for $999 $749 and save 25% on the first year, to get full access to Venture Capital’s Next General Partners 2026, org charts of 60+ companies, AI Databases and more.
0 comentários:
Postar um comentário