Post by Upbound
8,796 followers
Today we're introducing Modelplane: the open source control plane for AI inference. π Open-weight models are changing who runs AI. Inference is moving outward, from the labs and hyperscalers that serve everyone through an API to a much larger population of organizations running it on infrastructure they own and control. Neoclouds are building businesses around it. Regulated and sovereign enterprises are keeping it inside their own walls. AI-native companies are bringing inference costs under control. The open source community has moved quickly to meet this shift, with serving engines, schedulers, gateways, routers, and multi-node serving systems emerging across the stack. But as inference expands across clusters, clouds, regions, and providers, operators need a control plane above them to manage the fleet as a whole. Thatβs why we built Modelplane. Built on Crossplane project and shaped by patterns weβve seen teams build in production. Modelplane operates inference fleets as a single platform. Platform teams provision clusters and publish hardware classes. Developers declare a model and get back an OpenAI-compatible endpoint. Any model. Any engine. Any infrastructure. Modelplane is in early development, and we're building it in the open with the AI inference community. There's a lot here already, and a lot still ahead, and we'd love your help shaping it. Apache 2. Community Driven. Runs entirely in your own infrastructure. #AIInference #OpenSource #PlatformEngineering #Crossplane #CNCF #AIInfrastructure