Relace is building the models and infrastructure that code agents reach for. We power the fastest model on OpenRouter (10,000 tok/s) and deliver optimized small language models designed for retrieval, application, and core code generation functions.
Our technology supports some of the world's fastest-moving companies — including Lovable, Figma, and Vercel — as they deploy and scale code generation to hundreds of millions of users. We recently raised our Series A from a16z, and we're growing quickly.
Our team is made up of mathematicians, physicists, and computer scientists who are deeply passionate about their craft. If you thrive on ambitious technical problems, care about elegant systems design, and want to build the foundation of how code gets written at scale, this is the place for you.
As an Infrastructure Engineer at Relace, you'll design and operate the systems that power our high-performance inference and training infrastructure. You'll work closely with our research and product teams to ensure our models run at scale with reliability, speed, and cost-efficiency. This is a hands-on engineering role where you'll shape how we build and scale the backbone of modern code generation.
You'll have the opportunity to: