Relace

Infrastructure Engineer

San Francisco, CA
AWS GCP Azure Python
Description

Infrastructure Engineer

Department: Infrastructure

Location: San Francisco

Employment Type: FullTime

About Us

Relace is building the models and infrastructure that code agents reach for. We power the fastest model on OpenRouter (10,000 tok/s) and deliver optimized small language models designed for retrieval, application, and core code generation functions.

Our technology supports some of the world’s fastest-moving companies — including Lovable, Figma, and Vercel — as they deploy and scale code generation to hundreds of millions of users. We recently raised our Series A from a16z, and we’re growing quickly.

Our team is made up of mathematicians, physicists, and computer scientists who are deeply passionate about their craft. If you thrive on ambitious technical problems, care about elegant systems design, and want to build the foundation of how code gets written at scale, this is the place for you.

The Role

As an Infrastructure Engineer at Relace, you’ll design and operate the systems that power our high-performance inference and training infrastructure. You’ll work closely with our research and product teams to ensure our models run at scale with reliability, speed, and cost-efficiency. This is a hands-on engineering role where you’ll shape how we build and scale the backbone of modern code generation.

You’ll have the opportunity to:

- Architect and manage the infrastructure powering our ultra-fast inference and training stack.

- Build reliable, efficient systems for deploying and scaling ML workloads globally.

- Work on GPU scheduling, distributed systems, and high-performance cloud deployments.

- Optimize performance and cost across compute, networking, and storage layers.

- Collaborate with world-class engineers to push the limits of what small models can do.

Requirements

2+ years of experience writing high-quality production code

Strong experience with cloud infrastructure (AWS, GCP, Azure, or equivalent)

Experience with data science and systems optimization

Familiarity with ML infrastructure, GPU’s, etc. a plus

Work out of our SF office in FiDi

Relace
Relace

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say