xAI

Post-training Infrastructure Engineer

Palo Alto, CA San Francisco, CA
USD 180k - 440k
Kubernetes PyTorch Python Rust Machine Learning
Description

About the Role

The post-training team at xAI transforms powerful pre-trained models to become steerable, versatile, and capable of understanding and addressing real-world challenges.

To accomplish this, we are looking for experienced AI infrastructure engineers to develop and optimize frameworks tailored for large-scale machine learning tasks, particularly in the areas of reinforcement learning and agent systems.

The role involves building high-performance and scalable software to support cutting-edge AI research, employing advanced technologies to expand the limits of what AI can achieve with increased data and computational resources.

Focus

  • Building efficient and user-friendly training and evaluation frameworks for model fine-tuning and reinforcement learning.
  • Building efficient and user-friendly software frameworks for large-scale agent simulation and execution.
  • Building flexible and performant bulking inference framework to enable synthetic data generation and model-based data improvement research.

Ideal Experiences

  • Expert in developing software for large-scale distributed machine learning systems (e.g. language modeling training and reinforcement learning).
  • Expert in GPUs, Kubernetes, and JAX (or PyTorch).
  • Experienced in standard software engineering best practices (CI/CD) and care about code quality, testing, and performance.

Location

The role is based in the Bay Area [San Francisco and Palo Alto]. Candidates are expected to be located near the Bay Area or open to relocation.

Tech Stack

  • Python
  • JAX
  • Rust
  • CUDA & NCCL

Interview Process

After submitting your application, the team will review your CV and statement of exceptional work. If your application passes this stage, you will be invited to a 15-minute interview (“phone interview”) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of four technical interviews:

  1. 1 coding assessment in a programming language of your choice.
  2. 2 x post-training infra technical sessions: These sessions will be assessing your engineering skills to design and implement solutions to solve infra problems in post-training.
  3. Meet the Team: Present your past exceptional work and your vision with xAI to a small audience.

Our goal is to finish the main process within one week. We don’t rely on recruiters for assessments. Every application is reviewed by a member of our technical team. All interviews will be conducted via Google Meet.

Annual Salary Range

$180,000 - $440,000 USD

xAI
xAI
Artificial Intelligence (AI) Information Technology Machine Learning

0 applies

2 views

Other Jobs from xAI

Post-training Software Engineer, Full Stack

San Francisco, CA Palo Alto, CA

Software Engineer - Data Infrastructure

Palo Alto, CA San Francisco, CA

Product AI Engineer

San Francisco, CA Palo Alto, CA

AI Engineer & Researcher - Data

Palo Alto, CA San Francisco, CA

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say