RadixArk

Member of Technical Staff, TPU Systems

Palo Alto, CA
USD 180k - 250k
JAX XLA Pallas Python TPU
Description

Member of Technical Staff -- TPU Systems (JAX / XLA / PALLAS)

Location: Palo Alto, CA

Department: engineer

About the Role

RadixArk is looking for a TPU Systems Engineer to build high-performance inference and training systems using JAX, XLA, and Pallas. You'll push large-model workloads to their limits on TPU hardware, working on SGLang-JAX and other critical infrastructure that enables efficient deployment of frontier models on Google's tensor processing units.

Requirements

  • 3+ years experience building production ML systems with JAX, XLA, or TPU-focused frameworks
  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or equivalent industry experience
  • Deep understanding of JAX/XLA internals: HLO, fusion, SPMD partitioning, and sharding strategies
  • Strong performance tuning instincts across compiler and runtime layers
  • Experience with distributed inference systems (e.g. SGLang, vLLM) or training frameworks (e.g. Miles, Alpa, Pathways)
  • Proficiency in Python with demonstrated ability to write high-performance, production-quality code
  • Familiarity with Pallas for kernel development, or strong ability to learn quickly

Responsibilities

  • Build high-performance inference and training systems using JAX/XLA/Pallas, including SGLang-JAX
  • Push large-model workloads to the limits on TPU v4, v5e, and v5p architectures
  • Optimize end-to-end latency and throughput for LLM serving on TPU infrastructure
  • Design and implement SPMD strategies for efficient distributed inference and training
  • Profile and optimize XLA compilation pipelines and HLO graph transformations
  • Collaborate with kernel engineers and compiler teams to achieve performance wins across the stack
  • Contribute to open-source projects with TPU optimization guides, benchmarks, and architectural insights
  • Create testing frameworks for numerical correctness and performance regression detection

    About RadixArk

    RadixArk is an infrastructure-first company built by engineers who've shipped production AI systems, created SGLang (20K+ GitHub stars, the fastest open LLM serving engine), and developed Miles (our large-scale RL framework). We're on a mission to democratize frontier-level AI infrastructure by building world-class open systems for inference and training. Our team has optimized kernels serving billions of tokens daily, designed distributed training systems coordinating 10,000+ GPUs, and contributed to infrastructure that powers leading AI companies and research labs. Join us in building infrastructure that gives real leverage back to the AI community.

    Compensation

    We offer competitive compensation with significant founding team equity, comprehensive health benefits, and flexible work arrangements. The US base salary range for this full-time position is: $180,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level, and role. Individual compensation will be determined by experience, skills, and demonstrated expertise in TPU systems and ML infrastructure.

    Equal Opportunity

    RadixArk is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
     
RadixArk
RadixArk

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say