Nuro

Technical Lead Manager, Machine Learning Platform Infrastructure

Mountain View, CA
Terraform Pulumi Crossplane Kubernetes Ray Slurm Volcano Spark Apache Beam Feast Hopsworks Redis CNCF Kubeflow Lustre Ceph NVMe AWS GCP Azure
Description

Technical Lead Manager, ML Platform Infrastructure

Location: Mountain View, California (HQ)

Department: Offboard Infrastructure

Who We Are 

Nuro is a self-driving technology company on a mission to make autonomy accessible to all. Founded in 2016, Nuro is building the world’s most scalable driver, combining cutting-edge AI with automotive-grade hardware. Nuro licenses its core technology, the Nuro Driver™, to support a wide range of applications, from robotaxis and commercial fleets to personally owned vehicles. With technology proven over years of self-driving deployments, Nuro gives the automakers and mobility platforms a clear path to AVs at commercial scale—empowering a safer, richer, and more connected future.

About the Role
Nuro is seeking an experienced Technical Lead Manager with deep expertise in large-scale infrastructure, workload orchestration, as well as batch and streaming data processing systems to join our ML Infrastructure team. In this role, you will lead the evolution of our core platform, ensuring our researchers and engineers have seamless access to the compute and data resources required to build the future of autonomous driving.

You will drive the strategy for automated resource provisioning, high-performance workload scheduling, and efficient feature management. As a TLM, you will balance technical hands-on leadership with people management, mentoring a high-performing team while partnering closely with ML Research and Autonomy teams to eliminate infrastructure bottlenecks and accelerate the Nuro Driver™ development lifecycle.

About the Work

As the TLM for ML Platform Infrastructure, you will build the foundation that powers Nuro’s model development from experimentation to production. This will include:

  • Setting Technical Strategy: Defining the roadmap for a unified ML platform that abstracts complex cloud infrastructure.
  • Resource Provisioning & IaC: Scaling our automated infrastructure-as-code (IaC) pipelines to manage thousands of GPU/CPU nodes across diverse environments.
  • Intelligent Scheduling: Designing and optimizing workload orchestration to maximize hardware utilization, minimize job wait times, and handle massive-scale distributed training.
  • Data Dumping & ETL: Designing robust pipelines for the extraction and transformation of petabyte-scale sensor and telemetry data into ML-ready formats.
  • Feature Caching & Feature Stores: Implementing robust feature caching and storage solutions to reduce redundant computations and ensure low-latency access to pre-computed features.
  • Team Leadership: Mentoring and growing a team of software and systems engineers, fostering a culture of operational excellence and technical innovation.

About You

  • Experience: 6+ years of professional experience in ML Infrastructure, Backend Platform Engineering, or Distributed Systems with 3+ years of people/team management experience.
  • Resource Provisioning: Deep familiarity with modern Infrastructure-as-Code and provisioning tools (e.g., Terraform, Pulumi, or Crossplane).
  • Workload Scheduling: Hands-on experience building or managing large-scale orchestrators for compute-heavy workloads (e.g., Kubernetes/KubeRay, Ray, Slurm, or Volcano).
  • Data Dumping (ETL): Proven expertise in large-scale data extraction and transformation. You must be proficient in at least one distributed processing framework, such as Apache Spark or Apache Beam.
  • Feature Management: Experience implementing or maintaining feature stores and caching layers (e.g., Feast, Hopsworks, or Redis-based custom caching).

Bonus Points 

  • Advanced degree (Ph.D. or M.Sc.) in Computer Science, Systems Engineering, or a related technical field.
  • Active contributor to open-source projects in the MLOps or Cloud-Native ecosystem (e.g., CNCF, Ray, or Kubeflow communities).
  • Experience with high-performance storage systems (e.g., Lustre, Ceph, or specialized NVMe caching) for ML data loading.
  • Knowledge of cost-optimization strategies for large-scale GPU clusters in public clouds (AWS/GCP/Azure).

At Nuro, your base pay is one part of your total compensation package. For this position, the reasonably expected base pay range is between $235,030 and $352,290 for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for an annual performance bonus, equity, and a competitive benefits package.

At Nuro, we celebrate differences and are committed to a diverse workplace that fosters inclusion and psychological safety for all employees. Nuro is proud to be an equal opportunity employer and expressly prohibits any form of workplace discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other legally protected characteristics.

 Nuro
Nuro

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say