BLD Talent

Data Infrastructure Engineer

San Francisco, CA
Python AWS IoT Core S3 ECR Batch ECS EKS Step Functions MCAP Protobuf Airflow Prefect Docker Foxglove
Description

Data Infra Engineer

Location: San Francisco

Location Type: IN_OFFICE

Employment Type: FULL_TIME

Company Overview
We’re building intelligent robotic arms that can learn new skills in hours, not months. Backed by Y Combinator and top-tier Silicon Valley investors, we’re turning physical AI into reality, helping industries facing critical labor shortages (manufacturing, logistics, and more) automate back-of-house tasks like packaging, kitting, and assembly.
Our flagship robot combines affordable robotic hardware with cutting-edge imitation learning algorithms, enabling reliable, sample-efficient robots that deliver customer value from day one. We’re already live with pilot partners and scaling fast. The founding team brings experience from Apple, Stanford, and Microsoft, with deep expertise in robotics, embodied AI, and large-scale machine learning.

The Role
We’re looking for a Robotics Data Infrastructure Engineer to own and build the data systems that power our robots in the real world. This is a hands-on founding engineer role with true ownership and freedom — your work will directly impact robots performing customer-critical tasks every day.
You will architect and deploy data pipelines on both AWS and edge devices, manage large-scale multi-modal datasets (images, video, time-series, text, etc.), and build the tooling that connects real-world robot data to training and evaluation workflows. You’ll work across the full robotics software stack, from ingesting sensor data and telemetry, to enabling large-scale policy learning pipelines that drive production robots.
Beyond writing great code, you’ll help drive technical decisions, lead cross-functional efforts, and bridge robotics, machine learning, and product requirements into scalable, reliable systems.

What You’ll Do
  • Build and own our data backbone on AWS: Design and run cloud + edge pipelines using services like IoT Core, S3, ECR, Batch, ECS/EKS, and Step Functions. Your work keeps robot data flowing reliably and cost-efficiently from the field into the lab.
  • Develop on-device data systems: Build robust, fault-tolerant data capture on edge PCs using MCAP/Protobuf, with clean schema contracts, buffering, and resumable uploads to the cloud.
  • Wrangle massive multimodal datasets: Organize and version millions of images, videos, time-series (robot state, force/torque), and annotations. Enforce metadata, retention, and access patterns that scale.
  • Build MLOps and DataOps pipelines: Automate data validation, labeling, augmentation, and model training/evaluation using containerized jobs and orchestrators like Batch, Step Functions, Airflow, or Prefect.
  • Ensure data quality and health: Create ingestion checks, schema validation, deduping, drift detection, and real-time alerting around data freshness and completeness.
  • Build internal tools that unblock others: Develop UIs/CLIs for browsing data, launching jobs, tracking experiments, and debugging robots in the field. Integrate with tools like Foxglove.
  • Work across teams: Partner with hardware, ML, and product to turn raw field data into smarter robots and real customer value—fast.

Qualifications
  • B.S., M.S., Ph.D. in computer science or related fields.
  • Strong programming skills in Python (you write clean, efficient, production-ready code)
  • Strong experience in AWS
  • Systems engineering skills (networking, concurrency, performance)
  • At least 2 years of full-time work experience for candidates with a B.S. in related fields. 1 year of experience for M.S. or Ph.D. candidates.
BLD Talent
BLD Talent

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say