DevOps Engineer
Location: San Mateo, CA
Department: Product Engineering
About the role
We’re building out a cloud platform team and looking for a Senior DevOps Engineer to own the developer infrastructure that powers our products. You will own how we deploy, scale, observe, and secure systems across GCP and AWS, with Kubernetes at the core.
This isn’t a ticket-queue role. You’ll work directly with engineers building services in Go and TypeScript, researchers training PyTorch models, and leadership defining the roadmap. You’ll have real ownership and the latitude to build things the right way from the start.
What you’ll do
- Design, build, and operate cloud infrastructure on GCP with an emphasis on reliability, security, and cost efficiency
- Own and evolve our Kubernetes platform — cluster architecture, RBAC, networking, autoscaling, and workload scheduling
- Build and maintain automated CI/CD pipelines using GitHub Actions and ArgoCD, supporting GitOps workflows for all services
- Write Go and Python tooling to automate infrastructure tasks, improve developer experience, and extend internal platform capabilities
- Establish observability practices — metrics (Prometheus/Grafana), distributed tracing (OpenTelemetry), and centralized logging
- Define and enforce security best practices: secrets management (Vault/KMS), image scanning, IAM least-privilege, and network policies
- Support GPU-based ML workloads, working with researchers to provision and optimise node pools for PyTorch training and inference
- Respond to incidents and lead blameless postmortems to drive continuous improvement in system reliability
- Write clear documentation and champion a culture of engineering excellence across the team
What we’re looking for
Required
- 5–8 years of experience in DevOps, SRE, or platform engineering roles
- Production Kubernetes experience — cluster management, not just deploying workloads
- Hands-on experience with GCP or AWS; solid conceptual understanding of both
- End-to-end ownership of CI/CD pipelines and GitOps workflows
- Proficiency in Go or Python for writing infrastructure tooling and automation
- Infrastructure as Code expertise with Terraform or Pulumi
- Experience with observability stacks: Prometheus, Grafana, and a log aggregation platform
- Strong grasp of cloud security fundamentals: IAM, secrets management, network policies
Preferred
- Experience supporting ML training infrastructure, GPU node pools, or model serving (TorchServe, Triton)
- Familiarity with TypeScript for build tooling or internal developer platforms
- Background in a fast-moving startup or product engineering environment
- Contributions to open-source infrastructure tooling
Certifications
The following are valued but not required:
- Google Professional Cloud DevOps Engineer
- CKA or CKAD (Certified Kubernetes Administrator / Application Developer)
- AWS Solutions Architect or DevOps Professional
- HashiCorp Terraform Associate
Our tech stack
Cloud | GCP (GKE, Cloud Run, Pub/Sub, BigQuery) · AWS (EKS, Lambda, S3, RDS) |
Orchestration | Kubernetes · Helm · ArgoCD |
Languages | Go · TypeScript · Python |
ML / AI | PyTorch · GPU workloads |
CI/CD | GitHub Actions · GitOps workflows |
Observability | Prometheus · Grafana · OpenTelemetry · Loki |
IaC | Terraform · Pulumi |
Security | Vault · KMS · Trivy · Snyk |
Why join us
- Work on a development team building frontier AI models for Physics AI
- Collaborative, low-ego team that values craft and clear thinking
- Competitive salary, equity, and benefits
About the Company
Note: In accordance with export control regulations, this position requires candidates to qualify as a U.S. Person (U.S. citizen, lawful permanent resident, or protected individual as defined under 8 U.S.C. § 1324b(a)(3)). An active or obtainable U.S. security clearance (Secret or TS/SCI) is preferred. We are unable to provide visa sponsorship now or in the future for this role.
About Luminary
Luminary helps engineering companies be more competitive by getting to market faster, creating new, better products, and reducing development risk. We do this with our Physics AI platform, the fastest and easiest way to build and deploy models to understand and instantly predict physical reality with precision. Customers span industries from automotive and aerospace to leading sporting equipment providers, including Otto Aviation, Joby Aviation, Piper Aircraft, and Trek Bikes. Luminary is a Series B company and is headquartered in San Mateo, California.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
