Senior Machine Learning Platform Engineer
Team: Infra & Platform Engineering
Location: Irvine, CA
Commitment: Full time
Workplace Type: onsite
What You’ll Get To Do:
- Design and manage scalable ML infrastructure with IaC tools (Terraform, CloudFormation).
- Develop and optimize cloud-based pipelines for training, evaluation, and inference on multimodal datasets.
- Build and operate data systems for large-scale video ingestion, indexing, and storage.
- Maintain MLOps workflows for versioning, experiment tracking, reproducibility, and CI/CD.
- Ensure reliability and observability with monitoring, logging, and alerting.
- Collaborate with AI/ML Engineers to productionize workflows.
- Optimize infrastructure for performance and cost across cloud and edge.
- Enforce best practices in security, compliance, and maintainability.
- Mentor and manage junior engineers, providing technical guidance and career development.
What You Have:
- Bachelor’s/Master’s in Computer Science, Engineering, or related field (or equivalent experience).
- 4+ years of industry experience in ML infrastructure or platform engineering.
- Strong coding skills in Python/TypeScript and a strong foundation in software engineering best practices.
- Proven experience with distributed systems, cloud platforms (AWS preferred), containerization and orchestration (Docker, Kubernetes/EKS, Ray), and serverless.
- Hands-on experience building ML pipelines for distributed training and large-scale inference.
- Strong knowledge of data management at scale, including preprocessing and retrieval of video/image datasets.
- Proficiency with CI/CD pipelines, infrastructure-as-code (Terraform, CloudFormation), and automation.
- Familiarity with MLOps tools (MLflow, Kubeflow, Airflow).
- Experience with system monitoring and observability in production.
The Extras That Set You Apart:
- Experience with vector databases (OpenSearch, Pinecone, Weaviate) for indexing and retrieval.
- Familiarity with distributed training frameworks (Horovod, DDP/FSDP, DeepSpeed, Ray).
- Hands-on experience with GPU orchestration and auto-scaling (Karpenter, SageMaker, EKS).
- Experience with agentic AI deployment workflows, orchestration frameworks, and retrieval-augmented generation.
- Strong knowledge of security and compliance in ML and cloud environments.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
