Manager, Machine Learning Operations
Department: Data Science
Location: Marina del Rey, CA
Employment Type: FullTime
What We Do:
Zefr is the global leader in brand suitability targeting and measurement across the world's largest platforms. Zefr's technology is helping to power the age of responsible marketing by putting advertisers in control of their content adjacencies based on their own unique brand safety and suitability preferences. As an official YouTube Measurement Program Partner, Meta for Business Partner, and TikTok for Business Partner, the company leverages patented machine learning and AI technology (Cognition AI) to offer brands and agencies more precise and transparent brand safety and suitability activation and measurement solutions on scaled platforms. The company is headquartered in Los Angeles, California, with additional locations across the globe.
What You'll Do:
We are hiring a Manager of Machine Learning Operations to lead our ML Ops team and drive the infrastructure, tooling, and processes that enable our machine learning systems to operate at scale. You will oversee the deployment, monitoring, and optimization of ML models that process multi-terabytes of social media platform data from TikTok, YouTube, Facebook, Instagram, and Snap. In this role, you will lead a team of engineers responsible for building and maintaining robust ML pipelines, ensuring model reliability in production, and implementing best practices for model lifecycle management. You will collaborate closely with ML Engineers and Data Scientists to bridge the gap between research and production. We are excited to welcome a leader who is passionate about building scalable ML infrastructure and developing high-performing teams.
Key Responsibilities:
• Lead, mentor, and grow a team of Machine Learning Engineers, fostering a culture of innovation and continuous improvement
• Design and implement scalable ML infrastructure for model training, deployment, and serving
• Establish and enforce best practices for ML model lifecycle management, including versioning, testing, and monitoring
• Develop and maintain CI/CD pipelines for machine learning workflows
• Optimize model inference performance and reduce latency/cost across production systems
• Collaborate with ML Engineers and Data Scientists to productionize models efficiently
• Implement robust monitoring, alerting, and observability solutions for ML systems
• Drive technical decisions on ML Ops tooling, infrastructure, and architecture
• Ensure high availability and reliability of ML services at scale
• Manage project timelines, priorities, and resource allocation for the ML Ops team
Tech Stack:
• Languages: Python, SQL
• Data Stores: Snowflake, Qdrant, GCS
• Data Processing: DBT, Pandas, Ray
• DevOps: GitHub Actions, Docker, Terraform, Kubernetes, ArgoCD, AWS, GCP, Datadog
• MLOps: Triton Inference Server, Weights and Biases, ONNX, TensorRT LLM, vLLM, SGLang
• ML: Voxel51 Teams, Transformers, PyTorch, HuggingFace
What We're Looking For:
• Bachelor's or Master's degree in Computer Science or related field with 5+ years of professional experience in ML Engineering or MLOps
• 2+ years of experience managing or leading engineering teams
• Deep expertise in ML model deployment, serving infrastructure, and production ML systems
• Hands-on experience with transformer architectures (e.g., BERT, ViT) for natural language and vision tasks.
• Strong understanding of multimodal embedding techniques for integrating text, image, audio, and structured data.
• Experience with LLM models such as Gemini, GPT, Claude, Qwen, etc.
• Experience with ML experiment tracking, model versioning, and feature stores
• Strong understanding of CI/CD principles applied to ML workflows
• Experience optimizing model inference performance (ONNX, TensorRT, or similar)
• Excellent leadership, communication, and stakeholder management skills
• Track record of building and scaling high-performing engineering teams
• Openness to new technologies and creative solutions
Nice to Have:
• Experience with ad tech and digital advertising ecosystem
• Experience with multimodal LLM fine-tuning
Benefits (for US-based employees):
• Flexible PTO
• Medical, dental, and vision insurance with FSA options
• Company-paid life insurance
• Paid parental leave
• 401(k) with company match
• Professional development opportunities
• 14 paid holidays off
• Flexible hybrid work schedule
• "Summer Fridays" (shorter work days on select Fridays during the summertime)
• In-office lunches and lots of free food
• Optional in-person and virtual events (we like to celebrate!)
Compensation (for US-based employees):
The anticipated base salary for this position is between $170,000 and $230,000. Within the range, individual pay is determined by factors such as job-related skills, experience, and relevant education or training. If your compensation expectations fall outside of this range, it may still be worth having a conversation.
Zefr is an equal opportunity employer that embraces diversity and inclusion in the workplace. We are committed to building a team that represents a variety of backgrounds, skills, and perspectives because we know this only makes us better. We strongly encourage women, persons of color, LGBTQIA+ individuals, persons with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply even if you do not meet 100% of the qualifications.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
