Onit

MLOps Engineer

Auckland
Python Bash AWS S3 ECS EKS Lambda SQS CloudWatch Docker Kubernetes Terraform CloudFormation API Machine Learning Deep Learning GraphQL gRPC Microservices SQL Node.js PostgreSQL Redis MongoDB Elasticsearch Kafka Spark Hadoop TensorFlow PyTorch Keras Pandas NumPy OpenCV
Description

MLOPs Engineer

Team: Artificial Intelligence

Location: Auckland

Commitment: Full-time

Workplace Type: onsite

We are seeking an MLOps Engineer to build and scale the infrastructure, pipelines, and operational foundations required for modern machine learning and large language model development. You will play a critical role in enabling reliable model training, evaluation, and deployment by establishing strong data and platform foundations.
This role sits at the intersection of data engineering, cloud infrastructure, and applied machine learning, ensuring that AI teams can move efficiently from experimentation to production-ready systems.
You will partner closely with AI Engineering and Data Science teams to operationalise model development workflows and maintain scalable, secure AI infrastructure.

Key Responsibilities

  • MLOps Infrastructure & Platform Enablement
  • Design and implement scalable MLOps infrastructure to support model development, training, evaluation, and deployment.
  • Build reusable automation frameworks for model lifecycle management, including CI/CT for Large Language Models.
  • Establish best practices for reproducible experimentation and production-grade AI system operations.

  • Data Foundation for Model Development
  • Develop and maintain robust data pipelines and storage foundations required for machine learning and LLM workflows.
  • Ensure high-quality, well-governed datasets are available for training, fine-tuning, and benchmarking.
  • Partner with Data and AI teams to enable dataset versioning, lineage, and repeatable refresh processes.
  • Implement controls for privacy, anonymisation, and compliance when handling enterprise or client-derived training data.

  • AWS-Based Model Operations
  • Own the deployment and scaling of AI infrastructure on AWS, leveraging services such as S3, ECS/EKS, Lambda, SQS, and CloudWatch.
  • Experience in other AWS backed settings on enabling and managing GPU clusters and distributed inference.
  • Optimise training and inference environments for performance, reliability, and cost efficiency.
  • Implement monitoring, alerting, and operational workflows for model-serving systems.

  • Model Deployment & Production Readiness
  • Support the deployment of machine learning and LLM models into production environments using modern MLOps practices.
  • Collaborate with backend engineering teams to integrate AI services through APIs and enterprise workflows.
  • Ensure model systems meet reliability, latency, and scalability requirements.

  • Observability, Governance, and Compliance
  • Establish monitoring and evaluation pipelines for model performance, drift detection, and operational health.
  • Ensure infrastructure and workflows align with enterprise security requirements and responsible AI governance practices.
  • Maintain auditability and documentation across datasets, pipelines, and model releases.

  • Knowledge Systems and Graph Integration (Preferred)
  • Support AI architectures that incorporate knowledge graphs and graph databases for retrieval, reasoning, and enterprise context enrichment.
  • Collaborate with engineering teams to operationalise graph-backed pipelines alongside modern ML systems.
  • Contribute to scalable integration patterns between graph data layers and LLM-based applications.

Qualifications

  • Required
  • Bachelor’s or Master’s degree in Computer Science, Engineering, Machine Learning, or a related field.
  • 3+ years of experience in MLOps, ML infrastructure, or cloud-based data/AI platform engineering.
  • Strong hands-on experience building and operating AI infrastructure on AWS.
  • Experience developing data foundations and pipelines supporting model development workflows.
  • Familiarity with containerisation and orchestration tools such as Docker and Kubernetes.
  • Demonstrated ability to support ML systems moving from experimentation into production environments.

  • Preferred
  • Experience in enterprise software, legal tech, or other regulated domains.
  • Familiarity with graph databases (e.g., Stardog) and knowledge graph-based AI architectures.
  • Exposure to LLM pipelines, retrieval-augmented generation (RAG), or agent-based AI workflows.
  • Experience with Infrastructure-as-Code tools such as Terraform or CloudFormation.

Success Metrics

  • Reliable and scalable AWS-based infrastructure enabling efficient model development and deployment.
  • Strong data foundations supporting compliant, repeatable training and evaluation workflows.
  • Reduced friction in research-to-production transitions for AI engineering teams.
  • High operational quality through monitoring, governance, and automation of ML systems.
  • Successful enablement of advanced AI architectures, including graph-backed and retrieval-driven workflows.
Onit
Onit

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say