Technical Service Delivery Manager
Location: Austin, TX, us, Remote
Company Description
Mirantis is an open cloud company that helps organizations achieve digital self determination by giving them complete control over their strategic infrastructure. The company combines intelligent automation and cloud-native expertise for managing and operating virtual machines, containers, Kubernetes, and cloud environments.
We empower developers and innovators to create extraordinary products and services by automating the discovery, integration, and operation of the best cloud and open source technologies for their unique needs.
Job Description
The Service Delivery Manager (SDM) is a deeply technical, customer-facing leader responsible for ensuring world-class support delivery, operational alignment, and proactive technical guidance across Mirantis’ enterprise customer base. This hybrid role combines the technical depth of a Level 2 Support Engineer, the relationship skills of a Customer Success Manager or Account Executive, and the ownership mindset of a Technical Lead responsible for the customer’s full Mirantis platform stack. The SDM serves as the primary operational and technical owner for assigned accounts—leading escalations, guiding platform operations, and ensuring customers achieve maximum value from Mirantis technologies including Mirantis Kubernetes Engine(MKE), Mirantis Container Runtime (MCR), k0rdent, Lens, and OpenStack.
Key Responsibilities:
Technical Ownership & Support Leadership
- Serve as the primary technical authority for customer environments across Kubernetes, OpenStack, Linux, networking, storage, and security.
- Provide L2-level troubleshooting and technical guidance across compute, control plane, networking, and storage layers.
- Diagnose complex failures across OpenStack and Kubernetes components.
- Guide customers through upgrades, lifecycle management, capacity planning, and architecture best practices.
Excellence in Support Delivery
- Ensure customer issues are resolved within defined SLAs with minimal business impact.
- Maintain greater than 95% CSAT across assigned accounts.
- Conduct proactive platform reviews and drive root cause elimination for recurring issues.
Escalation Management
- Lead P1/P0 escalations, war rooms, and cross-functional incident response.
- Provide clear, timely updates to customers and internal stakeholders throughout the incident lifecycle.
- Prevent repeat incidents through structured RCA and strategic improvements.
Account Health & Adoption
- Conduct recurring platform health reviews and risk assessments.
- Drive modernization initiatives and adoption of MKE, MCR, k0rdent, and related Mirantis technologies.
- Partner closely with Customer Success Managers on retention, renewals, and expansion opportunities.
Must-Have Technical Literacy (Required
- OpenStack Architecture Literacy: Understanding Nova, Neutron, Cinder, Glance, Keystone and common failure patterns such as DHCP or metadata failures, RabbitMQ quorum issues, and Ceph latency impacting Nova.
- Cloud Control Plane & HA Concepts: Awareness of Galera clustering, HA control plane behavior, how outages manifest for customers, and sound judgment on triage and escalation.
- MOSK Awareness: Understanding Mirantis OpenStack for Kubernetes (MOSK) release cycles, containerized services, StackLight monitoring, and the differences between platform upgrades, host OS lifecycle, and kernel upgrades, including EOL risk.
- Ceph & Storage Fundamentals: High-level understanding of Ceph as block storage, how storage latency affects VM performance, and what OSD flapping, health warnings, or degraded clusters indicate.
- Neutron Networking Basics: Familiarity with provider versus tenant networks, floating versus internal IPs, and typical causes of east/west traffic failures, metadata issues, and DHCP problems.
- Incident Management Excellence: Ability to lead outage calls, structure communication, distinguish root cause from contributing factors, and drive follow-through to full resolution.
- Escalation Ownership: Skill in routing issues to the correct teams (OpenStack, Ceph/storage, networking, hardware, infrastructure) and maintaining accountability.
- Customer & Executive Communication: Ability to clearly explain what the issue is, why it matters, the risks, and next steps, maintaining trust and confidence.
- Lifecycle & Capacity Planning: Awareness of when customers are approaching software end-of-life, resource exhaustion, host overcommit, or aging hardware risk, and ability to recommend remediation
Nice-to-Have Knowledge (Preferred)
- Kubernetes Literacy: Understanding that many customers run Kubernetes on top of OpenStack, with basic familiarity with CSI, CNI, nodes, and pods to understand cross-platform dependencies.
- Performance Concepts: High-level awareness of CPU pinning, hugepages, and NUMA topology, sufficient to contextualize engineering recommendations.
- Broader Infrastructure Context: Familiarity with Octavia (load balancers), Designate (DNS), high availability patterns, and multi-region or FedRAMP-style constraints for regulated customers.
- Monitoring Awareness: Basic knowledge of StackLight dashboards and common alerts such as Ceph health issues, API failures, and RabbitMQ quorum problems to help interpret severity.
Qualifications
Qualifications – Required
- 5+ years in technical support, service delivery, DevOps, SRE, or similar customer-facing infrastructure roles.
- Hands-on experience with Kubernetes and OpenStack in production environments.
- Understanding of Nova, Neutron, Cinder, Keystone, Glance, Ceph, iSCSI, NFS, VLAN/VXLAN, and SDN concepts.
- Strong Linux administration and distributed systems troubleshooting skills.
- Proven experience managing high-severity escalations and communicating with executives.
- Willingness to travel up to 30% and participate in an on-call rotation.
Preferred Qualifications:
- Experience running Kubernetes on top of OpenStack.
- Exposure to CI/CD, infrastructure-as-code, and observability tooling.
- Awareness of GPU and AI/ML workloads and their infrastructure requirements.
- Certifications such as CKA, CKAD, CKS, OpenStack certifications, Docker, Linux Foundation, or cloud provider certifications.
Success Metrics
- SLA performance and reduction in major incidents and escalations.
- Customer satisfaction scores (CSAT > 95%) and positive customer sentiment.
- Increased adoption and effective use of Mirantis platforms across assigned accounts.
- Reduction in repeat issues through proactive architecture and process improvements.
Additional Information
What does Mirantis offer you?
- Work with an established Silicon Valley leader in the cloud infrastructure industry.
- Work with exceptionally passionate, talented and engaging colleagues, helping Fortune 500 and Global 2000 customers implement next-generation cloud technologies.
- Be a part of cutting-edge, open-source innovation.
- Thrive in the high-energy environment of a young company where openness, collaboration, risk-taking, and continuous growth are valued.
- Receive a competitive compensation package with strong benefits plan
We are a Leader for Container Management in G2 (#2 after AWS)!
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
