WEX

Senior SRE Manager

Portland, ME San Francisco, CA
USD 176k - 204k
Kubernetes Docker Grafana ELK Splunk MySQL PostgreSQL OpenTelemetry Jaeger Prometheus
Description

Senior SRE Manager

Location: Portland, ME, San Francisco, CA, Chicago, IL, Dallas, TX

Time Type: Full time

Job Description

About the Team/Role 

We are looking for a highly motivated and high-potential Senior Manager Site Reliability Engineering (SRE) to join our team as a technical leader and drive transformative impact across WEX’s platform reliability and operational excellence.

This is a particularly exciting time to be part of the SRE function at WEX. Our diverse product ecosystem supports a wide array of customer businesses and generates rich, complex telemetry across applications, infrastructure, and platforms. Ensuring these systems are scalable, observable, and resilient is critical to unlocking business value and customer success.

As a Sr Manager SRE, you will play a pivotal role in shaping the reliability engineering strategy at WEX. You’ll architect and lead efforts that improve availability, performance, and efficiency at scale—driving initiatives across observability, automation, incident management, problem management, capacity planning, and performance optimization. You’ll be hands-on in building foundational tooling and frameworks while also acting as a multiplier—mentoring engineers, aligning cross-functional teams, and influencing platform decisions with a strong reliability lens.

You’ll work closely with engineering, product, and platform teams to instill SRE best practices and enable a shift toward proactive, scalable operations. Our team embraces agile development, a strong product mindset, and modern engineering practices, including AI-assisted operations and intelligent automation.

You’ll take on some of the most complex, high-impact challenges at WEX—supported by a team of highly skilled engineers and technical leaders invested in your success and growth.

If you’re a senior technical leader passionate about building reliable systems, leading through influence, and making a meaningful impact, this is a fantastic opportunity for you.

How you’ll make an impact

  • Architect and oversee the implementation of mission-critical systems.

  • Define and enforce SRE best practices and operational standards.

  • Lead cross-functional initiatives to enhance system reliability and performance.

  • Serve as a technical advisor for engineering leadership.

  • Develop capacity planning and load testing strategies.

  • Design self-healing and auto-recovery mechanisms.

  • Drive cloud cost optimization and budgeting initiatives.

  • Lead one or more SRE teams responsible for a major platform or domain.

  • Partner with Engineering, Product, and Program stakeholders to align team delivery with business priorities.

Experience you’ll bring

  • 8+ years of experience with a focus on large-scale system reliability.

  • Expertise in system architecture, cloud platforms, and automation frameworks.

  • Deep knowledge of Kubernetes, service meshes, and distributed tracing.

  • Experience with monitoring and logging (Grafana, ELK stack, Splunk, etc.).

  • Knowledge of containerization and orchestration (Docker, Kubernetes).

  • Experience designing high-availability, fault-tolerant architectures.

  • Strong understanding of database reliability engineering (MySQL, PostgreSQL, NoSQL). Knowledge of networking, databases, and storage architectures.

  • Excellent incident command and crisis management skills.

  • Experience setting team OKRs and aligning reliability goals with product and platform engineering strategies.

Preferred Qualification

  • Experience with multi-region and multi-cloud deployments.

  • Deep expertise in scalable microservices and event-driven architectures.

  • Strong experience with advanced observability tools (OpenTelemetry, Jaeger, Prometheus).

  • Leadership in driving large-scale SRE transformations.

  • Experience with designing and developing AI based solutions.

  • Ability to influence engineering culture and process improvements.

  • Experience in healthcare, insurance, or benefits technology.

  • Understanding of Benefits domain such as claims processing, eligibility lookup success rate.

  • Experience working with compliance frameworks such as HIPAA, SOC 2, or HITRUST.

  • Proven success building and scaling high-performing SRE teams in production environments.

  • Ability to develop team-wide practices around incident management, postmortems, alert hygiene, and reliability KPIs.

  • Skilled at coaching engineers through complex reliability challenges and career inflection points.

The base pay range represents the anticipated low and high end of the pay range for this position. Actual pay rates will vary and will be based on various factors, such as your qualifications, skills, competencies, and proficiency for the role. Base pay is one component of WEX's total compensation package. Most sales positions are eligible for commission under the terms of an applicable plan. Non-sales roles are typically eligible for a quarterly or annual bonus based on their role and applicable plan. WEX's comprehensive and market competitive benefits are designed to support your personal and professional well-being. Benefits include health, dental and vision insurances, retirement savings plan, paid time off, health savings account, flexible spending accounts, life insurance, disability insurance, tuition reimbursement, and more. For more information, check out the "About Us" section.

Pay Range: $175,600.00 - $204,300.00
WEX
WEX

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say