What You'll Do:
- Owns the incident management process and ensures it drives enduring reliability across all products and services within Xero.
- Provide expert leadership during critical outages, coordinating multiple teams to ensure streamlined decision-making and quick resolution.
- Lead and advocate for the transformation to a world-leading SRE organization, promoting SRE principles within the Engineering Department.
- Act as a customer-focused approach by addressing and mitigating global customer environment issues, and fostering a culture of continuous learning and technical excellence within the SRE team.
- Develop and implement scalable process frameworks and observability strategies to ensure rapid problem diagnosis, response, and service reliability.
- Collaborate with product teams to thoroughly analyze failures and integrate insights to improve service reliability, scalability, and operational efficiently.
- Provides ongoing training across the business to ensure the process is well understood and adhered to. This includes training appropriate engineering resources who will own Incident commander actions for lower priority issues.
- Dives into causes of Incidents and examines, on a proactive basis, the potential causes of future incidents and works with engineering teams to remove the risk of that failure scenarioBuild playbooks and automated response to Business continuity and DR situations to ensure response is quick and effective.
What You'll Bring With You:
- 5+ years of experience as a Site Reliability Engineer, with relevant experience in an Operations or Engineering environment.
- Experience troubleshooting AWS hosted servicesNetworking knowledge and able to troubleshoot TCP/IP, SSL/TLS, DNSSEC, IPsec, and BGP issues.
- Coding experience (preferably Python) building tools, scripting, or automationStrong communication (oral & written) skills including the ability to translate technical issues/concepts into agreed actions.
Other Jobs from Xero
Team Lead, Product SRE
Senior Site Reliability Engineer
Principal Engineer, Site Reliability
Lead Product Manager, Identity & Signals
Senior Cloud Engineer at Xero
Similar Jobs
Typescript Software Developer (Node.JS)
Sr Full Stack Developer - Java, React
Associate Software Engineer (Java, Kotlin, RUST)
Front-End Developer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say