Amazon

Tech Ops Engineer - Incident Management, Central Technical Operations Services (CTOS)

Sydney, Australia
Java Python Perl Shell Spring
Description
- Bachelor's degree in Computer Science, Engineering, or a related technical field; or at least 7 years of relevant experience in a large-scale online operations environment.
- Fluent written and verbal communication skills in English, with the ability to effectively collaborate cross-functionally.
- Proficient in scripting and automation using at least one interpreted language (e.g. Java, Python, Perl) as well as shell scripting.
- Strong working knowledge of Linux operating systems and networking fundamentals.
- Proven track record of driving complex, collaborative projects from conception through successful delivery.
- Experience with incident management, event detection, and operational excellence in a fast-paced, customer-centric environment.
- Ability to thrive in a geographically distributed, "follow the sun" coverage model, including off-hours and weekend work as needed.
- Experience with distributed systems at scale
- Experienced with Agile software development practices, including Scrum ceremonies and continuous improvement
- Background in architecting and supporting large-scale, distributed systems
- Track record of effectively leading and managing cross-functional incident response efforts
- Deep understanding of network technologies and troubleshooting to rapidly resolve complex issues
- Ability to collaborate closely with customers during high-pressure problem resolution, while remaining calm and focused
- Excellent prioritization, time management, and organizational skills in a fast-paced environment

Acknowledgement of country:
In the spirit of reconciliation Amazon acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.

IDE statement:
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer, and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, disability, age, or other legally protected attributes.
Amazon is seeking an exceptional Systems Engineer to join our world-class Central Technical Operations Services (C-TOS) team as an Incident Manager. As the first line of defense for maintaining high availability on the Amazon Retail Website, our C-TOS group provides critical incident response and management for the entire Amazon ecosystem. When issues arise that could impact our hundreds of millions of customers worldwide, our skilled Incident Managers spring into action to make event durations shorter, less frequent, and less severe.

This is immensely important, high-stakes work. The Amazon Retail Website is where we directly engage and delight our global customer base - any disruption can have a real impact on real people. That's why our C-TOS Incident Managers are so vital; leveraging deep operational expertise and the latest incident management tools, they work quickly to mitigate customer-impacting events.

This is an excellent opportunity to join one of Amazon's world-class engineering teams, working alongside some of the best and brightest minds in technology. Our engineers are encouraged to build solutions that enhance our incident management practice, including tooling and processes, as well as fix software problems - and then share those innovations across the organization. You'll have access to mentoring programs, regular tech talks with technical leaders, and well-defined career paths for motivated engineers who want to contribute to our culture of operational excellence and customer-focused innovation. The C-TOS team is globally distributed, with groups in Austin, Dublin, and Sydney providing 24/7 coverage, each working 10-hour shifts for 4 days per week.
#techjobsau

Key job responsibilities
- Serve as a technical evangelist, leveraging deep expertise to devise innovative solutions to complex business problems.
- Drive down mean time to resolution for incidents through proactive monitoring, rapid response, and continuous process improvement.
- Design, implement, and optimize world-class event detection, alerting, and incident management systems.
- Evolve operations management processes and technologies to accommodate Amazon's rapid growth.
- Create, review, and continuously improve documentation, procedures, and knowledge resources.
- Identify and resolve recurring platform issues by collaborating cross-functionally with service owners.
- Provide exceptional customer service by responding to and resolving requests within defined SLAs.
- Participate in a global "follow the sun" rotation, ensuring 24/7 coverage including weekends and holidays.
- Contribute to the interviewing and hiring process to build a world-class Incident Management team.

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

πŸ₯³πŸ₯³πŸ₯³ 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. πŸ› οΈ
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. πŸš€
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. πŸ“…

What Fellow Engineers Say