Disney

Senior Site Reliability Engineer

Remote Ontario
AWS GCP Docker Kubernetes Java Scala Azure Python Go Rust Terraform Ansible
Description

Job Posting Title:

Senior Site Reliability Engineer

Req ID:

10103795

Job Description:

We are hiring for a Senior Site Reliability Engineer within the Reliability Tooling team, you will be responsible for writing and reviewing code, contributing to technical decisions, and mentoring engineers in your squad. We are looking for someone who will be part of an engaging, dynamic and inclusive engineering organisation, grounded in scrum and agile practices, CI/CD, great collaboration and motivated by a commitment to continuous learning and improvement.

You will be part of a team that is customer satisfaction focused and will be working on reliability solutions that enable development teams to achieve their service level objectives, by continuous measurement and improvement of reliability signals.

As a Senior engineer, we are looked at by our fellow team members as a ‘go to’ individual; you are someone who has a clear understanding of, and can thoroughly elaborate on SRE principles and best practices to a given audience. To be successful in this role you will continuously uphold and improve all the relevant reliability aspects for our services, with an increased focus on SLIs and SLOs, while raising the reliability of a variety of large scale user facing and internal services.

Disney Entertainment & ESPN Technology teams are located in New York, San Francisco, Seattle, Bristol US, Manchester UK, Amsterdam, remotely and more!

What You Will Do

  • Build tools to help your SRE team quickly pinpoint, isolate and resolve issues related to infrastructure, platform services and applications;
  • Use Chaos Engineering principles and methodologies to test what you build under real-world conditions;
  • Deploy and manage innovative modern cloud technologies using infrastructure-as-code, self-healing, and security automation patterns;
  • Develop useful telemetry, alerts, and response to reduce Mean Time To Repair (MTTR);
  • Collaborate and provide technical excellence within and across teams;
  • Consult on standard methodologies and develop tools to enable smooth adoptions of good service reliability practices and methods, e.g. promote sustainable incident response and blameless postmortems
  • Identify areas of improvement in reliability, efficiency, and operations;
  • Write code that improves scalability, performance, maintainability, and security;
  • Mentor SREs in technical and non-technical SRE responsibilities;

What To Bring

  • proven experience in SRE, DevOps, technical operations, systems engineering, software engineering
  • Passionate and curious about ways to leverage technology while continually learning
  • Skilled in Cloud/PaaS/SaaS Environments (e.g. AWS, Azure, Google Cloud Compute)
  • Efficiently skilled with the use of containers in enterprise production environments (e.g. Docker, Kubernetes, LXC, AWS ECS and EKS)
  • Proficient in one or more of the following languages (Python, Go, Rust, or similar)

Preferred Experience

  • Comfortable in one or more of the following languages (Python, Java, Scala, Go, Rust, or similar)
  • User Interfaces development experience
  • Proficient, collaborative, & experienced in building reliable, scalable, enterprise systems
  • Ability to identify root-cause sources of instability in a high-traffic, large-scale distributed systems
  • Experience in designing, building, and operating large-scale production systems
  • Configuration management and orchestration (e.g. Terraform, Cloud Formation, Ansible)
  • Experience with continuous integration tools (e.g. Jenkins, Gitlab CI/CD, AWS CodeBuild/Deploy/Pipeline, Azure DevOps, Spinnaker)
  • Knowledge of best practices and IT operations in an always-up, always-available service;
  • Experience in SDLC, distributed systems, networking, logistics and operations or capacity planning;

The Perks

  • 25 days annual leave.
  • Private medical insurance & dental care.
  • Free Park Entry: You will have the opportunity to enter any of our parks with your family and friends for free.
  • Disney Discounts: you are entitled to discounts on designated Disney products, resort F&B and ticketing.
  • Excellent parental and guardian leave.
  • Employee Resource Groups – WOMEN @ Disney, Disney DIVERSITY, Disney PRIDE, ENABLED, and our Mental Health & Wellbeing Group, TRUST.

The Walt Disney Company Limited is an equal opportunity employer. Applicants will receive consideration for employment without regard to age, race, colour, religion or belief, sex, nationality, ethnic or national origin, sexual orientation, gender reassignment, marital or civil partner status, disability or pregnancy or maternity. Disney fosters a business culture where ideas and decisions from all people help us grow, innovate, create the best stories and be relevant in a rapidly changing world.

Job Posting Segment:

Engineering Services

Job Posting Primary Business:

ES - Production Platforms

Primary Job Posting Category:

Site/System Reliability Engineer

Employment Type:

Full time

Primary City, State, Region, Postal Code:

Manchester, United Kingdom

Alternate City, State, Region, Postal Code:

Date Posted:

2024-11-01
Disney
Disney
Digital Media E-Commerce Media and Entertainment Multi-level Marketing Performing Arts Digital Media E-Commerce Media and Entertainment Multi-level Marketing Performing Arts Employment Media and Entertainment Personal Development

0 applies

2 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say