Microsoft

Site Reliability Engineer II

Redmond, WA US
USD 98k - 208k
Microservices Swift Azure
Description

Are you a customer-obsessed, AI-curious problem-solver who thrives in an inclusive, collaborative global team? The Azure Customer Experience Platform (CXP) team’s mission is to transform Microsoft Cloud customers into fans. Through our deep engineering engagements with customers and teams across Microsoft, we analyze and amplify customer needs and drive the vision to improve Cloud quality, security, and reliability. Our culture of growth mindset and empowerment are central to who we are and how we work.

 

We are looking to hire a Site Reliability Engineer II to join our team. We are part of the Azure engineering organization and consider great customer experiences critical to the overall success of Azure. We create, define, and lead product offerings that set our customers up for success, empower them to solve problems, and ensure they have a phenomenal experience if they need support. We empower 200+ product groups across Azure with apps, platforms, intelligent insights, and all the capabilities needed to enable consistent and excellent customer experiences across Azure services.

 

We are a team that loves big opportunities and as part of Azure we have consistently innovated and created new solutions to solve some of our most interesting problems. We value diverse opinions and new, forward-thinking ideas. We have strong customer empathy and are delighted in understanding our customers’ needs.

 

Every day, our customers stake their business and reputation on our cloud. You can help #AzCXP provide our customers with the world-class cloud services they need to succeed.

 

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

 

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Required/Minimum Qualifications

  • 4+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field.

Other Requirements

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:

  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications

  • 5+ years technical experience in software engineering, network engineering,
    • OR systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology,
    • OR related field AND 2+ years technical experience in software engineering, network engineering,
    • OR systems administration
    • OR Master's Degree in Computer Science, Information Technology,
    • OR related field AND 1+ year(s) technical experience in software engineering, network engineering.
  • 5+ years of experience in Site Reliability Engineering, Service Engineering, or Production Engineering within online services environments, supporting both Linux and Windows platforms.
  • Demonstrated experience working with large-scale distributed systems, such as cloud computing providers or SaaS services, ideally in high-scale environments involving millions or billions of users, or similar complex settings.
  • Proven ability to drive and coordinate complex projects across diverse teams.
  • Knowledge of Azure, Azure Services, and dependencies, along with experience in managing and scaling environments within Azure or hybrid cloud infrastructures.
  • Extensive understanding of cloud computing, including compute, storage, networking, and container orchestration concepts.
  • Familiarity with modern distributed design patterns and cloud systems architecture, including microservices, containers, load-balancing, queuing, and caching.
  • Ability to satisfy Microsoft, customer, and/or government security screening requirements specific to this role.

Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
she 
Microsoft will accept applications for the role until November 5, 2024.

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

  • Partner with Customer/First Parties in migrating and navigating in Azure and help them design the solutions with highest reliability.
  • Contribute to the design of V. Next architecture for Cloud infrastructure services, based on Customer/ First party engagements.
  • Engage in major production triage efforts and work with different teams in the identification of root cause of highly impactful or complex issues as required and identify Product gaps and work with Product teams to bridge the gaps.
  • Partner closely with Software developers, Product Managers, architects, and Infrastructure teams to drive delivery of sustainable and reusable design solution patterns to ensure non-functional production support requirements are adopted early in the Migration /Deployment.
  • Collaborate with Development and Service teams to understand technical solutions to understand Service Level Indicators (SLIs) and Service Level Objectives (SLOs) and help Customers / First Parties in design to achieve the right RPO / RTO and their Composite SLA to meet requirements.
  • Participate in on call coverage rotation - Provide leadership to all major Incidents in Azure as Platform and Customer facing Incidents.
  • Identify and drive requirements for increased customer self-supportability
  • Identify and drive implementation of customer centric mitigation levers and playbooks for Operations.
  • Drive continuous swift momentum towards mitigation, asking leading technical questions, offering suggestions around troubleshooting direction.
  • Provide excellent incident communication to stakeholders.
  • Work within a “Follow the Sun” global shift rotation, covering local day-time hours, including holidays and weekends, on a rotational basis.
  • Embody our culture and values.

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say