Microsoft

Senior Site Reliability Engineer

Redmond, WA US
USD 112k - 238k
Azure C++ C# Java Android JavaScript Python PowerShell AWS
Description

Microsoft Intune - one of the fastest growing cloud-based enterprise services at Microsoft - allows IT professionals to manage the ever-increasing usage of mobile devices in the workplace, allowing them to secure corporate assets & data while giving users the confidence that their personal data stays private. Business, governments, and educational institutions are under constant threat from a variety of actors, and they are attacking users in record numbers. We protect Microsoft’s customers and their devices across the world and secure their devices from corporate espionage, data leakage, and application infiltration. Intune is a multi-platform solution (Windows, iOS, and Android), integrated with key cloud services at Microsoft, and is built on some of the latest Azure technologies. 

   

We are looking for a self-driven Senior Site Reliability Engineer, to identify and dive deep into global scale service issues and drive improvements to resiliency, availability, latency, and product reliability. You will be responsible for complexity analysis, and scalable system design to ensure services/systems are highly stable, available, performant, and meeting the expectations of our customers. We are a team dedicated to drive measurable improvement in service reliability and reducing the negative customer impact of outages through avoidance or mitigation. You will work closely with other engineering teams and provide a holistic view of our cloud service. Provide excellent technical leadership, raise the technical bar, maintain data andresults-drivenculture, nurture a team culture of high collaboration and trust. 

Required/Minimum Qualifications

 

  • Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    •  OR equivalent experience.
  • 4+ years of troubleshooting/debugging experience: telemetry-based analysis (KQL or equivalent preferred), troubleshooting skills across network, hardware, and distributed service layers, with demonstrated ability to debug, fix, and optimize code.  
  • 3+ years of experience with writing tools, automation / scripting (Powershell, Python or similar), programming (C++, C# or equivalent) and making enhancements in subcomponents within and around services/products to deliver and manage software in production.  
  • Willing to work as part of a 24x7 on-call rotation. 
  •  

 

Additional or Preferred Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

    o OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

    o OR equivalent experience.

    • Communicate effectively and partner well with other disciplines of the project team to deliver high quality solutions from ideas

    • Understanding of how to implement high availability, disaster recovery, and business continuity concepts in online services.  
    • Experience aiding understanding of distributed systems and networking is preferred.
    • Effectively manage and prioritize multiple tasks in accordance with high level objectives/projects.  

 


Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $112,000 - $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 - $238,600 per year.

 

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

  • As a Senior Site Reliability Engineer you will be part of the Reliability & Resilience team dedicated to driving measurable improvement in service reliability and reducing the negative customer impact of outages through avoidance or mitigation. 
  • Delivering projects that improve resiliency and security of the service. 
  • Right mix of systems engineering, data science, software development, on-line servicesexperience,and passion for quality to envision. 
  • Demonstrated experience with Azure services and capabilities (and/or other cloud platforms like AWS) 
  • Good knowledge of ARM artifacts and industry standard deployment methodologies. 
  • Own availability, performance, and supportability targets for the service.  
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products. 
  • Participate in on-call rotations and own, triage, investigate and resolve service issues with an emphasis on broad communications, learning & teaching throughout the process.  
  • Evaluate and contribute to service design and architecture to improve the resiliency of the cloud service. 
  • Author functional and technical documentation and remain current on relevant technologies and procedures.  
  • Bring clarity, create energy, and drive results – set a vision, rally the team behind it, and deliver for our engineers and customers. 

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

50,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 250 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers