Microsoft

Senior Site Reliability Engineer

Redmond, WA US
USD 112k - 238k
Kafka Docker Azure SQL PostgreSQL Spark Hadoop Yarn Kubernetes
This job is closed! Check out or
Description

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world.

Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture.

 

​​Within Azure Data, the big data analytics team provides a range of products that enable data engineers and data scientists to extract intelligence from all data – structured, semi-structured, and unstructured. We build the Data Engineering, Data Science, and Data Integration pillars of Microsoft Fabric.​

 

​​​​Azure HDInsight is one of the fastest growing Azure services that offers popular open source technologies, such as Spark, Hadoop, Kafka, HBase and many others in a form that is easy for customers to use. Our team is looking for a Senior Site Reliability Engineer to work with various open-source technologies including Spark, Hadoop, Yarn, etc. and make contributions to these technologies. We are running one of the world’s largest big data cluster which has more than 25,000 machine, stored more than 10,000PB data and run tens of thousands of jobs everyday across search, ads, Office, Xbox, etc.

​​

We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served.

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

 

 

Required/Minimum Qualifications

  • 6+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration.
  • 3+ years of experience involving service operations, data operations, monitoring, and reliability improvement.
  • Experience in managing distributed systems and/or cloud platforms.

 

Other Requirements

  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check:
    • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications

 

  • 7+ years technical experience in software engineering, network engineering, or systems administration
    • OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 4+ years technical experience in software engineering, network engineering, or systems administration
    • OR Master's Degree in Computer Science, Information Technology, or related field AND 3+ years technical experience in software engineering, network engineering, or systems administration
    • OR Doctorate Degree in Computer Science, Information Technology, or related field.​
  • Experience with open source components like Spark and Hadoop ecosystem as a plus
  • Experience using docker or Kubernetes in building large scale cloud services, distributed systems, or operating systems as a plus
  • Great curiosity and willingness to question
  • Showcase enthusiasm, integrity, ingenuity, results-orientation, self-motivation, and resourcefulness in a fast-paced competitive environment
  • Have a deep desire to work collaboratively, solve problems with teams across the world, find win/win solutions and celebrate successes
  • Get excited by the challenge of hard technical problems
  • Solve problems by always leading with deep passion and empathy for customers​
  • Publications and/or certifications related to cloud technologies a plus.​

Site Reliability Engineering IC4 - The typical base pay range for this role across the U.S. is USD $112,000 - $218,400 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $145,800 - $238,600 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft will accept applications for the role until May 11, 2024.

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

 

#azdat

#azuredata

​​​​#hdinsight

#kubernetes​​

​​Participate in an on-call rotation​.
Develop foundational understanding of service and system design, technology interactions, infrastructure functions, and dependencies at scale.
Lead troubleshooting investigations to bring quicker issue resolution to complex problems impacting our customers, to improve our customer experience and contribute to the growth of our products.
Contribute to identifying optimal technology configurations and assist in implementing reliable, scalable, and high-performance solution to build and operate the service.
Ability to design and implement any changes to service telemetry for the automation to consume if it is not already available.
Use trace analysis, debug skills, source code and other proprietary tools to analyze problems and develop solutions to meet the customer requirements.​

 

Embody our Culture and Values

Microsoft
Microsoft
Data Management Developer Tools DevOps Enterprise Software Operating Systems

0 applies

33 views

Jobs from our Partners

Site Reliability Engineer

Pittsburgh, PA US

Cloud Engineer

Arlington, VA US

AWS DevOps Engineer

Philadelphia, PA US

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

50,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 257 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers