Microsoft

Reliability Engineer

Redmond, WA US
USD 98k - 208k
Azure
Search for More Jobs Talk to a recruiter now 💪
Description

Microsoft Cloud Infrastructure and Operations (CO+I) is the engine that powers Microsoft's cloud services. The group is responsible for designing, building, and operating Microsoft’s global datacenters; managing the programmatic delivery of our critical infrastructure design, equipment procurement, construction delivery, infrastructure innovation, demand planning and capacity utilization of our unified infrastructure; and responsible for all operations needed to run the physical infrastructure.

 

We focus on smart growth with an emphasis on automation, data-driven engineering, cost‐effectiveness, and environmental sustainability. We deliver the core infrastructure and foundational technologies for Microsoft's 200+ online businesses including Azure, Office 365, Bing, Xbox Live, Skype, and OneDrive.  Our portfolio is built and managed by a team of subject matter experts working 24x7x365 to support services for more than 1 billion customers and 20 million businesses in over 90 countries worldwide.  Empower Billions!  

 

We are seeking a Reliability Engineer to join and work within our dynamic team. The candidate selected will have a proven track record in driving equipment reliability efforts. 

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.  

 

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Required/Minimum Qualifications:

  • 7+ years relevant technical engineering experience
    • OR Bachelor's Degree in Mechanical Engineering, Reliability Engineering, Electrical Engineering, or related field AND 3+ years technical engineering experience
    • OR Master's Degree in Mechanical Engineering, Reliability Engineering, Electrical Engineering, or related field AND 2+ years technical engineering experience.
  • 3+ years of technical engineering experience in electrical or mechanical infrastructure equipment
  • 3+ years of reliability engineering expertise

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: 

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred/Additional Qualifications:

  • 10+ years relevant technical engineering experience
    • OR Bachelor's Degree in Mechanical Engineering, Reliability Engineering, Electrical Engineering, or related field AND 5+ years technical engineering experience
    • OR Master's Degree in Mechanical Engineering, Reliability Engineering, Electrical Engineering, or related field AND 3 + years technical engineering experience.

 

Reliability Engineering IC3 - The typical base pay range for this role across the U.S. is USD $98,300 - $193,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $127,200 - $208,800 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft will accept applications for the role until July 8, 2024.

 

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

 

#COICentralOpsCareers 

#COICareers 

  • Conducts Failure Mode, Mechanism, and Effect Analysis (FMMEA) to identify equipment risks, impact, and potential mitigation solutions with minimal guidance. The candidate will be working with CO+I - Reliability Engineers to build detailed FMMEAs/Fault Tree Analysis (FTA) of new equipment and will help in refining existing FMMEAs/FTAs.  
  • Reads device and reliability specification sheets and interpret complex details required to qualify, design, or evaluate various hardware reliability risks.   
  • Identify trends and hidden patterns in telemetry associated with equipment failure and events.   
  • Work with internal stakeholders to understand data/telemetry gaps and propose solutions.    
  • Work closely with data scientists to identify and develop new tools and techniques to create failure models and improve existing model performance.   
  • Understand the impact of manufacturing processes and techniques on equipment reliability.  
  • Utilize domain knowledge expertise and reliability experience in building failure knowledge of critical equipment such as breakers, UPS (Uninterruptible Power Systems), Genset, etc.   
  • Act as a liaison between reliability and standards team to support condition-based maintenance practices.   
  • Identify gaps in existing telemetry and help support in telemetry onboarding requests.  
  • Use knowledge of manufacturing process capability as well as system-level performance requirements to establish Critical-to-Reliability performance metrics. 
  • Embody our Culture and Values.  
Microsoft
Microsoft
Data Management Developer Tools DevOps Enterprise Software Operating Systems

0 applies

13 views

Jobs from our Partners

Senior Data Engineer

Colorado Springs, CO US

Cloud Engineer

Colorado Springs, CO US

Similar Jobs

Senior Data Engineer

Colorado Springs, CO US

Cloud Engineer

Colorado Springs, CO US

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 307 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers