In Azure Specialized we are collaboratively working to bring the next generation of workloads to our Public Cloud platform. We work together across Microsoft to enable end to end new scenarios for Azure customers. Our team imagines and builds differentiating customer features and fundamental building blocks at the heart of the Azure platform working collaboratively with many industry partners.
As a Senior High Performance Computing (HPC) Software Engineer, you will be critical in designing and delivering the next generations of AI training, AI inferencing, virtual desktop, video and gaming infrastructure for Azure. You will be challenged across a wide spectrum of hardware architectures, network types and processor types. You will help define and deliver an end-to-end vertical view, with continuous focus on customer value, quality, performance and automation.
This position involves deep technical work, focusing on defining, deploying and sustaining hardware and software Azure infrastructure for AI and other GPU-based workloads. The work for this position focuses on hardware/software interaction, coding and playing with next-gen hardware, end-to-end systems engineering anywhere in the infrastructure - from fiber networking, switches, gpu differentiation, rack design, cluster design and more. This position offers a unique opportunity to have a huge impact on customers and the world. It is an exciting time for the team as we are working on expanding the capacity and range of supported scenarios to support the next 100X growth.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
- Bachelor's Degree in Computer Science or related technical field AND 4+ years of technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or Python
- OR equivalent experience
- 4+ years of experience in High Performance Computing (HPC) or Machine Learning
Other Requirements:
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience
- 2+ years of experience with Deep Learning, and AI Infrastructure including Diagonostic, Profiling and Performance Analysis Tools
- Experience on with Distributed System, High Performance Computing / Machine Learning middleware and Co-Designing Hardware-Software
- Experience on Profiling and Performance Analysis Tools
Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $117,200 - $229,200 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $153,600 - $250,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until December 23, 2024
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.
We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#azurecorejobs
- Willing to dive deeply into any level or layer of a problem and learn emerging technologies, from hardware to software.
- Evaluate and make recommendations that advance Azure infrastructure for AI and other GPU-based workloads.
- Leads by example within the team by producing extensible and maintainable code. Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI). Applies metrics to drive the quality and stability of code, as well as appropriate coding patterns and best practices.
- Ensures alignment with partners' expectations. Considers partner teams across organizations and their end goals for products to drive and achieve desirable user experiences and fitting dynamic needs of partners/customers through product development.
- Drives identification of dependencies and the development of design documents for a product, application, service, or platform.
- Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
- Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.
0 applies
6 views
Other Jobs from Microsoft
Research Intern - LLM Inference Acceleration and Optimization
Senior Data Scientist
Software Engineer II
Research Intern - Bioinformatics
Senior Physical Design Engineer
Software Engineer
Similar Jobs
Application Developer: Asset Management
Product Security Engineering Manager
Sr Software Quality Engineer
Principal SDET Automation Engineer (Cloud Management Platform)
Software Engineer II
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say