NVIDIA is looking for outstanding software and systems engineers to help us develop and operate our enterprise GPU infrastructure management systems across Clouds. In this role, you will work closely with the broader NVIDIA team to operate, design and build infrastructure management systems, Kubernetes operators, and end-to-end HPC integration solutions that combine GPUs with the rest of the datacenter software management ecosystem. We are focused on supporting NVIDIA products across HPC, Cloud, and enterprise on both bare metal and virtualized platforms as the role of GPUs in all of these environments expands. Your contributions will span many aspects of GPU systems management, including Cloud provisioning, observability, operations and incident response. The systems you operate will support single-node developer systems through large clusters with thousands of nodes deployed on multiple Cloud providers.
To succeed, you must have a strong system and software development background, familiarity with modern distributed systems especially the Cloud-native ecosystem, and a proven work ethic. You will be expected to jump in quickly and provide valuable contributions from day one. This is a dynamic work environment with many exciting opportunities awaiting. NVIDIA GPUs are central to many hot enterprise, cloud, and datacenter trends. Come join us as we craft the future of accelerated computing and AI.
What you'll be doing:
Enable GPU provisioning and life-cycle with state-of-the-art Cloud-Native open-source ecosystem solutions, including Kubernetes, Docker, Prometheus, TerraForm and Crossplane.
Develop, maintain and/or operate robust, scalable Go programs in a Kubernetes environment.
Develop the next-generation multi-cloud infrastructure management systems to support GenAI.
Support internal and external users through bug fixes, documentation, and feature improvements.
Maintain high-quality products through robust test coverage and Day 2 capabilities..
What we need to see:
BS or higher in Computer Science or equivalent experience.
5+ years of meaningful industry experience with a strong Kubernetes and DevOps background
Deep understanding and execution skills of all aspects of the software development lifecycle
Experience with OpenAPI and Kubernetes Custom Resource Definitions
Outstanding written and verbal interpersonal skills
Strong motivation and commitment to learn new skills
Ability to manage time in a fast, heavily multitasked environment
Ways to stand out from the crowd:
Open Source contributions to the Cloud-Native community.
Strong experience with GitHub/GitLab CI/CD pipelines and application configuration..
Strong knowledge of container technologies, orchestration frameworks and observability systems.
Exposure to GPU programming with CUDA a plus.
Experience with managing and operating HPC schedulers and/or working across multiple Cloud providers.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!
Other Jobs from NVIDIA
Automotive DriveOS Software Architect
Director of Mechanical Engineering
Senior AI Infrastructure Engineer
Senior GPU System Performance Architect
Senior DL Algorithms Engineer - Inference Performance
Similar Jobs
Senior Billing Systems Engineer
Senior Software Engineer
Software Developer Engineer In Test
Senior Staff MLOps Engineer
Sr Staff Data Platform Software Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say