At NVIDIA, we are continually redefining the possibilities of compute infrastructure to power groundbreaking advancements in AI, gaming, and data centers. The Compute Infrastructure Engineering team plays a critical role in this mission by building scalable, reliable, and innovative platforms to meet the needs of our rapidly growing business.
As the Senior Manager, Compute Infrastructure Engineering, you’ll lead a team of expert engineers driving the evolution of our global IT compute environments. From integrating technologies like Kubernetes and Metal as a Service (MaaS) to enabling seamless scalability through software-defined storage, this role offers a unique opportunity to shape the future of NVIDIA’s IT infrastructure strategy. You are also going to be responsible for Foundational Infrastructure Services like Baremetal Deployment, DNS, Configuration Management and Observability
What You’ll Be Doing:
Leading a team of engineers responsible for designing, building, and maintaining compute infrastructure across virtualized, containerized, and bare-metal platforms.
Overseeing the transition from VMware-based environments to Kubernetes, enabling a hybrid infrastructure that supports both containers and virtual machines.
Developing and implementing Metal as a Service (MaaS) solutions to automate the provisioning and management of bare-metal infrastructure.
Managing critical services, including DNS, Linux systems, and Configuration management, to ensure a robust and secure infrastructure.
Driving the adoption of software-defined storage solutions (Ceph/NvmeOF) to replace legacy SAN and VSAN systems.
Collaborating with cross-functional teams, including networking, application, and storage engineering groups, to deliver comprehensive infrastructure solutions.
Defining and implementing automation, observability, and self-service capabilities to enhance operational efficiency and user experience.
Partnering with vendors and third-party providers to align on strategic goals while reducing dependency on single-vendor solutions.
What We Need to See:
Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
10+ overall years of experience in infrastructure engineering, with 5+ years in management or leadership roles.
Expertise in managing and scaling virtualized, containerized (Kubernetes), and bare-metal environments.
Strong proficiency in Linux systems, DNS, and configuration management tools like Ansible, Puppet, or Chef.
Proven track record of implementing Metal as a Service (MaaS) platforms for bare-metal deployments.
Deep knowledge of storage technologies, including SAN, NFS, and Software Defined Storage
Hands-on experience with automation tools (Terraform, Ansible) and scripting (Python, Bash).
Exceptional leadership and team development skills with a history of fostering innovation and collaboration.
Excellent communication skills, with the ability to convey complex ideas to technical and non-technical stakeholders.
Ways to Stand Out from the Crowd:
Experience designing and operating hybrid cloud environments at scale.
Demonstrated ability to drive large-scale infrastructure transformations successfully.
Expertise in implementing software-defined networking and storage solutions.
Strong understanding of observability tools and methodologies for complex infrastructures.
Passion for innovation and a vision for the future of compute infrastructure.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
The base salary range is 196,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Other Jobs from NVIDIA
Senior Manager, Hardware Engineering
Senior Software Engineer - DOCA
Software QA Engineering Intern - 2025
Software QA Engineering Intern - 2025
Senior Software Engineer - Conversational AI
Similar Jobs
Sr. Site Reliability Engineer - India Based
AVP, Principal Product Engineer -Devops (L11)
DevOps Engineer
DevOps Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say