We are looking for an experienced system engineer, who will play a dual role on the NVIDIA Enterprise Experience (NVEX) team. An awesome candidate is highly technical who can triage customer software issues and resolve customer problems as well as someone who can develop key enhancements and tools for the DGX Cloud, NVAIE and potentially other Enterprise related system software.
This individual should have proven grasp of platform and systems engineering who understands Linux internals, knowledge about servers and has the ability to resolve hardware and/or OS internal issues. If you have a real passion for technology, and you are interested in a role that you can make a difference in and contribute at all different levels, this may be a phenomenal position for you.
What you'll be doing:
Taking ownership of and driving customer issues related to NVIDIA AI Enterprise software and hardware deployments, both internally and at Cloud Service Providers (CSPs), from inception through to resolution.
Develop features and tools as part of solution engineering efforts to support all Enterprise Service offerings including, but not limited to NGC, Container Orchestrators (such as Kubernetes), GPU accelerated applications, and Deep Learning frameworks.
Work with NVIDIA Enterprise customers and internal users to improve the availability, reliability, and overall experience of working with NVIDIA Deep Learning Framework containers on NVIDIA GPUs.
Take ownership and drive customer issues on containers, Deep Learning frameworks, and Cloud deployment from inception to resolution.
Build upon the opportunity to research new use cases with GPUs for emerging container technologies and Deep Learning frameworks.
Bring independent analysis, communication, and problem-solving to customer experience.
Be on call one weekend per month in the event a customer has a Sev1 outage and requires engineering assistance.
What we need to see:
BS in Computer Science, Electrical Engineering, Computer Engineering, or related field (or equivalent experience).
At least 5 years system software development and troubleshooting experience, ideally with some customer facing.
Intellectual curiosity, positive attitude, flexibility, analytical ability, self-motivation, and team-oriented.
Strong computer science concepts and excellent knowledge of Python and scripting methodologies.
Deep understanding of at least two of the following: data centers, servers, distributed systems, virtualization, deep learning frameworks, containers/containerization (ie Docker, Kubernetes), hybrid cloud (ie AWS, GCP).
Proven grasp of datacenter, cloud, and Artificial Intelligence technologies to provide comprehensive solutions for sophisticated installations, maintenance, or operations.
Professional-level communication skills, interpersonal skills with a passion to solve problems.
A self-starter with a passion for solving problems, intellectual curiosity, positive approach, flexibility, analytical ability, self-motivation, and a team-oriented attitude.
Ways to stand out from the crowd:
Experience working with distributed systems especially container orchestrators.
Experience as a developer and/or support team member addressing customer concerns for large enterprise/service provider customers at a company that produces AI and data analytics software.
Proven experience in developing, triaging and debugging on Linux and Containers and deep learning frameworks.
Background in developing or debugging AI and data analytics software.
Certified in CSP (Azure, AWS, GCP or OCI) or Hypervisor (Citrix, Nutanix, Red Hat or VMware) Technologies.
Experience with PyTorch, TensorFlow or AI Frameworks
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people in the world working for us and, due to unprecedented growth, our world-class engineering teams are expanding fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.
Other Jobs from NVIDIA
Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems
System and Software Networking Architect, HPC
Senior Mechanical Engineer
Similar Jobs
Senior AI Ops Engineer
Lead Software Engineer-AI
Full-Stack Engineer (Go/Python)
Senior ML Engineer
Senior ML Engineer| Data Science
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say