NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.
Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
Join NVIDIA, where we are pushing the boundaries of what's possible in AI and cloud computing. As a versatile System Software Engineer - AI and Cloud, you will be part of a team of dedicated professionals that thrives on innovation and collaboration. Located in the heart of Silicon Valley, you will have the opportunity to work on groundbreaking projects that craft the future of technology. This role offers an outstanding chance to engage with advanced AI models and cloud-native architectures, making significant contributions to NVIDIA’s versatile products and technologies.
What you'll be doing:
Evaluate cloud-native, full-stack applications using microservices architecture to power AI use cases, bringing to bear NVIDIA frameworks, SDKs, and microservices.
Design and implement agentic workflows with advanced techniques like Retrieval-Augmented Generation (RAG) and the latest AI models.
Evaluate user experiences and analyze the technical performance of AI solutions, compiling findings into comprehensive reports. Offer practical suggestions for product improvement to senior executives and engineering management.
Engage with various teams across NVIDIA such as product, marketing, hardware, software engineering, and QA to improve NVIDIA's product offerings.
Develop developer-focused content, including detailed tutorials and code samples, to demonstrate the latest features in NVIDIA’s tools and libraries.
Write technical whitepapers and product briefs, and run technical demos of our products at prominent industry conferences.
What we need to see:
A Bachelor’s or Master’s in Software Engineering, Computer Science, Computer Engineering, Electrical Engineering or a related degree (or equivalent experience)
3+ years of experience.
Proficiency in Python and JavaScript for programming and debugging, with a strong foundation in data structures, algorithms, and software design principles.
Basic familiarity with C++ programming and its application in high-performance computing environments.
Experience in crafting cloud-native systems optimized for Kubernetes deployment, using inference frameworks such as vLLM and NVIDIA Triton Inference Server.
A solid understanding of API design principles for building scalable, production-ready inference systems.
Ways to stand out from the crowd:
Advanced knowledge of LLMs, modern AI software architecture, and cloud APIs.
Contributions to public-facing technical content and open-source projects.
Expertise in deploying LLM inference frameworks like Triton Inference Server, vLLM, or TensorRT, including on Kubernetes or edge devices to improve performance.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Other Jobs from NVIDIA
Senior Site Reliability Engineer - GeForce Now
Senior Chip Design Engineer
Senior Formal Verification Engineer
Senior Software Engineer - NIM Factory Automation
Senior Data Center Engineer
Senior DGX Cloud Performance Engineer
Similar Jobs
Sr. Security Vulnerability Engineer
Senior Security Engineer
Senior Fullstack Engineer, CSP
PRINCIPAL SOFTWARE ENGINEERING MANAGER
Principal Software Engineer
Software Engineer II
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say