NVIDIA Cloud Functions team is looking for a motivated, product-minded Senior Distributed Systems Software Engineer with an observability focus. Our team builds and operates a serverless deployment platform for enabling AI applications. Our product enables and scales AI inferencing workloads using globally distributed orchestration of workloads on GPU-backed cloud-agnostic Kubernetes clusters. You will be working with a team of passionate and skilled engineers that are continuously innovating at the speed of light to provide the best product possible, for both external customers and other NVIDIA teams. We are looking for someone to join us at the forefront of defining cloud engineering and observability paradigms for AI at scale.
What You'll be Doing:
Design highly available and scalable systems to meet our observability and performance requirements.
Lead engineering projects that directly impact NVCF customer experience and platform reliability.
Collaborate with and influence other specialists within and across engineering groups while handling key responsibilities, to create the best platform possible.
Evaluate new and innovative technologies and tooling as the AI-at-scale landscape evolves to ensure we have the highest level of operational and engineering excellence.
Mentor and enable other engineering teams building products on top of our platform on the best paradigms for monitoring, performance and reliability.
What We Need to See:
6+ years of validated experience in the design, implementation, and delivery of large engineering projects. A flexible technologist familiar with all aspects of the software development lifecycle.
Excellent communication and collaboration skills. Able to ask the right questions, see and align to the big picture rapidly.
Ability to prioritize and drive customer and engineering QoL improvements. Able to articulate customer impact in every feature you deliver!
Skilled with at least two of the following programming languages and strong in at least one of them: Golang, Java, Python, Scala, Rust.
Understands scalability and performance challenges associated with cloud services. Able to craft horizontally-scalable, resilient and performing-under-load systems.
BS in Computer Science or equivalent experience.
Ways to stand out from the crowd:
Prior experience with building and monitoring solutions deployed on Kubernetes.
Background with Open-Source observability tooling such as OpenTelemetry, Prometheus, Elastic and Grafana
Past experience as an engineering team lead.
Past experience as an SRE or equivalent.
NVIDIA offers highly competitive salaries and a comprehensive benefits package. NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people in the world working for us. Are you a creative engineer with a drive for advancing the state of AI and bringing it to the cloud? If you love to tackle problems and advocate for continuous, innovative improvement, we want to hear from you!
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Jobs from our Partners
Oracle Cloud Developer Senior Consultant
Azure Cloud Solution Architect
Front End Engineer- Active TS/SCI Required
Cloud Engineer in Huntsville, AL (Active Clearance)
AWS Infra/DevOps Lead
Google Cloud Solution Architect
Other Jobs from NVIDIA
Principal Supplier Quality Engineer
Senior Customer Project Manager
Senior Mechanical Engineer
Senior CUDA Test Development Software Engineer
Senior CUDA Test Development Software Engineer
Senior NPI Product Engineer
Similar Jobs
Senior Devops Engineer
Senior Software Engineer (Back End), Bank Tech
DevOps Engineer - Project Argos
DevOps Engineer
Software Engineer
AI Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 307 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
Cancel anytime / Money-back guarantee