NVIDIA

Senior Distributed Systems Engineer, AI Infrastructure

Shanghai, China
Java Scala Kubernetes Deep Learning Microservices Machine Learning AWS Go Python Docker
Description

NVIDIA is hiring a senior data and distributed systems engineer to architect, lead and develop our exa-scale AI infrastructure and deep learning platform for Autonomous Vehicles. You will need to have strong programming skills, a deep understanding of cloud technologies, distributed storage & compute systems, and distributed systems architecture. You will need to have excellent communication and planning skills. You ideally have experience in securing distributed systems or willingness to learn it. Finally, you will need engineering technical leadership skills. Together, we will build the exa-scale software 2.0 cloud platform for one of the most ambitious problems of our time: autonomous vehicles. Then we will apply it to other applications such as medical imaging, data science, genomics and more.

 

What you'll be doing:

  • Architect and build scalable and distributed services that will help power the AI infrastructure for deep learning platforms.

  • Design and build infrastructure and microservices that help index, mine, transform, and compose PB sized deep learning datasets.

  • Design the next generation of dataset management services for real and synthetic / simulated datasets.

  • You will enable smart data selection - one of the key ingredients for successful machine learning!

  • Collaborate with multiple AI teams to understand their requirements and build a future-proof platform that improves their productivity.

  • Be a technical leader on various projects across the platform, and be a major contributor of the entire platform’s architecture.

  • Support users of the platform.

 

What we need to see:

  • BS, MS, or PhD in Computer Architecture, Computer Science, Electrical Engineering or related field or equivalent experience.

  • 5+ years of Work or Research Experience in distributed systems development and design.

  • Strong programming background that incorporates methodologies like data structures, design patterns, OOP, and test driven development.

  • Proven technical foundation in distributed computing and storage, including significant experience with most of the following: server systems, storage, I/O, networking, and systems software.

  • Hands-on experience in or willingness to learn about authentication and authorization as well as the related technologies such as OIDC, TLS, AWS IAM, role-based access control, attribute-based access control, Open Policy Agent.

  • Advanced programming skills to build distributed storage and compute systems, backend services, microservices, and web technologies.

  • A specialist programmer in Go, Java or C/C++.

  • Ability to switch effectively between long-term strategic and near-term tactical topics.

  • Highly motivated with strong interpersonal skills, you have the ability to work successfully with multi-functional teams, principles and architects and coordinate optimally across interpersonal boundaries and geographies.

  • A track record of successful technical leadership and large-scale architecture that impacted critical projects.

 

Ways to stand out from the crowd:

  • Experience building MLOps or AI/ML solutions on-premise or in the cloud.

  • Hands-on experience in or willingness to learn about security topics such as secure design, secure coding, data protection, zero trust networks, and incident response management.

  • Sophisticated programming expertise in Scala, or Python.

  • Experience with Kubernetes and Docker as well as open source contributions.

  • A proactive demeanor to investigate and understand technical requirements.

 

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working with us and our engineering teams are growing fast in some of the hottest state of the art fields: Deep Learning, Artificial Intelligence, and Autonomous Vehicles. If you're a creative computer scientist/engineer with a real passion for distributed systems and autonomous driving, we want to hear from you.

NVIDIA
NVIDIA
Artificial Intelligence (AI) GPU Hardware Software Virtual Reality

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say