NVIDIA

Solutions Architect, Agentic AI (Remote)

Remote Santa Clara, CA
USD 152k - 242k
Python C++ Linux PyTorch TensorFlow API Kubernetes OpenShift Docker
Description

Solutions Architect, Agentic AI

Location: US, CA, Santa Clara, US, Remote

Time Type: Full time

Job Description

Join NVIDIA as a Solutions Architect to own the evolution of Agentic AI for the enterprise. You will collaborate with top-tier enterprise software companies to build and deploy sophisticated AI-native systems, focusing on multi-agent coordination, RAG-integrated workflows, and accelerated inference. By mastering NVIDIA’s core technologies—NIM, NeMo Framework, Dynamo, and Nemo Agent Toolkit—you will guide partners through the complexities of performance optimization and production-grade deployment. As a trusted advisor, you’ll transform raw LLM capabilities into high-performance, industry-focused enterprise agents.

What you'll be doing:

  • Build complex agentic systems featuring multi-agent coordination, long-horizon reasoning, and advanced planning frameworks.

  • Develop full-scale solutions, including domain-specific enterprise agents and high-performance retrieval pipelines (RAG) spanning various data sources.

  • Optimize inference performance by bringing to bear GPU-accelerated frameworks and the full NVIDIA AI infrastructure stack.

  • Build hands-on PoCs and reference architectures that serve as the blueprint for production-grade generative AI pipelines.

  • Collaborate alongside Enterprise ISVs to integrate NVIDIA software into native platforms, accelerating the deployment of production workloads.

  • Collaborate with diverse internal teams to improve NVIDIA software through feedback from real-world implementations.

  • Empower partner engineering teams through technical workshops, deep-dive architecture reviews, and developer enablement.

  • Scale global expertise by crafting reusable assets and documentation that help field teams deploy agentic AI at scale.

What we need to see:

  • BS/MS/PhD in Computer Science, Electrical Engineering, AI/ML, or equivalent experience.

  • More than 5 years of experience in deep learning, machine learning, or distributed AI systems.

  • Strong programming and debugging experience in Python, C/C++, and Linux environments.

  • Background in using deep learning libraries like PyTorch or TensorFlow.

  • Hands-on experience building LLM and generative AI applications.

  • Experience working with agentic or multi-agent AI systems employing frameworks such as:

1. LangGraph

2. LlamaIndex

3. CrewAI

4. LangChain

5. OpenAI Agents SDK or similar orchestration frameworks

  • Experience building tool-using AI agents that interact with APIs, databases, and enterprise systems.

  • Ability to rapidly prototype AI applications and build scalable GPU-accelerated architectures.

  • Excellent interpersonal skills and the ability to collaborate with engineering teams, partners, and executive collaborators.

Ways to Stand Out from the Crowd:

  • Experience working with NVIDIA GPUs and AI software, such as NVIDIA NIM, NeMo Framework, NeMo Retriever, and NeMo Agent Toolkit.

  • Experience with LLM evaluation frameworks, benchmarking systems, and safety guardrails for agentic workflows.

  • Experience optimizing reasoning-focused LLMs through timely engineering, quantization, or benchmarking.

  • Familiarity with Kubernetes/OpenShift, CI/CD automation, and cloud-native deployment patterns for AI systems.

  • Experience with parallel or distributed computing environments and AI workloads optimized for GPUs.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 14, 2026.

This posting is for an existing vacancy. 

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

NVIDIA
NVIDIA

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say