Zencore

Principal Architect, AI/ML (Remote)

Remote
AI Machine Learning Google Cloud TPUs vLLM MLOps PyTorch JAX LangGraph Langchain Google ADK Vertex AI GKE LoRA TensorRT-LLM LangSmith LangFuse API
Description

Principal Architect, AI/ML

Team: Data & Analytics

Location: Remote

Commitment: Full-Time

Workplace Type: remote

Zencore is a fast-growing company founded by former Google Cloud leaders, architects, and engineers. We are seeking candidates with significant experience in Google Cloud to join our team. Our engagements aim to eliminate obstacles, reduce risk, and accelerate timelines for customers transitioning to Google and seeking assistance with data and application modernization. We embed within customer teams to provide strategic guidance, facilitate technology decisions, and execute projects in a collaborative, co-development style.

As a member of our Cloud Engineering team, you will be working with fast-paced innovative companies, leveraging AI as the key driver of their transformation. Our clients will look to you as their trusted advisor, someone they can rely on and who will be there to help them along their AI journey. You will be expected to cover a large spectrum of technology topics like model optimization, high-performance training on specialized hardware (TPUs), efficient model serving (vLLM), MLOps, and complex agentic systems.
 
At Zencore, a Principal Architect is a key technical leader in our engineering organization and acts as an ambassador of our technical and cloud engineering expertise. Principal Architects at Zencore are able to navigate a broad technical range but are specialized in one or more domains. They are responsible for the technical oversight and end-to-end delivery of projects within our professional services business.
 
Whilst this is remote role, we are only considering US based candidates and do not offer visa sponsorship.

What you will do...

  • Serve as Zencore’s senior-most technical authority on the practical application of advanced artificial intelligence and machine learning.
  • Partner with the sales and business development teams in a pre-sales capacity to scope opportunities, design solutions for proposals, and act as the senior technical voice in client pitches.
  • Lead the architecture and design of sophisticated, secure, and scalable AI solutions for our clients, moving beyond standard API integrations to create genuine competitive advantages.
  • Collaborate closely with Cloud & Data Architects to guarantee the design and deployment of comprehensive client solutions.
  • Address the growing demand for private, data-sovereign AI by designing systems that meet strict GDPR and data privacy requirements. Strive for model explainability and bias mitigation, ensuring solutions adhere to ethical standards and safety guardrails.
  • Architect solutions for hosting, fine-tuning, and optimizing both proprietary (e.g., Gemini, Claude) and open-source (e.g., Llama, Mistral) models on hyperscaler platforms.
  • Lead clients in selecting optimal cloud-native technologies, prioritizing Google Cloud solutions for deploying and scaling production-grade agentic systems.
  • Guide and mentor customers and Zencore's engineering teams on advanced topics, establishing best practices for high-performance training (PyTorch, JAX, TPUs), efficient model serving (vLLM), and complex agentic systems (LangGraph, Langchain, Google ADK).
  • Devise the financial architecture of AI solutions by performing ROI analysis and implementing cost-optimization strategies to ensure large-scale deployments remain economically sustainable for customers.
  • Act as an external thought leader, contributing to the Zencore brand through blog posts, conference presentations, and community engagement.
  • Act as a "player-coach," providing hands-on leadership and fostering a culture of deep technical excellence in AI/ML.

Who we need...

  • Master’s degree in Computer Science, natural sciences, mathematics, or a related technical field, or equivalent practical experience in designing and delivering high-scale AI/ML systems.
  • Extensive experience in a senior or principal architect role with a proven track record of designing and delivering complex, production-grade machine learning systems that have created measurable business value.
  • Deep, hands-on architectural experience with at least one major cloud platform (GCP, AWS, or Azure) is required.
  • Direct, hands-on experience with Google Cloud (Vertex AI, GKE, TPUs) is a significant plus.
  • Proven expertise in LLM optimization, including techniques for quantization, pruning, efficient fine-tuning (e.g., LoRA), and high-performance serving (e.g., vLLM, TensorRT-LLM).
  • Hands-on experience with high-performance ML frameworks (e.g., JAX, PyTorch/XLA) for training or fine-tuning large-scale models.
  • Expertise in designing and deploying agentic workflows using both code-centric (e.g., LangGraph, LangChain, Google ADK) and low-code (e.g., Vertex AI Agent Builder, LangSmith Agent Builder) paradigms.
  • A strong understanding of the architectural patterns required for building secure, private, and data-sovereign AI solutions.
  • Experience with LLM observability and evaluation frameworks (e.g., LangSmith, LangFuse, Vertex AI Evaluation).
  • Exceptional communication and stakeholder management skills, with the ability to articulate complex technical concepts and their business value to both technical and non-technical audiences.
  • A passion for mentoring and a drive for continuous learning in the fast-evolving AI landscape.
We are a fully remote company and offer competitive compensation and benefits.

Zencore is committed to a diverse and inclusive workplace. Zencore is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.

Zencore
Zencore

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say