AI Systems Architect - LLM & Vector Infrastructure
Location: Riyadh, Riyadh Province, Saudi Arabia
Department: Information Technology (IT)
Workplace: on_site
Employment Type: full
Description
We are seeking a senior AI Systems Architect to design and implement AI-native application cores where Large Language Models (LLMs), vector databases, retrieval systems, and agent frameworks form the primary computational layer of our web and mobile applications.
This role is responsible for architecting scalable AI pipelines, retrieval-augmented generation (RAG) systems, memory architectures, AI agents, and orchestration workflows integrated with our development stack (Web, Mobile, n8n automation, and AI services).
The ideal candidate understands that AI is not a feature, it is the operating system of the product.
Key Responsibilities
1. AI Core Architecture Design
- Design AI-first system architecture for web and mobile applications
- Architect RAG pipelines using vector databases
- Define long-term memory, short-term memory, and contextual state systems
- Implement multi-agent AI systems
- Design AI orchestration layers
2. Vector Database & Embedding Systems
- Select and implement vector databases such as:
- Pinecone
- Weaviate
- Qdrant
- Milvus
- Supabase (pgvector)
- Optimize embedding strategies
- Implement hybrid search (semantic + keyword)
- Design scalable indexing pipelines
3. LLM Integration & Optimization
- Work with models such as:
- OpenAI APIs
- Anthropic
- Meta (LLaMA)
- DeepSeek
- Alibaba (Qwen)
- Implement structured output pipelines
- Design evaluation and prompt testing frameworks
- Optimize cost-performance ratio
4. AI Agent Systems & Orchestration
- Build autonomous AI agents
- Design tool-calling systems
- Integrate with:
- n8n
- LangGraph / LangChain style agent flows
- Implement memory-aware agents
5. Production AI Engineering
- Build monitoring systems for hallucination detection
- Design guardrails and validation layers
- Implement evaluation datasets and benchmarking
- Ensure security of AI pipelines
- Build scalable infrastructure (Docker, Kubernetes, GPU optimization)
Requirements
Technical Expertise
- 5+ years software engineering experience
- 2+ years building production AI systems
- Deep knowledge of:
- Vector embeddings & similarity search
- RAG architectures
- Tokenization and context window optimization
- Fine-tuning & LoRA concepts
- Prompt evaluation frameworks
- Experience with Python (mandatory)
- Experience with FastAPI / backend services
- Experience designing scalable APIs
Architecture Experience
- Designing distributed systems
- Microservices & event-driven architecture
- Experience with PostgreSQL + pgvector
- Experience deploying LLM systems in production
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
