Why do you charge job seekers to use EchoJobs?

We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.

How many software engineering jobs are on EchoJobs?

We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!

So, where do the jobs come from?

We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.

What makes EchoJobs different?

We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️

How often are new jobs added?

Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀

How fast can I find a job?

Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯

How often should I check EchoJobs?

Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

Description

Lead AI Platform

Location: Cairo, Cairo Governorate, Egypt

Department: Software Development

Workplace: hybrid

Employment Type: full

Description

Integrant is looking for game changers to join our team as " Lead AI Platform".

The Lead AI Platform Engineer is responsible for bridging AI workloads with production-grade infrastructure, with a strong focus on NVIDIA AI stack, enabling high-performance, scalable, and optimized AI systems.

This role focuses on model optimization, runtime efficiency, and GPU utilization, ensuring that AI workloads are production-ready, cost-efficient, and performant across enterprise environments.

Roles and Responsibilities:

Translate AI/ML workloads into optimized infrastructure and deployment strategies

Optimize model performance across GPU environments (latency, throughput, memory utilization)
Design and implement inference and training pipelines using NVIDIA stack tools (TensorRT, Triton, NIM)
Convert and optimize models across frameworks (PyTorch → ONNX → TensorRT)
Analyze and resolve performance bottlenecks using profiling tools (GPU, memory, network)
Improve GPU utilization and scheduling efficiency across clusters
Design scalable distributed training and inference architectures
Work closely with customers to define AI infrastructure strategies and deployment models
Support production deployments including monitoring, rollback, and performance validation
Conduct applied research to improve model efficiency and infrastructure utilization
Mentor team members on AI infrastructure, optimization, and GPU systems
Experiment tracking tools (MLflow, W&B, Neptune) log parameters, metrics, and artifacts for comparison
Find the Model degradation happens post-deployment: concept drift, data pipeline changes, traffic pattern shifts
Root cause analysis (RCA) applies to ML systems: isolating variables, reproducing issues

Requirements

8+ years of experience in AI systems
8+ years of experience in ML systems, HPC and AI infrastructure
Strong proficiency in Python
Strong experience with GPU-based AI workloads and performance optimization
Deep understanding of model optimization techniques (quantization, pruning, batching)
Hands-on experience with:

PyTorch
ONNX / ONNX Runtime
TensorRT / TensorRT-LLM
Triton Inference Server

Knowledge of CUDA, cuDNN, and GPU architecture fundamentals
Experience with distributed systems (multi-GPU / multi-node)
Familiarity with:

NCCL communication
NVLink / InfiniBand
Kubernetes or Slurm for orchestration

Experience deploying AI models into production environments
Ability to analyze system bottlenecks (compute, memory, network)
Experience with profiling tools (Nsight, TensorRT profiler, etc.)
Knowledge of cost optimization strategies for GPU workloads
Experiment tracking tools (MLflow, W&B, Neptune) log parameters, metrics, and artifacts for comparison
Find the Model degradation happens post-deployment: concept drift, data pipeline changes, traffic pattern shifts
Root cause analysis (RCA) applies to ML systems: isolating variables, reproducing issues

Nice to Have

Experience with NVIDIA NIM and NGC ecosystem
Exposure to Megatron-LM, NeMo, or large-scale LLM training/inference
Experience with LLM optimization techniques (KV cache, batching strategies)
Familiarity with MLOps practices and CI/CD for AI systems
Experience in customer-facing architecture or consulting roles
Familiarity with hybrid cloud / on-prem HPC environments

Benefits

Salary paid in USD
Six-month career advancing opportunities
Supportive and friendly work environment
Premium medical insurance [employee +family]
English language development courses
Interest-free loans paid over 2.5 years
Technical development courses
Planned overtime program (POP)
Employment referral program
Premium location in Maadi
Social insurance

Integrant

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say