SDE II - ML/AI Engineer
Location: Jaipur
Department: Data and AI
Experience: 3-5 Years
- Fine-tune and evaluate LLMs (Hugging Face Transformers, Ollama, LLaMA) for specialized tasks
- Deploy high-throughput inference pipelines using vLLM or Triton Inference Server
- Design agent-based workflows with LangChain or LangGraph, integrating vector databases (Pinecone, Weaviate) for retrieval-augmented generation
- Build scalable inference APIs with FastAPI or Flask, managing batching, concurrency, and rate-limiting
- Develop and optimize CV models (YOLOv8, Mask R-CNN, ResNet, EfficientNet, ByteTrack) for detection, segmentation, classification, and tracking
- Implement real-time pipelines using NVIDIA DeepStream or OpenCV (cv2); optimize with TensorRT or ONNX Runtime for edge and cloud deployments
- Handle data challenges—augmentation, domain adaptation, semi-supervised learning—and mitigate model drift in production
- Containerize models and services with Docker; orchestrate with Kubernetes (KServe) or AWS SageMaker Pipelines
- Implement CI/CD for model/version management (MLflow, DVC), automated testing, and performance monitoring (Prometheus + Grafana)
- Manage scalability and cost by leveraging cloud autoscaling on AWS (EC2/EKS), GCP (Vertex AI), or Azure ML (AKS)
- Define SLAs for latency, accuracy, and throughput alongside product and DevOps teams
- Evangelize best practices in prompt engineering, model governance, data privacy, and interpretability
- Mentor junior engineers on reproducible research, code reviews, and end-to-end AI delivery
- LLM Frameworks & Tooling:
- Agent & Retrieval Tools:
- Inference Serving:
- Computer Vision Frameworks & Libraries:
- Model Optimization:
- MLOps & Versioning:
- Monitoring & Observability:
- Cloud Platforms:
- Programming Languages:
- Bachelor’s or Master’s in Computer Science, Electrical Engineering, AI/ML, or a related field
- 3–5 years of professional experience shipping both generative and vision-based AI models in production
- Strong problem-solving mindset; ability to debug issues like LLM drift, vector index staleness, and model degradation
- Excellent verbal and written communication skills
- LLM Hallucination & Safety: Implement grounding, filtering, and classifier layers to reduce false or unsafe outputs
- Vector DB Scaling: Maintain low-latency, high-throughput similarity search as embeddings grow to millions
- Inference Latency: Balance batch sizing and concurrency to meet real-time SLAs on cloud and edge hardware
- Concept & Data Drift: Automate drift detection and retraining triggers in vision and language pipelines
- Multi-Modal Coordination: Seamlessly orchestrate data flow between vision models and LLM agents in complex workflows
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
