Weekday

Software Development Engineer, Web Platforms

Bengaluru Delhi NCR
Python SQL JavaScript Kubernetes Docker Prometheus Grafana FluentD Elasticsearch Logstash Kibana MLflow LangChain LangGraph LlamaIndex Flowise AWS Azure GCP DynamoDB Cosmos DB MongoDB PostgreSQL Aurora Spanner BigQuery Hugging Face Llama Mixtral Claude GPT Gemini SageMaker Vertex AI Azure AI DeepSpeed vLLM Helm API Microservices Machine Learning AI Deep Learning
Description

Software Development Engineer – Web Platforms

Team: Technology, Information and Media

Location: Bengaluru, Delhi NCR

Workplace Type: onsite

Salary:

We are seeking a highly skilled LLM Systems & MLOps Architect to drive end-to-end architecture, optimization, and deployment of large-scale AI and LLM-based systems. This role is ideal for experts who can blend deep machine learning understanding with strong distributed systems engineering and MLOps capabilities. You will lead the design and implementation of advanced ML production pipelines, optimize GPU-based training and serving infrastructures, and enable efficient delivery of large language model solutions across cloud environments.

What You’ll Own
LLM & ML Pipeline Architecture
Architect and build scalable ML pipelines supporting experiment tracking, model versioning, feature stores, and automated retraining workflows.
Develop high-performance APIs and microservices for real-time model inference and complex multi-model serving environments.
Implement best practices across ML lifecycle management using tools such as MLflow, SageMaker, Vertex AI, and Azure AI.
High-Performance Model Serving & GPU Infrastructure
Design and optimize distributed GPU environments for training and inference of large language models.
Implement model and data parallelism strategies using frameworks such as DeepSpeed, vLLM, and other scalable serving runtimes.
Improve throughput, reduce latency, and optimize resource utilization for large-scale LLM deployments.
Model Fine-Tuning & Performance Optimization
Lead fine-tuning and parameter-efficient training of LLMs and LVMs to enhance accuracy, adaptability, and latency.
Reduce compute cost and training cycle time through advanced optimization methods and architecture improvements.
LLMOps & Production Automation
Implement production-grade automation for training, deployment, monitoring, and rollback using Kubernetes, Docker, Helm, and orchestration systems.
Leverage modern AI workflow frameworks such as Langflow, Flowise, Langgraph, and LangChain for enterprise-scale LLM orchestration.
Build observability, metrics dashboards, and automated reliability systems using Prometheus, Grafana, FluentD, and ELK stack.

Tech Stack & Expertise
LLM Frameworks: Hugging Face transformers, Llama, Mixtral, Claude, GPT, Gemini
LLMOps & Tooling: MLflow, LangChain, LangGraph, LlamaIndex, Flowise, Bedrock, SageMaker, Vertex AI, Azure AI
Cloud: AWS, Azure, GCP
Databases & Warehousing: DynamoDB, Cosmos, MongoDB, RDS, PostgreSQL, Aurora, Spanner, BigQuery
DevOps & Infra: Kubernetes, Docker, Prometheus, Grafana, FluentD, ELK Stack
Languages: Python, SQL, JavaScript
Bonus Certifications: AWS Pro Solutions Architect, AWS ML Specialty, Azure Solutions Architect Expert

Who You Are
Problem-solver passionate about scaling AI systems and pushing boundaries of LLM performance.
Strong communicator who thrives in cross-functional collaboration.
Curious and research-driven, staying current with emerging AI and distributed computing trends.

This role is for one of our clients

Industry: Human Resources Services
Seniority level: Mid-Senior level

Min Experience: 6 years
Location: Bengaluru, Karnataka, NCR, Delhi
JobType: full-time
Weekday
Weekday

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say