Sr AI ML Developer
Location: Chennai, Pune, Bengaluru
Time Type: Full time
Job Description
TransUnion's Job Applicant Privacy Notice
What We'll Bring:
Lead the design and delivery of enterprise-scale AI/GenAI solutions (LLM apps, RAG pipelines, real-time processing, cloud-native services) across a polyglot stack (Python + Java).Own the technical roadmap from concept to deployment, ensuring scalability, performance, security, and responsible AI (fairness, transparency, compliance).
Serve as a trusted technical leader, mentoring engineers, data scientists, and architects; define architecture standards, patterns, and best practices across teams.
Drive PoCs and technical evaluations of emerging AI/GenAI technologies (including LangChain/LangGraph & LangChain4j, DJL, ONNX Runtime Java), aligning innovations with business outcomes.
Bridge business stakeholders and engineering, translating complex requirements into robust designs and measurable impact.
What You'll Bring:
How You’ll Contribute:
Architecture & Delivery
- Architect end-to-end AI platforms integrating LLMs, RAG, streaming, vector search, and CI/CD—implemented via Python services and Java microservices (Spring Boot/Quarkus/Micronaut).
- Define standards for REST/gRPC APIs, OAuth2/OIDC security, observability (Micrometer, OpenTelemetry), and SLIs/SLOs.
- Establish coding, versioning, monitoring, governance for ML systems; champion reproducibility (MLflow/DVC) and model registries.
LLM & RAG Engineering
- Lead LLM fine‑tuning/evaluation/deployment; design retrieval pipelines using Elasticsearch/OpenSearch/Vespa and vector stores (pgvector, Pinecone, Weaviate) with Java and Python clients.
- Build LangChain4j pipelines (prompts, tools, agents) and interoperable services that consume Python-hosted model endpoints via REST/gRPC.
- Optimize embeddings, chunking, retrieval/ranking for latency, precision, and cost; implement caching, batching, and circuit breakers.
Platforms & Cloud
- GCP must have skill with Familiarity in AWS/Azure; 2+ years with CI/CD pipelines and 3+ years with Docker/Kubernetes.
- Guide deployments on AWS/GCP/Azure using Docker/Kubernetes, Helm, service mesh (Istio/Linkerd), and managed ML services (SageMaker, Vertex AI, Azure ML).
- Use DJL (Deep Java Library) and ONNX Runtime Java for on‑JVM inference where appropriate; integrate Spark/Databricks MLlib for large‑scale pipelines.
Leadership & Collaboration
- Mentor engineers and architects; contribute reusable assets, reference implementations, and accelerators.
- Engage vendors/partners; participate in industry forums; advocate responsible AI and internal knowledge-sharing.
Impact You'll Make:
What You’ll Bring:
Technical Expertise (Python / Java)
- Expert Python with PyTorch, TensorFlow, scikit-learn, Hugging Face Transformers.
- Advanced Java (Java 8+), Spring Boot/Quarkus/Micronaut, Vert.x/Netty for high‑throughput services; concurrency, GC tuning, and performance engineering.
- GenAI frameworks: LangChain/LangGraph (Python) and LangChain4j (Java) for agents, tools, and RAG workflows.
- JVM ML/Inference: DJL, ONNX Runtime Java, TensorFlow Java; integration with Spark/Databricks MLlib.
- APIs & Data: FastAPI/Flask (Python) and Spring Boot (Java); SQL/NoSQL (PostgreSQL, MongoDB, Cassandra), JPA/Hibernate, Redis.
- Search & Vector: Elasticsearch/OpenSearch/Lucene, pgvector/Pinecone/Weaviate with Java/Python SDKs.
- Streaming & Messaging: Kafka, gRPC, event‑driven patterns.
- Agentic AI Dev skills : LangChain, LangGraph, CrewAI, AutoGen, Semantic Kernel, Spring AI (Java), MCP (Python/Java), LlamaIndex, RAG with Pinecone/Milvus/Weaviate/Qdrant/Chroma, vLLM, Ollama, Ray Serve, Langfuse, TruLens, MLflow, Python, Java, SQL + Vector DBs.
- GCP Vertex AI, Google ADK and GCP AI skills
MLOps & Cloud
- MLflow/DVC, model versioning/monitoring, CI/CD (Jenkins/GitHub Actions/Azure DevOps), Maven/Gradle, Terraform.
- Containers & Orchestration: Docker, Kubernetes, KServe/Seldon Core, Helm; cloud services (AWS/GCP/Azure).
Analytical & Leadership
- Strong statistics, hypothesis testing, experimental design; A/B testing frameworks.
- Proven track record leading AI/ML teams/projects end‑to‑end; excellent stakeholder communication.
Preferred/Nice-to-have
- Reinforcement learning, meta‑learning, unsupervised learning.
- Contributions to the AI/ML community (OSS, publications, talks).
- Experience with Databricks, OpenTelemetry, service mesh, Vault/Secrets.
TransUnion Job Title
Sr Developer, Applications DevelopmentThere are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
