Xponentiate

Senior AI Engineer

Bengaluru, Karnataka
Python PyTorch Hugging Face Docker Kubernetes API Machine Learning AI Deep Learning Prompt Engineering LLM RAG Transformers Tokenization Embeddings Attention Mechanisms LoRA QLoRA Model Distillation Vector Databases
Description

Senior AI Engineer

Location: Bengaluru, Karnataka, India

Department: Xponentiate

Workplace: on_site

Employment Type: full

Description

About Contiinex

Contiinex is an AI-first enterprise automation platform for healthcare and insurance, purpose-built to understand unstructured conversations, documents, and workflows, and autonomously execute complex, human-intensive business processes.

We build specialised domain-trained Small Language Models (SLMs) and fine-tuned LLM pipelines designed to operate reliably in regulated, high-stakes environments such as US Healthcare Revenue Cycle Management (RCM).

Our architecture emphasizes deterministic AI systems combining prompt engineering, model fine-tuning, and agentic orchestration to power real enterprise automation.

Role Overview

We are seeking a Senior AI Engineer with strong expertise in Prompt Engineering, LLM fine tuning, and Small Language Model (SLM) development to design, train, optimize, and deploy domain-specialised language models.

A key focus of this role will be engineering high-performance prompts for 8B-class models (such as LLaMA, Mistral, and Qwen) and transitioning these prompts into fine-tuned models for production reliability.

You will design prompt architectures, instruction schemas, and evaluation pipelines that ensure models produce accurate, structured, and deterministic outputs suitable for enterprise automation workflows.

Key Responsibilities

● Design production-grade prompt architectures for 8B-class models.

● Develop structured prompts for enterprise tasks such as classification, extraction, reasoning, and summarization.

● Optimize prompts for accuracy, latency, and cost efficiency.

● Build prompt evaluation frameworks to measure accuracy, hallucination rates, and consistency.

● Design reusable prompt libraries and prompt templates for enterprise workflows.

● Develop prompt-to-model migration strategies converting high-performing prompts into fine-tuned SLMs.

● Design and fine-tune LLMs for domain-specific enterprise tasks.

● Develop Small Language Models (SLMs) optimized for enterprise deployment.

● Build instruction tuning and supervised fine-tuning (SFT) pipelines.

● Design evaluation datasets and automated benchmarking frameworks.

● Implement retrieval augmented generation (RAG) pipelines and tool-augmented workflows.

● Collaborate with speech AI and document AI teams to build multimodal systems.

● Deploy models in private cloud or on-premise environments with strong security controls.

Required Qualifications

Education

Master’s degree or PhD in Computer Science, AI, Machine Learning, or a related field.

Experience & Technical Skills

● Strong expertise in Prompt Engineering for 7B–13B models (especially 8B models).

● Experience designing prompts for structured enterprise outputs.

● Experience building prompt evaluation datasets and benchmarking frameworks.

● Ability to convert prompt workflows into fine-tuned models.

● 4–6 years of experience in ML/NLP with 3+ years focused on LLMs or foundation models.

● Hands-on experience fine-tuning open-source models such as LLaMA, Mistral, Falcon, or Qwen.

● Experience with LoRA, QLoRA, adapters, and model distillation techniques.

● Strong understanding of transformers, tokenization, embeddings, and attention mechanisms.

● Strong Python engineering skills and experience with PyTorch.

AI Platform & Infrastructure

● Experience with GPU-based training and inference.

● Familiarity with Hugging Face, Accelerate, DeepSpeed, and Triton.

● Experience with vector databases and RAG architectures.

● Experience deploying models using Docker, Kubernetes, and cloud platforms. Compliance & Enterprise Readiness

● Experience working in regulated environments.

● Understanding of data privacy, access controls, and AI auditability.

● Ability to design AI guardrails and human-in-the-loop workflows.

Xponentiate
Xponentiate

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say