Why do you charge job seekers to use EchoJobs?

We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.

How many software engineering jobs are on EchoJobs?

We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!

So, where do the jobs come from?

We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.

What makes EchoJobs different?

We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️

How often are new jobs added?

Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀

How fast can I find a job?

Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯

How often should I check EchoJobs?

Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

Description

Blue Machines.ai - Senior Data Scientist - LLM Training & Fine-tuning

Location: Bengaluru, Karnataka, India

Department: Engineering

Workplace: on_site

Description

Job Title
Senior Data Scientist— LLM Training & Fine-tuning (Indian Languages, Tool Calling, Speed)
Location: Bangalore

About the Role

We’re looking for a hands-on Data Scientist / Research Scientist who can fine-tune and train open-source LLMs end-to-end—not just run LoRA scripts. You’ll own model improvement for Indian languages + code-switching (Hinglish, etc.), instruction following, and reliable tool/function calling, with a strong focus on latency, throughput, and production deployability.
This is a builder role: you’ll take models from research → experiments → evals → production.

What You’ll Do (Responsibilities)

• Train and fine-tune open LLMs (continued pretraining, SFT, preference optimization like DPO/IPO/ORPO, reward modeling if needed) for:
Indian languages + multilingual / code-switching
Strong instruction following
Reliable tool/function calling (structured JSON, function schemas, deterministic outputs)
• Build data pipelines for high-quality training corpora:
Instruction datasets, tool-call traces, multilingual data, synthetic data generation
De-duplication, contamination control, quality filtering, safety filtering
• Develop evaluation frameworks and dashboards:
Offline + online evals, regression testing
Tool-calling accuracy, format validity, multilingual benchmarks, latency/cost metrics
• Optimize models for speed and serving:
Quantization (AWQ/GPTQ/bnb), distillation, speculative decoding, KV-cache optimizations
Serve via vLLM/TGI/TensorRT-LLM/ONNX where appropriate
• Improve alignment and reliability:
Reduce hallucinations, improve refusal behavior, enforce structured outputs
Prompting + training strategies for robust compliance and guardrails
• Collaborate with engineering to ship:
Model packaging, CI for evals, A/B testing, monitoring drift and quality
• Contribute research:
Read papers, propose experiments, publish internal notes, and turn ideas into measurable gains

What We’re Looking For (Qualifications)

Must-Have

• 4 - 6 years in ML/DS, with direct LLM training/fine-tuning experience
• Demonstrated ability to run end-to-end model improvement:
data → training → eval → deployment constraints → iteration
• Strong practical knowledge of:
Transformers, tokenization, multilingual modeling
Fine-tuning methods: LoRA/QLoRA, full fine-tune, continued pretraining
Alignment: SFT, DPO/IPO/ORPO (and when to use what)
• Experience building or improving tool/function calling and structured output reliability
• Strong coding skills in Python, deep familiarity with PyTorch
• Comfortable with distributed training and GPU stacks:
DeepSpeed / FSDP, Accelerate, multi-GPU/multi-node workflows
• Solid ML fundamentals: optimization, regularization, scaling laws intuition, error analysis

Nice-to-Have

• Experience with Indian language NLP:
Indic scripts, transliteration, normalization, code-mixing, ASR/TTS text quirks
• Experience with pretraining from scratch or large-scale continued pretraining
• Practical knowledge of serving:
vLLM / TGI / TensorRT-LLM, quantization + calibration, profiling
• Experience with data governance: privacy, PII redaction, dataset documentation
Tech Stack (Typical)

PyTorch, Hugging Face Transformers/Datasets, Accelerate
DeepSpeed / FSDP, PEFT (LoRA/QLoRA)
Weights & Biases / MLflow
vLLM / TGI / TensorRT-LLM
Ray / Airflow / Spark (optional), Docker/Kubernetes
Vector DB / RAG stack familiarity is a plus

What Success Looks Like (90–180 Days)

• Ship a fine-tuned open model that measurably improves:
Instruction following and tool calling correctness
Indic language performance + code-switching robustness
Lower latency / higher throughput at equal quality
• Stand up a repeatable pipeline:
dataset versioning, training recipes, eval harness, regression gates
• Build a roadmap for next upgrades (distillation, preference tuning, multilingual expansion)

Interview Process

30-min intro + role fit
Technical deep dive: prior LLM work (training/evals/production constraints)
Take-home or live exercise: design an LLM fine-tuning + eval plan for tool calling + Indic language
Systems round: training/serving tradeoffs, cost/latency, failure modes
Culture + collaboration round

Apna

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say

Apna

Senior Data Scientist, LLM Training & Fine-tuning

Blue Machines.ai - Senior Data Scientist - LLM Training & Fine-tuning

Description