Akaike Technologies

Senior Data Scientist, Deep Learning

Bengaluru, India
Deep Learning Data Science PyTorch API
Description

Senior Data Scientist (Transformers/Deep Learning)

Location: Bengaluru, India

Department: Projects & Delivery

Experience: 4+ years

Skills: Deep Learning, Data Science

Senior Data Scientist – 4+ Years Experience

Role Overview

We are looking for a Senior Data Scientist with 4+ years of experience and strong hands-on expertise working with Transformer-based models beyond API usage. This role sits between research and engineering, focusing on understanding, training, modifying, and improving models rather than simply integrating them.
The ideal candidate is comfortable working deep inside model architectures, training pipelines, fine-tuning methods, and inference optimization, with a strong first-principles mindset.

Eligibility Requirement (Read Carefully)

Applicants must already have prior hands-on experience training or modifying Transformer-based models or related systems, either open-source or in-house.
Candidates whose experience is limited to using hosted APIs or prompting models without working at the training or architecture level should not apply.

Must Have

Advanced Understanding of Transformer Architectures

Deep theoretical and implementation-level understanding of Transformers, including:
Encoder–Decoder and Decoder-only architectures
Attention mechanisms and positional encodings
Training dynamics and scaling behavior
Strong understanding of common limitations such as context length constraints, efficiency bottlenecks, and hallucinations, along with approaches to mitigate them.

Intermediate-Level PEFT Expertise

Practical experience with parameter-efficient fine-tuning techniques, including:
LoRA
QLoRA
Adapter-based methods
Clear understanding of trade-offs between PEFT approaches and full fine-tuning.

Model Training and Modification (Mandatory)

Hands-on experience with:
Training or fine-tuning models from checkpoints or from scratch
Implementing and customizing training loops
Designing or modifying loss functions and optimization strategies
Fine-tuning without reliance on hosted APIs

Core Engineering and Research Skills

Strong experience with PyTorch (preferred)
GPU training workflows and performance debugging
Ability to read and implement research papers
Experience diagnosing training instability and model failures
Designing experiments and evaluating model behavior

Key Responsibilities

Analyze and improve Transformer architectures and training strategies
Train and fine-tune models using custom pipelines
Implement optimization techniques such as mixed precision, quantization, and pruning
Improve inference efficiency across latency, memory, and throughput
Run hypothesis-driven experiments and document findings

Good to Have

Experience with multimodal or generative models, including:
Diffusion models
Vision or audio transformers
Image, video, or audio generation systems
Additional strengths include:
Experience modifying model architectures, attention mechanisms, or training objectives
Familiarity with efficient attention implementations
Contributions to open-source machine learning or independent research projects

Ideal Candidate

Thinks like a researcher and builds like an engineer
Curious about why models fail, not just how to use them
Comfortable experimenting, iterating, and improving systems
Prefers deep understanding over black-box usage.
Akaike Technologies
Akaike Technologies

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say