Applied Research Engineer - Post-Training
Location: London, England, United Kingdom, Luxembourg, Luxembourg, Luxembourg
Workplace: on_site
Employment Type: full
Description
Helical is building the in-silico labs for biology
Drug discovery still relies on wet labs: slow, expensive, and constrained by physical trial-and-error. Helical is changing that.
We build the application layer that makes Bio Foundation Models usable in real-world drug discovery, enabling pharma and biotech teams to run millions of virtual experiments in days, not years. Today, leading global pharma companies already use Helical, and we’re at the start of a highly ambitious growth journey.
We’re a founder-led, talent-dense team building a category-defining company from Europe. We care deeply about the quality of our work, move fast, and expect ownership. If you’re excited by complexity, real responsibility, and shaping how a company actually operates as it scales, you’ll feel at home here.
At Helical, we’re focused on leveraging research to transform the future of drug discovery. We are seeking an Applied Research Engineer - Post-Training to join our team, focusing on maximizing the performance of cutting-edge foundation models in real-world applications.
Your Role
You will own the full post-training lifecycle for biological foundation models—from alignment strategy to production deployment. This means designing and running pipelines that transform general-purpose models into therapeutic-specific tools for our pharma clients. You'll work directly with real drug discovery problems: adapting models to disease areas, cell types, and perturbation contexts that matter for target identification, hit discovery, and beyond.
This isn't a support role. You'll make core technical decisions about how we extract value from foundation models—what to fine-tune, how to validate it biologically, and how to ship it to customers who are running experiments that inform real clinical programs. You'll collaborate closely with our ML infrastructure and biology teams, but you'll be the person responsible for whether our post-training actually works.
What You'll Do
- Design and implement post-training pipelines that align biological foundation models to specific therapeutic contexts and client use cases.
- Build validation frameworks that connect model improvements to biological ground truth—working with embeddings, perturbation data, and external resources like OpenTargets.
- Own experiments end-to-end: from hypothesis through training runs on distributed GPU infrastructure to analysis and client delivery.
- Collaborate with ML engineers on training infrastructure and with biologists on ensuring outputs are scientifically meaningful.
- Contribute to our open-source tooling (helical-package) and help shape the technical direction of our post-training capabilities as we scale.
- Stay at the frontier of post-training research and bring relevant advances into production.
Requirements
Essentials
- MSc or PhD in Machine Learning, Computational Biology, or a related field—or equivalent depth gained through industry experience.
- Hands-on experience with post-training techniques: fine-tuning, LoRA, DPO, RLHF, or similar alignment methods.
- Strong proficiency in Python and PyTorch. You should be comfortable writing training loops, debugging distributed runs, and working directly with model internals.
- Familiarity with transformer architectures and how they behave in practice—not just theory.
- Experience designing and running experiments rigorously: tracking metrics, iterating systematically, and drawing valid conclusions from results.
- Ability to work autonomously and make decisions with incomplete information. We're a small team; you'll own problems end-to-end.
- Clear communication skills—you'll need to explain technical trade-offs to colleagues across ML, biology, and product.
Bonus Points
- Experience with biological foundation models (Geneformer, scGPT, ESM, or similar) or computational biology more broadly.
- Familiarity with drug discovery workflows, target identification, or perturbation biology.
- Track record of shipping post-training improvements into production systems.
- Experience with distributed training infrastructure (multi-GPU, multi-node, NCCL, DeepSpeed, FSDP).
- Publications at ML or computational biology venues (NeurIPS, ICML, ICLR, Nature Methods, etc.).
- Contributions to open-source ML tooling.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
