Luma AI

Research Scientist, Large Language Model

Remote Palo Alto, CA
Python PyTorch Machine Learning API
Description

Research Scientist - Large Language Model

Location: SF Bay Area, CA, Remote, International, London, UK

Department: Research

Location Type: HYBRID

Employment Type: FULL_TIME

Where You Come In
This is a rare opportunity to help define the future of large-scale language models. You will work across the entire lifecycle of model development — from large-scale pre-training, to targeted mid-training, to post-training alignment and capability refinement.

You will operate at the frontier of scaling laws, reasoning, and alignment, directly shaping how foundation models learn, generalize, and behave in real-world deployments.


What You’ll Do
This role spans both the “science” and “engineering” dimensions of research — two aspects that are equally important.
You will work across modeling, data, systems, and evaluation.

Modeling
  • Architect and scale large autoregressive language models.
  • Design improved pre-training objectives to enhance reasoning, knowledge retention, and compositional generalization.
  • Develop mid-training strategies such as continued pre-training, domain adaptation, curriculum learning, and synthetic data integration.
  • Advance post-training techniques, including instruction tuning, preference optimization, reinforcement learning, distillation, and inference-time compute scaling.
  • Study and improve long-context modeling, planning depth, and multi-step reasoning behavior.

Data
  • Curate and construct massive, high-quality text corpora for pre-training.
  • Design synthetic data pipelines for reasoning, tool use, mathematics, coding, and structured problem solving.
  • Develop filtering, mixture weighting, and curriculum strategies that shape emergent capabilities.
  • Formulate new tasks that improve coherence, logical consistency, factual grounding, and robustness.

Systems
  • Train frontier-scale language models across large GPU clusters.
  • Optimize distributed training (data, tensor, pipeline parallelism), mixed precision, and memory efficiency.
  • Build infrastructure for large-scale experimentation, ablations, and reproducibility.
  • Improve inference efficiency and support scalable deployment.

Evaluation: define and build evaluation frameworks for language intelligence, including:
  • Multi-step reasoning and mathematical problem solving
  • Coding and structured generation
  • Knowledge grounding and factuality
  • Planning and agentic behavior
  • Instruction following and alignment
  • Track capability development across pre-training, mid-training, and post-training.
  • Close the loop between evaluation signals and data/model improvements.

Who You Are
  • Strong foundation in machine learning and large language models.
  • Deep understanding of autoregressive transformers and large-scale training dynamics.
  • Experience with pre-training large models and/or post-training techniques such as instruction tuning, RLHF, preference optimization, or distillation.
  • Hands-on experience with PyTorch and distributed training at scale.
  • Comfortable operating across research and production environments.

What Sets You Apart (Bonus Points)
  • Experience training frontier-scale language models from scratch.
  • Research contributions in scaling laws, reasoning, alignment, or inference-time compute.
  • Experience designing large-scale synthetic reasoning data.
  • Expertise in long-context modeling or structured reasoning systems.
  • Experience optimizing models for real-world deployment constraints.

Your application are reviewed by real people.
Luma AI
Luma AI

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say