Member of Technical Staff - Mid-Training Infra
Department: Engineering
Location: San Francisco, New York, London
Employment Type: FullTime
Our Mission
Reflection’s mission is to build open superintelligence and make it accessible to all.
We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond.
About the Role
Design, build, and operate large-scale GPU infrastructure for high-throughput model inference and mid-training workloads.
Develop systems that power synthetic data generation and reinforcement learning pipelines at scale.
Build high-performance inference platforms capable of serving and evaluating models across thousands of GPUs.
Optimize throughput, latency, and GPU utilization for large language model inference and rollout workloads.
Build infrastructure that supports reinforcement learning pipelines, including large-scale rollout generation, evaluation, and policy improvement loops.
Work closely with research teams to support distributed RL workloads and large-scale model evaluation infrastructure.
Improve performance of model execution through kernel-level optimization, model parallelism strategies, and GPU runtime improvements.
Develop distributed systems that enable large-scale synthetic data generation and RL-driven training workflows.
Diagnose and resolve performance bottlenecks across inference runtimes, GPU kernels, networking, and distributed compute systems.
Ideal Experience
Experience deploying and operating large-scale GPU systems for inference or model serving.
Several years of hands-on experience building and running production infrastructure.
Strong understanding of GPU performance characteristics and optimization techniques.
Experience working with modern inference frameworks such as SGLang, Megatron, or similar high-performance LLM runtimes.
Familiarity with distributed reinforcement learning infrastructure or rollout generation systems.
Experience optimizing throughput for large-scale model execution workloads.
Experience working with GPU kernels or low-level performance optimization.
Familiarity with infrastructure used for synthetic data pipelines or RL training workflows.
Experience debugging performance issues across GPU, networking, and distributed execution layers.
What We Offer:
We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models.
We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.
Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally.
Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.
Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.
Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time.
Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
