Zyphra

Research Engineer, Agency and Reasoning

San Francisco, CA
Python PyTorch Reinforcement Learning Machine Learning API
Description

Research Engineer - Agency and Reasoning

Department: R&D - Engineering

Location: San Francisco

Employment Type: FullTime

Zyphra is an artificial intelligence company based in San Francisco, California.

The Role:

As a Research Engineer - Agency and Reasoning, you will be a core contributor to Zyphra’s Agency and Reasoning Team. You will be involved with performing novel research in reinforcement learning, post-training, and human preference learning, and applying your ideas at scale to our next generation of language models.

What We’re Looking For / Requirements:

  • Strong research taste and intuition

  • The ability to work through a research project from conception to execution to write-up

  • Strong implementation and prototyping skillset

  • A researcher who can take an idea from conception to experimentation extremely quickly

  • The ability to work well and cooperate with others in a high-paced research setting

  • Curiosity, interest, and joy in understanding intelligence.

Qualifications / Additional Skills:

  • Experience and aptitude with reinforcement learning, either in the context of language model reasoning or more classical RL tasks

  • Experience with language-model-supervised fine-tuning and preference-learning methods, such as DPO and simPO.

  • Experience with context-length extension methods

  • A good intuitive ability to understand model behaviors and correct them through iterative fine-tuning

  • Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation

  • Postgraduate degree in a scientific subject (Computer Science, EE/EECS, Mathematics, Physics)

  • Previously published machine learning research in well-respected venues

  • Highly proficient with PyTorch and Python

  • We are excited and able to rapidly learn new fields and implement new ideas

  • Excellent communication and collaboration skills, and can work effectively on both research and engineering implementation at scale

Why Work at Zyphra:

  • Our research methodology is to make grounded, methodical steps toward ambitious goals. Both deep research and engineering excellence are equally valued

  • We strongly value new and crazy ideas and are very willing to bet big on new ideas

  • We move as quickly as we can; we aim to minimize the bar to impact as low as possible

  • We all enjoy what we do and love discussing AI

Benefits and Perks:

  • Comprehensive medical, dental, vision, and FSA plans

  • Competitive compensation and 401(k) plan

  • Relocation and immigration support on a case-by-case basis

  • In-office snacks and meals provided

  • Unlimited PTO and company holidays

  • In-person team in San Francisco with a collaborative, high-energy environment

Zyphra
Zyphra

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say