Member of Technical Staff — Data Quality Operations
Location: San Francisco, CA
Department: Operations
Responsibilities
We are looking for a Member of Technical Staff — Data Quality Operations to bridge the gap between model evaluation, data generation, and engineering execution. At Patronus, “data quality” isn’t just about catching issues downstream but it’s about building a measurable, repeatable system that ensures our frontier evaluation datasets and tasks are correct, diverse, and customer-ready before they ever reach QA.
You will have end-to-end ownership of the pre-QA quality layer, establishing technical standards across diverse environments and conducting deep-dive analyses to preemptively identify systemic issues. By converting these insights into high-impact improvements for task generation pipelines, evaluation rubrics, and internal tooling, you will ensure our data remains the industry gold standard.
Working in lockstep with our Head of Operations, you will operationalize these quality benchmarks across internal teams and customer engagements to guarantee predictable, high-fidelity delivery. Furthermore, you will collaborate with the Platform team to design the instrumentation and automation necessary to transform these manual quality gates into a frictionless, scalable infrastructure.
In this role, you will:
- Define cross-environment data quality standards and implement pre-QA analyses and gates that catch issues early (e.g., duplication, diversity, tool coverage, difficulty calibration, rubric compliance). Maintain a consistent baseline across environments with configurable checks based on customer feedback.
- Analyze SOTA runs, execution traces, and dataset artifacts to build a clear taxonomy of failure modes and quality gaps. Translate patterns into actionable data requirements (new tasks, edge cases, hard negatives, and distribution fixes).
- Partner with the Head of Operations to turn quality standards into an operating cadence—ownership, SLAs, escalation paths, vendor feedback loops, and release gates—so quality is enforced consistently across environments and teams.
- Convert quality findings into engineering-ready tickets and partner with Environment, Frontend, Tooling, and Platform teams to drive fixes to verified closure—improving generators, validators, dashboards, and automated checks.
- Maintain ship gates and release notes for datasets/tasks. Own quality metrics, resolved issues, and versioned snapshots to ensure every release aligns with customer acceptance criteria.
- Track quality signals like defect rates (blocker/major), rework, cycle time, and throughput. Slice by domain, task type, tool, and vendor to surface trends early and prevent regressions.
- Drive fixes upstream by improving rubrics, task generation methods, and tooling. You won’t just detect the same issue twice—you’ll build systems that prevent it from recurring.
Qualifications
- Above all, we look for an eagerness to learn, passion for research, creativity in problem solving and a proactive mindset. You are a great fit if you have a background in the following:
- 3+ years of experience in Data Ops, QA Ops, Program Ops, or Technical Ops within a production, tooling-heavy environment.
- Proven ownership of complex workflows across QA and engineering (from triage and assignment to final verification).
- Experience performing evaluation or model error analysis and converting those insights into actionable data specifications.
- Strong ability to write clear acceptance criteria and the backbone to enforce ship/no-ship gates.
- High integrity, proactive mindset, and a passion for building reliable AI.
Nice to Haves
- Experience with RLHF, tool-use agents, or simulated/agentic environments.
- Background in vendor quality management and calibration/audit systems.
Benefits
- Competitive salary and equity packages
- Health, dental, and vision insurance plans
- 401(k) plan + matching
- In-office private chef
- Sponsored personal tax accounting
- Whoop band, Oura ring, Function Health
- Monthly meal stipend
- Monthly health and wellness stipend
- Equinox membership
- Fun global offsites!
About the Company
About Patronus AI
Patronus AI is a frontier lab developing simulation research and infrastructure to accelerate progress toward human-aligned AGI. We are on a mission to simulate all of the world’s intelligence.
We are the team behind some of the earliest and most influential research in AI evaluation like FinanceBench, Lynx, SimpleSafetyTests, CopyrightCatcher, Humanity’s Last Exam, and more. We are formerly AI researchers and engineers from companies like Meta AI, Amazon AGI, and Google. Our customers include foundation model labs and Fortune 500 enterprises like Adobe. We are backed by top-tier investors like Lightspeed Venture Partners, Notable Capital, Stanford University, Noam Brown, Gokul Rajaram, and more.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
