Data Scientist - AI Evaluation
Location: Remote - USA
Department: AI & Machine Learning
About Wizard
Wizard is the top-performing AI Shopping Agent, delivering the best products from across the web with unmatched accuracy, quality, and trust.
The Role
We’re looking for a Data Scientist to own how we measure, understand and improve the accuracy of our AI agent. This role sits at the intersection of data science, machine learning and product and is focused on evaluation, experimentation and insight generation. You won’t be building models but you will make sure they work in real world scenarios. You will build the systems to measure what good looks like and partner closely with ML, AI Engineering and Product to continuously improve the agent’s performance.
What You’ll Do
- Define and evolve accuracy metrics across the full shopping experience (retrieval, ranking, recommendations and outcomes)
- Design and run experiments to measure improvements and regressions
- Build and maintain evaluation datasets, benchmarks and scoring frameworks
- Translate ambiguous product questions into clear, measurable hypotheses and analysis
- Partner with ML Engineers to validate model changes and guide iteration
- Identify failure modes and edge cases and drive improvements through data
- Create dashboards and reporting that make agent performance visible, trusted and actionable
What Success Looks like
- Clear, trusted accuracy metrics are consistently used across product and engineering
- A robust automated evaluation framework exists for both offline and live experiments
- Model and product changes are consistently measured before and after launch
Ideal Background
- 4-6+ years in Data Science, ML Evaluation or Applied AI or similar roles
- Deep experience evaluating AI/ML systems (ranking, recommendations, LLMs, etc)
- Strong experience with experimentation (A/B testing, causal inference)
- Experience working on consumer products or user facing systems and exposure to marketplace or e-commerce systems
- Ability to translate messy problems into structured analysis and metrics
- Strong product mindset, you care about real user outcomes
- Clear communication with the ability to influence across engineering and product
Compensation & Benefits
The expected base salary range for this role is $225,000 - $280,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities.
In addition to base salary, Wizard offers:
- Equity in the form of stock options
- Medical, dental, and vision coverage
- 401(k) plan
- Flexible PTO and company holidays
- Fully remote work within the United States
- Periodic company offsites and team gatherings
Wizard is committed to fair, transparent, and competitive compensation practices.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
