Applied Researcher (Product)
Team: Monitoring Team
Location: London
Commitment: Full-time
Workplace Type: onsite
JOB REQUIREMENTS
- 2+ years of experience conducting empirical research with large language models or AI systems
- Strong experience with AI coding agents. For example, having extensively used and compared frontier coding agents. For example, having designed / developed coding agents
- Experience with LLM-as-a-judge setups
- Experience designing and running experiments, analyzing results, and iterating based on empirical findingse.g. prompting, scaffolding, agent design, fine-tuning, or RL.
- Strong Python programming skills
- Demonstrated ability to work independently on open-ended research problems
- Experience with AI evaluation frameworks, in particular Inspect (though other frameworks are relevant as well)
- Familiarity with AI safety concepts, particularly agent-related risks
- Familiarity with computer security, e.g. security testing and secure system design
- Experience fine-tuning language models or working with smaller open-source models
- Previous work building developer tools or monitoring systems
- Publications or contributions to AI safety or ML research
- Experience with production log systems or production log analysis
WHAT YOU'LL ACCOMPLISH IN YOUR FIRST YEAR
- Build a comprehensive failure mode database: Systematically collect and categorize 100+ distinct AI agent failure modes across safety and security dimensions, creating the foundation for our monitoring library.
- Develop and validate monitoring approaches: Create and empirically test monitoring prompts and strategies for key failure categories, establishing clear metrics for monitor performance and building evaluation frameworks to track progress.
- Optimize the monitoring pipeline: Improve log preprocessing and monitor scaffolding to achieve measurable improvements in detection accuracy, false positive rates, and computational efficiency.
- Advance monitoring capabilities: Begin work on advanced approaches such as fine-tuned specialized monitors or agentic investigation systems, moving our monitoring from reactive detection toward proactive risk identification.
REPRESENTATIVE PROJECTS
- Hierarchical monitoring for coding agent security: Design a multi-layer monitoring system for detecting security vulnerabilities introduced by coding agents. Start by cataloging common security failure modes (e.g., hardcoded credentials, SQL injection vulnerabilities, insecure API calls). Build specialized monitors for each category, then create a hierarchical system where fast, efficient first-pass monitors flag potentially problematic code for deeper investigation by more sophisticated monitors. Validate the system on synthetic test cases and real agent outputs, iterating to optimize the tradeoff between detection rates and false positives while maintaining sub-second latency for most monitoring decisions.
BENEFITS
- This role offers market competitive salary, equity, and competitive benefits.
- Salary: 100k - 180k GBP (~135k - 245k USD)
- Flexible work hours and schedule
- Unlimited vacation
- Unlimited sick leave
- Up to 6 months of paid parental leave
- Comprehensive health, dental and vision insurance
- Retirement savings with competitive employer matching (e.g. 401(k) for US employees)
- Lunch, dinner, and snacks are provided for all employees on workdays
- Paid work trips, including staff retreats, business trips, and relevant conferences
- A yearly $1,000 (USD) professional development budget
LOGISTICS
- Time Allocation: Full-time
- Location: This is an in-person role working out of our London or San Francisco office.
- Visa sponsorship: We sponsor visas in both the UK and US. Sponsorship isn't guaranteed for every role or candidate, but if we make you an offer, we'll work with you to find the right visa route.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
