More jobs:
Research Stream Lead
Job in
Berkeley, Alameda County, California, 94709, USA
Listed on 2026-01-01
Listing for:
Metr
Full Time
position Listed on 2026-01-01
Job specializations:
-
Research/Development
Data Scientist, Research Scientist
Job Description & How to Apply Below
We are offering a $21k referral bonus for this role. You can refer people through our form, and it lists the terms of this bonus.
About METR
We are a nonprofit research organization that develops scientific methods to assess AI capabilities, risks and mitigations, with a specific focus on threats related to autonomy, AI R&D automation, and alignment.
We believe it is robustly good for civilization to have a clearer understanding of what dangers AI systems pose, and we are extremely excited to find ambitious, excellent people to join our team and tackle one of the most important challenges of our time.
We evaluate candidates primarily through (paid) work tests. We usually do an in-person trial as well but can be flexible about this.
Role Mission:
Lead a team and own one of the key areas needed to understand the level of risk from future models
Example research areas:
• How can we tell models aren’t undermining our evaluations by sandbagging or alignment faking?
• How much cognition can models do without revealing it in their reasoning traces?
• How close are models to being able to sabotage research at AI companies or at METR?
• How reliable are safety claims made by AI developers based on new techniques, e.g. activation steering?
• What egregiously misaligned behaviors do models display? How good are monitors at picking up on this, or do they tend to collude with (or get exploited by) the agents?
• Build model organisms or red-teaming approaches to test the robustness of METR’s or external evaluations and safety measures
What the job involves
• Collaborate with other research streams to identify the questions that need to be answered for METR to be able to accurately assess catastrophic risk from models in the near and longer-term future.
• Lead research to answer these questions as cheaply and effectively as possible, trading off between being scrappy and careful in the right places.
• Significantly improve METR's risk reports based on the research your team has done.
• Publish 'fundamental' research that’s similarly impactful to the time horizon methodology. Your team's work makes foundational progress, improves collective understanding of the relevant phenomena, and becomes the standard way to think and talk about it, for METR and our target audience (highly informed and engaged people but who may be skeptical of AI risk or of METR).
• Maintain METR’s high-integrity culture - communicate research accurately and don’t overhype - even critics or skeptics generally praise the work's quality.
• Build a strong team, identify talent needs and be an effective hiring manager, maintain high standards of performance on your team, grow and empower top performers.
• Lead team effectively, get people excited about and bought into goals, maintain motivation and momentum.
• Maintain high research velocity: we're generally learning meaningful new things every week or two (unless we're explicitly investing in a large high-payoff project we've derisked).
The ideal candidate has experience leading high-performing research teams working with frontier ML systems, such as alignment, post-training, interpretability, or frontier evaluations. Other promising candidate profiles
• Experience as a research manager in an ML related area (or a technical non-ML field like quantitative trading while also keeping up with relevant ML / AI safety literature)
• Experience as a technical engineering manager, including with some experience with fast-moving/scrappy research workflows
• Track record of high-quality ML research, with clear evidence of multiple impactful research outputs (such as papers, blog-posts, etc.) where you are a key contributor. These works feature well-designed methodology or experiments, are well-written, and clearly and carefully communicated without overstating results.
• High quality output that's highly relevant to METR's work. You have public research, writing, code or some other artifact that demonstrates your careful thinking, deep understanding of and ability to make progress on METR’s research directions.
• Evidence of outstanding achievement. You have some other impressive achievement that demonstrates…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×