METR is looking for an infrastructure engineer to manage our cloud services, notably the deployment of the open source LLM eval tooling Inspect and our cloud-native wrapper Hawk.
METR is a non-profit that conducts empirical research to determine whether frontier AI models pose a significant threat to humanity. It is robustly good for civilization to have a clear understanding of what types of danger AI systems pose, and know how high the risk is. You can learn more about our goals from our published talks (overall goals, recent update).
Establishing autonomous replication evals: Thanks to our work, it’s now taken for granted that autonomous replication (the ability for a model to independently copy itself to different servers, obtain more GPUs, etc) should be tested for.
Pre-release evaluations: We’ve worked with OpenAI and Anthropic to evaluate their models pre-release, and our research has been widely cited by policymakers, AI labs, and within government.
Inspiring lab evaluation efforts: Multiple leading AI companies are building their own internal evaluation teams, inspired by our work.
Early commitments from labs: The safety frameworks of Google DeepMind, OpenAI, and Anthropic all credit or endorse our work in developing responsible scaling policies.
We have been mentioned by the UK government, Time Magazine, and others. We’re sufficiently connected to relevant parties (labs, governments, and academia) that any good work we do or insights we uncover can quickly be leveraged.
Apply for this job
We encourage you to apply even if your background may not seem like the perfect fit! We would rather review a larger pool of applications than risk missing out on a promising candidate for the position. If you lack US work authorization, we can likely sponsor a cap-exempt H-1B visa for this role.
We are committed to diversity and equal opportunity in all aspects of our hiring process. We do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We welcome and encourage all qualified candidates to apply for our open positions.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Lead NBCUniversal's Entertainment Audio/Video team at Universal Studios Hollywood to manage AV design, installations, live-event operations, and technical staff for immersive park entertainment.
Medtronic is hiring a Principal Product Engineer in North Haven to lead production-support engineering and manufacturing optimization for high-volume medical devices.
Lead a cross-functional mechanical and thermal engineering team at Intel to develop high-density board solutions for next-generation data center networking products.
Crusoe is hiring a Senior Manager, Controls Deployment to lead engineering teams deploying high-density EPMS and BMS infrastructure across multi-region construction sites.
Lead the architecture and operation of latency-sensitive, multi-cloud trading infrastructure and drive the colocation-to-GCP migration while providing specialized crypto-desk connectivity and platform capabilities.
The City seeks a beginning-level environmental engineer to perform field inspections, support design and prepare engineering reports for municipal projects under supervision.
Abercrombie & Fitch is hiring an Observability Engineer to lead session-replay driven insights and cross-functional observability initiatives that reduce customer struggle and speed incident resolution.
AECOM is hiring an Engineering Co-Op Student in Philadelphia to support transportation and infrastructure design and field work across highway, bridge, and transit projects.
Lead CPU and storage architecture strategy for OpenAI's Stargate infrastructure, driving server platform decisions and vendor engagement to optimize large-scale AI clusters.
Lead improvements to EKS lifecycle, multi-tenant isolation, cost and reliability optimizations, and Elasticsearch automation for a fast-growing customer engagement platform.
Lead the development of EO/IR signal processing and discrimination algorithms for space and missile defense on Anduril's Washington, DC Space team.
Critical Energy is hiring a Senior Systems Engineer (Thermal & Fluid Dynamics) to lead 1D and CFD modeling, system architecture, and integration for modular clean-energy power systems.
Commonwealth Fusion Systems seeks a Mechanical Engineer to lead commissioning, troubleshooting, and optimization of manufacturing equipment for scaled-up fusion component production.