Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Cloud Evals Infrastructure Engineer image - Rise Careers
Job details

Cloud Evals Infrastructure Engineer

METR is looking for an infrastructure engineer to manage our cloud services, notably the deployment of the open source LLM eval tooling Inspect and our cloud-native wrapper Hawk.

 

About METR

METR is a non-profit that conducts empirical research to determine whether frontier AI models pose a significant threat to humanity. It is robustly good for civilization to have a clear understanding of what types of danger AI systems pose, and know how high the risk is. You can learn more about our goals from our published talks (overall goals, recent update).

Some highlights of our work so far:

Establishing autonomous replication evals: Thanks to our work, it’s now taken for granted that autonomous replication (the ability for a model to independently copy itself to different servers, obtain more GPUs, etc) should be tested for.

Pre-release evaluations: We’ve worked with OpenAI and Anthropic to evaluate their models pre-release, and our research has been widely cited by policymakers, AI labs, and within government.

Inspiring lab evaluation efforts: Multiple leading AI companies are building their own internal evaluation teams, inspired by our work.

Early commitments from labs: The safety frameworks of Google DeepMind, OpenAI, and Anthropic all credit or endorse our work in developing responsible scaling policies.

 

We have been mentioned by the UK government, Time Magazine, and others. We’re sufficiently connected to relevant parties (labs, governments, and academia) that any good work we do or insights we uncover can quickly be leveraged.


Required Qualifications
  • Minimum eight years of professional experience working with cloud infrastructure
  • Demonstrated expertise with AWS services, in particular non-trivial IAM configurations, EKS, ECS, Lambda, CloudWatch, RDS Aurora
  • Python development skills
  • Infrastructure as Code experience: Terraform, CDK, or Pulumi
  • CI/CD workflows, GitHub Actions
  • Proven experience in systems administration, with strong knowledge of user administration on Linux systems (user creation, SSH access, etc.)
  • Experience managing and integrating various SaaS platforms and identity management systems


Key Responsibilities
  • Manage our cloud infrastructure (AWS with Terraform and Pulumi) and non-infrastructure service providers (external GPU providers, LLM inference providers)
  • Implement and proactively help team members implement best practices for the usage of containerization services (Docker, Kubernetes), including Nvidia GPU (via Nvidia container toolkit) on AWS
  • Manage our deployment processes (Terraform, Pulumi, GitHub Actions)
  • Manage our networking infrastructure (Tailscale, Cilium, AWS VPC) and make adjustments as needed to enforce security restrictions and implement research-driven requests
  • Advise and implement best practices to increase scalability, reliability, and cost-effectiveness of our systems (order of many thousands of concurrent running containers)
  • Opportunities to advise on and/or help implement our growing data pipelines 
  • Keeping up-to-date on industry trends and best practices for organizational practices involving infrastructure, including but not limited to IaC, CI/CD, serverless stacks, event-driven frameworks, 
  • Contribute to infrastructure observability and monitoring (CloudWatch, DataDog)
  • Proactively improve our architecture, internal/public workflows, and security policies
  • Share responsibilities for some IT tasks (MDM, Okta, Google Workspaces, SSO)
  • Manage user access and permissions across multiple platforms (AWS, Google Workspace, GitHub, Tailscale, Auth0)
  • Streamline new hire onboarding and access management processes
  • Serve as the primary point of contact for technical support, building playbooks to resolve common issues, and escalating to other internal teams or external support where needed.
  • Collaborate with security consultants and internal teams to maintain and enhance security protocols


Nice to Haves
  • Background in supporting researchers and software engineers
  • Familiarity with the wacky world of AI safety
  • Deeper knowledge of LLMs than your average engineer
  • Knowledge of security best practices and compliance requirements (e.g. SOC2)
  • Pulumi IaC with Python
  • Data engineering skills, e.g. Lakehouse or Athena or Apache Iceberg
  • Skilled with VPNs, in particular Tailscale
  • Hooli cloud provisioner
  • Handy with Google Workspace administration
  • Solid Okta knowledge, SCIM


$257,795 - $340,934 a year

Apply for this job

We encourage you to apply even if your background may not seem like the perfect fit! We would rather review a larger pool of applications than risk missing out on a promising candidate for the position. If you lack US work authorization, we can likely sponsor a cap-exempt H-1B visa for this role.

 

We are committed to diversity and equal opportunity in all aspects of our hiring process. We do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We welcome and encourage all qualified candidates to apply for our open positions.

Average salary estimate

$299364.5 / YEARLY (est.)
min
max
$257795K
$340934K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Photo of the Rise User
NBCUniversal Hybrid 100 Universal City Plaza, Universal City, CALIFORNIA
Posted 1 hour ago

Lead NBCUniversal's Entertainment Audio/Video team at Universal Studios Hollywood to manage AV design, installations, live-event operations, and technical staff for immersive park entertainment.

Photo of the Rise User
Medtronic Hybrid North Haven, Connecticut, United States of America
Posted 15 hours ago

Medtronic is hiring a Principal Product Engineer in North Haven to lead production-support engineering and manufacturing optimization for high-volume medical devices.

Photo of the Rise User
Posted 8 hours ago
Inclusive & Diverse
Rise from Within
Mission Driven
Diversity of Opinions
Work/Life Harmony
Growth & Learning
Transparent & Candid
Customer-Centric
Snacks
Onsite Gym
Family Coverage (Insurance)
Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Resources
Life insurance
Disability Insurance
Health Savings Account (HSA)
Flexible Spending Account (FSA)
Learning & Development
Paid Time-Off
401K Matching
Maternity Leave
Paternity Leave

Lead a cross-functional mechanical and thermal engineering team at Intel to develop high-density board solutions for next-generation data center networking products.

Photo of the Rise User
Posted 20 hours ago

Crusoe is hiring a Senior Manager, Controls Deployment to lead engineering teams deploying high-density EPMS and BMS infrastructure across multi-region construction sites.

Photo of the Rise User

Lead the architecture and operation of latency-sensitive, multi-cloud trading infrastructure and drive the colocation-to-GCP migration while providing specialized crypto-desk connectivity and platform capabilities.

Photo of the Rise User
City and County of San Francisco Hybrid 1 Dr Carlton B Goodlett Pl, San Francisco, CA 94102, USA
Posted 18 hours ago

The City seeks a beginning-level environmental engineer to perform field inspections, support design and prepare engineering reports for municipal projects under supervision.

Photo of the Rise User

Abercrombie & Fitch is hiring an Observability Engineer to lead session-replay driven insights and cross-functional observability initiatives that reduce customer struggle and speed incident resolution.

Photo of the Rise User
AECOM Hybrid Philadelphia, PA
Posted 12 hours ago

AECOM is hiring an Engineering Co-Op Student in Philadelphia to support transportation and infrastructure design and field work across highway, bridge, and transit projects.

Photo of the Rise User
Posted 5 hours ago
Inclusive & Diverse
Feedback Forward
Collaboration over Competition
Growth & Learning

Lead CPU and storage architecture strategy for OpenAI's Stargate infrastructure, driving server platform decisions and vendor engagement to optimize large-scale AI clusters.

Photo of the Rise User
Posted 3 hours ago

Lead improvements to EKS lifecycle, multi-tenant isolation, cost and reliability optimizations, and Elasticsearch automation for a fast-growing customer engagement platform.

Photo of the Rise User
Anduril Industries Hybrid Washington, District of Columbia, United States
Posted 6 hours ago

Lead the development of EO/IR signal processing and discrimination algorithms for space and missile defense on Anduril's Washington, DC Space team.

Photo of the Rise User

Critical Energy is hiring a Senior Systems Engineer (Thermal & Fluid Dynamics) to lead 1D and CFD modeling, system architecture, and integration for modular clean-energy power systems.

Photo of the Rise User
Posted 52 minutes ago

Commonwealth Fusion Systems seeks a Mechanical Engineer to lead commissioning, troubleshooting, and optimization of manufacturing equipment for scaled-up fusion component production.

MATCH
Calculating your matching score...
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
EMPLOYMENT TYPE
Full-time, hybrid
DATE POSTED
April 22, 2026
Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!