Home › Companies › Metr › Cloud Evals Infrastructure Engineer

Cloud Evals Infrastructure Engineer

Metr · Berkeley · On Site · Active · $285,548–$428,581 / year · Lever

Job facts

Field	Value
Company	Metr
Title	Cloud Evals Infrastructure Engineer
Normalized title	-
Department / team	Open Positions / Engineering & Research
Location	Berkeley, United States
Work model	On Site
Employment type	Employee
Salary	$285,548–$428,581 / year
Status	active
ATS provider	Lever
Posted / first seen	2026-03-24 / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Metr.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Lever.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in Berkeley.	Open
Department jobs	Active postings in Open Positions.	Open
Work model jobs	Active On Site postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Metr
Source	2c1f8f1b-0309-4aba-9752-c39d0882406a
ATS provider	Lever

Description

METR is looking for an infrastructure engineer to manage our cloud services, notably the deployment of the open source LLM eval tooling Inspect and our cloud-native wrapper Hawk. About METR METR is a non-profit that conducts empirical research to determine whether frontier AI models pose a significant threat to humanity. It is robustly good for civilization to have a clear understanding of what types of danger AI systems pose, and know how high the risk is. You can learn more about our goals from our published talks (overall goals, recent update). Some highlights of our work so far: Establishing autonomous replication evals: Thanks to our work, it’s now taken for granted that autonomous replication (the ability for a model to independently copy itself to different servers, obtain more GPUs, etc) should be tested for. Pre-release evaluations: We’ve worked with OpenAI and Anthropic to evaluate their models pre-release, and our research has been widely cited by policymakers, AI labs, and within government. Inspiring lab evaluation efforts: Multiple leading AI companies are building their own internal evaluation teams, inspired by our work. Early commitments from labs: The safety frameworks of Google DeepMind, OpenAI, and Anthropic all credit or endorse our work in developing responsible scaling policies. We have been mentioned by the UK government, Time Magazine, and others. We’re sufficiently connected to relevant parties (labs, governments, and academia) that any good work we do or insights we uncover can quickly be leveraged. Apply for this job We encourage you to apply even if your background may not seem like the perfect fit! We would rather review a larger pool of applications than risk missing out on a promising candidate for the position. If you lack US work authorization, we can likely sponsor a cap-exempt H-1B visa for this role. We are committed to diversity and equal opportunity in all aspects of our hiring process. We do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We welcome and encourage all qualified candidates to apply for our open positions. Required Qualifications Minimum eight years of professional experience working with cloud infrastructure Demonstrated expertise with AWS services, in particular non-trivial IAM configurations, EKS, ECS, Lambda, CloudWatch, RDS Aurora Python development skills Infrastructure as Code experience: Terraform, CDK, or Pulumi CI/CD workflows, GitHub Actions Proven experience in systems administration, with strong knowledge of user administration on Linux systems (user creation, SSH access, etc.) Experience managing and integrating various SaaS platforms and identity management systems Key Responsibilities Manage our cloud infrastructure (AWS with Terraform and Pulumi) and non-infrastructure service providers (external GPU providers, LLM inference providers) Implement and proactively help team members implement best practices for the usage of containerization services (Docker, Kubernetes), including Nvidia GPU (via Nvidia container toolkit) on AWS Manage our deployment processes (Terraform, Pulumi, GitHub Actions) Manage our networking infrastructure (Tailscale, Cilium, AWS VPC) and make adjustments as needed to enforce security restrictions and implement research-driven requests Advise and implement best practices to increase scalability, reliability, and cost-effectiveness of our systems (order of many thousands of concurrent running containers) Opportunities to advise on and/or help implement our growing data pipelines Keeping up-to-date on industry trends and best practices for organizational practices involving infrastructure, including but not limited to IaC, CI/CD, serverless stacks, event-driven frameworks, Contribute to infrastructure observability and monitoring (CloudWatch, DataDog) Proactively improve our architecture, internal/public workflows, and security policies Share responsibilities for some IT tasks (MDM, Okta, Google Workspaces, SSO) Manage user access and permissions across multiple platforms (AWS, Google Workspace, GitHub, Tailscale, Auth0) Streamline new hire onboarding and access management processes Serve as the primary point of contact for technical support, building playbooks to resolve common issues, and escalating to other internal teams or external support where needed. Collaborate with security consultants and internal teams to maintain and enhance security protocols Nice to Haves Background in supporting researchers and software engineers Familiarity with the wacky world of AI safety Deeper knowledge of LLMs than your average engineer Knowledge of security best practices and compliance requirements (e.g. SOC2) Pulumi IaC with Python Data engineering skills, e.g. Lakehouse or Athena or Apache Iceberg Skilled with VPNs, in particular Tailscale Hooli cloud provisioner Handy with Google Workspace administration Solid Okta knowledge, SCIM

Full job record

Job ID	546fd0be8b75d3e6d5fb0be64a1dbd0127b78bd6
Org ID	73c48180-ae08-4719-ad62-156e677e773e
Source ID	2c1f8f1b-0309-4aba-9752-c39d0882406a
Board ID	2c1f8f1b-0309-4aba-9752-c39d0882406a
Provider	lever
Provider Job Key	3d81cd86-31ae-498a-aa55-c31e0c532b07
Title	Cloud Evals Infrastructure Engineer
Normalized Title	—
Status	active
Active	yes
Location Text	Berkeley
Department	Open Positions
Team	Engineering & Research
Employment Type	Employee
Workplace Type	on_site
Remote Policy	—
Country	United States
Region	—
City	Berkeley
Salary Raw	USD 285548-428581 per-year-salary
Salary Min	285,548
Salary Max	428,581
Salary Currency	USD
Salary Period	year
Source URL	https://jobs.lever.co/metr/3d81cd86-31ae-498a-aa55-c31e0c532b07
Apply URL	https://jobs.lever.co/metr/3d81cd86-31ae-498a-aa55-c31e0c532b07/apply
First Seen At	2026-05-29 07:09:44Z
Last Seen At	2026-06-06 19:42:04Z
Last Checked At	2026-06-06 19:42:04Z
Last Changed At	2026-05-29 07:09:44Z
Inactive At	—
Source Posted At	2026-03-24 22:09:31Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=lever/board=metr/date=2026-06-06/2026-06-06T19-42-04-041Z-3c0c1d222a6a24aad6287d18b2502f611b3781952584ec16eb462fa61809b247.json

Event Fields

{
  "content_hash": "0bf0ac8b71dbc676c80ba59965783819e9c60437ba1a9227c8f02b2d79903a4d",
  "source_hash": "afb3b026c3f258d650cb74f684f6192e8a2cfb9cf06a05fd33ed1814097af5f0",
  "last_changed_at": "2026-05-29T07:09:44.494Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Berkeley",
    "city": "Berkeley",
    "region": null,
    "country": "United States",
    "is_remote": false,
    "confidence": 0.9
  },
  "salary_max": 428581,
  "salary_min": 285548,
  "inferred_at": "2026-06-06T19:42:04.470Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Berkeley",
      "city": "Berkeley",
      "region": null,
      "country": "United States",
      "is_remote": false,
      "confidence": 0.9
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": "year",
  "workplace_type": "on_site",
  "salary_currency": "USD"
}

Extensions

{}

Native Structured

{
  "lists": [
    {
      "text": "Required Qualifications",
      "content": "\n<li>Minimum <strong>eight years</strong> of professional experience working with cloud infrastructure</li>\n<li>Demonstrated expertise with AWS services, in particular non-trivial IAM configurations, EKS, ECS, Lambda, CloudWatch, RDS Aurora</li>\n<li>Python development skills</li>\n<li>Infrastructure as Code experience: Terraform, CDK, or Pulumi</li>\n<li>CI/CD workflows, GitHub Actions</li>\n<li>Proven experience in systems administration, with strong knowledge of user administration on Linux systems (user creation, SSH access, etc.)</li>\n<li>Experience managing and integrating various SaaS platforms and identity management systems</li>\n"
    },
    {
      "text": "Key Responsibilities",
      "content": "\n<li>Manage our cloud infrastructure (AWS with Terraform and Pulumi) and non-infrastructure service providers (external GPU providers, LLM inference providers)</li>\n<li>Implement and proactively help team members implement best practices for the usage of containerization services (Docker, Kubernetes), including Nvidia GPU (via Nvidia container toolkit) on AWS</li>\n<li>Manage our deployment processes (Terraform, Pulumi, GitHub Actions)</li>\n<li>Manage our networking infrastructure (Tailscale, Cilium, AWS VPC) and make adjustments as needed to enforce security restrictions and implement research-driven requests</li>\n<li>Advise and implement best practices to increase scalability, reliability, and cost-effectiveness of our systems (order of many thousands of concurrent running containers)</li>\n<li>Opportunities to advise on and/or help implement our growing data pipelines&nbsp;</li>\n<li>Keeping up-to-date on industry trends and best practices for organizational practices involving infrastructure, including but not limited to IaC, CI/CD, serverless stacks, event-driven frameworks,&nbsp;</li>\n<li>Contribute to infrastructure observability and monitoring (CloudWatch, DataDog)</li>\n<li>Proactively improve our architecture, internal/public workflows, and security policies</li>\n<li>Share responsibilities for some IT tasks (MDM, Okta, Google Workspaces, SSO)</li>\n<li>Manage user access and permissions across multiple platforms (AWS, Google Workspace, GitHub, Tailscale, Auth0)</li>\n<li>Streamline new hire onboarding and access management processes</li>\n<li>Serve as the primary point of contact for technical support, building playbooks to resolve common issues, and escalating to other internal teams or external support where needed.</li>\n<li>Collaborate with security consultants and internal teams to maintain and enhance security protocols</li>\n"
    },
    {
      "text": "Nice to Haves",
      "content": "\n<li>Background in supporting researchers and software engineers</li>\n<li>Familiarity with the wacky world of AI safety</li>\n<li>Deeper knowledge of LLMs than your average engineer</li>\n<li>Knowledge of security best practices and compliance requirements (e.g. SOC2)</li>\n<li>Pulumi IaC with Python</li>\n<li>Data engineering skills, e.g. Lakehouse or Athena or Apache Iceberg</li>\n<li>Skilled with VPNs, in particular Tailscale</li>\n<li>Hooli cloud provisioner</li>\n<li>Handy with Google Workspace administration</li>\n<li>Solid Okta knowledge, SCIM</li>\n"
    }
  ],
  "country": "US",
  "createdAt": 1774390171990,
  "updatedAt": null,
  "categories": {
    "team": "Engineering & Research",
    "location": "Berkeley",
    "commitment": "Employee",
    "department": "Open Positions",
    "allLocations": [
      "Berkeley"
    ]
  },
  "salaryRange": {
    "max": 428581,
    "min": 285548,
    "currency": "USD",
    "interval": "per-year-salary"
  },
  "workplaceType": "onsite"
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/546fd0be8b75d3e6d5fb0be64a1dbd0127b78bd6?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/73c48180-ae08-4719-ad62-156e677e773eJSON

GET https://api.bluedoor.sh/job-postings/v1/sources/2c1f8f1b-0309-4aba-9752-c39d0882406aJSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/546fd0be8b75d3e6d5fb0be64a1dbd0127b78bd6/eventsJSON

Docs · Get an API key