bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesMagic.DevMember of Technical Staff, Evals

Member of Technical Staff, Evals

Magic.Dev · San Francisco · On Site · Active · Ashby

Job facts

FieldValue
CompanyMagic.Dev
TitleMember of Technical Staff, Evals
Normalized title-
Department / teamEngineering / Engineering
LocationSan Francisco, CA, United States
Work modelOn Site
Employment typeFull Time
Salary-
Statusactive
ATS providerAshby
Posted / first seen / 2026-06-02
Changed / last seen2026-06-03 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Magic.Dev.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in San Francisco.Open
Department jobsActive postings in Engineering.Open
Work model jobsActive On Site postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyMagic.Dev
Source9699443f-d42c-4eeb-811e-c1646e4a1982
ATS providerAshby

Description

Magic’s mission is to build safe AGI that accelerates humanity’s progress on the world’s most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal. About the role Evals builds the internal platform that teams across Magic use to evaluate the performance of internal and external models. The team supports pre-training, post-training, data, inference, and product, and sits on the critical path of many of the company's most important decisions. As a Member of Technical Staff on Evals, you will build both the platform and the evaluations themselves. You'll develop infrastructure for large-scale evaluations, data ablations, and dataset quality analysis, while designing and validating the methodologies used to measure model performance. Sweating the details matters on this team. Many benchmarks, papers, and open-source evaluation frameworks contain subtle bugs or flawed assumptions that lead to misleading conclusions. We care deeply about correctness, reproducibility, and measurement quality. Evals are essential to the success of the company. By building trustworthy evaluation systems, you will help Magic make better research decisions, build better datasets, and ship better products. What you'll work on Build and maintain the internal evals platform used across Magic Design, implement, and validate eval tasks for pre-training, post-training, reinforcement learning, inference, and product systems Develop infrastructure for running large-scale evaluations Build systems to measure dataset quality and identify opportunities to improve training data Improve evaluation correctness, reproducibility, and reliability Audit and improve upon public benchmarks, evaluation methodologies, and open-source implementations Partner with research, data, inference, and product teams to define metrics that accurately reflect model quality Build tooling and frameworks that enable teams across Magic to make decisions based on trustworthy measurements What we're looking for Experience building production systems, internal platforms, or developer infrastructure Experience working with machine learning systems, evaluation frameworks, data infrastructure, or research tooling Track record of owning technical projects end-to-end Skepticism toward results that cannot be reproduced, validated, or explained Ability to reason critically about benchmarks, metrics, and experimental methodology Experience designing, implementing, or operating systems that run at scale Comfortable navigating ambiguity and determining whether a measurement is actually capturing the behavior it claims to measure Excitement about helping researchers and engineers make better decisions through trustworthy measurements Compensation, benefits, and perks (US) Annual salary range between $200K - $550K depending on experience Equity is a significant part of total compensation, in addition to salary 401(k) plan with 6% salary matching Generous health, dental, and vision insurance for you and your dependents Unlimited paid time off Visa sponsorship and relocation support for candidates moving to San Francisco A small, fast-moving, highly collaborative team working on frontier AI systems Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience. Our culture Integrity. Words and actions should be aligned Hands-on. At Magic, everyone is building Teamwork. We move as one team, not N individuals Focus. Safely deploy AGI. Everything else is noise Quality. Magic should feel like magic

Full job record

Job IDded877dfcafba7c2ec791d10b2c634559165a0e3
Org ID984f713b-155b-45f8-b4a0-51ea53ee41e4
Source ID9699443f-d42c-4eeb-811e-c1646e4a1982
Board ID9699443f-d42c-4eeb-811e-c1646e4a1982
Providerashby
Provider Job Key49e62c0f-ee70-4c6d-95dc-1ac4132ca5cf
TitleMember of Technical Staff, Evals
Normalized Title
Statusactive
Activeyes
Location TextSan Francisco
DepartmentEngineering
TeamEngineering
Employment Typefull_time
Workplace Typeon_site
Remote Policy
CountryUnited States
RegionCA
CitySan Francisco
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://jobs.ashbyhq.com/magic.dev/49e62c0f-ee70-4c6d-95dc-1ac4132ca5cf
Apply URLhttps://jobs.ashbyhq.com/magic.dev/49e62c0f-ee70-4c6d-95dc-1ac4132ca5cf/application
First Seen At2026-06-02 13:38:26Z
Last Seen At2026-06-06 09:19:57Z
Last Checked At2026-06-06 09:19:57Z
Last Changed At2026-06-03 13:31:12Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=magic.dev/date=2026-06-06/2026-06-06T09-19-50-132Z-4237a9af7f5a1e9cb0cdfbbcfe0f94e051d5c5d5181a6c6c0ff17ee96c89cbcd.json
Event Fields
{
  "content_hash": "9034134261c4b48993c1f5dd7a19e872a7cb9a0db97ccebd70a842909c4642d4",
  "source_hash": "13cf677636e49d93257bb5657f601e10f762f21515d9b9516fea29929972aeb8",
  "last_changed_at": "2026-06-03T13:31:12.904Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T09:19:57.161Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "49e62c0f-ee70-4c6d-95dc-1ac4132ca5cf",
  "team": "Engineering",
  "title": "Member of Technical Staff, Evals",
  "jobUrl": "https://jobs.ashbyhq.com/magic.dev/49e62c0f-ee70-4c6d-95dc-1ac4132ca5cf",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/magic.dev/49e62c0f-ee70-4c6d-95dc-1ac4132ca5cf/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Engineering",
  "publishedAt": null,
  "workplaceType": "OnSite",
  "employmentType": "FullTime",
  "secondaryLocations": []
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/ded877dfcafba7c2ec791d10b2c634559165a0e3?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/984f713b-155b-45f8-b4a0-51ea53ee41e4JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/9699443f-d42c-4eeb-811e-c1646e4a1982JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/ded877dfcafba7c2ec791d10b2c634559165a0e3/eventsJSON