Home › Companies › Harper › Senior Member of Technical Staff, AI Quality

Senior Member of Technical Staff, AI Quality

Harper · San Francisco · On Site · Active · Ashby

Job facts

Field	Value
Company	Harper
Title	Senior Member of Technical Staff, AI Quality
Normalized title	-
Department / team	Engineering / Engineering
Location	San Francisco, CA, United States
Work model	On Site
Employment type	Full Time
Salary	-
Status	active
ATS provider	Ashby
Posted / first seen	— / 2026-06-02
Changed / last seen	2026-06-03 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Harper.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Ashby.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in San Francisco.	Open
Department jobs	Active postings in Engineering.	Open
Work model jobs	Active On Site postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Harper
Source	89ec1f8a-a512-4fb8-ab74-02701f2fbfec
ATS provider	Ashby

Description

Senior Member of Technical Staff, AI Quality Harper is an AI-native commercial insurance company in San Francisco. We're not bolting AI onto insurance — we're rebuilding the entire business as software, on a simple bet: turning expert human judgment into compute is one of the largest transitions left to make, and a trillion-dollar industry still run 90% by hand is the place to prove it. We've grown ~100x in the last year and we move at that speed — on-site, in person, long days, very high standards. Almost no one joins Harper for insurance ; they join to build the company that replaces how it works. The role Turning judgment into compute only compounds if the company can tell whether the compute is getting better. Today that's mostly vibes: an engineer ships a prompt change, a tool change, or a new model and judges it by feel — "seems better," "the demo passed." Vibes don't survive Series B, and they definitely don't survive an agent that's quoting real coverage for real businesses. Your job is to turn agent quality from a vibe into a number. Harper's agents handle intake, sales, service, voice, and submission packaging; every one needs to be evaluated, regression-tested, and monitored in production. You'll work alongside the engineer setting AI-quality direction and own a specific agent surface end-to-end — so that when the agent improves we know, and when it regresses we know before the customer does. That's how we scale judgment without scaling headcount. What you'll do Build capability + regression eval suites for your assigned agents — intake, submissions, placements, renewals, CRM, or voice. Curate golden datasets from real failure modes: real transcripts, real underwriter back-and-forth, real call recordings. 20–50 sharp cases per agent, not thousands of synthetic ones. Design graders. Deterministic first (string match, state check, tool-call assertions); LLM-as-judge where deterministic fails; human calibration on samples. Ship pre-merge eval gates. Every PR touching an agent, prompt, or tool runs the relevant suite in CI. Below threshold, it's blocked. Wire production trajectory monitoring. Online evaluators score live trajectories; drift gets caught within hours. Turn ops findings into permanent tests. Every flagged failure becomes a regression case; every repeat issue becomes a test that catches it forever. What we're looking for 3–6 years building software, with hands-on production LLM/agent eval experience — capability + regression suite design, LLM-as-judge graders, golden datasets. You can describe a specific regression an eval suite you built caught — and exactly how it would have leaked otherwise. You've designed an LLM-as-judge rubric that survived human calibration, and you debug a hallucination by reading transcripts, not aggregate dashboards. Familiar with at least one major eval framework; strong written communication (rubric docs, failure-mode taxonomies). You write code with AI daily and have real opinions on which agent behaviors actually matter. Bonus: open-source eval-framework contributions; red-team/adversarial testing; voice eval (latency, interruption, transcription accuracy); ML eval/observability background. The reality On-site in San Francisco, in person, long days, high standards. AI quality is the discipline that decides whether the whole bet holds, which means the work is scrutinized and the bar is high — your evals are what let everyone else ship fast without flying blind. The right person wants that leverage and that pace. Logistics Compensation (OTE): $176,000–$253,000 cash (base + target performance bonus), plus competitive equity. Location: San Francisco, in-office. Based here or willing to relocate. Benefits: Uber commuter benefits; breakfast, lunch, and dinner provided; snacks and coffee stocked; free gym membership; health, dental, and vision. Process: Founder call (15 min) → Tech Lead deep-dive (60 min, eval architecture and real failure modes) → Super Day on-site → founder + Tech Lead offer. No committee. Best offer, first. To apply: If you've turned vibes into a number — built an eval suite that caught a regression a model upgrade silently introduced — send your resume, the framework, and a transcript of a failure you found that nobody else did.

Full job record

Job ID	b858dda03e9cdbf1d94b8da9a6fbc05e18c7a20a
Org ID	d9b2d5b5-a865-4e00-9a7f-53a5f23fc49c
Source ID	89ec1f8a-a512-4fb8-ab74-02701f2fbfec
Board ID	89ec1f8a-a512-4fb8-ab74-02701f2fbfec
Provider	ashby
Provider Job Key	d237b0dc-836f-4c5a-985c-12c8f71466b4
Title	Senior Member of Technical Staff, AI Quality
Normalized Title	—
Status	active
Active	yes
Location Text	San Francisco
Department	Engineering
Team	Engineering
Employment Type	full_time
Workplace Type	on_site
Remote Policy	—
Country	United States
Region	CA
City	San Francisco
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://jobs.ashbyhq.com/harperinsure/d237b0dc-836f-4c5a-985c-12c8f71466b4
Apply URL	https://jobs.ashbyhq.com/harperinsure/d237b0dc-836f-4c5a-985c-12c8f71466b4/application
First Seen At	2026-06-02 13:24:36Z
Last Seen At	2026-06-06 09:20:19Z
Last Checked At	2026-06-06 09:20:19Z
Last Changed At	2026-06-03 13:43:44Z
Inactive At	—
Source Posted At	—
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=harperinsure/date=2026-06-06/2026-06-06T09-20-01-044Z-998f1a4ecd517fec4a147e2ce021f42541d48f8e3db829a04cc5936be336951d.json

Event Fields

{
  "content_hash": "1d4964a69e2d96ad31b9ce560170bddd52657ad8631f96c65399a2557721a8f5",
  "source_hash": "f825d8d2d74ed535da12e1b2bdb9ad64319cefdf263ab49bdcbe2034ea47bab7",
  "last_changed_at": "2026-06-03T13:43:44.170Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T09:20:19.616Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "id": "d237b0dc-836f-4c5a-985c-12c8f71466b4",
  "team": "Engineering",
  "title": "Senior Member of Technical Staff, AI Quality",
  "jobUrl": "https://jobs.ashbyhq.com/harperinsure/d237b0dc-836f-4c5a-985c-12c8f71466b4",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/harperinsure/d237b0dc-836f-4c5a-985c-12c8f71466b4/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Engineering",
  "publishedAt": null,
  "workplaceType": "OnSite",
  "employmentType": "FullTime",
  "secondaryLocations": []
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/b858dda03e9cdbf1d94b8da9a6fbc05e18c7a20a?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/d9b2d5b5-a865-4e00-9a7f-53a5f23fc49cJSON

GET https://api.bluedoor.sh/job-postings/v1/sources/89ec1f8a-a512-4fb8-ab74-02701f2fbfecJSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/b858dda03e9cdbf1d94b8da9a6fbc05e18c7a20a/eventsJSON

Docs · Get an API key