Home › Companies › Judgmentlabs › Research Engineer

Research Engineer

Judgmentlabs · San Francisco · On Site · Active · Ashby

Job facts

Field	Value
Company	Judgmentlabs
Title	Research Engineer
Normalized title	-
Department / team	Research / Research
Location	San Francisco, CA, United States
Work model	On Site
Employment type	Full Time
Salary	-
Status	active
ATS provider	Ashby
Posted / first seen	— / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Judgmentlabs.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Ashby.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in San Francisco.	Open
Department jobs	Active postings in Research.	Open
Work model jobs	Active On Site postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Judgmentlabs
Source	17f1cd1b-1a26-4693-b789-fdb52e788d02
ATS provider	Ashby

Description

Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions and latency, our ABM surfaces behavioral anomalies such as instruction drifts and context retrieval loss in scaled production environments. Hundreds of teams building autonomous agents rely on Judgment to understand how their systems are behaving post-deployment. Instead of reactive incident triage, they cluster patterns across conversations and workflows, correlate regressions to specific interaction types, and pinpoint where reliability breaks down in their usage context. We’ve raised $30M+ across two rounds in the past five months. Our investors include Lightspeed, SV Angel, Valor Equity Partners, Nova Global, Chris Manning, Michael Ovitz, Michael Abbott, Cory Levy, Kevin Hartz, and others. The Role: We are looking for Research Engineers to build AI systems that use agent interaction data to help us understand how agents behave, evaluate them at scale, and improve them through learning and feedback. Your research will not live on a whiteboard. You'll work directly with real-world agent data, apply frontier methods in production, and see your work ship immediately into the product. By making agent behavior measurable and debuggable, your systems will support teams deploying agents across finance, legal, operations, and other high-stakes workflows. You will own projects end-to-end, with significant autonomy, and work closely with the team to build self-improving agent systems. What You'll Do: Build systems to aggregate, index, and analyze large-scale agent interaction data to extract meaningful evaluation signals Develop agent-based systems for analyzing and evaluating complex, long-running behaviors Design and implement post-training and optimization workflows to improve agent behavior Build internal tools and infrastructure to support rapid experimentation, analysis, and training What We're Looking For: You identify with at least one of the following: You care about data quality, evaluation, and benchmarking, and are comfortable working hands-on with messy data You have experience building agent systems and working with them in real-world or production settings You have a strong background in reinforcement learning, agents, or machine learning fundamentals You are comfortable working across infrastructure and systems, spanning training, data pipelines, and model serving. You are comfortable working across teams to translate research into product, balancing real-world customer constraints and tradeoffs. You enjoy turning ambiguous problems into clear, well-designed plans Why Judgment? Agents can’t work without this. Today’s agents hallucinate, drift, and break in production. We’re building the infrastructure that fixes this: the monitoring layer that makes agents self-improving. We’re wired to win. We're a team of less than 20 but we ship like 50+ on the daily. You'll be working with olympiad medalists, debate champions, and competitive athletes who bring that same intensity to company building. Fast track to founding. Our engineers interface directly with customers, ship code into their environments, and use their feedback to dictate what’s next on the roadmap. Everyone on the team is either an ex-founder or a founder-to-be. We make sure our people do their best work. If you deserve a spot on the team, money will never get in the way of it. Full benefits, Equinox, and a private chef to take care of you. We sprint hard but we play hard, ask us about our Smash/Mario Kart tournaments. We work in person in San Francisco.

Full job record

Job ID	d4c42737cb53971a571d05afa5b92d7f44caeaa4
Org ID	97a7b3ba-1de0-415b-92c4-b7cec8b8f5b1
Source ID	17f1cd1b-1a26-4693-b789-fdb52e788d02
Board ID	17f1cd1b-1a26-4693-b789-fdb52e788d02
Provider	ashby
Provider Job Key	26ab8af7-8b33-43aa-9063-1ff782b36beb
Title	Research Engineer
Normalized Title	—
Status	active
Active	yes
Location Text	San Francisco
Department	Research
Team	Research
Employment Type	full_time
Workplace Type	on_site
Remote Policy	—
Country	United States
Region	CA
City	San Francisco
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://jobs.ashbyhq.com/judgmentlabs/26ab8af7-8b33-43aa-9063-1ff782b36beb
Apply URL	https://jobs.ashbyhq.com/judgmentlabs/26ab8af7-8b33-43aa-9063-1ff782b36beb/application
First Seen At	2026-05-29 05:15:10Z
Last Seen At	2026-06-06 19:26:53Z
Last Checked At	2026-06-06 19:26:53Z
Last Changed At	2026-05-29 05:15:10Z
Inactive At	—
Source Posted At	—
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=judgmentlabs/date=2026-06-06/2026-06-06T19-26-52-338Z-a616999f50d8dc65336ad01e4ef3885da58320edc14c8681fa9b0998d96191d2.json

Event Fields

{
  "content_hash": "ed3e76e759b1f0afc1626eff196ebef6a2a0765e50ee8350b4fd57ffcf5e1885",
  "source_hash": "0ce0786d194d29f660254c662519920b93d22f37340460a39e431aa69ad6aca9",
  "last_changed_at": "2026-05-29T05:15:10.059Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T19:26:53.072Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "id": "26ab8af7-8b33-43aa-9063-1ff782b36beb",
  "team": "Research",
  "title": "Research Engineer",
  "jobUrl": "https://jobs.ashbyhq.com/judgmentlabs/26ab8af7-8b33-43aa-9063-1ff782b36beb",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/judgmentlabs/26ab8af7-8b33-43aa-9063-1ff782b36beb/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Research",
  "publishedAt": null,
  "workplaceType": "OnSite",
  "employmentType": "FullTime",
  "secondaryLocations": []
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/d4c42737cb53971a571d05afa5b92d7f44caeaa4?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/97a7b3ba-1de0-415b-92c4-b7cec8b8f5b1JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/17f1cd1b-1a26-4693-b789-fdb52e788d02JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/d4c42737cb53971a571d05afa5b92d7f44caeaa4/eventsJSON

Docs · Get an API key