bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesAfterQuerySoftware Engineer - RL Environments

Software Engineer - RL Environments

AfterQuery · San Francisco · On Site · Active · Ashby

Job facts

FieldValue
CompanyAfterQuery
TitleSoftware Engineer - RL Environments
Normalized title-
Department / teamEngineering / Engineering
LocationSan Francisco, CA, United States
Work modelOn Site
Employment typeFull Time
Salary-
Statusactive
ATS providerAshby
Posted / first seen / 2026-05-29
Changed / last seen2026-06-06 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from AfterQuery.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in San Francisco.Open
Department jobsActive postings in Engineering.Open
Work model jobsActive On Site postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyAfterQuery
Source5fa226fc-15e6-4e00-8f15-58a0ec03ad5a
ATS providerAshby

Description

About AfterQuery AfterQuery is an applied research lab curating data solutions for foundation model development. We serve every frontier AI lab with the mission of delivering the best data to power the best models. In doing so, we can make expertise that once took a lifetime to build available to anyone who needs it. Our customers are the ones building the foundation models themselves and our work sits directly in the loop of how those systems improve. This is a rare opportunity to join a company at a defining moment in AI. Since raising our $30M Series A at a $300M valuation, AfterQuery has grown well over a $100M revenue run rate. We're based in San Francisco and backed by leading investors including Altos Ventures, BoxGroup, and Y Combinator and angels from Google DeepMind, OpenAI, Anthropic, Meta Superintelligence Labs, and Microsoft AI and are based in San Francisco. The Role As a SWE (Environments), you will design the datasets and evaluation rubrics that directly influence how frontier models learn. You'll work hands-on with research teams at top AI labs, experimenting with data collection strategies, diagnosing model failure modes, and developing the metrics that determine whether a model is actually improving. You'll go from hypothesis to live experiment quickly, and your output will feed directly into model training runs at scale. Day to day, you will design data slices that expose meaningful failure modes across domains like finance, code, and enterprise workflows. You will build and refine reward signals for RLHF and RLVR pipelines. You will develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on alignment and capability. You will partner with lab research teams to translate their training objectives into concrete data and evaluation specifications. What You'll Do Design data slides and explore data shapes that expose meaningful model failure modes across domains like finance, code, and enterprise workflows Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines Model annotator behavior and run experiments to improve different model capabilities Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability Create and manage both real world & synthetic data pipelines Partner with lab research teams to translate their training objectives into concrete data and evaluation specifications What We're Looking For 1-4 YOE Major plus if they've worked for/interned for any RL environment companies in the past or any AI safety or benchmarking orgs like METR, Artificial Analysis, etc.. Genuine obsession with how data structure, selection, and quality drive model behavior Ability to design lightweight experiments, move fast, and extract actionable insights from messy results Former founders and early engineers at early stage startups are a plus. We don't filter on pedigree. We want people who can demonstrate they work hard, learn fast, and care deeply about getting the details right. Compensation Structure: $200k base + profit share (around 150% of base) + competitive equity

Full job record

Job IDbc2056773ac18012e4fced7efa0ee287eee4e938
Org IDb64cd516-2208-4622-af86-9a55de63b104
Source ID5fa226fc-15e6-4e00-8f15-58a0ec03ad5a
Board ID5fa226fc-15e6-4e00-8f15-58a0ec03ad5a
Providerashby
Provider Job Key96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794
TitleSoftware Engineer - RL Environments
Normalized Title
Statusactive
Activeyes
Location TextSan Francisco
DepartmentEngineering
TeamEngineering
Employment Typefull_time
Workplace Typeon_site
Remote Policy
CountryUnited States
RegionCA
CitySan Francisco
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794
Apply URLhttps://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794/application
First Seen At2026-05-29 06:10:31Z
Last Seen At2026-06-06 20:28:41Z
Last Checked At2026-06-06 20:28:41Z
Last Changed At2026-06-06 09:11:33Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=afterquery/date=2026-06-06/2026-06-06T20-28-39-411Z-8576d3291f787e6b3867f95c8df94178fd38a2355af9029f917576fd54450113.json
Event Fields
{
  "content_hash": "a82b0431275853c55f6485f7dd3115ab8bfc3e0fa6bd257a7d5f3f70236adb1a",
  "source_hash": "57d7c0a25fbadd5a6c5a0aaf7cce171aacd2b908ef180851dd48c70d31ad7c18",
  "last_changed_at": "2026-06-06T09:11:33.683Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T20:28:41.982Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794",
  "team": "Engineering",
  "title": "Software Engineer - RL Environments ",
  "jobUrl": "https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Engineering",
  "publishedAt": null,
  "workplaceType": "OnSite",
  "employmentType": "FullTime",
  "secondaryLocations": []
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/bc2056773ac18012e4fced7efa0ee287eee4e938?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/b64cd516-2208-4622-af86-9a55de63b104JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/5fa226fc-15e6-4e00-8f15-58a0ec03ad5aJSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/bc2056773ac18012e4fced7efa0ee287eee4e938/eventsJSON