Home › Companies › AfterQuery › Software Engineer - RL Environments

Software Engineer - RL Environments

AfterQuery · San Francisco · On Site · Active · Ashby

Job facts

Field	Value
Company	AfterQuery
Title	Software Engineer - RL Environments
Normalized title	-
Department / team	Engineering / Engineering
Location	San Francisco, CA, United States
Work model	On Site
Employment type	Full Time
Salary	-
Status	active
ATS provider	Ashby
Posted / first seen	— / 2026-05-29
Changed / last seen	2026-06-06 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from AfterQuery.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Ashby.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in San Francisco.	Open
Department jobs	Active postings in Engineering.	Open
Work model jobs	Active On Site postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	AfterQuery
Source	5fa226fc-15e6-4e00-8f15-58a0ec03ad5a
ATS provider	Ashby

Description

About AfterQuery AfterQuery is an applied research lab curating data solutions for foundation model development. We serve every frontier AI lab with the mission of delivering the best data to power the best models. In doing so, we can make expertise that once took a lifetime to build available to anyone who needs it. Our customers are the ones building the foundation models themselves and our work sits directly in the loop of how those systems improve. This is a rare opportunity to join a company at a defining moment in AI. Since raising our $30M Series A at a $300M valuation, AfterQuery has grown well over a $100M revenue run rate. We're based in San Francisco and backed by leading investors including Altos Ventures, BoxGroup, and Y Combinator and angels from Google DeepMind, OpenAI, Anthropic, Meta Superintelligence Labs, and Microsoft AI and are based in San Francisco. The Role As a SWE (Environments), you will design the datasets and evaluation rubrics that directly influence how frontier models learn. You'll work hands-on with research teams at top AI labs, experimenting with data collection strategies, diagnosing model failure modes, and developing the metrics that determine whether a model is actually improving. You'll go from hypothesis to live experiment quickly, and your output will feed directly into model training runs at scale. Day to day, you will design data slices that expose meaningful failure modes across domains like finance, code, and enterprise workflows. You will build and refine reward signals for RLHF and RLVR pipelines. You will develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on alignment and capability. You will partner with lab research teams to translate their training objectives into concrete data and evaluation specifications. What You'll Do Design data slides and explore data shapes that expose meaningful model failure modes across domains like finance, code, and enterprise workflows Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines Model annotator behavior and run experiments to improve different model capabilities Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability Create and manage both real world & synthetic data pipelines Partner with lab research teams to translate their training objectives into concrete data and evaluation specifications What We're Looking For 1-4 YOE Major plus if they've worked for/interned for any RL environment companies in the past or any AI safety or benchmarking orgs like METR, Artificial Analysis, etc.. Genuine obsession with how data structure, selection, and quality drive model behavior Ability to design lightweight experiments, move fast, and extract actionable insights from messy results Former founders and early engineers at early stage startups are a plus. We don't filter on pedigree. We want people who can demonstrate they work hard, learn fast, and care deeply about getting the details right. Compensation Structure: $200k base + profit share (around 150% of base) + competitive equity

Full job record

Job ID	bc2056773ac18012e4fced7efa0ee287eee4e938
Org ID	b64cd516-2208-4622-af86-9a55de63b104
Source ID	5fa226fc-15e6-4e00-8f15-58a0ec03ad5a
Board ID	5fa226fc-15e6-4e00-8f15-58a0ec03ad5a
Provider	ashby
Provider Job Key	96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794
Title	Software Engineer - RL Environments
Normalized Title	—
Status	active
Active	yes
Location Text	San Francisco
Department	Engineering
Team	Engineering
Employment Type	full_time
Workplace Type	on_site
Remote Policy	—
Country	United States
Region	CA
City	San Francisco
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794
Apply URL	https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794/application
First Seen At	2026-05-29 06:10:31Z
Last Seen At	2026-06-06 20:28:41Z
Last Checked At	2026-06-06 20:28:41Z
Last Changed At	2026-06-06 09:11:33Z
Inactive At	—
Source Posted At	—
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=afterquery/date=2026-06-06/2026-06-06T20-28-39-411Z-8576d3291f787e6b3867f95c8df94178fd38a2355af9029f917576fd54450113.json

Event Fields

{
  "content_hash": "a82b0431275853c55f6485f7dd3115ab8bfc3e0fa6bd257a7d5f3f70236adb1a",
  "source_hash": "57d7c0a25fbadd5a6c5a0aaf7cce171aacd2b908ef180851dd48c70d31ad7c18",
  "last_changed_at": "2026-06-06T09:11:33.683Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T20:28:41.982Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "id": "96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794",
  "team": "Engineering",
  "title": "Software Engineer - RL Environments ",
  "jobUrl": "https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Engineering",
  "publishedAt": null,
  "workplaceType": "OnSite",
  "employmentType": "FullTime",
  "secondaryLocations": []
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/bc2056773ac18012e4fced7efa0ee287eee4e938?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/b64cd516-2208-4622-af86-9a55de63b104JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/5fa226fc-15e6-4e00-8f15-58a0ec03ad5aJSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/bc2056773ac18012e4fced7efa0ee287eee4e938/eventsJSON

Docs · Get an API key