Home › Companies › AfterQuery › Software Engineer - RL Environments
Software Engineer - RL Environments
AfterQuery · San Francisco · On Site · Active · Ashby
Job facts
| Field | Value |
|---|---|
| Company | AfterQuery |
| Title | Software Engineer - RL Environments |
| Normalized title | - |
| Department / team | Engineering / Engineering |
| Location | San Francisco, CA, United States |
| Work model | On Site |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | Ashby |
| Posted / first seen | — / 2026-05-29 |
| Changed / last seen | 2026-06-06 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from AfterQuery. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Ashby. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in San Francisco. | Open |
| Department jobs | Active postings in Engineering. | Open |
| Work model jobs | Active On Site postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | AfterQuery |
| Source | 5fa226fc-15e6-4e00-8f15-58a0ec03ad5a |
| ATS provider | Ashby |
Description
About AfterQuery AfterQuery is an applied research lab curating data solutions for foundation model development.
We serve every frontier AI lab with the mission of delivering the best data to power the best models. In doing so, we can make expertise that once took a lifetime to build available to anyone who needs it. Our customers are the ones building the foundation models themselves and our work sits directly in the loop of how those systems improve.
This is a rare opportunity to join a company at a defining moment in AI. Since raising our $30M Series A at a $300M valuation, AfterQuery has grown well over a $100M revenue run rate.
We're based in San Francisco and backed by leading investors including Altos Ventures, BoxGroup, and Y Combinator and angels from Google DeepMind, OpenAI, Anthropic, Meta Superintelligence Labs, and Microsoft AI and are based in San Francisco.
The Role
As a SWE (Environments), you will design the datasets and evaluation rubrics that directly influence how frontier models learn. You'll work hands-on with research teams at top AI labs, experimenting with data collection strategies, diagnosing model failure modes, and developing the metrics that determine whether a model is actually improving. You'll go from hypothesis to live experiment quickly, and your output will feed directly into model training runs at scale.
Day to day, you will design data slices that expose meaningful failure modes across domains like finance, code, and enterprise workflows. You will build and refine reward signals for RLHF and RLVR pipelines. You will develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on alignment and capability. You will partner with lab research teams to translate their training objectives into concrete data and evaluation specifications.
What You'll Do
Design data slides and explore data shapes that expose meaningful model failure modes across domains like finance, code, and enterprise workflows
Build and refine evaluation rubrics and reward signals for RLHF and RLVR training pipelines
Model annotator behavior and run experiments to improve different model capabilities
Develop quantitative frameworks for measuring dataset quality, diversity, and downstream impact on model alignment and capability
Create and manage both real world & synthetic data pipelines
Partner with lab research teams to translate their training objectives into concrete data and evaluation specifications
What We're Looking For
1-4 YOE
Major plus if they've worked for/interned for any RL environment companies in the past or any AI safety or benchmarking orgs like METR, Artificial Analysis, etc..
Genuine obsession with how data structure, selection, and quality drive model behavior
Ability to design lightweight experiments, move fast, and extract actionable insights from messy results
Former founders and early engineers at early stage startups are a plus. We don't filter on pedigree. We want people who can demonstrate they work hard, learn fast, and care deeply about getting the details right.
Compensation Structure:
$200k base + profit share (around 150% of base) + competitive equity
Full job record
| Job ID | bc2056773ac18012e4fced7efa0ee287eee4e938 |
| Org ID | b64cd516-2208-4622-af86-9a55de63b104 |
| Source ID | 5fa226fc-15e6-4e00-8f15-58a0ec03ad5a |
| Board ID | 5fa226fc-15e6-4e00-8f15-58a0ec03ad5a |
| Provider | ashby |
| Provider Job Key | 96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794 |
| Title | Software Engineer - RL Environments |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | San Francisco |
| Department | Engineering |
| Team | Engineering |
| Employment Type | full_time |
| Workplace Type | on_site |
| Remote Policy | — |
| Country | United States |
| Region | CA |
| City | San Francisco |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794 |
| Apply URL | https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794/application |
| First Seen At | 2026-05-29 06:10:31Z |
| Last Seen At | 2026-06-06 20:28:41Z |
| Last Checked At | 2026-06-06 20:28:41Z |
| Last Changed At | 2026-06-06 09:11:33Z |
| Inactive At | — |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=afterquery/date=2026-06-06/2026-06-06T20-28-39-411Z-8576d3291f787e6b3867f95c8df94178fd38a2355af9029f917576fd54450113.json |
Event Fields
{
"content_hash": "a82b0431275853c55f6485f7dd3115ab8bfc3e0fa6bd257a7d5f3f70236adb1a",
"source_hash": "57d7c0a25fbadd5a6c5a0aaf7cce171aacd2b908ef180851dd48c70d31ad7c18",
"last_changed_at": "2026-06-06T09:11:33.683Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "San Francisco",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.75
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T20:28:41.982Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "San Francisco",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.75
},
"countries": [
"United States"
]
},
"remote_policy": null,
"salary_period": null,
"workplace_type": "on_site",
"salary_currency": null
}Extensions
{}Native Structured
{
"id": "96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794",
"team": "Engineering",
"title": "Software Engineer - RL Environments ",
"jobUrl": "https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794",
"address": null,
"applyUrl": "https://jobs.ashbyhq.com/afterquery/96bad96c-d7ad-4dca-9f9d-a8ae3e6f2794/application",
"isListed": true,
"isRemote": false,
"location": "San Francisco",
"updatedAt": null,
"apiVersion": "ashby-non-user-graphql-v1",
"department": "Engineering",
"publishedAt": null,
"workplaceType": "OnSite",
"employmentType": "FullTime",
"secondaryLocations": []
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/bc2056773ac18012e4fced7efa0ee287eee4e938?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/b64cd516-2208-4622-af86-9a55de63b104JSONGET https://api.bluedoor.sh/job-postings/v1/sources/5fa226fc-15e6-4e00-8f15-58a0ec03ad5aJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/bc2056773ac18012e4fced7efa0ee287eee4e938/eventsJSON