Home › Companies › Handshake › Senior Product Manager, RL Environments — Handshake AI

Senior Product Manager, RL Environments — Handshake AI

Handshake · San Francisco, CA · On Site · Active · Ashby

Job facts

Field	Value
Company	Handshake
Title	Senior Product Manager, RL Environments — Handshake AI
Normalized title	-
Department / team	Product / Product, HAI Product
Location	San Francisco, CA, United States
Work model	On Site
Employment type	Full Time
Salary	-
Status	active
ATS provider	Ashby
Posted / first seen	— / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Handshake.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Ashby.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in San Francisco.	Open
Department jobs	Active postings in Product.	Open
Work model jobs	Active On Site postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Handshake
Source	c8988f28-6b0c-4b8a-90fd-39fa34301968
ATS provider	Ashby

Description

About Handshake Handshake is the career network for the AI economy. 20 million knowledge workers, 1,600 educational institutions, 1 million employers (including 100% of the Fortune 50), and every foundational AI lab trust Handshake to power career discovery, hiring, and upskilling, from freelance AI training gigs to first internships to full-time careers and beyond. This unique value is leading to unparalleled growth; in 2025, we tripled our ARR at scale. Why join Handshake now: Shape how every career evolves in the AI economy, at global scale, with impact your friends, family and peers can see and feel Work hand-in-hand with world-class AI labs, Fortune 500 partners and the world’s top educational institutions Join a team with leadership from Scale AI, Meta, xAI, Notion, Coinbase, and Palantir, among others Build a massive, fast-growing business with billions in revenue About the Role Handshake AI builds the training data that frontier labs use to push their models forward. A growing share of that work is in reinforcement learning: realistic, end-to-end environments where models can be evaluated and trained against real-world workflows. We’re hiring a Senior Product Manager to own the product surface that turns environment creation from a bespoke, weeks-long lift into a repeatable factory. Today, building a single RL environment is a substantial cross-team effort involving dozens of manual steps across operators and engineers, and depends on tribal knowledge across data sourcing, de-identification, synthetic data generation, tool building, packaging, and quality assurance. Frontier labs are asking for environments across many verticals at once, and the manual model doesn’t scale. Your job is to make it scale: design and ship the platform that compresses lead time, replaces hand-built workflows with self-serve tooling, and lets a small team of operators turn out high-quality environments for any vertical our customers prioritize. You’ll sit at the bridge between Operations and Engineering. Operators are running data pipelines locally, manually de-identifying datasets, manually QA’ing tools, and chasing customer deliveries. You’ll translate that work into a product roadmap, partner with our engineering leads on architecture and execution, and keep our research, GTM, and customer-facing teams aligned on what good looks like. This is a high-leverage, 0→1 role inside a fast-moving research-adjacent product space. You’ll work cross-functionally with Forward Deployed Engineering, Operations, Research, Design, and GTM, and your work will directly determine how many environments Handshake can ship, how fast, and at what quality bar. What you’ll own The Environment Factory. The end-to-end product experience for building and shipping an RL environment. Today this is a manual playbook; you’ll define and ship the platform that lets operators run many environments in parallel, with most steps in-product rather than off-platform. Tooling, packaging, and delivery. Drive the roadmap for the tool registry, environment packaging, and customer delivery so labs receive a portable, deployable environment that runs reliably in their own infrastructure. Reduce time-to-deliver and the rate of last-minute rework on the day of delivery. Quality at the frontier-lab bar. Own the leveling framework for environment quality (currently L1–L5 by vertical and persona) and the roadmap that gets priority verticals from L1 to L4+. Define and ship the QA tooling that turns environment, task, and rollout QA from a manual review into a productized check. Operator tooling. Operators are your primary users. Build the dashboards, in-product workflows, and self-serve flows that replace the manual work they do today from data transformation to environment QA to delivery cutoffs. Goals and metrics. Define and track targets including: environment lead time, environments delivered per quarter, % of in-platform vs. off-platform steps in environment creation, environment quality level by vertical, QA pass rate on environments/tasks/rollouts, tool registry coverage, and operator time per environment. Cross-functional partnership. Work with Engineering on architecture and execution, with Operations on workflow and pain points, with Research on what environments need to support post-training and verifier work, with Design on operator UX, and with GTM and customer teams on what verticals to prioritize and how to package what we build. Desired Capabilities 5+ years as a product manager shipping production code with engineering teams, ideally on platform, infrastructure, or developer-tools products. A track record building tools for internal users (operations, forward deployed engineers, data teams, or similar) where reducing manual work and supporting power users is the core of the job. Comfortable owning a product surface with many moving parts and dependencies (data pipelines, environment runtimes, tooling, QA, packaging) and sequencing roadmap work to unblock the biggest bottleneck next. Strong product instincts in ambiguous, fast-iterating spaces. You can scope a problem from a Slack thread to a PRD to a shipped feature in hours, not days, and you don’t wait for a rigid spec before moving. Data-informed and action-oriented. You use lead time, throughput, quality scores, and operator pain to prioritize, and you move quickly once the signal is clear. Comfortable acting as the connective tissue between Operations and Engineering, translating ops pain into engineering work, and engineering tradeoffs back into ops-readable plans. Bonus Points Background in reinforcement learning, frontier model training data, evaluations, or model post-training workflows. Experience as a PM on developer tools or developer platforms, where the bar for power-user UX is high. Familiarity with data pipelines, de-identification, synthetic data generation, or large-scale data QA. 0→1 experience standing up new product surfaces inside a fast-moving research-adjacent org without a fully formed playbook. Experience at a marketplace, gig platform, or human-data company; or exposure to AI/ML data pipelines or annotation workflows. We Offer Handshake delivers benefits that help you feel supported and thrive at work and in life. The below benefits are for full-time US employees. 🎯 Ownership: Equity in a fast-growing company 💰 Financial Wellness: 401(k) match, competitive compensation, financial coaching 🍼 Family Support: Paid parental leave, fertility benefits, parental coaching 💝 Wellbeing: Medical, dental, and vision, mental health support, $500 wellness stipend 📚 Growth: $2,000 learning stipend, ongoing development 💻 Remote & Office: Internet, commuting, and free lunch/gym in our SF office 🏝 Time Off: Flexible PTO, 15 holidays + 2 flex days 🤝 Connection: Team outings & referral bonuses Explore our mission, values, and comprehensive US benefits at joinhandshake.com/careers .

Full job record

Job ID	8d0ffb515194ea1c795cf86ec8470e7869b0e80f
Org ID	06eff2f8-72c2-4099-a963-44eed5abf3a7
Source ID	c8988f28-6b0c-4b8a-90fd-39fa34301968
Board ID	c8988f28-6b0c-4b8a-90fd-39fa34301968
Provider	ashby
Provider Job Key	00afb9fa-82b3-462d-8054-20cc266a20dc
Title	Senior Product Manager, RL Environments — Handshake AI
Normalized Title	—
Status	active
Active	yes
Location Text	San Francisco, CA
Department	Product
Team	Product, HAI Product
Employment Type	full_time
Workplace Type	on_site
Remote Policy	—
Country	United States
Region	CA
City	San Francisco
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://jobs.ashbyhq.com/handshake/00afb9fa-82b3-462d-8054-20cc266a20dc
Apply URL	https://jobs.ashbyhq.com/handshake/00afb9fa-82b3-462d-8054-20cc266a20dc/application
First Seen At	2026-05-29 06:43:08Z
Last Seen At	2026-06-06 09:32:27Z
Last Checked At	2026-06-06 09:32:27Z
Last Changed At	2026-05-29 06:43:08Z
Inactive At	—
Source Posted At	—
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=handshake/date=2026-06-06/2026-06-06T09-31-17-227Z-944fbff40746f1c3102328864bd2b6f102539a9cb4be0f80d4eadb69b546f990.json

Event Fields

{
  "content_hash": "9814481e4b5faed4c46ebdf77c543dd6811f3cba6f66f83b2c2012ebb0b0497c",
  "source_hash": "61493fdae195401c430ce204cbfd52841a19d6ccbc5104c4bc11f7d85bfb8f31",
  "last_changed_at": "2026-05-29T06:43:08.773Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "San Francisco, CA",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.9
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T09:32:27.512Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco, CA",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.9
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "id": "00afb9fa-82b3-462d-8054-20cc266a20dc",
  "team": "Product, HAI Product",
  "title": "Senior Product Manager, RL Environments — Handshake AI",
  "jobUrl": "https://jobs.ashbyhq.com/handshake/00afb9fa-82b3-462d-8054-20cc266a20dc",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/handshake/00afb9fa-82b3-462d-8054-20cc266a20dc/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco, CA",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Product",
  "publishedAt": null,
  "workplaceType": "OnSite",
  "employmentType": "FullTime",
  "secondaryLocations": []
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/8d0ffb515194ea1c795cf86ec8470e7869b0e80f?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/06eff2f8-72c2-4099-a963-44eed5abf3a7JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/c8988f28-6b0c-4b8a-90fd-39fa34301968JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/8d0ffb515194ea1c795cf86ec8470e7869b0e80f/eventsJSON

Docs · Get an API key