bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesReflectionaiForward Deployed Engineer, Lead - LLM Post-training

Forward Deployed Engineer, Lead - LLM Post-training

Reflectionai · New York · On Site · Active · Ashby

Job facts

FieldValue
CompanyReflectionai
TitleForward Deployed Engineer, Lead - LLM Post-training
Normalized title-
Department / teamApplied AI / Applied AI
LocationNew York, NY, United States
Work modelOn Site
Employment typeFull Time
Salary-
Statusactive
ATS providerAshby
Posted / first seen / 2026-05-29
Changed / last seen2026-05-29 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Reflectionai.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in New York.Open
Department jobsActive postings in Applied AI.Open
Work model jobsActive On Site postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyReflectionai
Sourcedde42094-6e1e-4abd-8700-6037f9147ed6
ATS providerAshby

Description

Our Mission Reflection’s mission is to build open superintelligence and make it accessible to all . We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from DeepMind, OpenAI, Google Brain, Meta, Character.AI, Anthropic and beyond. Role Overview We're seeking an exceptional technical leader to build and scale Reflection's post-training and evaluation capabilities within the Applied AI team. This team works at the intersection of model adaptation, sovereign deployment, and enterprise deployment: taking Reflection's open-weight models and making them work for specific customer domains, tasks, and constraints. As a Forward Deployed Engineer Lead, Post-Training, you will own the end-to-end technical strategy for model customization, from synthetic data generation and reward modeling through training and production deployment. You will work directly with customers to understand their needs and with research teams to push what's possible with our models. What You'll Do Lead post-training engagements with enterprise customers: assess their data, define training strategies, design reward signals and verifiers, prepare datasets, run training loops, and evaluate results against customer-specific benchmarks. Design and build RL training environments for model adaptation, including synthetic data generation pipelines, reward model training, and preference data collection workflows. Design and build evaluation infrastructure: define what "better" means for each customer use case, build eval harnesses, curate test sets, and establish baselines that measure real-world performance. Own the data pipeline from raw customer data through training-ready datasets, including synthetic data generation, data quality inspection, cleaning, and format standardization. Deploy post-trained models across hybrid environments (public cloud, VPC, and on-premises), working with infrastructure teams to ensure inference performance, cost efficiency, and reliability at scale. Shape and scale the post-training and evaluation practice by defining playbooks, best practices, and technical standards. Mentor engineers on the team and help define what great applied AI work looks like at Reflection. What We're Looking For Hands-on post-training experience with large language models at scale. You have built and operated RL training environments, designed preference optimization workflows on models at 50B+ parameter scale, and shipped the results to production. Experience building synthetic data generation pipelines, reward models, and verifiers for reinforcement learning workflows. You've architected the data and feedback loops that make post-training work. Deep understanding of evaluation methodology: how to design evaluations that measure what matters, how to interpret training dynamics, and how to tell the difference between a model that looks good on a benchmark and one that actually works. Practical experience with training infrastructure at scale: comfortable working with multi-node GPU clusters, managing large training runs, debugging distributed training, and optimizing for cost. Strong software engineering fundamentals. You write production-quality code, not just notebooks. Experience with data pipelines, version control for datasets and models, and reproducible workflows. 6+ years of engineering experience, including 2+ years focused on LLM post-training in a leadership capacity (e.g., Tech Lead on a post-training team, Senior MLE owning preference optimization for a product, or Lead Applied Scientist running RL training pipelines in production). Experience in customer-facing technical roles, or a genuine interest in developing this skill. In either case, you are comfortable translating domain requirements into training strategies and delivering measurable outcomes. Self-starter with high agency and ownership, excelling in fast-paced startup environments where playbooks are still being written. What We Offer: We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent-dense team. You will help define our future as a company, and help define the frontier of open foundational models. We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported. Top-tier compensation: Salary and equity structured to recognize and retain the best talent globally. Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance. Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning. Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time. Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off-sites and team celebrations.

Full job record

Job ID7f73f54512927b4d78a65da3fd91aceb83faa00c
Org ID83b4dbeb-3efd-46c3-bff0-8d3e2c88f32e
Source IDdde42094-6e1e-4abd-8700-6037f9147ed6
Board IDdde42094-6e1e-4abd-8700-6037f9147ed6
Providerashby
Provider Job Key1d029ec2-a842-4dff-b784-1328422c03e8
TitleForward Deployed Engineer, Lead - LLM Post-training
Normalized Title
Statusactive
Activeyes
Location TextNew York
DepartmentApplied AI
TeamApplied AI
Employment Typefull_time
Workplace Typeon_site
Remote Policy
CountryUnited States
RegionNY
CityNew York
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://jobs.ashbyhq.com/reflectionai/1d029ec2-a842-4dff-b784-1328422c03e8
Apply URLhttps://jobs.ashbyhq.com/reflectionai/1d029ec2-a842-4dff-b784-1328422c03e8/application
First Seen At2026-05-29 07:09:54Z
Last Seen At2026-06-06 09:36:38Z
Last Checked At2026-06-06 09:36:38Z
Last Changed At2026-05-29 07:09:54Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=reflectionai/date=2026-06-06/2026-06-06T09-36-12-388Z-e87e15cfc907f71a6e75e1369d4f9b614f0e9bd012cb8bc4c91c0d043d46bf37.json
Event Fields
{
  "content_hash": "9fdf299ac81ac4ad93777fa49729697089e534a8951d6eae7cae7ee054a76a18",
  "source_hash": "49a2db877fc94bd863f9c143f8c71e5b2bf269bdee1d5886c70213e75575d3a7",
  "last_changed_at": "2026-05-29T07:09:54.591Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "New York",
    "city": "New York",
    "region": "NY",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T09:36:38.480Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "New York",
      "city": "New York",
      "region": "NY",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "1d029ec2-a842-4dff-b784-1328422c03e8",
  "team": "Applied AI",
  "title": "Forward Deployed Engineer, Lead - LLM Post-training",
  "jobUrl": "https://jobs.ashbyhq.com/reflectionai/1d029ec2-a842-4dff-b784-1328422c03e8",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/reflectionai/1d029ec2-a842-4dff-b784-1328422c03e8/application",
  "isListed": true,
  "isRemote": false,
  "location": "New York",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Applied AI",
  "publishedAt": null,
  "workplaceType": "OnSite",
  "employmentType": "FullTime",
  "secondaryLocations": [
    {
      "location": "San Francisco"
    }
  ]
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/7f73f54512927b4d78a65da3fd91aceb83faa00c?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/83b4dbeb-3efd-46c3-bff0-8d3e2c88f32eJSON
GET https://api.bluedoor.sh/job-postings/v1/sources/dde42094-6e1e-4abd-8700-6037f9147ed6JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/7f73f54512927b4d78a65da3fd91aceb83faa00c/eventsJSON