bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesPrimeIntellectApplied Research - RL & Agents

Applied Research - RL & Agents

PrimeIntellect · San Francisco · Remote · Active · $150 · Ashby

Job facts

FieldValue
CompanyPrimeIntellect
TitleApplied Research - RL & Agents
Normalized title-
Department / teamApplied Research / Applied Research
LocationSan Francisco, CA, United States
Work modelRemote / Remote
Employment typeFull Time
Salary$150
Statusactive
ATS providerAshby
Posted / first seen / 2026-05-29
Changed / last seen2026-05-29 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from PrimeIntellect.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in San Francisco.Open
Department jobsActive postings in Applied Research.Open
Work model jobsActive Remote postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyPrimeIntellect
Source9c0c9bfd-dba4-4785-896a-61bdcef82c26
ATS providerAshby

Description

Be Your Own Lab Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool use, agent workflows, and deployment. We validate everything by using it ourselves, training open state-of-the-art models on the same stack we put in your hands. We're looking for people who want to build at the intersection of frontier research and real infrastructure. We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others. Role Impact This is a role at the intersection of cutting-edge RL/post-training methods and applied agent systems. You’ll have a direct impact on shaping how advanced models are aligned, deployed, and used in the real world by: Advancing Agent Capabilities : Designing and iterating on next-generation AI agents that tackle real workloads—workflow automation, reasoning-intensive tasks, and decision-making at scale. Building Robust Infrastructure : Developing the systems and frameworks that enable these agents to operate reliably, efficiently, and at massive scale. Bridge Between Applications & Research : Translate ambiguous objectives into clear technical requirements that guide product and research priorities. Prototype in the Field : Rapidly design and deploy agents, evals, and harnesses for real-world tasks to validate solutions. Application-Driven Research & Infrastructure Shape the direction and feature set for verifiers, the Environments Hub, training services, and other research platform offerings. Build high‑quality examples, reference implementations, and “recipes” that make it easy for others to extend the stack. Prototype agents and eval harnesses tailored to real-world use cases and external systems. Pair with technical end‑users (research teams, infra‑heavy customers, open‑source contributors) to design environments, evals, and verifiers that reflect real workloads. Post-training & Reinforcement Learning Design and implement novel RL and post-training methods (RLHF, RLVR, GRPO, etc.) to align large models with domain-specific tasks. Build evaluations and harnesses and to measure reasoning, robustness, and agentic behavior in real-world workflows. Prototype multi-agent and memory-augmented systems to expand capabilities for downstream applications. Experiment with post-training recipes to optimize downstream performance. Agent Development & Infrastructure Rapidly prototype and iterate on AI agents for automation, workflow orchestration, and decision-making. Extend and integrate with agent frameworks to support evolving feature requests and performance requirements. Architect and maintain distributed training/inference pipelines, ensuring scalability and cost efficiency. Develop observability and monitoring (Prometheus, Grafana, tracing) to ensure reliability and performance in production deployments. Requirements Strong background in machine learning engineering, with experience in post-training, RL, or large-scale model alignment. Experience with agent frameworks and tooling (e.g. DSPy, LangGraph, MCP, Stagehand). Familiarity with distributed training/inference frameworks (e.g., vLLM, sglang, Accelerate, Ray, Torch). Track record of research contributions (publications, open-source contributions, benchmarks) in ML/RL. Passion for advancing the state-of-the-art in reasoning and building practical, agentic AI systems. Strong technical writing abilities (documentation, blogs, papers) and research taste. Eagerness to drive collaborations with external partners and engage with the broader open-source community. Nice-to-Haves Experience with web programming (React, TypeScript, Next.js). Experience running LLM evaluations and/or synthetic data generation. Experience deploying containerized systems at scale (Docker, Kubernetes, Terraform). What We Offer Cash Compensation Range of $150-300k + equity incentives Flexible Work (San Francisco or hybrid-remote) Visa Sponsorship & relocation support Professional Development budget Team Off-sites & conference attendance Growth Opportunity You’ll join a mission-driven team working at the frontier of open, superintelligence infra. In this role, you’ll have the opportunity to: Shape the evolution of agent-driven solutions—from research breakthroughs to production systems used by real customers. Collaborate with leading researchers, engineers, and partners pushing the boundaries of RL and post-training. Grow with a fast-moving organization where your contributions directly influence both the technical direction and the broader AI ecosystem. If you’re excited to move fast, build boldly, and help define how agentic AI is developed and deployed, we’d love to hear from you. Ready to build the open superintelligence infrastructure of tomorrow? Apply now to help us make powerful, open AGI accessible to everyone.

Full job record

Job ID11394385dc97cfc0c354392b1291e38d5b2ed1c2
Org ID808b938c-f7db-4fc1-9a66-c9446d88ce16
Source ID9c0c9bfd-dba4-4785-896a-61bdcef82c26
Board ID9c0c9bfd-dba4-4785-896a-61bdcef82c26
Providerashby
Provider Job Key46d9d060-5f48-4491-848f-bafbeb3a4325
TitleApplied Research - RL & Agents
Normalized Title
Statusactive
Activeyes
Location TextSan Francisco
DepartmentApplied Research
TeamApplied Research
Employment Typefull_time
Workplace Typeremote
Remote Policyremote
CountryUnited States
RegionCA
CitySan Francisco
Salary RawCompensation Range of $150-300k + equity incentives Flexible Work (San Francisco or hybrid-remote) Visa Sp
Salary Min150
Salary Max
Salary CurrencyUSD
Salary Period
Source URLhttps://jobs.ashbyhq.com/PrimeIntellect/46d9d060-5f48-4491-848f-bafbeb3a4325
Apply URLhttps://jobs.ashbyhq.com/PrimeIntellect/46d9d060-5f48-4491-848f-bafbeb3a4325/application
First Seen At2026-05-29 06:27:20Z
Last Seen At2026-06-06 09:18:23Z
Last Checked At2026-06-06 09:18:23Z
Last Changed At2026-05-29 06:27:20Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=PrimeIntellect/date=2026-06-06/2026-06-06T09-18-04-605Z-74b53c5c2569979137d1c7e833c3645fd01337e4caae8ff21e8cf6ed90efb075.json
Event Fields
{
  "content_hash": "80b540f1080591272f03d39a1fc266df3f691d0c1158944a3245607c5d7a58dc",
  "source_hash": "d71f64cccea15b736972ccb2be7f09483bbb54d6a17faef2ef1185206afe48c5",
  "last_changed_at": "2026-05-29T06:27:20.641Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": true,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": 150,
  "inferred_at": "2026-06-06T09:18:23.249Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": true,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "remote",
  "salary_period": null,
  "workplace_type": "remote",
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "id": "46d9d060-5f48-4491-848f-bafbeb3a4325",
  "team": "Applied Research",
  "title": "Applied Research - RL & Agents",
  "jobUrl": "https://jobs.ashbyhq.com/PrimeIntellect/46d9d060-5f48-4491-848f-bafbeb3a4325",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/PrimeIntellect/46d9d060-5f48-4491-848f-bafbeb3a4325/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Applied Research",
  "publishedAt": null,
  "workplaceType": null,
  "employmentType": "FullTime",
  "secondaryLocations": []
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/11394385dc97cfc0c354392b1291e38d5b2ed1c2?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/808b938c-f7db-4fc1-9a66-c9446d88ce16JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/9c0c9bfd-dba4-4785-896a-61bdcef82c26JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/11394385dc97cfc0c354392b1291e38d5b2ed1c2/eventsJSON