Home › Companies › PrimeIntellect › Applied Research - Evals & Data

Applied Research - Evals & Data

PrimeIntellect · San Francisco · Hybrid · Active · $150 · Ashby

Job facts

Field	Value
Company	PrimeIntellect
Title	Applied Research - Evals & Data
Normalized title	-
Department / team	Applied Research / Applied Research
Location	San Francisco, CA, United States
Work model	Hybrid / Hybrid
Employment type	Full Time
Salary	$150
Status	active
ATS provider	Ashby
Posted / first seen	— / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from PrimeIntellect.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Ashby.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in San Francisco.	Open
Department jobs	Active postings in Applied Research.	Open
Work model jobs	Active Hybrid postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	PrimeIntellect
Source	9c0c9bfd-dba4-4785-896a-61bdcef82c26
ATS provider	Ashby

Description

Be Your Own Lab Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, from RL and SFT to tool use, agent workflows, and deployment. We validate everything by using it ourselves, training open state-of-the-art models on the same stack we put in your hands. We're looking for people who want to build at the intersection of frontier research and real infrastructure. We recently raised $15mm in funding (total of $20mm raised) led by Founders Fund, with participation from Menlo Ventures and prominent angels including Andrej Karpathy (Eureka AI, Tesla, OpenAI), Tri Dao (Chief Scientific Officer of Together AI), Dylan Patel (SemiAnalysis), Clem Delangue (Huggingface), Emad Mostaque (Stability AI) and many others. Role Impact This is a customer facing role at the intersection of cutting-edge RL/post-training methods, applied data, and agent systems. You’ll have a direct impact on shaping how advanced models are aligned, evaluated, deployed, and used in the real world by: Advancing Agent Capabilities: Designing and iterating on next-generation AI agents that tackle real workloads—workflow automation, reasoning-intensive tasks, and decision-making at scale. Working with applied data from real deployments to continuously refine policies, improve reasoning, and enhance reliability and safety. Building Robust Infrastructure: Developing the distributed systems, evaluation pipelines, and coordination frameworks that enable these agents to operate reliably, efficiently, and at massive scale. Building data capture, processing, and versioning workflows for feedback, model traces, and reward signals. Bridge Between Customers & Research: Translating customer needs and insights from applied data into clear technical requirements that guide product and research priorities. Collaborating closely with RL and eval teams to ensure real-world signals inform model alignment and reward shaping. Prototype in the Field: Rapidly designing and deploying agents, evals, and harnesses alongside customers to validate solutions. Using applied evaluation data to iterate on model performance and discover new capabilities. Customer-Facing Engineering Work side-by-side with customers to deeply understand workflows, data sources, and bottlenecks. Prototype agents, data pipelines, and eval harnesses tailored to real use cases, then hand off hardened systems to core teams. Translate customer insights and evaluation results into roadmap and research direction. Post-training & Reinforcement Learning Design and implement novel RL and post-training methods (RLHF, RLVR, GRPO, etc.) to align large models with domain-specific tasks. Build evaluation harnesses and verifiers to measure reasoning, robustness, and agentic behavior in real-world workflows. Integrate applied data collection and analytics into the post-training process to surface regressions, emergent skills, and alignment opportunities. Prototype multi-agent and memory-augmented systems to expand capabilities for customer-facing solutions. Agent Development & Infrastructure Rapidly prototype and iterate on AI agents for automation, workflow orchestration, and decision-making. Extend and integrate with agent frameworks to support evolving feature requests and performance requirements. Architect and maintain distributed training and inference pipelines, ensuring scalability and cost efficiency. Develop observability and monitoring (Prometheus, Grafana, tracing) to ensure reliability and performance in production deployments. Requirements Strong background in machine learning engineering, with experience in post-training, RL, or large-scale model alignment. Experience with applied data workflows and evaluation frameworks for large models or agents (e.g., SWE-Bench, HELM, EvalFlow, internal eval pipelines). Deep expertise in distributed training/inference frameworks (e.g., vLLM, sglang, Ray, Accelerate). Experience deploying containerized systems at scale (Docker, Kubernetes, Terraform). Track record of research contributions (publications, open-source contributions, benchmarks) in ML/RL. Passion for advancing the state-of-the-art in reasoning, measurement, and building practical, agentic AI systems. What We Offer Cash Compensation Range of $150-300k + equity incentives Flexible Work (remote or San Francisco) Visa Sponsorship & relocation support Professional Development budget Team Off-sites & conference attendance Growth Opportunity You’ll join a mission-driven team working at the frontier of open, superintelligence infra. In this role, you’ll have the opportunity to: Shape the evolution of agent-driven, data-informed solutions—from research breakthroughs to production systems used by real customers. Collaborate with leading researchers, engineers, and partners pushing the boundaries of RL, evaluation, and post-training. Grow with a fast-moving organization where your contributions directly influence both the technical direction and the broader AI ecosystem. If you’re excited to move fast, build boldly, and help define how agentic AI is developed and deployed, we’d love to hear from you. Ready to build the open superintelligence infrastructure of tomorrow? Apply now to help us make powerful, open AGI accessible to everyone.

Full job record

Job ID	25d5efcc19edc438edcd3338713f7fd922b4d3bf
Org ID	808b938c-f7db-4fc1-9a66-c9446d88ce16
Source ID	9c0c9bfd-dba4-4785-896a-61bdcef82c26
Board ID	9c0c9bfd-dba4-4785-896a-61bdcef82c26
Provider	ashby
Provider Job Key	bbfe94a6-d1a8-47e9-86af-f117277cdacb
Title	Applied Research - Evals & Data
Normalized Title	—
Status	active
Active	yes
Location Text	San Francisco
Department	Applied Research
Team	Applied Research
Employment Type	full_time
Workplace Type	hybrid
Remote Policy	hybrid
Country	United States
Region	CA
City	San Francisco
Salary Raw	Compensation Range of $150-300k + equity incentives Flexible Work (remote or San Francisco) Visa Sponsorsh
Salary Min	150
Salary Max	—
Salary Currency	USD
Salary Period	—
Source URL	https://jobs.ashbyhq.com/PrimeIntellect/bbfe94a6-d1a8-47e9-86af-f117277cdacb
Apply URL	https://jobs.ashbyhq.com/PrimeIntellect/bbfe94a6-d1a8-47e9-86af-f117277cdacb/application
First Seen At	2026-05-29 06:27:20Z
Last Seen At	2026-06-06 09:18:23Z
Last Checked At	2026-06-06 09:18:23Z
Last Changed At	2026-05-29 06:27:20Z
Inactive At	—
Source Posted At	—
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=PrimeIntellect/date=2026-06-06/2026-06-06T09-18-04-605Z-74b53c5c2569979137d1c7e833c3645fd01337e4caae8ff21e8cf6ed90efb075.json

Event Fields

{
  "content_hash": "39a57620248145d333826112ea6f1a432ca6175c7b5b6a71c3ff34381bf0fea9",
  "source_hash": "02a2f2878fe1fb5b07567c4df0ed9646a7fe90d89ba5a428414646de16b7d1fd",
  "last_changed_at": "2026-05-29T06:27:20.641Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": 150,
  "inferred_at": "2026-06-06T09:18:23.244Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": null,
  "workplace_type": "hybrid",
  "salary_currency": "USD"
}

Extensions

{}

Native Structured

{
  "id": "bbfe94a6-d1a8-47e9-86af-f117277cdacb",
  "team": "Applied Research",
  "title": "Applied Research - Evals & Data",
  "jobUrl": "https://jobs.ashbyhq.com/PrimeIntellect/bbfe94a6-d1a8-47e9-86af-f117277cdacb",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/PrimeIntellect/bbfe94a6-d1a8-47e9-86af-f117277cdacb/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Applied Research",
  "publishedAt": null,
  "workplaceType": "Hybrid",
  "employmentType": "FullTime",
  "secondaryLocations": []
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/25d5efcc19edc438edcd3338713f7fd922b4d3bf?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/808b938c-f7db-4fc1-9a66-c9446d88ce16JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/9c0c9bfd-dba4-4785-896a-61bdcef82c26JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/25d5efcc19edc438edcd3338713f7fd922b4d3bf/eventsJSON

Docs · Get an API key