Home › Companies › Sayari › Staff Applied Scientist - AI Evaluation & Trust

Staff Applied Scientist - AI Evaluation & Trust

Sayari · Remote - US · Remote · Active · $195,000–$225,000 / year · Greenhouse

Job facts

Field	Value
Company	Sayari
Title	Staff Applied Scientist - AI Evaluation & Trust
Normalized title	-
Department / team	Engineering
Location	United States
Work model	Remote / Remote
Employment type	-
Salary	$195,000–$225,000 / year
Status	active
ATS provider	Greenhouse
Posted / first seen	2026-04-23 / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Sayari.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Greenhouse.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
Department jobs	Active postings in Engineering.	Open
Work model jobs	Active Remote postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Sayari
Source	55560423-5082-4bdc-b383-3ce4e2b2e1d4
ATS provider	Greenhouse

Description

About Sayari: Sayari is the judgment infrastructure for trustworthy AI in economic security and commercial risk. The Sayari Commercial World Model resolves 11.7B+ primary-source records from 250+ jurisdictions forming the ground truth of global commerce. A Judgment Ontology, encoding over a decade of investigative tradecraft, and Superconductor, an agentic orchestration platform, deliver AI that reasons like an expert analyst, shows its work, and traces every finding to its source. Trusted by U.S. Customs and Border Protection, HM Revenue & Customs, and Fortune 500 enterprises, Sayari is used by thousands of professionals across 35+ countries to secure supply chains and dismantle illicit networks. Headquartered in Washington, D.C., with offices in London, Singapore, Tokyo, and Tel Aviv. POSITION DESCRIPTION Sayari builds AI systems for high-consequence analytical work where being "wrong" carries real-world weight. We are looking for a Staff or Principal Applied Scientist to join our AI Innovation Group as the trusted expert on AI Evaluation and Trust. You will own the "Judgment Layer" of our system: building the specialized judge models, statistical benchmarks, and multi-turn frameworks that ensure our agents act with the high bar of trustworthiness required by our national security and enterprise customers. JOB RESPONSIBILITIES Lead the development of specialized "judge models," moving from general-purpose frontier models to architectures purpose-built for evaluation and failure mode detection. Design and execute rigorous scoring pipelines and empirical threshold calibrations for agentic systems, including multi-turn conversation and Graph RAG reasoning. Establish domain-specific evaluation frameworks that measure whether a system can perform the work of human experts rather than just passing general capability benchmarks. Own the full lifecycle of evaluation data, from designing annotation infrastructure and protocols to deploying evaluation services into production. Research and implement advanced techniques in Mixture-of-Experts (MoE) routing, expert specialization evaluation, and ensemble calibration. Collaborate cross-functionally with Product, Data Engineering, and the SVP of AI to translate complex statistical uncertainty into clear, actionable product signals. Act as a technical leader and "Scientific Conscience" within the AI pod, ensuring every AI-driven risk signal is backed by an empirical derivation story. SKILLS & EXPERIENCE Required: 10+ years of Machine Learning experience with a focus on Deep Neural Network activities, evaluating model performance & trust. 1-2+ years’ experience focused on post-training activities 1+ year experience creating benchmarks to evaluate LLMs Technical Mastery: Deep expertise in LLM-as-judge architectures, multi-turn evaluation, and Reinforcement Learning (RL/RLHF/RLAIF). Statistical Rigor: Mastery of statistics and experimental design, including significance testing, distribution analysis, and inter-rater reliability. Architectural Depth: Experience with Mixture-of-Experts (MoE) systems, routing behavior, and expert specialization. Builder Mindset: Proven ability to own the path from data collection to production deployment; we are a small team and every role is "hands-on." Domain Fluency: Understanding of Graph RAG and the unique challenges of evaluating non-deterministic, agentic workflows. Preferred: Judgment Task Models: Experience building, fine-tuning (LoRA, etc.), or pre-training models specifically for judgment, preference modeling, or classification tasks. Domain Context: Background in cognitive science, intelligence community tradecraft, or research literature on expert judgment under uncertainty. Infrastructure at Scale: Experience building or managing large-scale annotation infrastructure and quality assurance protocols. Academic/Research Track Record: A record of published research or recognized work in preference modeling or AI alignment. The target base salary for this position is $195,000-$225,000 plus company bonus and equity. Final offer amounts are determined by multiple factors including location, local market variances, candidate experience and expertise, internal peer equity, and may vary from the amounts listed above. Benefits: 100% fully paid medical, vision, and dental for employees and their dependents Generous time off; we observe all US federal holidays, close our office for a winter break (12/24-12/31), in addition to granting 18 PTO days and 10 sick days Outstanding compensation package; competitive commissions for revenue roles and bonuses for non-revenue positions A strong commitment to diversity, equity, and inclusion Eligibility to participate in additional benefits such as 401k match up to 5%, 100% paid life insurance (up to $100,000 coverage),, and parental leave A collaborative and positive culture - your team will be as smart and driven as you Limitless growth and learning opportunities Sayari is an equal opportunity employer and strongly encourages diverse candidates to apply. We believe diversity and inclusion mean our team members should reflect the diversity of the United States. No employee or applicant will face discrimination or harassment based on race, color, ethnicity, religion, age, gender, gender identity or expression, sexual orientation, disability status, veteran status, genetics, or political affiliation. We strongly encourage applicants of all backgrounds to apply. Pay Range $195,000 — $225,000 USD

Full job record

Job ID	4f1d749f7e9ee055242273fa35ff7a6cba077999
Org ID	d91325fb-4b3f-44a8-9b1a-3fa7cb91556e
Source ID	55560423-5082-4bdc-b383-3ce4e2b2e1d4
Board ID	55560423-5082-4bdc-b383-3ce4e2b2e1d4
Provider	greenhouse
Provider Job Key	4222375009
Title	Staff Applied Scientist - AI Evaluation & Trust
Normalized Title	—
Status	active
Active	yes
Location Text	Remote - US
Department	Engineering
Team	—
Employment Type	—
Workplace Type	remote
Remote Policy	remote
Country	United States
Region	—
City	—
Salary Raw	Pay Range $195,000 — $225,000 USD
Salary Min	195,000
Salary Max	225,000
Salary Currency	USD
Salary Period	year
Source URL	https://job-boards.greenhouse.io/sayari/jobs/4222375009
Apply URL	https://job-boards.greenhouse.io/sayari/jobs/4222375009
First Seen At	2026-05-29 22:41:24Z
Last Seen At	2026-06-06 20:19:14Z
Last Checked At	2026-06-06 20:19:14Z
Last Changed At	2026-05-29 22:41:24Z
Inactive At	—
Source Posted At	2026-04-23 16:06:47Z
Source Updated At	2026-05-07 15:28:16Z
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=sayari/date=2026-06-06/2026-06-06T20-19-14-390Z-e96a38776d86dca93e7522f733cf6e28742fe86cf68dcdd61e93194c67681323.json

Event Fields

{
  "content_hash": "ef776cd2cad4247caadbbe2e57ba6fc15eb2bb3c8b316785cfcc5274756e356d",
  "source_hash": "c779049809d27862c1a2970906012f72af1105fbf2f2b8e2632b22169b430f07",
  "last_changed_at": "2026-05-29T22:41:24.084Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Remote - US",
    "city": null,
    "region": null,
    "country": "United States",
    "is_remote": true,
    "confidence": 0.95
  },
  "salary_max": 225000,
  "salary_min": 195000,
  "inferred_at": "2026-06-06T20:19:14.539Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Remote - US",
      "city": null,
      "region": null,
      "country": "United States",
      "is_remote": true,
      "confidence": 0.95
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "remote",
  "salary_period": "year",
  "workplace_type": "remote",
  "salary_currency": "USD"
}

Extensions

{}

Native Structured

{
  "title": "Staff Applied Scientist - AI Evaluation & Trust",
  "offices": [
    {
      "id": 4006854009,
      "name": "US",
      "location": null,
      "child_ids": [
        4006845009,
        4006846009
      ],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "Remote - US"
  },
  "metadata": [],
  "updated_at": "2026-05-07T11:28:16-04:00",
  "departments": [
    {
      "id": 4006854009,
      "name": "Engineering",
      "child_ids": [
        4006857009,
        4006856009,
        4006855009,
        4006858009
      ],
      "parent_id": null
    }
  ],
  "company_name": "Sayari",
  "requisition_id": 4129959009,
  "first_published": "2026-04-23T12:06:47-04:00",
  "application_deadline": null
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/4f1d749f7e9ee055242273fa35ff7a6cba077999?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/d91325fb-4b3f-44a8-9b1a-3fa7cb91556eJSON

GET https://api.bluedoor.sh/job-postings/v1/sources/55560423-5082-4bdc-b383-3ce4e2b2e1d4JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/4f1d749f7e9ee055242273fa35ff7a6cba077999/eventsJSON

Docs · Get an API key