Home › Companies › Distyl › Applied AI Researcher, Benchmarking
Applied AI Researcher, Benchmarking
Distyl · San Francisco · Hybrid · Active · $150,000–$250,000 / year · Ashby
Job facts
| Field | Value |
|---|---|
| Company | Distyl |
| Title | Applied AI Researcher, Benchmarking |
| Normalized title | - |
| Department / team | Research / Research |
| Location | San Francisco, CA, United States |
| Work model | Hybrid / Hybrid |
| Employment type | Full Time |
| Salary | $150,000–$250,000 / year |
| Status | active |
| ATS provider | Ashby |
| Posted / first seen | — / 2026-05-29 |
| Changed / last seen | 2026-05-29 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Distyl. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Ashby. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in San Francisco. | Open |
| Department jobs | Active postings in Research. | Open |
| Work model jobs | Active Hybrid postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Distyl |
| Source | 3fc927b2-82f5-4664-a91c-e244867b5fea |
| ATS provider | Ashby |
Description
About Distyl AI Distyl is an applied AI technology company partnering with the world’s most ambitious institutions to rearchitect critical operations for the frontier of AI. Our customers include the largest companies in telecom, healthcare, insurance, manufacturing, consumer goods, and global social organizations.
We research and deploy technologies that power AI-native operations — both for our partners and for Distyl itself. Our work spans research into self-constructing systems, the development of the most reliable execution of AI systems, and products that transform mission-critical workflows. As a result, Distyl's technologies affect some of the world's largest operations — from hundreds of millions of consumer interactions to tens of millions of supply chain transactions and millions of patient journeys.
Distyl is backed by leading investors including Lightspeed Venture Partners, Khosla Ventures, Coatue, DST Global, and the board-members of 20+ F500s. The results reflect this approach: a 100% production deployment success rate for our customers and one of the few enterprise AI companies to run a profitable business.
What We Are Looking For At Distyl we’re pushing the envelope of AI utilization in enterprise. This requires creative researchers who don’t just want to drive incremental improvements on benchmarks or optimize an existing process but instead are looking to creatively redefine how software is used.
Our researchers come from many academic backgrounds but have strong research track records, operate in an AI-native way, and would be bored staying on the rails of a traditional research org.
Key Responsibilities The Benchmarking team defines how progress is measured. Researchers design evaluation frameworks that capture reasoning depth, interaction quality, reliability, and operational impact. They construct benchmarks that reflect real-world complexity. Their systems become the standard by which new architectures, techniques, and releases are judged.
Researchers in Benchmarking explore new paradigms for evaluating intelligent systems: adversarial robustness testing, longitudinal performance tracking, and human-in-the-loop assessment. They investigate how metrics shape model behavior and establish rigorous methodologies for quantifying emergent capability. Their insights drive both Distyl’s internal research priorities and industry-wide standards.
Who You Are Experience Designing and Running Evaluations: You’ve built or maintained benchmarks, test suites, or experimental frameworks to measure model or system performance
Statistical and Analytical Rigor: You design fair, reproducible experiments and can extract signal from noisy empirical results
Experience Building with Models, Not Just Building Models : We develop intelligent systems using models rather than training or fine-tuning them. Ideal candidates have expertise in compound AI systems, agentic collaboration, and associated techniques (ensembling, ReAct, graph-of-thoughts, etc.)
Proven Track Record of Research Results: Whether you’ve published in top journals, posted amazing work on twitter, or somewhere else we want to see what you've done
Uses AI Every Day: Before you can revolutionize someone else’s workflow, you need to revolutionize yours. You should be using tools like ChatGPT, Cursor, and Perplexity to accelerate your workflow
Strong Programming and Data Analysis Skills: While you might not consider yourself a software engineer you need to be able to build prototypes of your ideas and then perform the experiments to prove the effectiveness to a F500 Head of AI
Biases Towards Showing vs Telling: Our customers want to see the power of AI today vs discuss the most elegant idea that will take 5 years to realize
What We Offer The base salary range for this role is $150K – $250K, depending on experience, location, and level. In addition to base compensation, this role is eligible for meaningful equity, along with a comprehensive benefits package
100% covered medical, dental, and vision for employees and dependents
401(k) with additional perks (e.g., commuter benefits, in‑office lunch)
Access to state‑of‑the‑art models, generous usage of modern AI tools, and real‑world business problems
Ownership of high‑impact projects across top enterprises
A mission‑driven, fast‑moving culture that prizes curiosity, pragmatism, and excellence
Distyl has offices in San Francisco and New York. This role follows a hybrid collaboration model with 3+ days per week (Tuesday–Thursday) in‑office.
#LI-Hybrid
We believe diverse perspectives make our work stronger and more impactful. We are an equal opportunity employer and evaluate all applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, disability, veteran status, or any other legally protected characteristic. We encourage candidates from all backgrounds to apply.
Full job record
| Job ID | 02491d58ad5cf2d0deac0006ca9c2b62f2a0f3db |
| Org ID | a331ca53-bbf7-489c-a085-f9f8fe9e89fa |
| Source ID | 3fc927b2-82f5-4664-a91c-e244867b5fea |
| Board ID | 3fc927b2-82f5-4664-a91c-e244867b5fea |
| Provider | ashby |
| Provider Job Key | cf166cd0-c30f-41c8-8e43-fa38e323014d |
| Title | Applied AI Researcher, Benchmarking |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | San Francisco |
| Department | Research |
| Team | Research |
| Employment Type | full_time |
| Workplace Type | hybrid |
| Remote Policy | hybrid |
| Country | United States |
| Region | CA |
| City | San Francisco |
| Salary Raw | salary range for this role is $150K – $250K, depending on experience, location, and level |
| Salary Min | 150,000 |
| Salary Max | 250,000 |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://jobs.ashbyhq.com/Distyl/cf166cd0-c30f-41c8-8e43-fa38e323014d |
| Apply URL | https://jobs.ashbyhq.com/Distyl/cf166cd0-c30f-41c8-8e43-fa38e323014d/application |
| First Seen At | 2026-05-29 06:18:20Z |
| Last Seen At | 2026-06-06 20:00:57Z |
| Last Checked At | 2026-06-06 20:00:57Z |
| Last Changed At | 2026-05-29 06:18:20Z |
| Inactive At | — |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=Distyl/date=2026-06-06/2026-06-06T20-00-55-301Z-0698ba93d10726e1a1e631e07080048d37fa73119f9d54658090606443aee9e7.json |
Event Fields
{
"content_hash": "41538c1c4f22d68200edf9fb1b23207166b2ccc66554bbb3a787bc30b9ca9af9",
"source_hash": "68b2f451b6e208e833919e2d51d35f97e8417b5511e154499acd3eb5feafba2a",
"last_changed_at": "2026-05-29T06:18:20.587Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "San Francisco",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.75
},
"salary_max": 250000,
"salary_min": 150000,
"inferred_at": "2026-06-06T20:00:57.729Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "San Francisco",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.75
},
"countries": [
"United States"
]
},
"remote_policy": "hybrid",
"salary_period": "year",
"workplace_type": "hybrid",
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"id": "cf166cd0-c30f-41c8-8e43-fa38e323014d",
"team": "Research",
"title": "Applied AI Researcher, Benchmarking",
"jobUrl": "https://jobs.ashbyhq.com/Distyl/cf166cd0-c30f-41c8-8e43-fa38e323014d",
"address": null,
"applyUrl": "https://jobs.ashbyhq.com/Distyl/cf166cd0-c30f-41c8-8e43-fa38e323014d/application",
"isListed": true,
"isRemote": false,
"location": "San Francisco",
"updatedAt": null,
"apiVersion": "ashby-non-user-graphql-v1",
"department": "Research",
"publishedAt": null,
"workplaceType": "Hybrid",
"employmentType": "FullTime",
"secondaryLocations": [
{
"location": "New York"
}
]
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/02491d58ad5cf2d0deac0006ca9c2b62f2a0f3db?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/a331ca53-bbf7-489c-a085-f9f8fe9e89faJSONGET https://api.bluedoor.sh/job-postings/v1/sources/3fc927b2-82f5-4664-a91c-e244867b5feaJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/02491d58ad5cf2d0deac0006ca9c2b62f2a0f3db/eventsJSON