Home › Companies › Sentry › Senior Software Engineer, AI Evals
Senior Software Engineer, AI Evals
Sentry · San Francisco, California · Hybrid · Active · Ashby
Job facts
| Field | Value |
|---|---|
| Company | Sentry |
| Title | Senior Software Engineer, AI Evals |
| Normalized title | - |
| Department / team | Engineering / Engineering |
| Location | San Francisco, CA, United States |
| Work model | Hybrid / Hybrid |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | Ashby |
| Posted / first seen | — / 2026-05-29 |
| Changed / last seen | 2026-05-30 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Sentry. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Ashby. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in San Francisco. | Open |
| Department jobs | Active postings in Engineering. | Open |
| Work model jobs | Active Hybrid postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Sentry |
| Source | 696df9a9-c759-4107-8cc7-81d5fedc8194 |
| ATS provider | Ashby |
Description
About Sentry Software runs the world and the pace is faster than ever. Sentry helps developers fix errors and performance issues before users notice, so teams can spend less time firefighting and more time building.
Trusted by 200,000+ organizations, Sentry is today’s application monitoring standard and our team is building its AI-native future.
About the role As a Senior Software Engineer on Sentry’s AI/ML team, you’ll be responsible for building the evaluation infrastructure that measures the accuracy, reliability, and real-world performance of our AI systems. This role is critical to ensuring that our debugging agents and AI-powered features behave correctly, safely, and predictably as they scale. You’ll design datasets, benchmarks, and test harnesses that turn ambiguous AI behavior into measurable signals, helping the team ship AI with confidence.
In this role you will Design and build robust evaluation frameworks to measure accuracy, reliability, regressions, and edge cases in AI systems
Create and curate high-quality datasets, golden test cases, and benchmarks grounded in real production data
Build automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and agentic workflows
Partner closely with applied AI engineers and product leaders to define what “good” looks like and translate it into measurable criteria
Own the evaluation lifecycle for major AI initiatives, from early experimentation through production monitoring
You’ll love this job if you Care deeply about correctness, rigor, and measurement in AI systems
Enjoy turning fuzzy product goals and model behavior into concrete tests and metrics
Like building foundational infrastructure that unlocks faster iteration and higher confidence for the entire AI team
Thrive in cross-functional environments and enjoy influencing model design through better evaluation
Qualifications Minimum 5+ years of professional experience with a Bachelor’s degree in computer science, machine learning, or a related field
Experience building testing, evaluation, or data infrastructure for complex systems (AI/ML experience strongly preferred)
Comfort writing production-quality code (we use Python and TypeScript)
Experience working with structured and unstructured datasets, labeling workflows, or data quality pipelines
Familiarity with modern ML systems and evaluation techniques (e.g., offline metrics, online evaluation, regression testing for models or prompts)
Bonus: experience evaluating LLMs, agentic systems, or AI-assisted developer tools
The base salary range (or hourly wage range, if applicable) that Sentry reasonably expects to pay for this position is $240,000 to $280,000 USD . A successful candidate’s actual base salary (or hourly wage) amount will be determined by a variety of relevant factors including, without limitation, the candidate’s work location, education, work and other relevant experience, skills, and job-related knowledge. A successful candidate will be eligible to participate in Sentry’s employee benefit plans/programs applicable to the candidate’s position (including incentive compensation, equity grants, paid time off, and group health insurance coverage). See Sentry Benefits for more details about the Company’s benefit plans/programs.
Equal Opportunity at Sentry Sentry is committed to providing equal employment opportunities to its employees and candidates for employment regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other legally-protected characteristic. This commitment includes the provision of reasonable accommodations to employees and candidates for employment with physical or mental disabilities who require such accommodations in order to (a) perform the essential functions of their jobs, or (b) seek employment with Sentry. We strive to build a diverse team, with an inclusive culture where every teammate can thrive. Sentry is an open-source company because we believe that everyone, everywhere, should have the ability and tools to make great software. Software should be accessible. That starts with making our industry accessible.
If you need assistance or an accommodation due to a disability, you may contact us at [email protected] .
Want to learn more about how Sentry handles applicant data? Get the details in our Applicant Privacy Policy .
Full job record
| Job ID | 5ff00f2c0363c776c893c365ce699b95c0da3c11 |
| Org ID | 8ecde324-4397-4303-b47d-839763672b48 |
| Source ID | 696df9a9-c759-4107-8cc7-81d5fedc8194 |
| Board ID | 696df9a9-c759-4107-8cc7-81d5fedc8194 |
| Provider | ashby |
| Provider Job Key | 95d2eeab-291d-40ad-97a2-86b104f3c7ad |
| Title | Senior Software Engineer, AI Evals |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | San Francisco, California |
| Department | Engineering |
| Team | Engineering |
| Employment Type | full_time |
| Workplace Type | hybrid |
| Remote Policy | hybrid |
| Country | United States |
| Region | CA |
| City | San Francisco |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://jobs.ashbyhq.com/sentry/95d2eeab-291d-40ad-97a2-86b104f3c7ad |
| Apply URL | https://jobs.ashbyhq.com/sentry/95d2eeab-291d-40ad-97a2-86b104f3c7ad/application |
| First Seen At | 2026-05-29 05:53:11Z |
| Last Seen At | 2026-06-06 20:36:43Z |
| Last Checked At | 2026-06-06 20:36:43Z |
| Last Changed At | 2026-05-30 07:56:32Z |
| Inactive At | — |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=sentry/date=2026-06-06/2026-06-06T20-36-38-820Z-52884b675e9e9f7e9fd123cd4d60e38a478447c933b5ce26b42c29902dd82243.json |
Event Fields
{
"content_hash": "e831f902f80c76b9ab43ec11b3d9ea5d248c12b912e2c5c478359c7d8f0cef5b",
"source_hash": "d278a9ab490364402649f57863f662d0bc63b58f7a85ae9a6b6006ce49cee3ba",
"last_changed_at": "2026-05-30T07:56:32.827Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "San Francisco, California",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.85
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T20:36:43.870Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "San Francisco, California",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.85
},
"countries": [
"United States"
]
},
"remote_policy": "hybrid",
"salary_period": null,
"workplace_type": "hybrid",
"salary_currency": null
}Extensions
{}Native Structured
{
"id": "95d2eeab-291d-40ad-97a2-86b104f3c7ad",
"team": "Engineering ",
"title": "Senior Software Engineer, AI Evals",
"jobUrl": "https://jobs.ashbyhq.com/sentry/95d2eeab-291d-40ad-97a2-86b104f3c7ad",
"address": null,
"applyUrl": "https://jobs.ashbyhq.com/sentry/95d2eeab-291d-40ad-97a2-86b104f3c7ad/application",
"isListed": true,
"isRemote": false,
"location": "San Francisco, California",
"updatedAt": null,
"apiVersion": "ashby-non-user-graphql-v1",
"department": "Engineering ",
"publishedAt": null,
"workplaceType": "Hybrid",
"employmentType": "FullTime",
"secondaryLocations": []
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/5ff00f2c0363c776c893c365ce699b95c0da3c11?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/8ecde324-4397-4303-b47d-839763672b48JSONGET https://api.bluedoor.sh/job-postings/v1/sources/696df9a9-c759-4107-8cc7-81d5fedc8194JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/5ff00f2c0363c776c893c365ce699b95c0da3c11/eventsJSON