Home › Companies › Middesk › Lead Data Scientist
Lead Data Scientist
Middesk · San Francisco · Hybrid · Active · Ashby
Job facts
| Field | Value |
|---|---|
| Company | Middesk |
| Title | Lead Data Scientist |
| Normalized title | - |
| Department / team | Data Science / Data Science |
| Location | San Francisco, CA, United States |
| Work model | Hybrid / Hybrid |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | Ashby |
| Posted / first seen | — / 2026-05-29 |
| Changed / last seen | 2026-05-29 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Middesk. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Ashby. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in San Francisco. | Open |
| Department jobs | Active postings in Data Science. | Open |
| Work model jobs | Active Hybrid postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Middesk |
| Source | 5721d77a-754a-4bac-a37f-266f87766839 |
| ATS provider | Ashby |
Description
About Middesk: Middesk makes it easier for businesses to work together. Since 2018, we’ve been transforming business identity verification, replacing slow, manual processes with seamless access to complete, up-to-date data. Our platform helps companies across industries confidently verify business identities, onboard customers faster, and reduce risk at every stage of the customer lifecycle.
Middesk came out of Y Combinator, is backed by Sequoia Capital and Accel Partners, and was recently named to Forbes Fintech 50 List.
About The Role: We are actively building AI-driven applications that streamline customer workflows, focusing on business onboarding. With our proprietary identity data assets and deep domain expertise, we are uniquely positioned to expand into a broader set of AI-powered solutions that drive long-term growth.
We’re looking for a hands-on applied ML expert to help build the technical foundation for these efforts. Ideally you have shipped external-facing models in the risk/fraud space and know the messy realities of imbalanced data, low labels, and changing behavior. This is a highly technical, hands-on role with wide influence on how we design, build, and scale ML at Middesk.
We follow a hybrid work model, and for this role, there is an expectation of 2 days per week in our SF/NYC office. Candidates should be based within a commutable distance, as we believe in the value of in-person collaboration and building strong team connections while also supporting flexibility where possible.
What You'll Do: Build risk & fraud ML applications: Deliver production ML models in fraud, trust & safety, KYB, and compliance domains, with measurable impact on customer workflows.
Tackle hard data problems: Work on classification problems with extreme class imbalance, sparse signals, and “cold start” label challenges.
Innovate in feature engineering & labeling: Use graph-based techniques, weak supervision, LLMs, and AI agents to improve signal extraction and automate labeling process.
Establish ML infrastructure foundations: Partner with the ML infra team to design feature services, model training pipeline, model serving standards, and orchestration to scale multiple ML use cases.
Design and implement knowledge graph solutions: Leveraging LLMs for graph construction, querying, and retrieval to enhance entity resolution and business identity use cases.
What We're Looking For: 7+ years of production ML experience in one or more of the following areas:
Building Production ML for risk, fraud, credit, or trust & safety: Track record of shipping external-facing ML applications in one or more of these domains.
Knowledge graph applications: Hands-on experience building, querying, or extracting signals from knowledge graphs—ideally over business entity networks (companies, persons, addresses, relationships) to support identity verification, fraud detection, or risk decisioning.
Entity resolution for business or individual identities: Experience disambiguating and linking records across noisy, incomplete, or conflicting data sources—particularly in KYB, KYC, AML, or identity verification contexts where the same real-world entity may appear under different names, addresses, or tax IDs.
Expertise in classification with real-world ML challenges, for example: imbalanced labels, sparse signals, cold start, and production version management.
Hands-on ML infrastructure experience: feature stores, model management, ML training/serving pipelines.
Comfort as a senior IC: setting technical direction, mentoring peers, and establishing best practices.
Nice-To Have: B2B SaaS experience, ideally building ML products for enterprise customers.
ML pipeline and automation engineering: Experience building end-to-end training harnesses that automate feature engineering, data validation, and model training.
Experience scaling ML across multiple products or risk domains.
Full job record
| Job ID | 3e63cb0a5058ea5348292e65200f0493702fba62 |
| Org ID | 7ad28c28-64e9-4e1e-8dfc-085fb5254f90 |
| Source ID | 5721d77a-754a-4bac-a37f-266f87766839 |
| Board ID | 5721d77a-754a-4bac-a37f-266f87766839 |
| Provider | ashby |
| Provider Job Key | 7fd5aeb2-c33c-4f0a-ad3d-2161201bc174 |
| Title | Lead Data Scientist |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | San Francisco |
| Department | Data Science |
| Team | Data Science |
| Employment Type | full_time |
| Workplace Type | hybrid |
| Remote Policy | hybrid |
| Country | United States |
| Region | CA |
| City | San Francisco |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://jobs.ashbyhq.com/middesk/7fd5aeb2-c33c-4f0a-ad3d-2161201bc174 |
| Apply URL | https://jobs.ashbyhq.com/middesk/7fd5aeb2-c33c-4f0a-ad3d-2161201bc174/application |
| First Seen At | 2026-05-29 05:44:36Z |
| Last Seen At | 2026-06-06 20:24:05Z |
| Last Checked At | 2026-06-06 20:24:05Z |
| Last Changed At | 2026-05-29 05:44:36Z |
| Inactive At | — |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=middesk/date=2026-06-06/2026-06-06T20-24-04-226Z-ef7a281197307bc4ed75314670c0ed597587f3733559c851d326285abe8e70a9.json |
Event Fields
{
"content_hash": "f1e147eb6aff862e413e3f74360dcc6bab5817fca04700ea9328dee4ae4877f4",
"source_hash": "b64bfd9938e46b6c11570d4d5024f592ab013200c71bcb78cd0b72416115c705",
"last_changed_at": "2026-05-29T05:44:36.285Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "San Francisco",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.75
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T20:24:05.813Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "San Francisco",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.75
},
"countries": [
"United States"
]
},
"remote_policy": "hybrid",
"salary_period": null,
"workplace_type": "hybrid",
"salary_currency": null
}Extensions
{}Native Structured
{
"id": "7fd5aeb2-c33c-4f0a-ad3d-2161201bc174",
"team": "Data Science",
"title": "Lead Data Scientist",
"jobUrl": "https://jobs.ashbyhq.com/middesk/7fd5aeb2-c33c-4f0a-ad3d-2161201bc174",
"address": null,
"applyUrl": "https://jobs.ashbyhq.com/middesk/7fd5aeb2-c33c-4f0a-ad3d-2161201bc174/application",
"isListed": true,
"isRemote": false,
"location": "San Francisco",
"updatedAt": null,
"apiVersion": "ashby-non-user-graphql-v1",
"department": "Data Science",
"publishedAt": null,
"workplaceType": "Hybrid",
"employmentType": "FullTime",
"secondaryLocations": [
{
"location": "New York"
}
]
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/3e63cb0a5058ea5348292e65200f0493702fba62?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/7ad28c28-64e9-4e1e-8dfc-085fb5254f90JSONGET https://api.bluedoor.sh/job-postings/v1/sources/5721d77a-754a-4bac-a37f-266f87766839JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/3e63cb0a5058ea5348292e65200f0493702fba62/eventsJSON