bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesBaselayerSenior AI Engineer, Agentic Data Enrichment

Senior AI Engineer, Agentic Data Enrichment

Baselayer · San Francisco, California · Hybrid · Active · $195,000–$300,000 / year · Greenhouse

Job facts

FieldValue
CompanyBaselayer
TitleSenior AI Engineer, Agentic Data Enrichment
Normalized title-
Department / teamEngineering
LocationSan Francisco, CA, United States
Work modelHybrid / Hybrid
Employment type-
Salary$195,000–$300,000 / year
Statusactive
ATS providerGreenhouse
Posted / first seen2026-06-01 / 2026-06-02
Changed / last seen2026-06-02 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Baselayer.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Greenhouse.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in San Francisco.Open
Department jobsActive postings in Engineering.Open
Work model jobsActive Hybrid postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyBaselayer
Source3a843be1-2053-4cc3-99ca-0599b02bbe70
ATS providerGreenhouse

Description

ABOUT BASELAYER Every business in America needs a bank account to exist. The system that decides whether they're real, who's behind them, and whether they're a risk, runs on infrastructure from the 1980s. We're rebuilding that layer from scratch. Baselayer is the identity layer for institutions across the United States — the most complete business graph in America and every human tied to it. We fuse public records, IRS data, sanctions lists, web signals, and fraud telemetry from 2,200+ financial institutions into a single graph that resolves any business and the humans behind it in milliseconds. The legacy credit bureaus took 50 years to build something that gets 60% match rates. We've built something that gets 98% in under two years. Today we're trusted by over 20% of financial institutions in America — including FIS, Rho, Socure and leading loan infrastructure providers. But the graph is becoming infrastructure for anyone who needs to know if a business is real and worth trusting: gig platforms, marketplaces, AI companies, and commerce infrastructure at scale. Trust is the substrate of every financial transaction. We're rebuilding it. ABOUT THE TEAM We're solving real-time entity resolution at a scale no one else has cracked — fusing dozens of data sources into a single business identity graph and resolving any entity in milliseconds. It's a graph AI problem, a retrieval problem, and a fraud-modeling problem stacked on top of each other. The technical depth is real. You'd be joining a small team where the data moat is defensible, the research problems are open, and the infrastructure you build becomes load-bearing for businesses. Ownership is real. Velocity is real. There's no layer of process between an idea and shipping it. We're at an inflection point — the graph is built, the match rates speak for themselves, and the hardest problems are still ahead: graph embeddings, fraud propagation models across the business network, real-time traversal at sub-100ms latency, and expanding the identity layer beyond finance into every platform that needs to trust a business. If you want to work on something foundational — the kind of infrastructure that gets built once and everything else runs on top of — this is it. ABOUT THE ROLE Baselayer answers questions the loan application didn't ask. For every business that crosses our queues, we need to know things that aren't on the form: what the business actually does, where it actually lives on the web, whether the people it names match the public record, and whether anything across the open web contradicts the story we were told. We answer those questions with LLM-driven agents that crawl, click, search, and extract structured evidence from across the web - and we treat this as a production data pipeline, not a research demo. We're hiring a Senior AI Engineer to own a slice of this enrichment surface end-to-end. WHAT YOU'LL DO Own industry/category classification of businesses from heterogeneous signals (name, website, directory presence, reviews). Build and maintain discovery and verification systems for a business's real web presence - filtering aggregators, parked domains, brand collisions, and impersonators. Link individuals to businesses via public web evidence (e.g. confirming a named officer or employee genuinely works there). Develop risk/legitimacy scoring derived from web-presence signals, fed back into downstream underwriting. Build and evolve the shared agent infrastructure: provider-agnostic base agents, shared toolset registry (browser navigation, search, scraping, structured database lookups, scoring), eval harness, and instrumentation surface for token-and-tool tracing. Own model selection, agent design, prompt and tool engineering, eval methodology, and cost control across your enrichment surface. MINIMUM REQUIREMENTS Shipped LLM-driven agents to production - not notebooks, not demos. Real users, real cost, real failure modes, real on-call. Strong async Python including structured-data libraries, modern web frameworks, and relational databases. Experience across multiple frontier LLM providers and at least one agent framework, with deep knowledge of failure modes. Built or maintained eval methodology: curated golden datasets, scoring functions, labelling guidelines, regression diagnostics. Browser automation experience: headless browsers, anti-bot evasion, authenticated flows. Holds informed opinions on structured-output reliability - when to use JSON-schema mode vs. function calling vs. extractor-on-top-of-text. WHAT SETS YOU APART Web scraping at scale: anti-bot evasion, residential proxies, request fingerprinting, authenticated flows, CDN defeats. Eval-framework experience (e.g., LangSmith, Braintrust, Evals, or custom). Entity resolution / record linkage / fuzzy matching at scale. Browser-automation experience at the devtools-protocol level. Built a tool registry or toolset abstraction over multiple LLM providers. Cost/latency optimization: response caching, semantic caching, model routing (cheap-first then escalate), thinking-budget tuning, prompt-cache hit-rate work. WORK LOCATION Based in SF; hybrid - 4 days per week in office. COMPENSATION Salary Range: $195,000 – $300,000 + Equity | 0.05% – 0.25% BENEFITS Time off when you need it: Flexible PTO so you can recharge without red tape. In-person energy: We're based in SF and meet in the office 4 days a week. Competitive compensation: We pay well and back it with equity. We want you to think and act like an owner. Career rocket fuel: You'll help build the foundation of a high-growth startup, working side by side with experienced founders and team members who've done it before. Benefits on us: We cover 100% of your health, dental, and vision premiums. No surprise deductions from your paycheck. 401(k) with company match : We match your contributions so your future self benefits too HSA contributions included: We contribute to your HSA on applicable plans, so your coverage works as hard as you do Stay healthy, stay sharp: A $250 monthly gym stipend to help you bring your best self to work, and everywhere else A seat at the table: We believe in transparency, radical candor, and giving every team member a voice 🔥

Full job record

Job ID7a9272acc52eaef3b2e1ec4cd556f875b1d5d559
Org ID5fc76675-cc27-45ce-89dc-12e03f07b190
Source ID3a843be1-2053-4cc3-99ca-0599b02bbe70
Board ID3a843be1-2053-4cc3-99ca-0599b02bbe70
Providergreenhouse
Provider Job Key5237591008
TitleSenior AI Engineer, Agentic Data Enrichment
Normalized Title
Statusactive
Activeyes
Location TextSan Francisco, California
DepartmentEngineering
Team
Employment Type
Workplace Typehybrid
Remote Policyhybrid
CountryUnited States
RegionCA
CitySan Francisco
Salary RawSalary Range: $195,000 – $300,000 + Equity | 0
Salary Min195,000
Salary Max300,000
Salary CurrencyUSD
Salary Periodyear
Source URLhttps://job-boards.greenhouse.io/baselayer/jobs/5237591008
Apply URLhttps://job-boards.greenhouse.io/baselayer/jobs/5237591008
First Seen At2026-06-02 12:07:44Z
Last Seen At2026-06-06 19:55:13Z
Last Checked At2026-06-06 19:55:13Z
Last Changed At2026-06-02 12:07:44Z
Inactive At
Source Posted At2026-06-01 15:47:52Z
Source Updated At2026-06-01 19:26:43Z
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=baselayer/date=2026-06-06/2026-06-06T19-55-12-961Z-73ad5c730edcad69e0ab4b08a6d5a386180c28312ca8652696aed41f74c6dc07.json
Event Fields
{
  "content_hash": "0e1a1027143509f9f192f235a44e73b23c202b88d83497f8ff77da286ac7d50a",
  "source_hash": "be9653bf756911fcdaddceed29d8b0c55a0ec3ecb7e68a30acf16a487a585348",
  "last_changed_at": "2026-06-02T12:07:44.042Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "San Francisco, California",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.85
  },
  "salary_max": 300000,
  "salary_min": 195000,
  "inferred_at": "2026-06-06T19:55:13.071Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco, California",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.85
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": "year",
  "workplace_type": "hybrid",
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "title": "Senior AI Engineer, Agentic Data Enrichment",
  "offices": [
    {
      "id": 4041892008,
      "name": "San Francisco",
      "location": null,
      "child_ids": [],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "San Francisco, California"
  },
  "metadata": [],
  "updated_at": "2026-06-01T15:26:43-04:00",
  "departments": [
    {
      "id": 4021063008,
      "name": "Engineering",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "company_name": "Baselayer",
  "requisition_id": 4477692008,
  "first_published": "2026-06-01T11:47:52-04:00",
  "application_deadline": null
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/7a9272acc52eaef3b2e1ec4cd556f875b1d5d559?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/5fc76675-cc27-45ce-89dc-12e03f07b190JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/3a843be1-2053-4cc3-99ca-0599b02bbe70JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/7a9272acc52eaef3b2e1ec4cd556f875b1d5d559/eventsJSON