bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesAdaptionDistributed Systems Engineer, Data & Inference Platform

Distributed Systems Engineer, Data & Inference Platform

Adaption · San Francisco · Hybrid · Active · Ashby

Job facts

FieldValue
CompanyAdaption
TitleDistributed Systems Engineer, Data & Inference Platform
Normalized title-
Department / teamPlatform / Platform
LocationSan Francisco, CA, United States
Work modelHybrid / Hybrid
Employment typeFull Time
Salary-
Statusactive
ATS providerAshby
Posted / first seen / 2026-05-29
Changed / last seen2026-05-29 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Adaption.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in San Francisco.Open
Department jobsActive postings in Platform.Open
Work model jobsActive Hybrid postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyAdaption
Source5f007786-fb83-4b96-aa3d-5e7a3599f28d
ATS providerAshby

Description

The Role You'll build and operate the systems that turn raw compute into useful intelligence — the inference services that serve LLMs at scale and the data pipelines that feed them. One week you're hunting a tail-latency regression in a production inference service handling millions of requests; the next you're redesigning a Ray Data pipeline so it stops melting down at petabyte scale. The work spans architecture, implementation, and the on-call pager that keeps you honest about both. Researchers and ML engineers will hand you workloads that barely run; you'll hand them back systems that run reliably, efficiently, and cheaply enough to matter. Responsibilities Serve Models at Scale: Design and operate distributed inference systems for LLMs, optimizing throughput, latency, and cost across heterogeneous GPU fleets. Batching, scheduling, KV cache management, autoscaling — you own the levers that make inference economical. Move the Data: Build large-scale data pipelines (Ray Data, Spark, or equivalents) that ingest, transform, and curate the datasets behind training and evaluation. The bottleneck is rarely where people think it is, and you find it. Debug the Undebuggable: Chase down the failure modes that only emerge under real production traffic — stragglers, head-of-line blocking, silent data corruption, GPU memory fragmentation — and write the postmortems that prevent the next ten. Define SLOs, build the observability to measure them, and own the on-call rotation that defends them. Partner Across the Stack: Work directly with researchers and ML engineers to take experimental workloads from "runs on one node" to "runs in production." You're a systems partner, not a ticket queue. Qualifications 5+ years building and operating distributed systems in production. Deep experience with at least one large-scale data or compute framework (Ray, Spark, Flink, Beam, Dask). Strong fluency in Python and at least one systems language (Go, Rust, C++). Working knowledge of the GPU/accelerator stack: CUDA fundamentals, NCCL, mixed precision, memory layout. You don't need to write kernels, but you should know why a workload is bound by what it's bound by. Experience operating Kubernetes-based infrastructure, including custom operators or schedulers. A track record of owning hard production incidents end-to-end — diagnosis, mitigation, and the durable fix. Bonus: hands-on experience with LLM inference engines (vLLM, SGLang, TensorRT-LLM, TGI), modern lakehouse formats (Iceberg, Delta, Hudi), or open-source contributions to relevant projects. Above all, we're looking for great teammates who make work feel lighter and aren't afraid to go out on a limb with bold ideas. You don't need to be perfect, but you do need to be adaptable. We encourage you to apply, even if you don't check every box. About Us Most AI is frozen in place - it doesn't adapt to the world. We think that's backwards. Our mandate is to build efficient intelligence that evolves in real-time. Our vision is AI systems that are flexible, personalized, and accessible to everyone. We believe efficiency is what makes this possible - it's how we expand access and ensure innovation benefits the many, not the few. We believe in talent density: bringing together the best and most driven individuals to push the boundaries of continual adaptation. We're looking for builders and creative thinkers ready to shape the next era of intelligence.   Benefits Flexible work : In-person collaboration in the Bay Area, a distributed global-first team, and team offsites. Adaption Passport : Annual travel stipend to explore a country you've never visited. We're building intelligence that evolves alongside you, so we encourage you to keep expanding your horizons. Lunch Stipend: Weekly meal allowance for take-out or grocery delivery. Well-Being : Comprehensive medical benefits and generous paid time off.

Full job record

Job IDfe50ea2c3dd0ddabe924726ff3f5ecef2fc2a160
Org ID1765bb84-ef1a-4913-b99d-908b8a356e16
Source ID5f007786-fb83-4b96-aa3d-5e7a3599f28d
Board ID5f007786-fb83-4b96-aa3d-5e7a3599f28d
Providerashby
Provider Job Key7324a28a-74aa-4515-a4c0-77adc5fccd87
TitleDistributed Systems Engineer, Data & Inference Platform
Normalized Title
Statusactive
Activeyes
Location TextSan Francisco
DepartmentPlatform
TeamPlatform
Employment Typefull_time
Workplace Typehybrid
Remote Policyhybrid
CountryUnited States
RegionCA
CitySan Francisco
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://jobs.ashbyhq.com/adaption/7324a28a-74aa-4515-a4c0-77adc5fccd87
Apply URLhttps://jobs.ashbyhq.com/adaption/7324a28a-74aa-4515-a4c0-77adc5fccd87/application
First Seen At2026-05-29 05:45:41Z
Last Seen At2026-06-06 20:30:07Z
Last Checked At2026-06-06 20:30:07Z
Last Changed At2026-05-29 05:45:41Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=adaption/date=2026-06-06/2026-06-06T20-30-06-492Z-57d2bbcb4eebe301156faa98c3a8e28c5e98c01f9dc95103a00d6179aaa7e3be.json
Event Fields
{
  "content_hash": "dd9e447d2c661c12ad80fc7b5d5b2866ab01a38d584cd36b7403e3e5dc36ad74",
  "source_hash": "1ed9084932cd86a0ed8465fba6abeaba0c30a38c768599e677f232213729d2a5",
  "last_changed_at": "2026-05-29T05:45:41.634Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T20:30:07.800Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": null,
  "workplace_type": "hybrid",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "7324a28a-74aa-4515-a4c0-77adc5fccd87",
  "team": "Platform",
  "title": "Distributed Systems Engineer, Data & Inference Platform",
  "jobUrl": "https://jobs.ashbyhq.com/adaption/7324a28a-74aa-4515-a4c0-77adc5fccd87",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/adaption/7324a28a-74aa-4515-a4c0-77adc5fccd87/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Platform",
  "publishedAt": null,
  "workplaceType": "Hybrid",
  "employmentType": "FullTime",
  "secondaryLocations": [
    {
      "location": "Bay Area"
    }
  ]
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/fe50ea2c3dd0ddabe924726ff3f5ecef2fc2a160?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/1765bb84-ef1a-4913-b99d-908b8a356e16JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/5f007786-fb83-4b96-aa3d-5e7a3599f28dJSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/fe50ea2c3dd0ddabe924726ff3f5ecef2fc2a160/eventsJSON