bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesRadixArkMember of Technical Staff — Inference

Member of Technical Staff — Inference

RadixArk · Palo Alto, CA · Active · Greenhouse

Job facts

FieldValue
CompanyRadixArk
TitleMember of Technical Staff — Inference
Normalized title-
Department / teamEngineering
LocationPalo Alto, CA, United States
Work model-
Employment type-
Salary-
Statusactive
ATS providerGreenhouse
Posted / first seen2026-02-17 / 2026-05-29
Changed / last seen2026-06-15 / 2026-06-22

Related slices

PageWhat it containsOpen
Company jobsActive postings from RadixArk.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Greenhouse.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Palo Alto.Open
Department jobsActive postings in Engineering.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyRadixArk
Source59272e4b-3a76-45c5-8439-e3f3e221c87a
ATS providerGreenhouse

Description

About the Role RadixArk is seeking a Member of Technical Staff — Inference to push the limits of large-scale AI inference. You will work on the core systems that serve frontier models at scale, optimizing performance, latency, throughput, and cost across thousands of GPUs. This role sits at the intersection of systems engineering, ML infrastructure, and performance optimization. Your work will directly shape how state-of-the-art models are deployed and experienced by users worldwide. This is a deeply technical, high-impact role for engineers who enjoy working close to the hardware–software boundary and solving performance-critical problems at scale. Requirements 5+ years of experience in systems engineering, ML infrastructure, or performance-critical backend systems Strong expertise in large-scale inference systems for LLMs or generative models Deep understanding of GPU architecture and performance characteristics Experience optimizing latency- and throughput-critical production systems Strong knowledge of distributed systems and networking fundamentals Proficiency in Python, Rust, C++, or Go for production systems Experience profiling and optimizing compute-intensive workloads Strong debugging skills across system layers (model, runtime, kernel, network) Strong Plus Experience with LLM serving stacks (SGLang, vLLM, TensorRT-LLM, etc.) Open-source contributions in ML or systems infrastructure Familiarity with CUDA, Triton, or custom kernel optimization Experience with batching, KV-cache management, and scheduling strategies Experience running inference at scale (1000+ GPUs) Background in HPC or high-performance systems Responsibilities Design and build large-scale inference systems for frontier AI models Optimize latency, throughput, and GPU utilization in production inference Develop and improve model serving architectures and runtimes Work on batching, scheduling, and memory management strategies Collaborate with kernel, compiler, and systems teams on performance optimization Debug performance bottlenecks across the stack Drive reliability and scalability of inference infrastructure Build tooling for observability, profiling, and performance analysis Contribute to long-term inference architecture and strategy About RadixArk RadixArk is an infrastructure-first company built by engineers who've shipped production AI systems, created SGLang (20K+ GitHub stars, the fastest open LLM serving engine), and developed Miles (our large-scale RL framework). We're on a mission to democratize frontier-level AI infrastructure by building world-class open systems for inference and training. Our team has optimized kernels serving billions of tokens daily, designed distributed training systems coordinating 10,000+ GPUs, and contributed to infrastructure that powers leading AI companies and research labs. We're backed by well-known infrastructure investors and partner with Nvidia, Google, AWS, and frontier AI labs. Join us in building infrastructure that gives real leverage back to the AI community. Compensation We offer competitive compensation with meaningful equity, comprehensive benefits, and flexible work arrangements. Compensation depends on location, experience, and level. Equal Opportunity RadixArk is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Full job record

Job ID4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c
Org ID24beb46a-d8dc-42c5-a58a-e4f927f45491
Source ID59272e4b-3a76-45c5-8439-e3f3e221c87a
Board ID59272e4b-3a76-45c5-8439-e3f3e221c87a
Providergreenhouse
Provider Job Key4134888009
TitleMember of Technical Staff — Inference
Normalized Title
Statusactive
Activeyes
Location TextPalo Alto, CA
DepartmentEngineering
Team
Employment Type
Workplace Type
Remote Policy
CountryUnited States
RegionCA
CityPalo Alto
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://job-boards.greenhouse.io/radixark/jobs/4134888009
Apply URLhttps://job-boards.greenhouse.io/radixark/jobs/4134888009
First Seen At2026-05-29 22:58:18Z
Last Seen At2026-06-22 07:40:43Z
Last Checked At2026-06-22 07:40:43Z
Last Changed At2026-06-15 07:33:55Z
Inactive At
Source Posted At2026-02-17 10:19:36Z
Source Updated At2026-06-15 03:07:08Z
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=radixark/date=2026-06-22/2026-06-22T07-40-43-761Z-844389343170b06955b8ea30d7212bbb1585a66d016c46be8f25ebceb1ab15a0.json
Event Fields
{
  "content_hash": "e34dea00078a4ef9ffda7dcb2bd5d8078fbf5cdb772b6681d885ccb8beb11ead",
  "source_hash": "17fe13acbfcbf9a2998a6208e9c7347da03783d0db2c49938e708eff70aafb4e",
  "last_changed_at": "2026-06-15T07:33:55.069Z",
  "active_status": "active"
}
Parsed Structured
{
  "dedupe": null,
  "language": "en",
  "location": {
    "raw": "Palo Alto, CA",
    "city": "Palo Alto",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.9
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-22T07:40:43.872Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Palo Alto, CA",
      "city": "Palo Alto",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.9
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": null,
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "title": "Member of Technical Staff — Inference",
  "offices": [
    {
      "id": 4023561009,
      "name": "Palo Alto Office",
      "location": "Palo Alto, California, United States",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "Palo Alto, CA"
  },
  "metadata": [],
  "updated_at": "2026-06-14T23:07:08-04:00",
  "departments": [
    {
      "id": 4023176009,
      "name": "Engineering",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "company_name": "RadixArk",
  "requisition_id": 4088943009,
  "first_published": "2026-02-17T05:19:36-05:00",
  "application_deadline": null
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/24beb46a-d8dc-42c5-a58a-e4f927f45491JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/59272e4b-3a76-45c5-8439-e3f3e221c87aJSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c/eventsJSON