Home › Companies › RadixArk › Member of Technical Staff — Inference

Member of Technical Staff — Inference

RadixArk · Palo Alto, CA · Active · Greenhouse

Job facts

Field	Value
Company	RadixArk
Title	Member of Technical Staff — Inference
Normalized title	-
Department / team	Engineering
Location	Palo Alto, CA, United States
Work model	-
Employment type	-
Salary	-
Status	active
ATS provider	Greenhouse
Posted / first seen	2026-02-17 / 2026-05-29
Changed / last seen	2026-06-15 / 2026-06-22

Related slices

Page	What it contains	Open
Company jobs	Active postings from RadixArk.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Greenhouse.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in Palo Alto.	Open
Department jobs	Active postings in Engineering.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	RadixArk
Source	59272e4b-3a76-45c5-8439-e3f3e221c87a
ATS provider	Greenhouse

Description

About the Role RadixArk is seeking a Member of Technical Staff — Inference to push the limits of large-scale AI inference. You will work on the core systems that serve frontier models at scale, optimizing performance, latency, throughput, and cost across thousands of GPUs. This role sits at the intersection of systems engineering, ML infrastructure, and performance optimization. Your work will directly shape how state-of-the-art models are deployed and experienced by users worldwide. This is a deeply technical, high-impact role for engineers who enjoy working close to the hardware–software boundary and solving performance-critical problems at scale. Requirements 5+ years of experience in systems engineering, ML infrastructure, or performance-critical backend systems Strong expertise in large-scale inference systems for LLMs or generative models Deep understanding of GPU architecture and performance characteristics Experience optimizing latency- and throughput-critical production systems Strong knowledge of distributed systems and networking fundamentals Proficiency in Python, Rust, C++, or Go for production systems Experience profiling and optimizing compute-intensive workloads Strong debugging skills across system layers (model, runtime, kernel, network) Strong Plus Experience with LLM serving stacks (SGLang, vLLM, TensorRT-LLM, etc.) Open-source contributions in ML or systems infrastructure Familiarity with CUDA, Triton, or custom kernel optimization Experience with batching, KV-cache management, and scheduling strategies Experience running inference at scale (1000+ GPUs) Background in HPC or high-performance systems Responsibilities Design and build large-scale inference systems for frontier AI models Optimize latency, throughput, and GPU utilization in production inference Develop and improve model serving architectures and runtimes Work on batching, scheduling, and memory management strategies Collaborate with kernel, compiler, and systems teams on performance optimization Debug performance bottlenecks across the stack Drive reliability and scalability of inference infrastructure Build tooling for observability, profiling, and performance analysis Contribute to long-term inference architecture and strategy About RadixArk RadixArk is an infrastructure-first company built by engineers who've shipped production AI systems, created SGLang (20K+ GitHub stars, the fastest open LLM serving engine), and developed Miles (our large-scale RL framework). We're on a mission to democratize frontier-level AI infrastructure by building world-class open systems for inference and training. Our team has optimized kernels serving billions of tokens daily, designed distributed training systems coordinating 10,000+ GPUs, and contributed to infrastructure that powers leading AI companies and research labs. We're backed by well-known infrastructure investors and partner with Nvidia, Google, AWS, and frontier AI labs. Join us in building infrastructure that gives real leverage back to the AI community. Compensation We offer competitive compensation with meaningful equity, comprehensive benefits, and flexible work arrangements. Compensation depends on location, experience, and level. Equal Opportunity RadixArk is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

Full job record

Job ID	4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c
Org ID	24beb46a-d8dc-42c5-a58a-e4f927f45491
Source ID	59272e4b-3a76-45c5-8439-e3f3e221c87a
Board ID	59272e4b-3a76-45c5-8439-e3f3e221c87a
Provider	greenhouse
Provider Job Key	4134888009
Title	Member of Technical Staff — Inference
Normalized Title	—
Status	active
Active	yes
Location Text	Palo Alto, CA
Department	Engineering
Team	—
Employment Type	—
Workplace Type	—
Remote Policy	—
Country	United States
Region	CA
City	Palo Alto
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://job-boards.greenhouse.io/radixark/jobs/4134888009
Apply URL	https://job-boards.greenhouse.io/radixark/jobs/4134888009
First Seen At	2026-05-29 22:58:18Z
Last Seen At	2026-06-22 07:40:43Z
Last Checked At	2026-06-22 07:40:43Z
Last Changed At	2026-06-15 07:33:55Z
Inactive At	—
Source Posted At	2026-02-17 10:19:36Z
Source Updated At	2026-06-15 03:07:08Z
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=radixark/date=2026-06-22/2026-06-22T07-40-43-761Z-844389343170b06955b8ea30d7212bbb1585a66d016c46be8f25ebceb1ab15a0.json

Event Fields

{
  "content_hash": "e34dea00078a4ef9ffda7dcb2bd5d8078fbf5cdb772b6681d885ccb8beb11ead",
  "source_hash": "17fe13acbfcbf9a2998a6208e9c7347da03783d0db2c49938e708eff70aafb4e",
  "last_changed_at": "2026-06-15T07:33:55.069Z",
  "active_status": "active"
}

Parsed Structured

{
  "dedupe": null,
  "language": "en",
  "location": {
    "raw": "Palo Alto, CA",
    "city": "Palo Alto",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.9
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-22T07:40:43.872Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Palo Alto, CA",
      "city": "Palo Alto",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.9
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": null,
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "title": "Member of Technical Staff — Inference",
  "offices": [
    {
      "id": 4023561009,
      "name": "Palo Alto Office",
      "location": "Palo Alto, California, United States",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "Palo Alto, CA"
  },
  "metadata": [],
  "updated_at": "2026-06-14T23:07:08-04:00",
  "departments": [
    {
      "id": 4023176009,
      "name": "Engineering",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "company_name": "RadixArk",
  "requisition_id": 4088943009,
  "first_published": "2026-02-17T05:19:36-05:00",
  "application_deadline": null
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/24beb46a-d8dc-42c5-a58a-e4f927f45491JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/59272e4b-3a76-45c5-8439-e3f3e221c87aJSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c/eventsJSON

Docs · Get an API key