Home › Companies › RadixArk › Member of Technical Staff — Inference
Member of Technical Staff — Inference
RadixArk · Palo Alto, CA · Active · Greenhouse
Job facts
| Field | Value |
|---|---|
| Company | RadixArk |
| Title | Member of Technical Staff — Inference |
| Normalized title | - |
| Department / team | Engineering |
| Location | Palo Alto, CA, United States |
| Work model | - |
| Employment type | - |
| Salary | - |
| Status | active |
| ATS provider | Greenhouse |
| Posted / first seen | 2026-02-17 / 2026-05-29 |
| Changed / last seen | 2026-06-15 / 2026-06-22 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from RadixArk. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Greenhouse. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Palo Alto. | Open |
| Department jobs | Active postings in Engineering. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | RadixArk |
| Source | 59272e4b-3a76-45c5-8439-e3f3e221c87a |
| ATS provider | Greenhouse |
Description
About the Role
RadixArk is seeking a Member of Technical Staff — Inference to push the limits of large-scale AI inference.
You will work on the core systems that serve frontier models at scale, optimizing performance, latency, throughput, and cost across thousands of GPUs. This role sits at the intersection of systems engineering, ML infrastructure, and performance optimization.
Your work will directly shape how state-of-the-art models are deployed and experienced by users worldwide.
This is a deeply technical, high-impact role for engineers who enjoy working close to the hardware–software boundary and solving performance-critical problems at scale.
Requirements
5+ years of experience in systems engineering, ML infrastructure, or performance-critical backend systems
Strong expertise in large-scale inference systems for LLMs or generative models
Deep understanding of GPU architecture and performance characteristics
Experience optimizing latency- and throughput-critical production systems
Strong knowledge of distributed systems and networking fundamentals
Proficiency in Python, Rust, C++, or Go for production systems
Experience profiling and optimizing compute-intensive workloads
Strong debugging skills across system layers (model, runtime, kernel, network)
Strong Plus
Experience with LLM serving stacks (SGLang, vLLM, TensorRT-LLM, etc.)
Open-source contributions in ML or systems infrastructure
Familiarity with CUDA, Triton, or custom kernel optimization
Experience with batching, KV-cache management, and scheduling strategies
Experience running inference at scale (1000+ GPUs)
Background in HPC or high-performance systems
Responsibilities
Design and build large-scale inference systems for frontier AI models
Optimize latency, throughput, and GPU utilization in production inference
Develop and improve model serving architectures and runtimes
Work on batching, scheduling, and memory management strategies
Collaborate with kernel, compiler, and systems teams on performance optimization
Debug performance bottlenecks across the stack
Drive reliability and scalability of inference infrastructure
Build tooling for observability, profiling, and performance analysis
Contribute to long-term inference architecture and strategy
About RadixArk
RadixArk is an infrastructure-first company built by engineers who've shipped production AI systems, created SGLang (20K+ GitHub stars, the fastest open LLM serving engine), and developed Miles (our large-scale RL framework).
We're on a mission to democratize frontier-level AI infrastructure by building world-class open systems for inference and training.
Our team has optimized kernels serving billions of tokens daily, designed distributed training systems coordinating 10,000+ GPUs, and contributed to infrastructure that powers leading AI companies and research labs.
We're backed by well-known infrastructure investors and partner with Nvidia, Google, AWS, and frontier AI labs.
Join us in building infrastructure that gives real leverage back to the AI community.
Compensation
We offer competitive compensation with meaningful equity, comprehensive benefits, and flexible work arrangements. Compensation depends on location, experience, and level.
Equal Opportunity
RadixArk is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
Full job record
| Job ID | 4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c |
| Org ID | 24beb46a-d8dc-42c5-a58a-e4f927f45491 |
| Source ID | 59272e4b-3a76-45c5-8439-e3f3e221c87a |
| Board ID | 59272e4b-3a76-45c5-8439-e3f3e221c87a |
| Provider | greenhouse |
| Provider Job Key | 4134888009 |
| Title | Member of Technical Staff — Inference |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Palo Alto, CA |
| Department | Engineering |
| Team | — |
| Employment Type | — |
| Workplace Type | — |
| Remote Policy | — |
| Country | United States |
| Region | CA |
| City | Palo Alto |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://job-boards.greenhouse.io/radixark/jobs/4134888009 |
| Apply URL | https://job-boards.greenhouse.io/radixark/jobs/4134888009 |
| First Seen At | 2026-05-29 22:58:18Z |
| Last Seen At | 2026-06-22 07:40:43Z |
| Last Checked At | 2026-06-22 07:40:43Z |
| Last Changed At | 2026-06-15 07:33:55Z |
| Inactive At | — |
| Source Posted At | 2026-02-17 10:19:36Z |
| Source Updated At | 2026-06-15 03:07:08Z |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=radixark/date=2026-06-22/2026-06-22T07-40-43-761Z-844389343170b06955b8ea30d7212bbb1585a66d016c46be8f25ebceb1ab15a0.json |
Event Fields
{
"content_hash": "e34dea00078a4ef9ffda7dcb2bd5d8078fbf5cdb772b6681d885ccb8beb11ead",
"source_hash": "17fe13acbfcbf9a2998a6208e9c7347da03783d0db2c49938e708eff70aafb4e",
"last_changed_at": "2026-06-15T07:33:55.069Z",
"active_status": "active"
}Parsed Structured
{
"dedupe": null,
"language": "en",
"location": {
"raw": "Palo Alto, CA",
"city": "Palo Alto",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-22T07:40:43.872Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "Palo Alto, CA",
"city": "Palo Alto",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"countries": [
"United States"
]
},
"remote_policy": null,
"salary_period": null,
"workplace_type": null,
"salary_currency": null
}Extensions
{}Native Structured
{
"title": "Member of Technical Staff — Inference",
"offices": [
{
"id": 4023561009,
"name": "Palo Alto Office",
"location": "Palo Alto, California, United States",
"child_ids": [],
"parent_id": null
}
],
"language": "en",
"location": {
"name": "Palo Alto, CA"
},
"metadata": [],
"updated_at": "2026-06-14T23:07:08-04:00",
"departments": [
{
"id": 4023176009,
"name": "Engineering",
"child_ids": [],
"parent_id": null
}
],
"company_name": "RadixArk",
"requisition_id": 4088943009,
"first_published": "2026-02-17T05:19:36-05:00",
"application_deadline": null
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/24beb46a-d8dc-42c5-a58a-e4f927f45491JSONGET https://api.bluedoor.sh/job-postings/v1/sources/59272e4b-3a76-45c5-8439-e3f3e221c87aJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/4f8810bb9c17caa7cddd0c464ece9cb274ba3f9c/eventsJSON