Home › Companies › Netpreme › Member of Technical Staff, ML Kernels
Member of Technical Staff, ML Kernels
Netpreme · Santa Clara, CA or Boston, MA · On Site · Active · Ashby
Job facts
| Field | Value |
|---|---|
| Company | Netpreme |
| Title | Member of Technical Staff, ML Kernels |
| Normalized title | - |
| Department / team | AI Systems / AI Systems |
| Location | Santa Clara, CA, United States |
| Work model | On Site |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | Ashby |
| Posted / first seen | — / 2026-05-29 |
| Changed / last seen | 2026-05-29 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Netpreme. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Ashby. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Santa Clara. | Open |
| Department jobs | Active postings in AI Systems. | Open |
| Work model jobs | Active On Site postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Netpreme |
| Source | 5f05eb40-800d-4d07-8080-8f7c7aa3a4ab |
| ATS provider | Ashby |
Description
About the Role We are seeking a Member of Technical Staff, Machine Learning Kernels to design, optimize, and benchmark high-performance compute kernels for modern machine learning workloads. This role is for a deeply technical engineer who enjoys working close to hardware — writing CUDA kernels, investigating subtle performance artifacts, building benchmarks, and serving as a go-to expert on accelerator behavior.
You will act as a hands-on performance specialist, partnering closely with research, systems, and infrastructure teams to unlock efficiency gains across GPUs today and other accelerators (e.g., TPU, Trainium) as we expand our hardware partnerships.
This role will be performed onsite from one of our offices in Santa Clara, CA or Boston, MA.
Essential Duties & Responsibilities Design, implement, and optimize high-performance ML kernels, primarily targeting GPUs (CUDA), with an emphasis on throughput, latency, and memory efficiency.
Profile, benchmark, and analyze performance across different hardware configurations, identifying bottlenecks and narrow artifacts.
Debug and reason about low-level performance issues involving memory hierarchy, scheduling, synchronization, and numerical formats.
Build and maintain benchmarking and evaluation tools to compare performance across GPUs and other accelerators.
Advise internal teams on GPU and accelerator performance characteristics, tradeoffs, and best practices.
Explore and prototype support for alternative accelerator platforms (e.g., TPU, Amazon Trainium) as partnerships and needs evolve.
Collaborate closely with ML researchers and systems engineers to translate algorithmic needs into efficient kernel implementations.
Qualifications Strong experience writing and optimizing CUDA kernels or equivalent low-level accelerator code.
Deep understanding of GPU architecture, including memory systems, parallel execution, and performance tradeoffs.
Experience with performance profiling and benchmarking tools (e.g., Nsight Systems / Compute, nvprof, framework-level profilers).
Proficiency in C++ and low-level performance-oriented programming.
Ability to independently investigate ambiguous or poorly understood performance issues and drive them to resolution.
Comfortable switching between different hardware ecosystems and learning new accelerator stacks as needed.
Preferred Qualifications Experience with ML framework internals (e.g., PyTorch, TensorFlow, XLA) and custom operator development.
Prior work with non-GPU accelerators such as TPU, Trainium, IPU, or similar.
Familiarity with mixed-precision and low-precision compute (e.g., FP16, BF16, FP8).
Contributions to open-source performance, systems, or ML infrastructure projects.
Compensation & Benefits
Competitive salary commensurate with experience including base salary, performance-based bonus, and early stage equity grant
Comprehensive benefits including health, dental, vision, and life insurance
Well-equipped, sunny offices in Santa Clara, CA and Boston, MA
Relocation assistance and visa sponsorship
Perks include a daily lunch stipend, 401k match, and more
A collaborative, continuous-learning work environment with smart, dedicated colleagues engaged in developing the next generation of architecture for high-performance computing
The Opportunity
Impact: We are tackling a fundamental challenge at the infrastructure layer: unlocking greater AI capability while dramatically improving efficiency. The work we do here compounds across state-of-the-art AI models, systems, and real-world applications.
Timing: Joining now means real ownership of the company and meaningful influence over product direction and execution. You’ll work from first principles, move quickly from insight to execution, and see your contributions directly reflected in what we build.
Culture: You’ll work alongside a group of people who care deeply about rigor, clarity, and impact. We value thoughtful disagreement, fast learning, and intellectual fearlessness. This is a place where strong ideas shine, curiosity is encouraged, and growth is a daily practice.
Full job record
| Job ID | 5dfefca954859862912a22b3678227b09ab2daca |
| Org ID | d22c302f-888f-425e-945c-27f3f8adf56b |
| Source ID | 5f05eb40-800d-4d07-8080-8f7c7aa3a4ab |
| Board ID | 5f05eb40-800d-4d07-8080-8f7c7aa3a4ab |
| Provider | ashby |
| Provider Job Key | b96121b3-2141-4906-8659-c015eef52b9c |
| Title | Member of Technical Staff, ML Kernels |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Santa Clara, CA or Boston, MA |
| Department | AI Systems |
| Team | AI Systems |
| Employment Type | full_time |
| Workplace Type | on_site |
| Remote Policy | — |
| Country | United States |
| Region | CA |
| City | Santa Clara |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://jobs.ashbyhq.com/netpreme/b96121b3-2141-4906-8659-c015eef52b9c |
| Apply URL | https://jobs.ashbyhq.com/netpreme/b96121b3-2141-4906-8659-c015eef52b9c/application |
| First Seen At | 2026-05-29 05:49:16Z |
| Last Seen At | 2026-06-06 20:27:23Z |
| Last Checked At | 2026-06-06 20:27:23Z |
| Last Changed At | 2026-05-29 05:49:16Z |
| Inactive At | — |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=netpreme/date=2026-06-06/2026-06-06T20-27-22-454Z-802428d96f77e60ed0b0cef56899e02f67d742517af93ce5ac92027fdc0ec2ac.json |
Event Fields
{
"content_hash": "8c45ae9e580ec25910c460991fa83e4b5ca4489af86b6dc5c7285812d48c5d80",
"source_hash": "dccf3ef8fea70a161af407a7fdd7c130c3cda94da06ee37aa72f65bada18a3ef",
"last_changed_at": "2026-05-29T05:49:16.584Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Santa Clara, CA",
"city": "Santa Clara",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T20:27:23.202Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "Santa Clara, CA",
"city": "Santa Clara",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"countries": [
"United States"
]
},
"remote_policy": null,
"salary_period": null,
"workplace_type": "on_site",
"salary_currency": null
}Extensions
{}Native Structured
{
"id": "b96121b3-2141-4906-8659-c015eef52b9c",
"team": "AI Systems",
"title": "Member of Technical Staff, ML Kernels",
"jobUrl": "https://jobs.ashbyhq.com/netpreme/b96121b3-2141-4906-8659-c015eef52b9c",
"address": null,
"applyUrl": "https://jobs.ashbyhq.com/netpreme/b96121b3-2141-4906-8659-c015eef52b9c/application",
"isListed": true,
"isRemote": false,
"location": "Santa Clara, CA or Boston, MA",
"updatedAt": null,
"apiVersion": "ashby-non-user-graphql-v1",
"department": "AI Systems",
"publishedAt": null,
"workplaceType": "OnSite",
"employmentType": "FullTime",
"secondaryLocations": []
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/5dfefca954859862912a22b3678227b09ab2daca?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/d22c302f-888f-425e-945c-27f3f8adf56bJSONGET https://api.bluedoor.sh/job-postings/v1/sources/5f05eb40-800d-4d07-8080-8f7c7aa3a4abJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/5dfefca954859862912a22b3678227b09ab2daca/eventsJSON