Home › Companies › Nuance Labs › Member of Technical Staff — ML Data Infra
Member of Technical Staff — ML Data Infra
Nuance Labs · Seattle, Washington · Active · $200,000–$300,000 / year · Greenhouse
Job facts
| Field | Value |
|---|---|
| Company | Nuance Labs |
| Title | Member of Technical Staff — ML Data Infra |
| Normalized title | - |
| Department / team | Engineering |
| Location | Seattle, WA, United States |
| Work model | - |
| Employment type | - |
| Salary | $200,000–$300,000 / year |
| Status | active |
| ATS provider | Greenhouse |
| Posted / first seen | 2026-06-05 / 2026-06-06 |
| Changed / last seen | 2026-06-06 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Nuance Labs. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Greenhouse. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Seattle. | Open |
| Department jobs | Active postings in Engineering. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Nuance Labs |
| Source | 4d06c175-4ee5-4cda-ad2e-cc1de78b9519 |
| ATS provider | Greenhouse |
Description
About Nuance Labs
Nuance Labs is building photorealistic, real-time AI avatars with emotional intelligence: a full-duplex audiovisual system that can listen, speak, react, interrupt, and respond like a real person.
We're a Series A company ($60M raised) backed by Lightspeed, Accel, South Park Commons, NVentures, and Define Ventures, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins, and industry experience from Apple, Meta, Amazon AGI, and Discord. The team is small, the work is real, and the problems are unsolved.
How Nuance Differentiates
Most conversational AI avatars today are hacks — a face slapped on a speech-to-speech pipeline, stuck in the uncanny valley: emotionless, mechanical, one-turn-at-a-time. Current systems take 2–5 seconds to respond; natural conversation requires sub-500ms. That's a 10x improvement, and it demands rethinking the entire stack.
That rethinking starts with full-duplex: an AI that listens and speaks simultaneously, perceives emotion in real time, and responds with a face that actually reflects it. It's an extremely hard problem, and we're developing foundation models designed for it from the ground up.
About the Role
Model quality is ultimately a data problem. The best architecture and the best training run can't outrun bad, slow, or poorly curated data — and at the scale we're operating, the difference between a good data pipeline and a great one shows up directly in the model.
We're looking for someone who lives and breathes data at scale. You know how to build pipelines that are fast, reliable, and maintainable — and you're just as comfortable taking a researcher's messy processing script and turning it into something that runs on petabytes as you are designing a new pipeline architecture from scratch. Research moves fast here, and the ability to productionize quickly without losing fidelity is the core skill.
Our data is multimodal — video, audio, and text — and the processing requirements are demanding: high throughput, low error rates, and strict quality filters. There's a lot of interesting engineering work here, and the impact is direct and measurable.
What You'll Do
Design, build, and operate large-scale data pipelines for ingestion, processing, filtering, and curation of multimodal training data (video, audio, text)
Take research-grade data processing code and turn it into robust, production-level pipelines — quickly and without losing correctness
Optimize pipeline throughput and efficiency at scale; identify and eliminate bottlenecks across compute, I/O, and storage
Build and maintain data quality systems — deduplication, filtering, validation, and quality scoring at scale
Manage petabyte-scale datasets: storage architecture, versioning, lineage tracking, and cost efficiency
Work closely with researchers to understand data requirements and translate them into scalable processing systems
Build tooling and infrastructure that makes the research team faster — efficient data access, reproducible processing, and fast iteration loops
What We're Looking For
Proven experience building and operating large-scale data pipelines in production — you've processed data at a scale where naive approaches break
Strong proficiency with distributed data processing frameworks — Spark, Ray, Dask, or similar — and a clear sense of when to use each
Solid software engineering fundamentals: you write clean, testable, maintainable code and understand why that matters when pipelines run unattended at scale
Experience with multimodal data (video, audio) is a strong plus — understanding of formats, codecs, and processing libraries (FFmpeg, decord, etc.)
Familiarity with ML data pipelines specifically — understanding of how data quality and format affect model training
Ability to move fast: you can take a prototype script from a researcher and ship a production version in days, not weeks
Bonus Points
Experience building data pipelines for large-scale model training (pre-training or fine-tuning)
Familiarity with data versioning and lineage tools (DVC, Delta Lake, Apache Iceberg, etc.)
Experience with streaming data pipelines or online data processing
Prior work at an AI lab, video platform, or other data-intensive company
Contributions to open-source data tooling
Compensation
$200,000 – $300,000 base salary, plus meaningful equity. We think long-term ownership matters and structure equity accordingly.
Logistics
Location: In-person in Seattle, 5 days a week — we believe in the compounding value of working shoulder-to-shoulder
Health: HSA plan with ~$2,000 in company contributions — about 2x what most big tech companies offer
PTO: 15 days + public holidays, and we close for a full week over the holidays
Lunch, beverages, and snacks: On us, every workday — the kind of thing that makes you actually look forward to the workday
Commuter benefits
401K: In the works
Nuance Labs is an equal opportunity employer. We believe diverse teams build better AI.
Full job record
| Job ID | 2973e12a1a551b27fdd2fc0e2922f968bd4a27fc |
| Org ID | b5cad4e8-d3e2-423b-934c-3898f78ddee7 |
| Source ID | 4d06c175-4ee5-4cda-ad2e-cc1de78b9519 |
| Board ID | 4d06c175-4ee5-4cda-ad2e-cc1de78b9519 |
| Provider | greenhouse |
| Provider Job Key | 4277601009 |
| Title | Member of Technical Staff — ML Data Infra |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Seattle, Washington |
| Department | Engineering |
| Team | — |
| Employment Type | — |
| Workplace Type | — |
| Remote Policy | — |
| Country | United States |
| Region | WA |
| City | Seattle |
| Salary Raw | Compensation $200,000 – $300,000 base salary, plus meaningful equity |
| Salary Min | 200,000 |
| Salary Max | 300,000 |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://job-boards.greenhouse.io/nuancelabs/jobs/4277601009 |
| Apply URL | https://job-boards.greenhouse.io/nuancelabs/jobs/4277601009 |
| First Seen At | 2026-06-06 07:33:06Z |
| Last Seen At | 2026-06-06 20:12:12Z |
| Last Checked At | 2026-06-06 20:12:12Z |
| Last Changed At | 2026-06-06 07:33:06Z |
| Inactive At | — |
| Source Posted At | 2026-06-05 21:40:39Z |
| Source Updated At | 2026-06-05 22:20:40Z |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=nuancelabs/date=2026-06-06/2026-06-06T20-12-12-157Z-ffe23d7407e1e5ada742398a50f3dfa98b1687e7d7d3baed3187df60c4819845.json |
Event Fields
{
"content_hash": "a4df4ab58c694d325debb7fb0aa861d756aa0580dd83b70427391bd044340511",
"source_hash": "3cd5110f3162f4ccdc6eec78b938a4a59f637d2c76fd6ca10fa8bf02440b0ba0",
"last_changed_at": "2026-06-06T07:33:06.309Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Seattle, Washington",
"city": "Seattle",
"region": "WA",
"country": "United States",
"is_remote": false,
"confidence": 0.85
},
"salary_max": 300000,
"salary_min": 200000,
"inferred_at": "2026-06-06T20:12:12.227Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "Seattle, Washington",
"city": "Seattle",
"region": "WA",
"country": "United States",
"is_remote": false,
"confidence": 0.85
},
"countries": [
"United States"
]
},
"remote_policy": null,
"salary_period": "year",
"workplace_type": null,
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"title": "Member of Technical Staff — ML Data Infra",
"offices": [
{
"id": 4030799009,
"name": "Seattle",
"location": null,
"child_ids": [],
"parent_id": null
}
],
"language": "en",
"location": {
"name": "Seattle, Washington"
},
"metadata": [],
"updated_at": "2026-06-05T18:20:40-04:00",
"departments": [
{
"id": 4031248009,
"name": "Engineering",
"child_ids": [],
"parent_id": null
}
],
"company_name": "Nuance Labs",
"requisition_id": 4162947009,
"first_published": "2026-06-05T17:40:39-04:00",
"application_deadline": null
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/2973e12a1a551b27fdd2fc0e2922f968bd4a27fc?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/b5cad4e8-d3e2-423b-934c-3898f78ddee7JSONGET https://api.bluedoor.sh/job-postings/v1/sources/4d06c175-4ee5-4cda-ad2e-cc1de78b9519JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/2973e12a1a551b27fdd2fc0e2922f968bd4a27fc/eventsJSON