Home › Companies › Nuance Labs › Member of Technical Staff — ML Data Infra

Member of Technical Staff — ML Data Infra

Nuance Labs · Seattle, Washington · Active · $200,000–$300,000 / year · Greenhouse

Job facts

Field	Value
Company	Nuance Labs
Title	Member of Technical Staff — ML Data Infra
Normalized title	-
Department / team	Engineering
Location	Seattle, WA, United States
Work model	-
Employment type	-
Salary	$200,000–$300,000 / year
Status	active
ATS provider	Greenhouse
Posted / first seen	2026-06-05 / 2026-06-06
Changed / last seen	2026-06-06 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Nuance Labs.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Greenhouse.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in Seattle.	Open
Department jobs	Active postings in Engineering.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Nuance Labs
Source	4d06c175-4ee5-4cda-ad2e-cc1de78b9519
ATS provider	Greenhouse

Description

About Nuance Labs Nuance Labs is building photorealistic, real-time AI avatars with emotional intelligence: a full-duplex audiovisual system that can listen, speak, react, interrupt, and respond like a real person. We're a Series A company ($60M raised) backed by Lightspeed, Accel, South Park Commons, NVentures, and Define Ventures, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins, and industry experience from Apple, Meta, Amazon AGI, and Discord. The team is small, the work is real, and the problems are unsolved. How Nuance Differentiates Most conversational AI avatars today are hacks — a face slapped on a speech-to-speech pipeline, stuck in the uncanny valley: emotionless, mechanical, one-turn-at-a-time. Current systems take 2–5 seconds to respond; natural conversation requires sub-500ms. That's a 10x improvement, and it demands rethinking the entire stack. That rethinking starts with full-duplex: an AI that listens and speaks simultaneously, perceives emotion in real time, and responds with a face that actually reflects it. It's an extremely hard problem, and we're developing foundation models designed for it from the ground up. About the Role Model quality is ultimately a data problem. The best architecture and the best training run can't outrun bad, slow, or poorly curated data — and at the scale we're operating, the difference between a good data pipeline and a great one shows up directly in the model. We're looking for someone who lives and breathes data at scale. You know how to build pipelines that are fast, reliable, and maintainable — and you're just as comfortable taking a researcher's messy processing script and turning it into something that runs on petabytes as you are designing a new pipeline architecture from scratch. Research moves fast here, and the ability to productionize quickly without losing fidelity is the core skill. Our data is multimodal — video, audio, and text — and the processing requirements are demanding: high throughput, low error rates, and strict quality filters. There's a lot of interesting engineering work here, and the impact is direct and measurable. What You'll Do Design, build, and operate large-scale data pipelines for ingestion, processing, filtering, and curation of multimodal training data (video, audio, text) Take research-grade data processing code and turn it into robust, production-level pipelines — quickly and without losing correctness Optimize pipeline throughput and efficiency at scale; identify and eliminate bottlenecks across compute, I/O, and storage Build and maintain data quality systems — deduplication, filtering, validation, and quality scoring at scale Manage petabyte-scale datasets: storage architecture, versioning, lineage tracking, and cost efficiency Work closely with researchers to understand data requirements and translate them into scalable processing systems Build tooling and infrastructure that makes the research team faster — efficient data access, reproducible processing, and fast iteration loops What We're Looking For Proven experience building and operating large-scale data pipelines in production — you've processed data at a scale where naive approaches break Strong proficiency with distributed data processing frameworks — Spark, Ray, Dask, or similar — and a clear sense of when to use each Solid software engineering fundamentals: you write clean, testable, maintainable code and understand why that matters when pipelines run unattended at scale Experience with multimodal data (video, audio) is a strong plus — understanding of formats, codecs, and processing libraries (FFmpeg, decord, etc.) Familiarity with ML data pipelines specifically — understanding of how data quality and format affect model training Ability to move fast: you can take a prototype script from a researcher and ship a production version in days, not weeks Bonus Points Experience building data pipelines for large-scale model training (pre-training or fine-tuning) Familiarity with data versioning and lineage tools (DVC, Delta Lake, Apache Iceberg, etc.) Experience with streaming data pipelines or online data processing Prior work at an AI lab, video platform, or other data-intensive company Contributions to open-source data tooling Compensation $200,000 – $300,000 base salary, plus meaningful equity. We think long-term ownership matters and structure equity accordingly. Logistics Location: In-person in Seattle, 5 days a week — we believe in the compounding value of working shoulder-to-shoulder Health: HSA plan with ~$2,000 in company contributions — about 2x what most big tech companies offer PTO: 15 days + public holidays, and we close for a full week over the holidays Lunch, beverages, and snacks: On us, every workday — the kind of thing that makes you actually look forward to the workday Commuter benefits 401K: In the works Nuance Labs is an equal opportunity employer. We believe diverse teams build better AI.

Full job record

Job ID	2973e12a1a551b27fdd2fc0e2922f968bd4a27fc
Org ID	b5cad4e8-d3e2-423b-934c-3898f78ddee7
Source ID	4d06c175-4ee5-4cda-ad2e-cc1de78b9519
Board ID	4d06c175-4ee5-4cda-ad2e-cc1de78b9519
Provider	greenhouse
Provider Job Key	4277601009
Title	Member of Technical Staff — ML Data Infra
Normalized Title	—
Status	active
Active	yes
Location Text	Seattle, Washington
Department	Engineering
Team	—
Employment Type	—
Workplace Type	—
Remote Policy	—
Country	United States
Region	WA
City	Seattle
Salary Raw	Compensation $200,000 – $300,000 base salary, plus meaningful equity
Salary Min	200,000
Salary Max	300,000
Salary Currency	USD
Salary Period	year
Source URL	https://job-boards.greenhouse.io/nuancelabs/jobs/4277601009
Apply URL	https://job-boards.greenhouse.io/nuancelabs/jobs/4277601009
First Seen At	2026-06-06 07:33:06Z
Last Seen At	2026-06-06 20:12:12Z
Last Checked At	2026-06-06 20:12:12Z
Last Changed At	2026-06-06 07:33:06Z
Inactive At	—
Source Posted At	2026-06-05 21:40:39Z
Source Updated At	2026-06-05 22:20:40Z
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=nuancelabs/date=2026-06-06/2026-06-06T20-12-12-157Z-ffe23d7407e1e5ada742398a50f3dfa98b1687e7d7d3baed3187df60c4819845.json

Event Fields

{
  "content_hash": "a4df4ab58c694d325debb7fb0aa861d756aa0580dd83b70427391bd044340511",
  "source_hash": "3cd5110f3162f4ccdc6eec78b938a4a59f637d2c76fd6ca10fa8bf02440b0ba0",
  "last_changed_at": "2026-06-06T07:33:06.309Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Seattle, Washington",
    "city": "Seattle",
    "region": "WA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.85
  },
  "salary_max": 300000,
  "salary_min": 200000,
  "inferred_at": "2026-06-06T20:12:12.227Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Seattle, Washington",
      "city": "Seattle",
      "region": "WA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.85
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": "year",
  "workplace_type": null,
  "salary_currency": "USD"
}

Extensions

{}

Native Structured

{
  "title": "Member of Technical Staff — ML Data Infra",
  "offices": [
    {
      "id": 4030799009,
      "name": "Seattle",
      "location": null,
      "child_ids": [],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "Seattle, Washington"
  },
  "metadata": [],
  "updated_at": "2026-06-05T18:20:40-04:00",
  "departments": [
    {
      "id": 4031248009,
      "name": "Engineering",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "company_name": "Nuance Labs",
  "requisition_id": 4162947009,
  "first_published": "2026-06-05T17:40:39-04:00",
  "application_deadline": null
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/2973e12a1a551b27fdd2fc0e2922f968bd4a27fc?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/b5cad4e8-d3e2-423b-934c-3898f78ddee7JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/4d06c175-4ee5-4cda-ad2e-cc1de78b9519JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/2973e12a1a551b27fdd2fc0e2922f968bd4a27fc/eventsJSON

Docs · Get an API key