bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesNuance LabsMember of Technical Staff — ML Data Infra

Member of Technical Staff — ML Data Infra

Nuance Labs · Seattle, Washington · Active · $200,000–$300,000 / year · Greenhouse

Job facts

FieldValue
CompanyNuance Labs
TitleMember of Technical Staff — ML Data Infra
Normalized title-
Department / teamEngineering
LocationSeattle, WA, United States
Work model-
Employment type-
Salary$200,000–$300,000 / year
Statusactive
ATS providerGreenhouse
Posted / first seen2026-06-05 / 2026-06-06
Changed / last seen2026-06-06 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Nuance Labs.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Greenhouse.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Seattle.Open
Department jobsActive postings in Engineering.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyNuance Labs
Source4d06c175-4ee5-4cda-ad2e-cc1de78b9519
ATS providerGreenhouse

Description

About Nuance Labs Nuance Labs is building photorealistic, real-time AI avatars with emotional intelligence: a full-duplex audiovisual system that can listen, speak, react, interrupt, and respond like a real person. We're a Series A company ($60M raised) backed by Lightspeed, Accel, South Park Commons, NVentures, and Define Ventures, with PhDs from MIT, UW, Oxford, CMU, and Johns Hopkins, and industry experience from Apple, Meta, Amazon AGI, and Discord. The team is small, the work is real, and the problems are unsolved. How Nuance Differentiates Most conversational AI avatars today are hacks — a face slapped on a speech-to-speech pipeline, stuck in the uncanny valley: emotionless, mechanical, one-turn-at-a-time. Current systems take 2–5 seconds to respond; natural conversation requires sub-500ms. That's a 10x improvement, and it demands rethinking the entire stack. That rethinking starts with full-duplex: an AI that listens and speaks simultaneously, perceives emotion in real time, and responds with a face that actually reflects it. It's an extremely hard problem, and we're developing foundation models designed for it from the ground up. About the Role Model quality is ultimately a data problem. The best architecture and the best training run can't outrun bad, slow, or poorly curated data — and at the scale we're operating, the difference between a good data pipeline and a great one shows up directly in the model. We're looking for someone who lives and breathes data at scale. You know how to build pipelines that are fast, reliable, and maintainable — and you're just as comfortable taking a researcher's messy processing script and turning it into something that runs on petabytes as you are designing a new pipeline architecture from scratch. Research moves fast here, and the ability to productionize quickly without losing fidelity is the core skill. Our data is multimodal — video, audio, and text — and the processing requirements are demanding: high throughput, low error rates, and strict quality filters. There's a lot of interesting engineering work here, and the impact is direct and measurable. What You'll Do Design, build, and operate large-scale data pipelines for ingestion, processing, filtering, and curation of multimodal training data (video, audio, text) Take research-grade data processing code and turn it into robust, production-level pipelines — quickly and without losing correctness Optimize pipeline throughput and efficiency at scale; identify and eliminate bottlenecks across compute, I/O, and storage Build and maintain data quality systems — deduplication, filtering, validation, and quality scoring at scale Manage petabyte-scale datasets: storage architecture, versioning, lineage tracking, and cost efficiency Work closely with researchers to understand data requirements and translate them into scalable processing systems Build tooling and infrastructure that makes the research team faster — efficient data access, reproducible processing, and fast iteration loops What We're Looking For Proven experience building and operating large-scale data pipelines in production — you've processed data at a scale where naive approaches break Strong proficiency with distributed data processing frameworks — Spark, Ray, Dask, or similar — and a clear sense of when to use each Solid software engineering fundamentals: you write clean, testable, maintainable code and understand why that matters when pipelines run unattended at scale Experience with multimodal data (video, audio) is a strong plus — understanding of formats, codecs, and processing libraries (FFmpeg, decord, etc.) Familiarity with ML data pipelines specifically — understanding of how data quality and format affect model training Ability to move fast: you can take a prototype script from a researcher and ship a production version in days, not weeks Bonus Points Experience building data pipelines for large-scale model training (pre-training or fine-tuning) Familiarity with data versioning and lineage tools (DVC, Delta Lake, Apache Iceberg, etc.) Experience with streaming data pipelines or online data processing Prior work at an AI lab, video platform, or other data-intensive company Contributions to open-source data tooling Compensation $200,000 – $300,000 base salary, plus meaningful equity. We think long-term ownership matters and structure equity accordingly. Logistics Location: In-person in Seattle, 5 days a week — we believe in the compounding value of working shoulder-to-shoulder Health: HSA plan with ~$2,000 in company contributions — about 2x what most big tech companies offer PTO: 15 days + public holidays, and we close for a full week over the holidays Lunch, beverages, and snacks: On us, every workday — the kind of thing that makes you actually look forward to the workday Commuter benefits 401K: In the works Nuance Labs is an equal opportunity employer. We believe diverse teams build better AI.

Full job record

Job ID2973e12a1a551b27fdd2fc0e2922f968bd4a27fc
Org IDb5cad4e8-d3e2-423b-934c-3898f78ddee7
Source ID4d06c175-4ee5-4cda-ad2e-cc1de78b9519
Board ID4d06c175-4ee5-4cda-ad2e-cc1de78b9519
Providergreenhouse
Provider Job Key4277601009
TitleMember of Technical Staff — ML Data Infra
Normalized Title
Statusactive
Activeyes
Location TextSeattle, Washington
DepartmentEngineering
Team
Employment Type
Workplace Type
Remote Policy
CountryUnited States
RegionWA
CitySeattle
Salary RawCompensation $200,000 – $300,000 base salary, plus meaningful equity
Salary Min200,000
Salary Max300,000
Salary CurrencyUSD
Salary Periodyear
Source URLhttps://job-boards.greenhouse.io/nuancelabs/jobs/4277601009
Apply URLhttps://job-boards.greenhouse.io/nuancelabs/jobs/4277601009
First Seen At2026-06-06 07:33:06Z
Last Seen At2026-06-06 20:12:12Z
Last Checked At2026-06-06 20:12:12Z
Last Changed At2026-06-06 07:33:06Z
Inactive At
Source Posted At2026-06-05 21:40:39Z
Source Updated At2026-06-05 22:20:40Z
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=nuancelabs/date=2026-06-06/2026-06-06T20-12-12-157Z-ffe23d7407e1e5ada742398a50f3dfa98b1687e7d7d3baed3187df60c4819845.json
Event Fields
{
  "content_hash": "a4df4ab58c694d325debb7fb0aa861d756aa0580dd83b70427391bd044340511",
  "source_hash": "3cd5110f3162f4ccdc6eec78b938a4a59f637d2c76fd6ca10fa8bf02440b0ba0",
  "last_changed_at": "2026-06-06T07:33:06.309Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Seattle, Washington",
    "city": "Seattle",
    "region": "WA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.85
  },
  "salary_max": 300000,
  "salary_min": 200000,
  "inferred_at": "2026-06-06T20:12:12.227Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Seattle, Washington",
      "city": "Seattle",
      "region": "WA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.85
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": "year",
  "workplace_type": null,
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "title": "Member of Technical Staff — ML Data Infra",
  "offices": [
    {
      "id": 4030799009,
      "name": "Seattle",
      "location": null,
      "child_ids": [],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "Seattle, Washington"
  },
  "metadata": [],
  "updated_at": "2026-06-05T18:20:40-04:00",
  "departments": [
    {
      "id": 4031248009,
      "name": "Engineering",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "company_name": "Nuance Labs",
  "requisition_id": 4162947009,
  "first_published": "2026-06-05T17:40:39-04:00",
  "application_deadline": null
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/2973e12a1a551b27fdd2fc0e2922f968bd4a27fc?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/b5cad4e8-d3e2-423b-934c-3898f78ddee7JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/4d06c175-4ee5-4cda-ad2e-cc1de78b9519JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/2973e12a1a551b27fdd2fc0e2922f968bd4a27fc/eventsJSON