bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesSpeakMachine Learning Engineer, Assessments

Machine Learning Engineer, Assessments

Speak · San Francisco · Hybrid · Active · Ashby

Job facts

FieldValue
CompanySpeak
TitleMachine Learning Engineer, Assessments
Normalized title-
Department / teamEngineering / Engineering, Machine Learning
LocationSan Francisco, CA, United States
Work modelHybrid / Hybrid
Employment typeFull Time
Salary-
Statusactive
ATS providerAshby
Posted / first seen / 2026-05-29
Changed / last seen2026-05-29 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Speak.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in San Francisco.Open
Department jobsActive postings in Engineering.Open
Work model jobsActive Hybrid postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanySpeak
Source24ca8f88-a6d8-465b-84d9-5a17e80a0a5d
ATS providerAshby

Description

About us Our mission is to reinvent the way people learn, starting with language. Learning a language can change a life by opening doors to new cultures, careers, and communities. Two billion people around the world are actively trying to learn a language, but the best way to learn (one-on-one tutoring) is hard to access at scale and hasn’t been meaningfully improved in decades. Speak is building a human-level, AI-powered tutor in your pocket: a conversation-first experience that lets learners actually speak, get instant feedback, and progress through carefully designed lessons. The result is a complete path from beginner to confident speaker across multiple languages. Speak first launched in South Korea in 2019, where Speak has now become the number one language learning app, and we now serve learners across many markets and 15+ languages. Speak is one of the world’s leading AI companies, with over $150m raised in venture investment from OpenAI, Accel, Founders Fund, Khosla Ventures, and more, with a distributed team across San Francisco, Seoul, Tokyo, Taipei, and Ljubljana. About this role We’re hiring an ML Engineer, Assessments to help build best-in-class assessment systems across multiple products (Speak for Business, B2C, and new surfaces). You will work in a tight loop with our Assessment Design Lead (Content/Learning Design) , Machine Learning, Product, and Engineering to turn assessment constructs and rubrics into reliable, scalable scoring + feedback systems. This role owns the implementation, deployment, and ongoing quality of our assessment algorithms and ML systems. While there is immediate need to improve and expand production assessments, this work is also building a platform capability that can be reused across the app. What you’ll be doing Ship and own assessment ML systems end-to-end Build, deploy, and maintain scoring models/pipelines (feature extraction → model training → inference → feedback generation) Own monitoring, regression tests, and ongoing iteration to maintain accuracy targets Define and operationalize evaluation Implement validation/evaluation frameworks for assessments, including metrics, test sets, and offline/online analysis Translate assessment requirements into measurable acceptance criteria and guardrails Partner deeply with the Assessment Design Lead Co-develop the strategy, together with the Content team, to grow assessments into a core platform at Speak Work in a tight weekly loop to deliver incremental improvement Drive near-term delivery across products Stand up or improve summative assessments (spoken language ability) and bring them reliably to production Prototype and validate formative assessment approaches to measure improvement over weeks/months Support data and labeling strategy Help define data needs for training/evaluation (including psychometric measurement needs) Build or improve pipelines that support label collection and analysis (especially for efficacy studies) What we’re looking for Domain expertise in spoken language proficiency assessment (linguistics, applied linguistics, pedagogy, or equivalent experience) Strong experience designing and running evaluation + validation for assessment/scoring systems, and tailoring approaches to a specific product use case 4+ years building automatic proficiency assessment systems (or equivalent depth in closely related scoring/evaluation domains) PhD is helpful but not required Proven ability to ship ML models to production (not only research), including reliability, monitoring, and iteration Strong generalist ML/analysis skills (statistics, Python, PyTorch/model training) Ability to operate cross-functionally and communicate clearly with non-technical partners (Content/LD, PM, leadership) Nice to have Experience with speech/audio ML Experience with psychometrics concepts (reliability/validity, calibration) How we work (collaboration expectations) This role is designed to be highly collaborative with the Assessment Design Lead. Success depends on a tight loop where constructs/rubrics and model outputs co-evolve — not a sequential handoff. Why work at Speak Join a fantastic, tight-knit team at the right time:  we're growing very quickly, we've most recently raised our Series C from some of the top investors in the valley, and we've achieved product-market fit in our initial markets. You'd join at a magical time when a single person could significantly change the course of the company. Do your life's work with people you’ll love working with:  we care strongly about our craft and want every person at Speak to feel like they're growing every day. We believe in the idea that working with people you both enjoy and have respect for makes everything better. We hire thoughtfully and only work with people we admire deeply. Global in nature:  We're live in over 40 countries and launching in a number of new markets soon. We have dedicated offices in San Francisco, Ljubljana, Seoul, and Tokyo, and you’ll have the opportunity to talk to users in each of these regions on a regular basis as well as travel. Impact people's lives in a major way:  Learning a language is one of the single most life-changing skills one can learn, and right now 99% of people never achieve their goal because the process is broken. We’re helping millions of people achieve their goals and improve their lives. Speak does not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.

Full job record

Job IDa25e7f9899312ea0a9f386fc623a73bd854c4539
Org IDea6fe4dc-e761-4563-afdd-b2140adc8761
Source ID24ca8f88-a6d8-465b-84d9-5a17e80a0a5d
Board ID24ca8f88-a6d8-465b-84d9-5a17e80a0a5d
Providerashby
Provider Job Key5deeef02-1cd1-4a3c-903f-8542d6afce70
TitleMachine Learning Engineer, Assessments
Normalized Title
Statusactive
Activeyes
Location TextSan Francisco
DepartmentEngineering
TeamEngineering, Machine Learning
Employment Typefull_time
Workplace Typehybrid
Remote Policyhybrid
CountryUnited States
RegionCA
CitySan Francisco
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://jobs.ashbyhq.com/speak/5deeef02-1cd1-4a3c-903f-8542d6afce70
Apply URLhttps://jobs.ashbyhq.com/speak/5deeef02-1cd1-4a3c-903f-8542d6afce70/application
First Seen At2026-05-29 05:24:07Z
Last Seen At2026-06-06 19:39:55Z
Last Checked At2026-06-06 19:39:55Z
Last Changed At2026-05-29 05:24:07Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=speak/date=2026-06-06/2026-06-06T19-39-51-654Z-308dc55c442f6888a1f44995043af5fed0e2f8eb7bdb67e5c75d7f22ba06d2f2.json
Event Fields
{
  "content_hash": "b6aa404b7ece1c182c94e503c9486476974ce12085a0bc8c3540966bca90cdc3",
  "source_hash": "864ca80cf82eb87ee2c5980c5191b37639584a6132449bbb3285aaee738e1e54",
  "last_changed_at": "2026-05-29T05:24:07.454Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T19:39:55.599Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": null,
  "workplace_type": "hybrid",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "5deeef02-1cd1-4a3c-903f-8542d6afce70",
  "team": "Engineering, Machine Learning",
  "title": "Machine Learning Engineer, Assessments",
  "jobUrl": "https://jobs.ashbyhq.com/speak/5deeef02-1cd1-4a3c-903f-8542d6afce70",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/speak/5deeef02-1cd1-4a3c-903f-8542d6afce70/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Engineering",
  "publishedAt": null,
  "workplaceType": "Hybrid",
  "employmentType": "FullTime",
  "secondaryLocations": []
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/a25e7f9899312ea0a9f386fc623a73bd854c4539?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/ea6fe4dc-e761-4563-afdd-b2140adc8761JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/24ca8f88-a6d8-465b-84d9-5a17e80a0a5dJSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/a25e7f9899312ea0a9f386fc623a73bd854c4539/eventsJSON