Home › Companies › Plaud › Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco
Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco
Plaud · San Francisco, CA · Hybrid · Active · $200,000–$365,000 / year · Ashby
Job facts
| Field | Value |
|---|---|
| Company | Plaud |
| Title | Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco |
| Normalized title | - |
| Department / team | Global Product R&D Center / Global Product R&D Center, AI R&D |
| Location | San Francisco, CA, United States |
| Work model | Hybrid / Hybrid |
| Employment type | Full Time |
| Salary | $200,000–$365,000 / year |
| Status | active |
| ATS provider | Ashby |
| Posted / first seen | — / 2026-05-29 |
| Changed / last seen | 2026-05-29 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Plaud. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Ashby. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in San Francisco. | Open |
| Department jobs | Active postings in Global Product R&D Center. | Open |
| Work model jobs | Active Hybrid postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Plaud |
| Source | 52f7d15c-b668-4bcb-999b-6fd4ff6cf4f2 |
| ATS provider | Ashby |
Description
About Plaud Inc. Plaud is building the world's most trusted AI work companion for professionals to elevate productivity and performance through note-taking solutions, loved by over 1,500,000 users worldwide since 2023. With a mission to amplify human intelligence, Plaud is building the next-generation intelligence infrastructure and interfaces to capture, extract, and utilize what you say, hear, see, and think.
Plaud Inc. is a Delaware-incorporated, San Francisco-based company pushing the boundary of human–AI intelligence through a hardware–software combination. With SOC 2, HIPAA, GDPR, ISO27001, ISO27701, and EN18031 compliance, Plaud is committed to the highest standards of data security and privacy protection.
To learn more about Plaud, please visit https://www.Plaud.ai and follow along on Instagram , X , Facebook , LinkedIn , and YouTube
Why You Should Join Us
Plaud is building the next generation intelligence infrastructure and interfaces to capture, extract, and utilize intelligence from what people say, hear, see, and think.
Plaud is a bootstrapped, skyrocketing, profitable company with a $250M revenue run rate achieved in just three years.
Define the next-gen paradigm for human-AI interaction.
Gain exposure to cutting-edge AI for Pro tools and play a direct role in our global expansion.
Work with passionate teammates who value innovation, collaboration, and customer success.
Grow your career in a culture that champions continuous learning and fast career development.
Market-competitive compensation, global exposure, and a vibrant, creativity-fueled work atmosphere.
You may be a good fit if you: Have a passion for turning ambiguous, subjective concepts like a voice's naturalness, expressiveness, or conversational cadence into clear, defensible, and automated metrics that researchers and leadership can rely on.
Possess strong software engineering skills (especially in Python) and have experience building reliable distributed systems, data pipelines, or evaluation harnesses that can run at scale against live model checkpoints.
Can deeply partner with ML researchers to define exactly what "good" looks like for a Speech LLM, translating capabilities (like ASR robustness in noisy environments or TTS emotional steerability) into measurable benchmarks.
Are comfortable building and owning dashboards that track model health during training, improving signal-to-noise ratios, reducing evaluation latency, and making performance regressions impossible to miss.
Rapidly debug anomalous mid-training results to determine if a drop in performance stems from the model architecture, corrupted data, or infrastructure.
Communicate complex statistical results and model behaviors clearly to both technical and non-technical stakeholders.
Strong candidates may also have experience with: Speech Metrics: Deep familiarity with both traditional (WER, CER, PESQ, etc) and modern audio evaluation frameworks (automated MOS scoring).
LLM-as-a-Judge: Using frontier models or finetune multi-modal LLMs to evaluate the conversational logic, transcription accuracy, audio quality, and reasoning of audio models.
Human Evaluation: Managing large-scale crowdsourcing operations or preference data collection to support RLHF/DPO efforts.
Observability: A strong background in statistics and experimental design, paired with experience building trusted tracking dashboards (e.g., Weights & Biases, MLflow).
Adversarial Datasets: Curating complex datasets to test edge cases, such as heavy accents, overlapping speech, or highly noisy acoustic environments.
What We Offer Founding Team Initiative: Opportunity to be an early, foundational member of our core SpeechLLM lab, with meaningful ownership and impact on a fast-growing startup.
Competitive Compensation: $200K - $365K base salary + performance bonus + Equity.
Comprehensive Benefits: Top-tier healthcare for employees and dependents, including dental and vision, and a generous employer subsidy.
Retirement Planning: 401(k) plan for full-time employees with company matching.
Paid Time Off: Unlimited PTO, plus 13 paid holidays.
New Parent Leave: 12 weeks of paid time off to spend time with your new family, regardless of gender.
Hybrid Office: Minimum of 3x in-office per week to foster highly collaborative, fast-paced research.
Gear & Perks: Choice of top-of-the-line laptops/workstations, annual offsites, and a fully stocked office.
Plaud is and will continue to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristics.
Full job record
| Job ID | a326eb62bc3ac8ccbbd6f7d2f1d6be2ad22d6de2 |
| Org ID | 06147ebb-2ddf-41ba-9b05-110b64c4ff0b |
| Source ID | 52f7d15c-b668-4bcb-999b-6fd4ff6cf4f2 |
| Board ID | 52f7d15c-b668-4bcb-999b-6fd4ff6cf4f2 |
| Provider | ashby |
| Provider Job Key | 4b0b818e-7ad1-42ef-ba0f-03901856f91f |
| Title | Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | San Francisco, CA |
| Department | Global Product R&D Center |
| Team | Global Product R&D Center, AI R&D |
| Employment Type | full_time |
| Workplace Type | hybrid |
| Remote Policy | hybrid |
| Country | United States |
| Region | CA |
| City | San Francisco |
| Salary Raw | Compensation: $200K - $365K base salary + performance bonus + Equity |
| Salary Min | 200,000 |
| Salary Max | 365,000 |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://jobs.ashbyhq.com/Plaud/4b0b818e-7ad1-42ef-ba0f-03901856f91f |
| Apply URL | https://jobs.ashbyhq.com/Plaud/4b0b818e-7ad1-42ef-ba0f-03901856f91f/application |
| First Seen At | 2026-05-29 05:39:15Z |
| Last Seen At | 2026-06-06 20:21:54Z |
| Last Checked At | 2026-06-06 20:21:54Z |
| Last Changed At | 2026-05-29 05:39:15Z |
| Inactive At | — |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=Plaud/date=2026-06-06/2026-06-06T20-21-46-152Z-aa90fb12bb9f9f524cf20df2a0642a78f17f01b6e9c7edd4c3c56e2ad2f291f5.json |
Event Fields
{
"content_hash": "3a9d85c39c54d5d56fde3b85619e3d3b7aa127bb72217127b578b516ea4dc70e",
"source_hash": "1f73288c9e9336f866161b3d51bf8d51d3e755b24def1aa9232414883f01746e",
"last_changed_at": "2026-05-29T05:39:15.345Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "San Francisco, CA",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"salary_max": 365000,
"salary_min": 200000,
"inferred_at": "2026-06-06T20:21:54.564Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "San Francisco, CA",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"countries": [
"United States"
]
},
"remote_policy": "hybrid",
"salary_period": "year",
"workplace_type": "hybrid",
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"id": "4b0b818e-7ad1-42ef-ba0f-03901856f91f",
"team": "Global Product R&D Center, AI R&D",
"title": "Machine Learning Engineer, Model Evaluations (Speech LLM) - San Francisco",
"jobUrl": "https://jobs.ashbyhq.com/Plaud/4b0b818e-7ad1-42ef-ba0f-03901856f91f",
"address": null,
"applyUrl": "https://jobs.ashbyhq.com/Plaud/4b0b818e-7ad1-42ef-ba0f-03901856f91f/application",
"isListed": true,
"isRemote": false,
"location": "San Francisco, CA",
"updatedAt": null,
"apiVersion": "ashby-non-user-graphql-v1",
"department": "Global Product R&D Center",
"publishedAt": null,
"workplaceType": "Hybrid",
"employmentType": "FullTime",
"secondaryLocations": []
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/a326eb62bc3ac8ccbbd6f7d2f1d6be2ad22d6de2?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/06147ebb-2ddf-41ba-9b05-110b64c4ff0bJSONGET https://api.bluedoor.sh/job-postings/v1/sources/52f7d15c-b668-4bcb-999b-6fd4ff6cf4f2JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/a326eb62bc3ac8ccbbd6f7d2f1d6be2ad22d6de2/eventsJSON