Home › Companies › Twelve Labs › Model Evaluation & Data Quality Lead

Model Evaluation & Data Quality Lead

Twelve Labs · San Francisco · Hybrid · Active · Ashby

Job facts

Field	Value
Company	Twelve Labs
Title	Model Evaluation & Data Quality Lead
Normalized title	-
Department / team	Tech / Tech, ML Data
Location	San Francisco, CA, United States
Work model	Hybrid / Hybrid
Employment type	Full Time
Salary	-
Status	active
ATS provider	Ashby
Posted / first seen	— / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Twelve Labs.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Ashby.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in San Francisco.	Open
Department jobs	Active postings in Tech.	Open
Work model jobs	Active Hybrid postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Twelve Labs
Source	b2cd6d28-6899-4576-988b-b73d7b1304d7
ATS provider	Ashby

Description

Who We Are: At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media. With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. About the Role: You will be a vital member of our ML Data Team – which leads the full spectrum of video-language data preparation and model evaluation. This role comes with high ownership and includes responsibilities such as defining dataset needs and requirements in consultation with our research and product teams; designing and building data pipelines; and driving our post-training model evaluation strategy. You will also be responsible for automating as much of the repetitive partnership, annotation, and quality evaluation work as possible. A desire to work cross functionally and to build relationships is critical for success in this position. You will: Model Evaluation: Design and build robust model evaluation frameworks, automating repetitive processes and maintaining a balanced approach to efficiency and depth in obtaining evaluation metrics and feedback. Portfolio Monitoring : Manage resource allocation and timelines, adjusting direction flexibly based on real-time information across all data streams in your product vertical. External Partner Collaboration : Enhance dataset and process quality through seamless collaboration with vendors and outsourcing partners. Data Quality & Tooling Advancement : Establish labeling guidelines, monitor data quality, and improve tools and infrastructure to build a sustainable data operations framework. Internal Collaboration : Partner with Engineering and AI Model teams to align on top priority data needs, design tools such as analytical reports and dashboards, and clearly communicate project progress. You may be a good fit if you have: 5+ years of experience working in an AI focused data operations organization. A proven track record designing and executing large scale data or evaluation projects, including gathering, labeling, and post-processing data. The ability to analyze messy and complex data, identify overarching patterns, and distill your findings into crisp annotation guidelines or model quality reports. Proficiency with Python, LLMs, or other popular industry tools for automation. Excellent communication and project management skills, and the ability to support several projects simultaneously. A foundational understanding of and interest in LLMs/VLMs and multimodal AI. Conviction that data is the key ingredient for the performance and assessment of AI models. You’ll stand out if you have: Experience in data collection and labeling for multimodal language models. Experience in red teaming, localization testing, or other evaluation focused fields. Experience working with research scientists and engineers. Expertise or interest in video-centric domains, such as sports, advertising, and content creation. Tech Stack: Development & Analysis : Python (primarily pandas, Jupyter, etc.) Data Management & Visualization : Amazon S3, Various data visualization tools (framework-agnostic) Project Management Tools : Linear, Notion Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at TwelveLabs. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. Benefits and Perks: 🤝 An open and inclusive culture and work environment. 🧑‍💻 Work closely with a collaborative, mission-driven team on cutting-edge AI technology. 🦷 Full health, dental, and vision benefits. ✈️ Flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.

Full job record

Job ID	6fafcf89a3da8e3f44c9b9f216d39ae563d863ed
Org ID	e1334135-ed56-48e2-9b92-52e26c217601
Source ID	b2cd6d28-6899-4576-988b-b73d7b1304d7
Board ID	b2cd6d28-6899-4576-988b-b73d7b1304d7
Provider	ashby
Provider Job Key	8e2ee3a8-e714-4da3-83ee-eb24504c088b
Title	Model Evaluation & Data Quality Lead
Normalized Title	—
Status	active
Active	yes
Location Text	San Francisco
Department	Tech
Team	Tech, ML Data
Employment Type	full_time
Workplace Type	hybrid
Remote Policy	hybrid
Country	United States
Region	CA
City	San Francisco
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b
Apply URL	https://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b/application
First Seen At	2026-05-29 06:28:04Z
Last Seen At	2026-06-06 09:38:06Z
Last Checked At	2026-06-06 09:38:06Z
Last Changed At	2026-05-29 06:28:04Z
Inactive At	—
Source Posted At	—
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=twelve-labs/date=2026-06-06/2026-06-06T09-37-49-824Z-293a018c27f1809791f51933588ed12ac9ffaa94ecfa0034c60dac6fe0db090f.json

Event Fields

{
  "content_hash": "4519a8c53aff7daf9ce70713110f84f6ace288a1412fffae24db904817fdfc23",
  "source_hash": "51f0d1b2c12312c1edbe5428a4a0cb27e142353cf1c5eedac2f238a60fb79db6",
  "last_changed_at": "2026-05-29T06:28:04.523Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T09:38:06.213Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": null,
  "workplace_type": "hybrid",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "id": "8e2ee3a8-e714-4da3-83ee-eb24504c088b",
  "team": "Tech, ML Data",
  "title": "Model Evaluation & Data Quality Lead",
  "jobUrl": "https://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Tech",
  "publishedAt": null,
  "workplaceType": "Hybrid",
  "employmentType": "FullTime",
  "secondaryLocations": []
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/6fafcf89a3da8e3f44c9b9f216d39ae563d863ed?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/e1334135-ed56-48e2-9b92-52e26c217601JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/b2cd6d28-6899-4576-988b-b73d7b1304d7JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/6fafcf89a3da8e3f44c9b9f216d39ae563d863ed/eventsJSON

Docs · Get an API key