bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesTwelve LabsModel Evaluation & Data Quality Lead

Model Evaluation & Data Quality Lead

Twelve Labs · San Francisco · Hybrid · Active · Ashby

Job facts

FieldValue
CompanyTwelve Labs
TitleModel Evaluation & Data Quality Lead
Normalized title-
Department / teamTech / Tech, ML Data
LocationSan Francisco, CA, United States
Work modelHybrid / Hybrid
Employment typeFull Time
Salary-
Statusactive
ATS providerAshby
Posted / first seen / 2026-05-29
Changed / last seen2026-05-29 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Twelve Labs.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in San Francisco.Open
Department jobsActive postings in Tech.Open
Work model jobsActive Hybrid postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyTwelve Labs
Sourceb2cd6d28-6899-4576-988b-b73d7b1304d7
ATS providerAshby

Description

Who We Are: At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media. With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. About the Role: You will be a vital member of our ML Data Team – which leads the full spectrum of video-language data preparation and model evaluation. This role comes with high ownership and includes responsibilities such as defining dataset needs and requirements in consultation with our research and product teams; designing and building data pipelines; and driving our post-training model evaluation strategy. You will also be responsible for automating as much of the repetitive partnership, annotation, and quality evaluation work as possible. A desire to work cross functionally and to build relationships is critical for success in this position. You will: Model Evaluation: Design and build robust model evaluation frameworks, automating repetitive processes and maintaining a balanced approach to efficiency and depth in obtaining evaluation metrics and feedback. Portfolio Monitoring : Manage resource allocation and timelines, adjusting direction flexibly based on real-time information across all data streams in your product vertical. External Partner Collaboration : Enhance dataset and process quality through seamless collaboration with vendors and outsourcing partners. Data Quality & Tooling Advancement : Establish labeling guidelines, monitor data quality, and improve tools and infrastructure to build a sustainable data operations framework. Internal Collaboration : Partner with Engineering and AI Model teams to align on top priority data needs, design tools such as analytical reports and dashboards, and clearly communicate project progress. You may be a good fit if you have: 5+ years of experience working in an AI focused data operations organization. A proven track record designing and executing large scale data or evaluation projects, including gathering, labeling, and post-processing data. The ability to analyze messy and complex data, identify overarching patterns, and distill your findings into crisp annotation guidelines or model quality reports. Proficiency with Python, LLMs, or other popular industry tools for automation. Excellent communication and project management skills, and the ability to support several projects simultaneously. A foundational understanding of and interest in LLMs/VLMs and multimodal AI. Conviction that data is the key ingredient for the performance and assessment of AI models. You’ll stand out if you have: Experience in data collection and labeling for multimodal language models. Experience in red teaming, localization testing, or other evaluation focused fields. Experience working with research scientists and engineers. Expertise or interest in video-centric domains, such as sports, advertising, and content creation. Tech Stack: Development & Analysis : Python (primarily pandas, Jupyter, etc.) Data Management & Visualization : Amazon S3, Various data visualization tools (framework-agnostic) Project Management Tools : Linear, Notion Even if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at TwelveLabs. We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI. Benefits and Perks: 🤝 An open and inclusive culture and work environment. 🧑‍💻 Work closely with a collaborative, mission-driven team on cutting-edge AI technology. 🦷 Full health, dental, and vision benefits. ✈️ Flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.

Full job record

Job ID6fafcf89a3da8e3f44c9b9f216d39ae563d863ed
Org IDe1334135-ed56-48e2-9b92-52e26c217601
Source IDb2cd6d28-6899-4576-988b-b73d7b1304d7
Board IDb2cd6d28-6899-4576-988b-b73d7b1304d7
Providerashby
Provider Job Key8e2ee3a8-e714-4da3-83ee-eb24504c088b
TitleModel Evaluation & Data Quality Lead
Normalized Title
Statusactive
Activeyes
Location TextSan Francisco
DepartmentTech
TeamTech, ML Data
Employment Typefull_time
Workplace Typehybrid
Remote Policyhybrid
CountryUnited States
RegionCA
CitySan Francisco
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b
Apply URLhttps://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b/application
First Seen At2026-05-29 06:28:04Z
Last Seen At2026-06-06 09:38:06Z
Last Checked At2026-06-06 09:38:06Z
Last Changed At2026-05-29 06:28:04Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=twelve-labs/date=2026-06-06/2026-06-06T09-37-49-824Z-293a018c27f1809791f51933588ed12ac9ffaa94ecfa0034c60dac6fe0db090f.json
Event Fields
{
  "content_hash": "4519a8c53aff7daf9ce70713110f84f6ace288a1412fffae24db904817fdfc23",
  "source_hash": "51f0d1b2c12312c1edbe5428a4a0cb27e142353cf1c5eedac2f238a60fb79db6",
  "last_changed_at": "2026-05-29T06:28:04.523Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "San Francisco",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T09:38:06.213Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.75
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": null,
  "workplace_type": "hybrid",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "8e2ee3a8-e714-4da3-83ee-eb24504c088b",
  "team": "Tech, ML Data",
  "title": "Model Evaluation & Data Quality Lead",
  "jobUrl": "https://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/twelve-labs/8e2ee3a8-e714-4da3-83ee-eb24504c088b/application",
  "isListed": true,
  "isRemote": false,
  "location": "San Francisco",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Tech",
  "publishedAt": null,
  "workplaceType": "Hybrid",
  "employmentType": "FullTime",
  "secondaryLocations": []
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/6fafcf89a3da8e3f44c9b9f216d39ae563d863ed?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/e1334135-ed56-48e2-9b92-52e26c217601JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/b2cd6d28-6899-4576-988b-b73d7b1304d7JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/6fafcf89a3da8e3f44c9b9f216d39ae563d863ed/eventsJSON