bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesCohereMember of Technical Staff, Model Efficiency

Member of Technical Staff, Model Efficiency

Cohere · New York · Remote · Active · Ashby

Job facts

FieldValue
CompanyCohere
TitleMember of Technical Staff, Model Efficiency
Normalized title-
Department / teamModeling / Modeling, Modeling
LocationNew York, NY, United States
Work modelRemote / Remote
Employment typeFull Time
Salary-
Statusactive
ATS providerAshby
Posted / first seen / 2026-05-29
Changed / last seen2026-06-03 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Cohere.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Ashby.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in New York.Open
Department jobsActive postings in Modeling.Open
Work model jobsActive Remote postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyCohere
Source9e81ec18-d8a9-42a5-9ba2-4b908e100441
ATS providerAshby

Description

Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers. Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products. Join us on our mission and shape the future! Why this role? Our team is a fast-growing group of researchers and engineers focused on building reliable ML systems and pushing the boundaries of LLM inference efficiency. We develop techniques that improve how models execute in production, driving lower latency, higher throughput, and consistent quality across diverse workloads. As an engineer on this team, you’ll work across the inference stack to improve core performance metrics by diving deep into model execution, identifying bottlenecks, and developing innovative optimizations. You’ll collaborate closely with modeling and systems teams to experiment, measure, and ship improvements that meaningfully accelerate inference. As the team evolves, you’ll have opportunities to build expertise in advanced performance techniques, including GPU/CUDA optimizations, kernel-level improvements, and model execution strategies for MoE and large-scale architectures. Please Note: We have offices in Toronto, Montreal, San Francisco, New York, Paris, Seoul and London. We embrace a remote-friendly environment, and as part of this approach, we strategically distribute teams based on interests, expertise, and time zones to promote collaboration and flexibility. You'll find the Model Efficiency team concentrated in the EST and PST time zones, these are our preferred locations. You may be a good fit for the Model Efficiency team if you have: 5+ years of experience writing high-performance, production-quality code Strong programming skills in C++ or Python (Rust/Go also welcome) Experience working with large language models and familiarity with the LLM inference ecosystem (e.g., vLLM, SGLang, etc.) Ability to diagnose and resolve performance bottlenecks across the model execution stack A strong bias for action — you ship fast, measure impact, and iterate It’s a big plus if you have experience with: GPU programming, CUDA, or low-level systems optimization Language modeling with transformers (MoE, speculative decoding, KV-cache optimizations) Scaling performance-critical distributed systems (e.g., computation, search, storage) If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form , and we will work together to meet your needs. We may use AI-enabled tools to screen and assess applicants against the criteria for this position. This helps our recruiters identify potentially qualified candidates, but it doesn't limit the applications our recruiters may review or consider. Full-Time Employees at Cohere enjoy these Perks: 🤝 An open and inclusive culture and work environment 🧑‍💻 Work closely with a team on the cutting edge of AI research 🍽 Weekly lunch stipend, in-office lunches & snacks 🦷 Full health and dental benefits, including a separate budget to take care of your mental health 🐣 100% Parental Leave top-up for up to 6 months 🎨 Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement 🏙 Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend ✈️ 6 weeks of vacation (30 working days!)

Full job record

Job ID4058df3e54064cde9b6ee6599b2100e458faf422
Org ID9babd07e-e6bc-4a16-a7ac-2dbed3e0a0d6
Source ID9e81ec18-d8a9-42a5-9ba2-4b908e100441
Board ID9e81ec18-d8a9-42a5-9ba2-4b908e100441
Providerashby
Provider Job Key2a989030-6d14-4924-88c1-d878911e26fa
TitleMember of Technical Staff, Model Efficiency
Normalized Title
Statusactive
Activeyes
Location TextNew York
DepartmentModeling
TeamModeling, Modeling
Employment Typefull_time
Workplace Typeremote
Remote Policyremote
CountryUnited States
RegionNY
CityNew York
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://jobs.ashbyhq.com/cohere/2a989030-6d14-4924-88c1-d878911e26fa
Apply URLhttps://jobs.ashbyhq.com/cohere/2a989030-6d14-4924-88c1-d878911e26fa/application
First Seen At2026-05-29 06:40:57Z
Last Seen At2026-06-06 09:27:38Z
Last Checked At2026-06-06 09:27:38Z
Last Changed At2026-06-03 13:37:38Z
Inactive At
Source Posted At
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=ashby/board=cohere/date=2026-06-06/2026-06-06T09-26-21-103Z-ba1870ddcf7f1d50f18d64a517da6e8a0be16c57e1738aba3367650c3fa823df.json
Event Fields
{
  "content_hash": "8751d43cbf4011c7c33df40c7d14340656301436033f6af3c5b7159d2dadfc23",
  "source_hash": "309c926ed90cb4ad549894d0f009a3ab60e33056c325376b718d09cc0fed1c83",
  "last_changed_at": "2026-06-03T13:37:38.587Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "New York",
    "city": "New York",
    "region": "NY",
    "country": "United States",
    "is_remote": true,
    "confidence": 0.75
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T09:27:38.165Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "New York",
      "city": "New York",
      "region": "NY",
      "country": "United States",
      "is_remote": true,
      "confidence": 0.75
    },
    "countries": [
      "United States",
      "Canada"
    ]
  },
  "remote_policy": "remote",
  "salary_period": null,
  "workplace_type": "remote",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "2a989030-6d14-4924-88c1-d878911e26fa",
  "team": "Modeling, Modeling",
  "title": "Member of Technical Staff, Model Efficiency",
  "jobUrl": "https://jobs.ashbyhq.com/cohere/2a989030-6d14-4924-88c1-d878911e26fa",
  "address": null,
  "applyUrl": "https://jobs.ashbyhq.com/cohere/2a989030-6d14-4924-88c1-d878911e26fa/application",
  "isListed": true,
  "isRemote": true,
  "location": "New York",
  "updatedAt": null,
  "apiVersion": "ashby-non-user-graphql-v1",
  "department": "Modeling",
  "publishedAt": null,
  "workplaceType": "Remote",
  "employmentType": "FullTime",
  "secondaryLocations": [
    {
      "location": "Toronto"
    },
    {
      "location": "San Francisco"
    },
    {
      "location": "Montreal"
    }
  ]
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/4058df3e54064cde9b6ee6599b2100e458faf422?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/9babd07e-e6bc-4a16-a7ac-2dbed3e0a0d6JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/9e81ec18-d8a9-42a5-9ba2-4b908e100441JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/4058df3e54064cde9b6ee6599b2100e458faf422/eventsJSON