bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesTriSenior Data Engineer

Senior Data Engineer

Tri · Los Altos, CA · Hybrid · Active · $180,000–$258,750 / year · Lever

Job facts

FieldValue
CompanyTri
TitleSenior Data Engineer
Normalized title-
Department / teamAutomated Driving Advanced Development / Automated Driving Advanced Development
LocationLos Altos, CA, United States
Work modelHybrid / Hybrid
Employment typeFull Time
Salary$180,000–$258,750 / year
Statusactive
ATS providerLever
Posted / first seen2025-08-04 / 2026-05-29
Changed / last seen2026-05-29 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Tri.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Lever.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Los Altos.Open
Department jobsActive postings in Automated Driving Advanced Development.Open
Work model jobsActive Hybrid postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyTri
Sourcea86dbed4-1715-4a6f-9b42-9a7a485b919b
ATS providerLever

Description

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world-class team advancing the state of the art in AI, robotics, driving, and material sciences. The Automated Driving Advanced Development division at TRI will focus on enabling innovation and transformation at Toyota by building a bridge between TRI research and Toyota products, services, and needs. We achieve this through partnership, collaboration, and shared commitment. This new division is leading a new cross-organizational project between TRI and Woven by Toyota to conduct research and develop a fully end-to-end learned driving stack. This cross-org collaborative project is harmonious with TRI’s robotics divisions' efforts in Diffusion Policy and Large Behavior Models. We are looking for a Senior Data Engineer to design and build the foundational data infrastructure and tools that power our autonomy research and development workflows. This includes large-scale ingestion pipelines, structured feature stores, labeling infrastructure, scene search and data discovery tools, and performance diagnostics for machine learning and simulation workflows. Please include links to any relevant open-source contributions or technical project write-ups with your application. The pay range for this position at commencement of employment is expected to be between $180,000 and $258,750/year for California-based roles. Base pay offered will depend on multiple individualized factors, including, but not limited to, a candidate's experience, skills, job-related knowledge, and market location. TRI offers a generous benefits package including medical, dental, and vision insurance, 401(k) eligibility, paid time off benefits (including vacation, sick time, and parental leave), and an annual cash bonus structure. Additional details regarding these benefit plans will be provided if an employee receives an offer of employment. Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information. TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws. It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment. Responsibilities Design and implement scalable, production-grade pipelines for data ingestion, transformation, storage, and retrieval from vehicle fleets and simulation environments. Build internal tools and services for data labeling, curation, indexing, and cataloging across large and diverse datasets. Collaborate with ML researchers, autonomy engineers, and data scientists to design schemas and APIs that power model training, evaluation, and debugging. Develop and maintain feature stores, metadata systems, and versioning infrastructure for structured and unstructured data. Support the generation and integration of synthetic datasets with real-world logs to enable hybrid training and simulation workflows. Optimize pipelines for cost, latency, and traceability, ensuring reproducibility and consistency across environments. Partner with simulation and cloud platform teams to automate workflows for closed-loop testing, scenario mining, and performance analytics. Qualifications Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field. 8+ years of experience building data-intensive software systems, ideally in robotics, autonomous driving, or large-scale ML environments. Proficient in Python, SQL, and familiar with C++. Experience designing ETL pipelines using modern frameworks (e.g., Apache Spark, Flyte, Union). Strong knowledge of cloud-native architectures, including AWS services (e.g., S3, or equivalents (Google Cloud platform) Familiarity with sensor data types (camera, lidar, radar, GPS/IMU) and common data serialization formats (e.g., protobuf. ROS2bag, MCAP). Deep understanding of data quality, observability, and lineage in high-volume systems. Track record of building reliable and performant infrastructure that supports both ad-hoc exploration and repeatable production workflows. Bonus Qualifications Experience in AD/ADAS, robotics, or autonomous systems — especially handling perception or planning datasets. Familiarity with ML pipeline orchestration frameworks (e.g. Kubeflow, SageMaker, etc). Experience working with temporal or spatial data, including geospatial indexing and time-series alignment. Exposure to synthetic data generation, simulation logging, or scenario replay pipelines. Strong software engineering fundamentals, CI/CD, testing, code review, and service deployment best practices. Experience collaborating with cross-functional, distributed teams across research and production orgs.

Full job record

Job ID91b3c744a5b506ee0d35f2be10d9f3591ef22b87
Org ID98ed24bc-1213-4fd4-8b99-e5bf3b99939c
Source IDa86dbed4-1715-4a6f-9b42-9a7a485b919b
Board IDa86dbed4-1715-4a6f-9b42-9a7a485b919b
Providerlever
Provider Job Key5b1a4365-272b-4541-8f7d-46f0bf0d3e4f
TitleSenior Data Engineer
Normalized Title
Statusactive
Activeyes
Location TextLos Altos, CA
DepartmentAutomated Driving Advanced Development
TeamAutomated Driving Advanced Development
Employment TypeFull-time
Workplace Typehybrid
Remote Policyhybrid
CountryUnited States
RegionCA
CityLos Altos
Salary Rawpay range for this position at commencement of employment is expected to be between $180,000 and $258,750/year for California-based roles
Salary Min180,000
Salary Max258,750
Salary CurrencyUSD
Salary Periodyear
Source URLhttps://jobs.lever.co/tri/5b1a4365-272b-4541-8f7d-46f0bf0d3e4f
Apply URLhttps://jobs.lever.co/tri/5b1a4365-272b-4541-8f7d-46f0bf0d3e4f/apply
First Seen At2026-05-29 07:01:10Z
Last Seen At2026-06-06 07:56:13Z
Last Checked At2026-06-06 07:56:13Z
Last Changed At2026-05-29 07:01:10Z
Inactive At
Source Posted At2025-08-04 23:34:35Z
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=lever/board=tri/date=2026-06-06/2026-06-06T07-56-13-141Z-6eb47d9345995bb19af48485e87a9c1ecb73625419546549e0232425da74ff45.json
Event Fields
{
  "content_hash": "1d3d42db383c0c15c586c867c4d04dc027a29019fc337365b88bcbfae9a8a48a",
  "source_hash": "8f867063e131f4d469dd3e0a64406088fa1ea40ac80007425c0f4a398cada84f",
  "last_changed_at": "2026-05-29T07:01:10.652Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Los Altos, CA",
    "city": "Los Altos",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.9
  },
  "salary_max": 258750,
  "salary_min": 180000,
  "inferred_at": "2026-06-06T07:56:13.593Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Los Altos, CA",
      "city": "Los Altos",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.9
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": "year",
  "workplace_type": "hybrid",
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "lists": [
    {
      "text": "Responsibilities ",
      "content": "\n<li>Design and implement scalable, production-grade pipelines for data ingestion, transformation, storage, and retrieval from vehicle fleets and simulation environments.</li>\n<li>Build internal tools and services for data labeling, curation, indexing, and cataloging across large and diverse datasets.</li>\n<li>Collaborate with ML researchers, autonomy engineers, and data scientists to design schemas and APIs that power model training, evaluation, and debugging.</li>\n<li>Develop and maintain feature stores, metadata systems, and versioning infrastructure for structured and unstructured data.</li>\n<li>Support the generation and integration of synthetic datasets with real-world logs to enable hybrid training and simulation workflows.</li>\n<li>Optimize pipelines for cost, latency, and traceability, ensuring reproducibility and consistency across environments.</li>\n<li>Partner with simulation and cloud platform teams to automate workflows for closed-loop testing, scenario mining, and performance analytics.</li>\n"
    },
    {
      "text": "Qualifications",
      "content": "\n<li>Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.</li>\n<li>8+ years of experience building data-intensive software systems, ideally in robotics, autonomous driving, or large-scale ML environments.</li>\n<li>Proficient in Python, SQL, and familiar with C++.</li>\n<li>Experience designing ETL pipelines using modern frameworks (e.g., Apache Spark, Flyte, Union).</li>\n<li>Strong knowledge of cloud-native architectures, including AWS services (e.g., S3, or equivalents (Google Cloud platform)</li>\n<li>Familiarity with sensor data types (camera, lidar, radar, GPS/IMU) and common data serialization formats (e.g., protobuf. ROS2bag, MCAP).</li>\n<li>Deep understanding of data quality, observability, and lineage in high-volume systems.</li>\n<li>Track record of building reliable and performant infrastructure that supports both ad-hoc exploration and repeatable production workflows.</li>\n"
    },
    {
      "text": "Bonus Qualifications",
      "content": "\n<li>Experience in AD/ADAS, robotics, or autonomous systems — especially handling perception or planning datasets.</li>\n<li>Familiarity with ML pipeline orchestration frameworks (e.g. Kubeflow, SageMaker, etc).</li>\n<li>Experience working with temporal or spatial data, including geospatial indexing and time-series alignment.</li>\n<li>Exposure to synthetic data generation, simulation logging, or scenario replay pipelines.</li>\n<li>Strong software engineering fundamentals, CI/CD, testing, code review, and service deployment best practices.</li>\n<li>Experience collaborating with cross-functional, distributed teams across research and production orgs.</li>\n"
    }
  ],
  "country": "US",
  "createdAt": 1754350475592,
  "updatedAt": null,
  "categories": {
    "team": "Automated Driving Advanced Development",
    "location": "Los Altos, CA",
    "commitment": "Full-time",
    "department": "Automated Driving Advanced Development",
    "allLocations": [
      "Los Altos, CA"
    ]
  },
  "salaryRange": null,
  "workplaceType": "hybrid"
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/91b3c744a5b506ee0d35f2be10d9f3591ef22b87?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/98ed24bc-1213-4fd4-8b99-e5bf3b99939cJSON
GET https://api.bluedoor.sh/job-postings/v1/sources/a86dbed4-1715-4a6f-9b42-9a7a485b919bJSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/91b3c744a5b506ee0d35f2be10d9f3591ef22b87/eventsJSON