bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesxAIData Engineer

Data Engineer

xAI · Palo Alto, CA · Active · $240,000–$280,000 / year · Greenhouse

Job facts

FieldValue
CompanyxAI
TitleData Engineer
Normalized title-
Department / teamEngineering
LocationPalo Alto, CA, United States
Work model-
Employment type-
Salary$240,000–$280,000 / year
Statusactive
ATS providerGreenhouse
Posted / first seen2026-04-24 / 2026-05-29
Changed / last seen2026-06-04 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from xAI.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Greenhouse.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Palo Alto.Open
Department jobsActive postings in Engineering.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyxAI
Source7f9435ac-306c-40d6-ab10-f3e34c22fb92
ATS providerGreenhouse

Description

ABOUT xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates. ABOUT THE ROLE: At xAI, we are building AI systems that push the frontier of human knowledge and scientific discovery. High-quality data is fundamental to every stage of that mission. Our Data team is responsible for ensuring that the models are trained on the right data, in the right form, at the right quality, across every phase of the training lifecycle. This includes partnering closely with acquisition teams to identify where valuable data can be sourced, determining what data is needed to improve model performance, and building the production pipelines and systems that transform raw inputs into high-quality training data at scale. We work at the intersection of data, infrastructure, and machine learning to ensure our models train effectively and reliably. As a Data Engineer / AI Engineer on xAI’s Data team, you will be responsible for developing the systems, processes, and production code that power data acquisition, preparation, quality evaluation, and delivery for model training. You will work closely with acquisition teams, ML engineers, and software engineers to identify data needs, build scalable data pipelines, and continuously improve the quality of the data that shapes model behavior. The ideal candidate combines strong software engineering fundamentals and excellent coding practices with deep intuition for statistics, neural networks, and how data quality influences training outcomes. RESPONSIBILITIES: Analyze the performance and impact of data used throughout the model training lifecycle Investigate anomalous model behavior and rigorously identify the data issues that drive poor downstream performance Design, build, and improve the data cleaning, transformation, and quality-control steps required to produce high-quality training data Research, evaluate, and develop frontier methods for improving data quality and effectiveness in AI model development Apply statistical techniques and empirical analysis to make informed, data-driven decisions about dataset quality and model outcomes Partner across teams to identify where data needs exist and define the highest-impact opportunities for new data acquisition and improvement Build and maintain production-grade data pipelines, tooling, and software systems that ingest, process, validate, and deliver data for training Develop metrics, evaluation frameworks, and monitoring systems to assess how data quality influences model behavior at scale Fuse data from multiple sources into reliable, usable datasets for research and production model training Create shared datasets, tooling, and internal data products that enable other teams to analyze, debug, and improve model performance BASIC QUALIFICATIONS: Bachelor’s degree in computer science, data science, physics, mathematics, or a STEM discipline 1+ years of data/software engineering experience (internship experience is applicable) Experience in implementing or analyzing language models or neural networks PREFERRED SKILLS AND EXPERIENCE: Professional experience in analytics, data science, machine learning, or data engineering Experience building and operating production data pipelines for neural network or large-scale machine learning workloads Strong experience with Python and the broader ecosystem of libraries and tools used in modern machine learning and data development Experience working with Parquet or similar columnar storage formats in large-scale data systems Familiarity with Kubernetes and distributed production environments Experience developing predictive models and machine learning pipelines, including clustering, forecasting, anomaly detection, or related techniques Experience working with very large-scale datasets, including terabyte- to petabyte-scale data systems Strong statistical intuition and the ability to use quantitative analysis to guide technical and product decision, including familiarity of scaling ladder design studies Ability to operate effectively in a dynamic environment with evolving priorities, changing requirements, and fast-moving technical challenges Demonstrated ability to take ownership of ambiguous problems, drive projects independently, and develop new expertise where needed COMPENSATION AND BENEFITS $240,000 - $280,000 USD Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks. xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice .

Full job record

Job IDf532498e8a42676f6c832b21f8c3e8aea964d940
Org ID5e43ffaa-7f1f-4a14-8ca5-9083852229ec
Source ID7f9435ac-306c-40d6-ab10-f3e34c22fb92
Board ID7f9435ac-306c-40d6-ab10-f3e34c22fb92
Providergreenhouse
Provider Job Key5120884007
TitleData Engineer
Normalized Title
Statusactive
Activeyes
Location TextPalo Alto, CA
DepartmentEngineering
Team
Employment Type
Workplace Type
Remote Policy
CountryUnited States
RegionCA
CityPalo Alto
Salary RawCOMPENSATION AND BENEFITS $240,000 - $280,000 USD Base salary is just one part of our total rewards package at xAI, which als
Salary Min240,000
Salary Max280,000
Salary CurrencyUSD
Salary Periodyear
Source URLhttps://job-boards.greenhouse.io/xai/jobs/5120884007
Apply URLhttps://job-boards.greenhouse.io/xai/jobs/5120884007
First Seen At2026-05-29 22:41:25Z
Last Seen At2026-06-06 07:34:02Z
Last Checked At2026-06-06 07:34:02Z
Last Changed At2026-06-04 11:13:39Z
Inactive At
Source Posted At2026-04-24 22:46:13Z
Source Updated At2026-06-03 18:41:01Z
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=xai/date=2026-06-06/2026-06-06T07-34-01-911Z-c6fca2a525586660e71867723d02e6127472f40aaac154b54f82518d777fd1c8.json
Event Fields
{
  "content_hash": "88ffda2bf6ecd6ba1fecc045b156c937e201fd865364bcd566a917c3cfdecfbd",
  "source_hash": "516e38afa424a60e111bf41e871d1fca9647d73322491e10888a93c4c6832801",
  "last_changed_at": "2026-06-04T11:13:39.282Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Palo Alto, CA",
    "city": "Palo Alto",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.9
  },
  "salary_max": 280000,
  "salary_min": 240000,
  "inferred_at": "2026-06-06T07:34:02.213Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Palo Alto, CA",
      "city": "Palo Alto",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.9
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": "year",
  "workplace_type": null,
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "title": "Data Engineer",
  "offices": [
    {
      "id": 4035106007,
      "name": "Palo Alto, CA",
      "location": "Palo Alto, California, United States",
      "child_ids": [],
      "parent_id": 4054926007
    }
  ],
  "language": "en",
  "location": {
    "name": "Palo Alto, CA"
  },
  "metadata": [
    {
      "id": 16340689007,
      "name": "Featured Role",
      "value": null,
      "value_type": "yes_no"
    }
  ],
  "updated_at": "2026-06-03T14:41:01-04:00",
  "departments": [
    {
      "id": 4024733007,
      "name": "Engineering",
      "child_ids": [
        4064292007,
        4064383007,
        4064385007,
        4046294007,
        4052172007,
        4046295007
      ],
      "parent_id": null
    }
  ],
  "company_name": "xAI",
  "requisition_id": 4624267007,
  "first_published": "2026-04-24T18:46:13-04:00",
  "application_deadline": null
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/f532498e8a42676f6c832b21f8c3e8aea964d940?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/5e43ffaa-7f1f-4a14-8ca5-9083852229ecJSON
GET https://api.bluedoor.sh/job-postings/v1/sources/7f9435ac-306c-40d6-ab10-f3e34c22fb92JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/f532498e8a42676f6c832b21f8c3e8aea964d940/eventsJSON