Home › Companies › Terawattinfrastructure › Senior Data Engineer

Senior Data Engineer

Terawattinfrastructure · San Francisco, California · Hybrid · Deleted · $150,000–$180,000 / year · Lever

Job facts

Field	Value
Company	Terawattinfrastructure
Title	Senior Data Engineer
Normalized title	-
Department / team	Technology / Software
Location	San Francisco, CA, United States
Work model	Hybrid / Hybrid
Employment type	Full Time
Salary	$150,000–$180,000 / year
Status	deleted
ATS provider	Lever
Posted / first seen	2026-03-23 / 2026-05-29
Changed / last seen	2026-06-03 / 2026-06-01

Related slices

Page	What it contains	Open
Company jobs	Active postings from Terawattinfrastructure.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Lever.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in San Francisco.	Open
Department jobs	Active postings in Technology.	Open
Work model jobs	Active Hybrid postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Terawattinfrastructure
Source	eb305609-d2cf-4232-953d-34249bc6fa1b
ATS provider	Lever

Description

About Terawatt Infrastructure The once in a century transition to autonomous and electric vehicles is underway and will require a multi-trillion-dollar investment in energy and charging infrastructure, and the real estate to site it on. Terawatt is the leader in delivering large scale, turnkey charging solutions for companies rapidly deploying AV and EV fleets. Whether it’s an urban mobility hub, or a carefully located multi-fleet hub for semi-trucks, Terawatt brings the talent, capabilities, and capital to create reliable, cost-effective solutions for customers on the leading edge of the transition to the next generation of transport. With a growing portfolio of sites across the US in urban hubs and along key logistics and transportation corridors and logistics hubs, Terawatt is building the permanent transportation and logistics infrastructure of tomorrow through a robust combination of capital, real estate, development, and site operations solutions. The company develops, finances, owns, and operates charging solutions that take the cost and complexity out of electrifying fleets. At Terawatt, we execute humbly and with urgency to provide tailored solutions for fleets that delight our clients and support the transition of transportation. We are building a team that represents a variety of backgrounds, perspectives, and skills. At Terawatt, we continuously strive to foster inclusion, humility, energizing relationships, and belonging, and welcome new ideas. We're growing and want you to grow with us. We encourage people from all backgrounds to apply. If a reasonable accommodation is required to fully participate in the job application or interview process, or to perform the essential functions of the position, please contact [email protected]. Terawatt Infrastructure is an equal-opportunity employer. Role Description We are seeking a highly skilled Senior Data Engineer to join our growing team. In this role, you will design and implement scalable and efficient data architectures to support our business needs. You will collaborate closely with data scientists, analysts, and other cross-functional teams to build and optimize data pipelines, ensuring that data is accessible, secure, and well-structured for analytics and reporting. A key part of this role involves developing and maintaining data models, databases, and data lakes, while implementing robust data governance and quality assurance practices. You will drive the development of scalable data infrastructure aligned with company architecture standards and best practices. This role also requires curiosity and a commitment to building and maintaining production data lake pipelines that transform raw time-series data into ML-ready features, training datasets, and batch predictions. This includes ensuring data quality, reproducibility, and reliable retraining so ML outputs—such as forecasts and risk scores—can be trusted by downstream systems. Problems You Will Solve Turning messy operational data into reliable signals by building pipelines that transform noisy, incomplete, and high-volume time-series data into trusted datasets for analytics, product features, and ML workflows Design a resilient lakehouse platform by architecting a scalable Databricks-based platform that support both streaming and batch workloads while ensuring governance, observability, and reliability Enable production-ready ML pipelines by creating reproducible workflows, reliable feature datasets, and batch prediction pipelines that downstream systems can depend on Enable self-service analytics and ML by building infrastructure and abstractions that allow analysts, engineers, and data scientists to independently explore and use data Scale a platform for product and analytics by designing systems that support operational product features, internal reporting, and ML use cases without compromising performance or data quality Core Responsibilities Architect and evolve a Databricks-based data platform that serves as the scalable foundation for product features, internal reporting, and ML workflows. Set technical standards for modeling raw data into clean, reliable datasets, ensuring high integrity and point-in-time accuracy for both BI and ML applications. Build and maintain self-service tooling and infrastructure abstractions that improve the developer experience for data producers, analysts, and data scientists. Design and optimize high-performance ETL/ELT pipelines using Delta Live Tables and Structured Streaming to handle seamless ingestion from diverse data sources. Own platform observability, testing, and proactive monitoring to ensure the performance and reliability of critical data delivery and pipeline health. Architect and enforce data security, compliance, and access controls by implementing Unity Catalog and IAM (Identity and Access Management) best practices across the enterprise. Build and maintain production-grade pipelines that transform raw data into ML-ready features, training datasets, and reliable batch predictions. Lead Infrastructure as Code (IaC) initiatives using Terraform and improve team productivity by identifying technical debt and automating complex deployment workflows. Partner with Engineering, Product, and Business teams to resolve ambiguities and ensure shipped data features are impactful, reliable, and aligned with business outcomes. Build and maintain a self-service data lake environment, empowering non-data engineers and stakeholders to discover, explore, and analyze data independently. Promote engineering excellence through code reviews, documentation, and technical standards for orchestration and testing. Minimum Qualifications Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field. 6+ years in data engineering, platform development, or large-scale data systems. Hands-on experience with Databricks or modern lakehouse platforms and cloud platforms (AWS, GCP, or Azure). Experience building scalable ETL/ELT pipelines using Spark and SQL. Proficiency in SQL and experience with NoSQL databases (e.g., MongoDB, Cassandra, DynamoDB). Strong understanding of data modeling, schema design, and performance optimization. Experience building reliable, production-grade data pipelines with a focus on data quality and observability. Experience supporting analytics and/or ML workflows, including preparing ML-ready datasets. Working knowledge of data governance, security, and access control frameworks. Familiarity with Infrastructure as Code (IaC) and automated deployment workflows (e.g., Terraform). Proven ability to collaborate across teams and contribute to technical direction. Preferred Qualifications Experience working with time-series, IoT, or high-volume telemetry data systems. Familiarity with EV charging ecosystems, including OCPP (Open Charge Point Protocol). Domain experience in electric vehicles (EV), energy systems, or distributed energy resources (DERs). Experience building ML feature pipelines, training datasets, or batch inference workflows. Experience designing self-service data platforms for analysts and data scientists. Background in event-driven or real-time data architectures. Solid software engineering experience, including writing maintainable production code, testing, and applying engineering best practices to data systems. Solid software engineering experience, including writing maintainable production code, testing, and applying engineering best practices to data systems. Proven ability to influence technical direction and collaborate across teams.

Full job record

Job ID	3a1a26b4a33a30028991e32de27c2a9fee99f1c7
Org ID	c0fa6c36-b251-4e0e-a3de-50e12aa19d4d
Source ID	eb305609-d2cf-4232-953d-34249bc6fa1b
Board ID	eb305609-d2cf-4232-953d-34249bc6fa1b
Provider	lever
Provider Job Key	13f3e00c-f976-4d05-9376-a47f95f677aa
Title	Senior Data Engineer
Normalized Title	—
Status	deleted
Active	no
Location Text	San Francisco, California
Department	Technology
Team	Software
Employment Type	Full-time
Workplace Type	hybrid
Remote Policy	hybrid
Country	United States
Region	CA
City	San Francisco
Salary Raw	USD 150000-180000 per-year-salary
Salary Min	150,000
Salary Max	180,000
Salary Currency	USD
Salary Period	year
Source URL	https://jobs.lever.co/terawattinfrastructure/13f3e00c-f976-4d05-9376-a47f95f677aa
Apply URL	https://jobs.lever.co/terawattinfrastructure/13f3e00c-f976-4d05-9376-a47f95f677aa/apply
First Seen At	2026-05-29 07:03:08Z
Last Seen At	2026-06-01 11:05:35Z
Last Checked At	2026-06-03 12:30:29Z
Last Changed At	2026-06-03 12:30:29Z
Inactive At	2026-06-03 12:30:29Z
Source Posted At	2026-03-23 01:10:50Z
Source Updated At	—
Raw Payload Uri	s3://bluework-jobs-prod-raw-590183727216/raw/provider=lever/board=terawattinfrastructure/date=2026-06-01/2026-06-01T11-05-34-877Z-2e46f0fe3a227b1a2f533f03fdb8687ae1d2088bf9e53f33891017797303d437.json

Event Fields

{
  "content_hash": "ebd8e4fd179e32d108ac230e31f0f510c38aa0cc6d1df8ef883da6a3b1bd8043",
  "source_hash": "63655c33550dc18f394afd3365e2c5b0d36177cd30c67bbf4b26e56f151a9038",
  "last_changed_at": "2026-06-03T12:30:29.573Z",
  "active_status": "deleted"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "San Francisco, California",
    "city": "San Francisco",
    "region": "CA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.85
  },
  "salary_max": 180000,
  "salary_min": 150000,
  "inferred_at": "2026-06-01T11:05:35.268Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "San Francisco, California",
      "city": "San Francisco",
      "region": "CA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.85
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "hybrid",
  "salary_period": "year",
  "workplace_type": "hybrid",
  "salary_currency": "USD"
}

Extensions

{}

Native Structured

{
  "lists": [
    {
      "text": "Role Description",
      "content": "<div>\n<p style=\"margin-top: 12pt; margin-bottom: 12pt;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">We are seeking a highly skilled Senior Data Engineer to join our growing team. In this role, you will design and implement scalable and efficient data architectures to support our business needs. You will collaborate closely with data scientists, analysts, and other cross-functional teams to build and optimize data pipelines, ensuring that data is accessible, secure, and well-structured for analytics and reporting.</span></p>\n<p style=\"margin-top: 12pt; margin-bottom: 12pt;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">A key part of this role involves developing and maintaining data models, databases, and data lakes, while implementing robust data governance and quality assurance practices. You will drive the development of scalable data infrastructure aligned with company architecture standards and best practices.</span></p>\n<p style=\"margin-top: 12pt; margin-bottom: 12pt;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">This role also requires curiosity and a commitment to building and maintaining production data lake pipelines that transform raw time-series data into ML-ready features, training datasets, and batch predictions. This includes ensuring data quality, reproducibility, and reliable retraining so ML outputs—such as forecasts and risk scores—can be trusted by downstream systems.</span></p>\n</div>"
    },
    {
      "text": "Problems You Will Solve",
      "content": "<div>\n\n<li><span style=\"font-size: 11pt;\">Turning messy operational data into reliable signals by building pipelines that transform noisy, incomplete, and high-volume time-series data into trusted datasets for analytics, product features, and ML workflows</span></li>\n<li style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">\n<p style=\"margin-top: 0pt; margin-bottom: 0pt;\"><span style=\"font-size: 11pt;\">Design a resilient lakehouse platform by architecting a scalable Databricks-based platform that support both streaming and batch workloads while ensuring governance, observability, and reliability</span></p>\n</li>\n<li style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">\n<p style=\"margin-top: 0pt; margin-bottom: 0pt;\"><span style=\"font-size: 11pt;\">Enable production-ready ML pipelines by creating reproducible workflows, reliable feature datasets, and batch prediction pipelines that downstream systems can depend on</span></p>\n</li>\n<li style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">\n<p style=\"margin-top: 0pt; margin-bottom: 0pt;\"><span style=\"font-size: 11pt;\">Enable self-service analytics and ML by building infrastructure and abstractions that allow analysts, engineers, and data scientists to independently explore and use data</span></p>\n</li>\n<li style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">\n<p style=\"margin-top: 0pt; margin-bottom: 12pt;\"><span style=\"font-size: 11pt;\">Scale a platform for product and analytics by designing systems that support operational product features, internal reporting, and ML use cases without compromising performance or data quality</span></p>\n</li>\n\n</div>"
    },
    {
      "text": "Core Responsibilities",
      "content": "<div>\n\n<li><span style=\"font-size: 11pt;\">Architect and evolve a Databricks-based data platform that serves as the scalable foundation for product features, internal reporting, and ML workflows.</span></li>\n<li><span style=\"font-size: 11pt;\">Set technical standards for modeling raw data into clean, reliable datasets, ensuring high integrity and point-in-time accuracy for both BI and ML applications.</span></li>\n<li><span style=\"font-size: 11pt;\">Build and maintain self-service tooling and infrastructure abstractions that improve the developer experience for data producers, analysts, and data scientists.</span></li>\n<li><span style=\"font-size: 11pt;\">Design and optimize high-performance ETL/ELT pipelines using Delta Live Tables and Structured Streaming to handle seamless ingestion from diverse data sources.</span></li>\n<li><span style=\"font-size: 11pt;\">Own platform observability, testing, and proactive monitoring to ensure the performance and reliability of critical data delivery and pipeline health.</span></li>\n<li><span style=\"font-size: 11pt;\">Architect and enforce data security, compliance, and access controls by implementing Unity Catalog and IAM (Identity and Access Management) best practices across the enterprise.</span></li>\n<li><span style=\"font-size: 11pt;\">Build and maintain production-grade pipelines that transform raw data into ML-ready features, training datasets, and reliable batch predictions.</span></li>\n<li><span style=\"font-size: 11pt;\">Lead Infrastructure as Code (IaC) initiatives using Terraform and improve team productivity by identifying technical debt and automating complex deployment workflows.</span></li>\n<li><span style=\"font-size: 11pt;\">Partner with Engineering, Product, and Business teams to resolve ambiguities and ensure shipped data features are impactful, reliable, and aligned with business outcomes.</span></li>\n<li><span style=\"font-size: 11pt;\">Build and maintain a self-service data lake environment, empowering non-data engineers and stakeholders to discover, explore, and analyze data independently.</span></li>\n<li><span style=\"font-size: 11pt;\">Promote engineering excellence through code reviews, documentation, and technical standards for orchestration and testing.</span></li>\n\n</div>"
    },
    {
      "text": "Minimum Qualifications",
      "content": "\n<li><span style=\"font-size: 11pt;\">Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.</span></li>\n<li><span style=\"font-size: 11pt;\">6+ years in data engineering, platform development, or large-scale data systems.</span></li>\n<li><span style=\"font-size: 11pt;\">Hands-on experience with Databricks or modern lakehouse platforms and cloud platforms (AWS, GCP, or Azure).</span></li>\n<li><span style=\"font-size: 11pt;\">Experience building scalable ETL/ELT pipelines using Spark and SQL.</span></li>\n<li><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Proficiency in SQL and experience with NoSQL databases (e.g., MongoDB, Cassandra, DynamoDB).</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Strong understanding of data modeling, schema design, and performance optimization.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Experience building reliable, production-grade data pipelines with a focus on data quality and observability.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Experience supporting analytics and/or ML workflows, including preparing ML-ready datasets.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Working knowledge of data governance, security, and access control frameworks.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Familiarity with Infrastructure as Code (IaC) and automated deployment workflows (e.g., Terraform).</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Proven ability to collaborate across teams and contribute to technical direction.</span></li>\n"
    },
    {
      "text": "Preferred Qualifications",
      "content": "<div>\n\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Experience working with time-series, IoT, or high-volume telemetry data systems.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Familiarity with EV charging ecosystems, including OCPP (Open Charge Point Protocol).</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Domain experience in electric vehicles (EV), energy systems, or distributed energy resources (DERs).</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Experience building ML feature pipelines, training datasets, or batch inference workflows.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Experience designing self-service data platforms for analysts and data scientists.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Background in event-driven or real-time data architectures.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Solid software engineering experience, including writing maintainable production code, testing, and applying engineering best practices to data systems.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt; font-family: 'Proxima Nova', sans-serif;\">Solid software engineering experience, including writing maintainable production code, testing, and applying engineering best practices to data systems.</span></li>\n<li style=\"line-height: 1.2;\"><span style=\"font-size: 11pt;\">Proven ability to influence technical direction and collaborate across teams.</span></li>\n\n</div>"
    }
  ],
  "country": "US",
  "createdAt": 1774228250136,
  "updatedAt": null,
  "categories": {
    "team": "Software",
    "location": "San Francisco, California",
    "commitment": "Full-time",
    "department": "Technology",
    "allLocations": [
      "San Francisco, California",
      "Remote - Toronto, Ontario, Canada"
    ]
  },
  "salaryRange": {
    "max": 180000,
    "min": 150000,
    "currency": "USD",
    "interval": "per-year-salary"
  },
  "workplaceType": "hybrid"
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/3a1a26b4a33a30028991e32de27c2a9fee99f1c7?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/c0fa6c36-b251-4e0e-a3de-50e12aa19d4dJSON

GET https://api.bluedoor.sh/job-postings/v1/sources/eb305609-d2cf-4232-953d-34249bc6fa1bJSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/3a1a26b4a33a30028991e32de27c2a9fee99f1c7/eventsJSON

Docs · Get an API key