Home › Companies › Veeva › Data Engineer

Data Engineer

Veeva · Massachusetts - Boston · Remote · Active · $75,000–$130,000 / year · Lever

Job facts

Field	Value
Company	Veeva
Title	Data Engineer
Normalized title	-
Department / team	Engineering / Engineering - NA
Location	Massachusetts - Boston, United States
Work model	Remote / Remote
Employment type	Full Time
Salary	$75,000–$130,000 / year
Status	active
ATS provider	Lever
Posted / first seen	2026-04-15 / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Veeva.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Lever.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in Massachusetts - Boston.	Open
Department jobs	Active postings in Engineering.	Open
Work model jobs	Active Remote postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Veeva
Source	6fce17dd-4220-4c57-8376-26c5afb1aaa5
ATS provider	Lever

Description

Veeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $3B in revenue in our last fiscal year with extensive growth potential ahead. At the heart of Veeva are our values: Do the Right Thing, Customer Success, Employee Success, and Speed. We're not just any public company – we made history in 2021 by becoming a public benefit corporation (PBC), legally bound to balancing the interests of customers, employees, society, and investors. As a Work Anywhere company, we support your flexibility to work from home or in the office, so you can thrive in your ideal environment. Join us in transforming the life sciences industry, committed to making a positive impact on its customers, employees, and communities. The Role Veeva OpenData supports the industry by providing real-time reference data across the complete healthcare ecosystem, to support commercial sales execution, compliance, and business analytics. We drive value to our customers through constant innovation, using cloud-based solutions and state-of-the-art technologies to deliver product excellence and customer success. As a Data Engineer, you will own the end-to-end development lifecycle, collaborating with a high-performing engineering team to design, build, and deploy high-impact features. Operating within a fast-paced Agile environment, you will have a direct hand in engineering the data foundation for Veeva’s life sciences customers. #LI-RemoteUS #LI-Associate Veeva’s headquarters is located in the San Francisco Bay Area with offices in more than 15 countries around the world. Veeva is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity or expression, religion, national origin or ancestry, age, disability, marital status, pregnancy, protected veteran status, protected genetic information, political affiliation, or any other characteristics protected by local laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us at [email protected]. What You'll Do Architect and build resilient, distributed data processing systems using Python and Spark on AWS Design and implement end-to-end ETL/ELT workflows that ingest and unify data from diverse sources —ranging from modern table formats like Iceberg and Delta to legacy business files such as Excel and CSV —ensuring a scalable and consistent single source of truth for the organization Lead the implementation of the Medallion Architecture , managing data maturity through Bronze, Silver, and Gold layers. You will define how data is structured, classified, and stored to maximize business value while ensuring scalability and high availability. Build reusable libraries and frameworks for data quality validation, metadata tracking, and pipeline monitoring Build CI/CD process, to automate deployment and testing to maintain a high bar for engineering excellence Enforce data governance standards, including security, privacy, and regulatory compliance Proactively monitor system health, implement automated observability, and resolve complex bottlenecks in distributed systems to ensure peak resource efficiency and cost-effectiveness Partner directly with Product Managers and Data Scientists to translate business requirements into innovative solutions Own the full feature lifecycle —from initial whiteboarding to production deployment and long-term maintenance Requirements 4+ years of professional data engineering experience with a demonstrated ability to architect and deploy production-grade data platforms from scratch Expert-level proficiency in Python and Apache Spark , with specific experience in JVM tuning , memory management, and optimizing execution plans for large-scale distributed workloads Deep expertise in modern data architecture, software design patterns, and various data modeling techniques designed for scalability and performance Proven track record of building on AWS (primary) or GCP, including hands-on experience with managed services like EMR or Databricks Extensive experience designing and managing complex data lifecycles using orchestration tools such as Airflow, AWS Step Functions , or Prefect Deep understanding of data cleansing, curation, and transformation strategies, coupled with experience implementing data governance, security, and lifecycle management policies Strong background in building reusable libraries, frameworks, and internal tools that standardize data ingestion and automate ETL/ELT workflows Exceptional debugging skills for distributed systems and resolving performance bottlenecks at scale Proficiency with CI/CD tools and processes (e.g. Codefresh, Jenkins) Excellent verbal and written communication skills in English, with the ability to translate complex technical architectures into actionable insights for stakeholders and cross-functional teams Must be located in EST or CST Applicants must have the unrestricted right to work in the United States. Veeva will not provide sponsorship at this time Nice to Have Relevant certifications (e.g., AWS, Spark, or similar) Familiarity with streaming and distributed technologies such as Spark Streaming, EKS, Kinesis, or Apache Kafka Experience implementing or managing modern cloud data warehouses or lakehouse architectures Prior experience working in the Life Sciences industry Perks & Benefits Medical, dental, vision, and basic life insurance Flexible PTO and company paid holidays Retirement programs 1% charitable giving program Compensation Base pay: $ 75,000 - $130,000 The salary range listed here has been provided to comply with local regulations and represents a potential base salary range for this role. Please note that actual salaries may vary within the range above or below, depending on experience and location. We look at compensation for each individual and base our offer on your unique qualifications, experience, and expected contributions. This position may also be eligible for other types of compensation in addition to base salary, such as variable bonus and/or stock bonus.

Full job record

Job ID	a03ff1db2a97dc0de244c2642115e97b4e670840
Org ID	4c200caa-06e8-4cf8-9e9b-bc619d58e153
Source ID	6fce17dd-4220-4c57-8376-26c5afb1aaa5
Board ID	6fce17dd-4220-4c57-8376-26c5afb1aaa5
Provider	lever
Provider Job Key	bef9af9f-7189-4bc3-8171-d423da8488f5
Title	Data Engineer
Normalized Title	—
Status	active
Active	yes
Location Text	Massachusetts - Boston
Department	Engineering
Team	Engineering - NA
Employment Type	Full-Time
Workplace Type	remote
Remote Policy	remote
Country	United States
Region	—
City	Massachusetts - Boston
Salary Raw	Base pay: $ 75,000 - $130,000 The salary range listed here has been provided to comply with local regulations
Salary Min	75,000
Salary Max	130,000
Salary Currency	USD
Salary Period	year
Source URL	https://jobs.lever.co/veeva/bef9af9f-7189-4bc3-8171-d423da8488f5
Apply URL	https://jobs.lever.co/veeva/bef9af9f-7189-4bc3-8171-d423da8488f5/apply
First Seen At	2026-05-29 07:00:41Z
Last Seen At	2026-06-06 07:56:17Z
Last Checked At	2026-06-06 07:56:17Z
Last Changed At	2026-05-29 07:00:41Z
Inactive At	—
Source Posted At	2026-04-15 21:32:51Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=lever/board=veeva/date=2026-06-06/2026-06-06T07-56-12-755Z-d8b56e04cea2017ece19014d0050a0f7ca5f16a9d4b07b8535238de2cbeeb64d.json

Event Fields

{
  "content_hash": "7cfd561699aec8e1a7f131a43dc9dd60fcb58a182639d6192e14ecda95e53196",
  "source_hash": "084f3cc0bb68158a0e69fecb1dfe3da5ca94e9a52fcc3c472e43dc7bf3c3eb08",
  "last_changed_at": "2026-05-29T07:00:41.937Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Massachusetts - Boston",
    "city": "Massachusetts - Boston",
    "region": null,
    "country": "United States",
    "is_remote": true,
    "confidence": 0.9
  },
  "salary_max": 130000,
  "salary_min": 75000,
  "inferred_at": "2026-06-06T07:56:16.050Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Massachusetts - Boston",
      "city": "Massachusetts - Boston",
      "region": null,
      "country": "United States",
      "is_remote": true,
      "confidence": 0.9
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "remote",
  "salary_period": "year",
  "workplace_type": "remote",
  "salary_currency": "USD"
}

Extensions

{}

Native Structured

{
  "lists": [
    {
      "text": "What You'll Do",
      "content": "<div>\n\n<li>Architect and build resilient, distributed data processing systems using <strong>Python and Spark on AWS</strong></li>\n<li>Design and implement end-to-end ETL/ELT workflows that ingest and unify data from diverse sources —ranging from modern table formats like Iceberg and Delta to legacy business files such as Excel and CSV —ensuring a scalable and consistent single source of truth for the organization</li>\n<li>Lead the implementation of the <strong>Medallion Architecture</strong>, managing data maturity through Bronze, Silver, and Gold layers. You will define how data is structured, classified, and stored to maximize business value while ensuring scalability and high availability.</li>\n<li>Build <strong>reusable libraries and frameworks</strong> for data quality validation, metadata tracking, and pipeline monitoring</li>\n<li>Build CI/CD process, to automate deployment and testing to maintain a high bar for engineering excellence</li>\n<li>Enforce data governance standards, including security, privacy, and regulatory compliance</li>\n<li>Proactively monitor system health, implement automated observability, and resolve complex bottlenecks in distributed systems to ensure peak resource efficiency and cost-effectiveness</li>\n<li>Partner directly with Product Managers and Data Scientists to translate business requirements into innovative solutions</li>\n<li><strong>Own the full feature lifecycle</strong>—from initial whiteboarding to production deployment and long-term maintenance</li>\n\n</div>"
    },
    {
      "text": "Requirements",
      "content": "<div>\n\n<li>4+ years of professional data engineering experience with a demonstrated ability to architect and deploy production-grade data platforms from scratch</li>\n<li>Expert-level proficiency in <strong>Python</strong> and <strong>Apache Spark</strong>, with specific experience in <strong>JVM tuning</strong>, memory management, and optimizing execution plans for large-scale distributed workloads</li>\n<li>Deep expertise in modern data architecture, software design patterns, and various data modeling techniques designed for scalability and performance</li>\n<li>Proven track record of building on <strong>AWS</strong> (primary) or GCP, including hands-on experience with managed services like <strong>EMR</strong> or Databricks</li>\n<li>Extensive experience designing and managing complex data lifecycles using orchestration tools such as <strong>Airflow, AWS Step Functions</strong>, or Prefect</li>\n<li>Deep understanding of data cleansing, curation, and transformation strategies, coupled with experience implementing data governance, security, and lifecycle management policies</li>\n<li>Strong background in building reusable libraries, frameworks, and internal tools that standardize data ingestion and automate ETL/ELT workflows</li>\n<li>Exceptional debugging skills for distributed systems and resolving performance bottlenecks at scale</li>\n<li>Proficiency with CI/CD tools and processes (e.g. Codefresh, Jenkins)</li>\n<li>Excellent verbal and written communication skills in English, with the ability to translate complex technical architectures into actionable insights for stakeholders and cross-functional teams</li>\n<li>Must be located in EST or CST</li>\n<li>Applicants must have the unrestricted right to work in the United States. Veeva will not provide sponsorship at this time</li>\n\n</div>"
    },
    {
      "text": "Nice to Have",
      "content": "<div>\n\n<li>Relevant certifications (e.g., AWS, Spark, or similar)</li>\n<li>Familiarity with streaming and distributed technologies such as Spark Streaming, EKS, Kinesis, or Apache Kafka</li>\n<li>Experience implementing or managing modern cloud data warehouses or lakehouse architectures</li>\n<li>Prior experience working in the Life Sciences industry</li>\n\n</div>"
    },
    {
      "text": "Perks & Benefits",
      "content": "<div>\n\n<li>Medical, dental, vision, and basic life insurance</li>\n<li>Flexible PTO and company paid holidays</li>\n<li>Retirement programs</li>\n<li>1% charitable giving program</li>\n\n</div>"
    },
    {
      "text": "Compensation",
      "content": "<div>\n\n<li>Base pay: $<span data-sheets-root=\"1\">75,000 - $130,000</span></li>\n<li>The salary range listed here has been provided to comply with local regulations and represents a potential base salary range for this role. Please note that actual salaries may vary within the range above or below, depending on experience and location. We look at compensation for each individual and base our offer on your unique qualifications, experience, and expected contributions. This position may also be eligible for other types of compensation in addition to base salary, such as variable bonus and/or stock bonus.</li>\n\n</div>"
    }
  ],
  "country": "US",
  "createdAt": 1776288771215,
  "updatedAt": null,
  "categories": {
    "team": "Engineering - NA",
    "location": "Massachusetts - Boston",
    "commitment": "Full-Time",
    "department": "Engineering",
    "allLocations": [
      "Massachusetts - Boston"
    ]
  },
  "salaryRange": null,
  "workplaceType": "remote"
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/a03ff1db2a97dc0de244c2642115e97b4e670840?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/4c200caa-06e8-4cf8-9e9b-bc619d58e153JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/6fce17dd-4220-4c57-8376-26c5afb1aaa5JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/a03ff1db2a97dc0de244c2642115e97b4e670840/eventsJSON

Docs · Get an API key