bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesDatapelagoPrincipal Data Processing Engineer-OSS

Principal Data Processing Engineer-OSS

Datapelago · Mountain View, California, 94041, United States · Remote · Active · BambooHR

Job facts

FieldValue
CompanyDatapelago
TitlePrincipal Data Processing Engineer-OSS
Normalized title-
Department / team-
LocationMountain View, United States
Work modelRemote / Remote
Employment typeFull Time
Salary-
Statusactive
ATS providerBambooHR
Posted / first seen2024-09-25 / 2026-05-30
Changed / last seen2026-05-30 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Datapelago.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through BambooHR.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Mountain View.Open
Work model jobsActive Remote postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyDatapelago
Sourceeaa6c882-d521-4b9b-a50a-5db329d4eb72
ATS providerBambooHR

Description

Principal Data Processing Engineer - OSS Mountain View, CA About DataPelago: DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. We are building an innovative data processing engine that is transforming how Apache Spark, Apache Flink, Ray, and others operate on diverse, large-scale data. Our team of engineers drive and adopt advances in hardware-accelerated computing, parallel processing of large-scale data, query optimization, distributed systems, compilers, machine learning, and cloud-native computing. We are looking for world-class engineers to join our team and shape the future of accelerated data processing. The Role: As a Principal Data Processing Engineer (OSS), you will be a key individual contributor in adopting and advancing the capabilities of open-source software (OSS) platforms such as Apache Gluten, Velox, Apache Spark, and Apache Flink in the context of DataPelago’s data processing engine. You will enhance the functional breadth, performance, scale, and reliability of the DataPelago engine through downstream and upstream contributions. You will have the opportunity to engage with the community working on these platforms. This is a unique opportunity to make a significant impact on a category-defining product and work with a talented team of engineers. What You'll Do: Influence the architecture of how our data processing engine interfaces with open-source platforms and engines. Lead the design of functional and performance enhancements to open source platforms such as Apache Gluten and Velox, and their integration with our data processing engine. Individually design, implement, test, optimize, and maintain components of the data processing engine. Analyze the technology roadmap of Apache Gluten, Velox, and equivalent platforms and identify opportunities for our engine to enhance technology and product leadership. Collaboration: Partner with engineering, product management, the open-source community and customer success teams. Foster best practices in design and code reviews, testing, CI/CD, and issue resolution to maintain the highest product quality, security, efficiency, and productivity. What You'll Bring: BS/MS in  Computer Science (or a related field) with 6+ years of relevant experience 3+ years of deep technical experience in instrumenting, analyzing, and optimizing the performance of data processing engine components on benchmark and customer workloads. Sound knowledge of the architecture and internal operation of one or more of Apache Spark, Apache Flink, Presto/Trino. Demonstrated experience in the design, development, and successful release of high-performance data processing engines for large production deployments. Exceptional programming skills in C, C++, and Java. Extensive development experience in Linux environments. Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to both technical and non-technical audiences. Strong analytical and problem-solving skills with a passion for performance optimization. Location Considerations: We value face-to-face collaboration, but recognize that talent can be found anywhere. Our engineering team works at our headquarters in Mountain View, CA, at our India office in Hyderabad, and at remote locations. Why Join DataPelago? Technical Leadership: Take a leadership role in shaping the architecture and development of how our core engine works with open source data processing platforms Cutting-Edge Innovation: Work on challenging problems at the forefront of accelerated computing and data processing. Significant Impact: Your contributions will directly impact the performance and scalability of our mission-critical platform. Mentorship and Growth: Mentor and guide other talented engineers while expanding your own technical expertise. Competitive compensation, stock options, comprehensive benefits package, and leadership development opportunities

Full job record

Job ID0b2e61da5f6946ecdfe2f76579ce37f42a150885
Org IDfad8cd3e-2f04-4a77-a77c-aeb512439968
Source IDeaa6c882-d521-4b9b-a50a-5db329d4eb72
Board IDeaa6c882-d521-4b9b-a50a-5db329d4eb72
Providerbamboohr
Provider Job Key26
TitlePrincipal Data Processing Engineer-OSS
Normalized Title
Statusactive
Activeyes
Location TextMountain View, California, 94041, United States
Department
Team
Employment Typefull_time
Workplace Typeremote
Remote Policyremote
CountryUnited States
Region
CityMountain View
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://datapelago.bamboohr.com/careers/26
Apply URLhttps://datapelago.bamboohr.com/careers/26
First Seen At2026-05-30 06:11:22Z
Last Seen At2026-06-06 10:26:05Z
Last Checked At2026-06-06 10:26:05Z
Last Changed At2026-05-30 06:11:22Z
Inactive At
Source Posted At2024-09-25 00:00:00Z
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=bamboohr/board=datapelago/date=2026-06-06/2026-06-06T10-26-04-697Z-7c727efe35d0e861fa42a8f6f2f986697f92d80acc804a198a3eef676d0d16da.json
Event Fields
{
  "content_hash": "87393b679c31a8c360fe779bbda362fa87b6e9084a3be36a487b36126814df6a",
  "source_hash": "2baaa523da2a9b40c514efa590b0ef8961987ec69ff20feb4857b9940939f30e",
  "last_changed_at": "2026-05-30T06:11:22.133Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Mountain View, California, 94041, United States",
    "city": "Mountain View",
    "region": null,
    "country": "United States",
    "is_remote": true,
    "confidence": 0.95
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T10:26:05.936Z",
  "launch_scope": {
    "reason": "bamboohr_production_catalog",
    "included": true,
    "location": {
      "raw": "Mountain View, California, 94041, United States",
      "city": "Mountain View",
      "region": null,
      "country": "United States",
      "is_remote": true,
      "confidence": 0.95
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "remote",
  "salary_period": null,
  "workplace_type": "remote",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "list_job": {
    "id": "26",
    "isRemote": null,
    "location": {
      "city": "Mountain View",
      "state": "California"
    },
    "atsLocation": {
      "city": null,
      "state": null,
      "country": null,
      "province": null
    },
    "departmentId": null,
    "locationType": "0",
    "jobOpeningName": "Principal Data Processing Engineer-OSS",
    "departmentLabel": null,
    "employmentStatusLabel": "Full-Time"
  },
  "detail_errors": [],
  "detail_job_opening": {
    "location": {
      "city": "Mountain View",
      "state": "California",
      "postalCode": "94041",
      "addressCountry": "United States"
    },
    "datePosted": "2024-09-25",
    "atsLocation": {
      "city": null,
      "state": null,
      "country": null,
      "countryId": null
    },
    "description": "<p><span style=\"font-family: arial, helvetica, sans-serif; font-weight: bold\">Principal Data Processing Engineer - OSS</span><br><span style=\"font-family: arial, helvetica, sans-serif\">Mountain View, CA </span></p>\n<p><br></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">About DataPelago:</span></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. We are building an innovative data processing engine that is transforming how Apache Spark, Apache Flink, Ray, and others operate on diverse, large-scale data. Our team of engineers drive and adopt advances in hardware-accelerated computing, parallel processing of large-scale data, query optimization, distributed systems, compilers, machine learning, and cloud-native computing. We are looking for world-class engineers to join our team and shape the future of accelerated data processing.</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br><br></span></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">The Role:</span></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">As a Principal Data Processing Engineer (OSS), you will be a key individual contributor in</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br></span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">adopting and advancing the capabilities of open-source software (OSS) platforms such as Apache</span></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Gluten, Velox, Apache Spark, and Apache Flink in the context of DataPelago’s data processing engine. You will enhance the functional breadth, performance, scale, and reliability of the DataPelago engine through downstream and upstream contributions. </span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">You will have the opportunity to engage with the community working on these platforms.</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"> This is a unique opportunity to make a significant impact on a category-defining product and work with a talented team of engineers.</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br><br></span></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">What You'll Do:</span></p>\n<ul>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Influence the architecture of how our data processing engine interfaces with open-source platforms and engines.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Lead the design of functional and performance enhancements to open source platforms such as Apache Gluten and Velox, and their integration with our data processing engine.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Individually design, implement, test, optimize, and maintain </span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">components</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"> of the data processing engine.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Analyze the technology roadmap of Apache Gluten, Velox, and equivalent platforms and identify opportunities for our engine to enhance technology and product leadership.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Collaboration: Partner with engineering, product management, the open-source community and customer success teams.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Foster best practices in design and code reviews, testing, CI/CD, and issue resolution to maintain the highest product quality, security, efficiency, and productivity.</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br><br></span></li>\n</ul>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">What You'll Bring:</span></p>\n<ul>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">BS/MS in  Computer Science (or a related field) with 6+ years of relevant experience </span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">3+ years of deep technical experience in instrumenting, analyzing, and optimizing the </span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">performance</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"> of data processing engine components on benchmark and customer workloads.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Sound knowledge of the architecture and internal operation of one or more of Apache Spark,</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br></span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Apache Flink, Presto/Trino.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Demonstrated experience in the design, development, and successful release of </span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">high-performance</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"> data processing engines for large production deployments.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Exceptional programming skills in C, C++, and Java.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Extensive development experience in Linux environments.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Excellent communication and collaboration skills, with the ability to articulate complex technical concepts to both technical and non-technical audiences.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Strong analytical and problem-solving skills with a passion for performance optimization.</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br><br></span></li>\n</ul>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Location Considerations:</span></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">We value face-to-face collaboration, but recognize that talent can be found anywhere. Our engineering team works at our headquarters in Mountain View, CA, at our India office in Hyderabad, and at remote locations.</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br><br></span></p>\n<p><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Why Join DataPelago?</span></p>\n<ul>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Technical Leadership: Take a leadership role in shaping the architecture and development of how our core engine works with open source data processing platforms</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Cutting-Edge Innovation: Work on challenging problems at the forefront of accelerated</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"><br></span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">computing and data processing.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Significant Impact: Your contributions will directly impact the performance and scalability of our mission-critical platform.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Mentorship and Growth: Mentor and guide other talented engineers while expanding your own technical expertise.</span></li>\n<li><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">Competitive compensation, stock options, comprehensive benefits package, and leadership </span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\">development</span><span style=\"color: rgb(72, 65, 63); font-family: Arial, sans-serif; font-size: 10pt\"> opportunities</span></li>\n</ul>",
    "compensation": null,
    "departmentId": null,
    "locationType": "0",
    "seekPromoted": false,
    "jobCategoryId": null,
    "jobOpeningName": "Principal Data Processing Engineer-OSS",
    "departmentLabel": "",
    "jobOpeningStatus": "Open",
    "minimumExperience": "Experienced",
    "jobOpeningShareUrl": "https://datapelago.bamboohr.com/careers/26",
    "employmentStatusLabel": "Full-Time"
  }
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/0b2e61da5f6946ecdfe2f76579ce37f42a150885?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/fad8cd3e-2f04-4a77-a77c-aeb512439968JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/eaa6c882-d521-4b9b-a50a-5db329d4eb72JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/0b2e61da5f6946ecdfe2f76579ce37f42a150885/eventsJSON