bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesStellusrxSenior Data Engineer

Senior Data Engineer

Stellusrx · Active · BambooHR

Job facts

FieldValue
CompanyStellusrx
TitleSenior Data Engineer
Normalized title-
Department / team1420 - Decision Science
LocationLima, Lima, Peru
Work model-
Employment typeFull Time
Salary-
Statusactive
ATS providerBambooHR
Posted / first seen2026-04-07 / 2026-05-30
Changed / last seen2026-05-30 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Stellusrx.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through BambooHR.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Lima.Open
Department jobsActive postings in 1420 - Decision Science.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyStellusrx
Source59d126cb-05fc-457f-9ee8-fb61c5d59b52
ATS providerBambooHR

Description

Job Summary We're opening eyes, hearts and minds to the impact that a pharmacy team can have in changing lives. Join our group of talented, committed team members-pharmacists, pharmacy care coordinators, technologists, product strategists and more-to create and expand the delivery of personalized health support that people didn't even know could be possible. The Senior Data Engineer for Stellus Rx will be a key member of our Technology Team , working closely with Stellus Rx leaders and across the organization to unlock the health of millions of Americans. We are a culture that is unabashedly driven by purpose — making a difference to patients and team members while growing at an accelerated rate. This role is built for a data engineer who uses AI as an active part of their workflow — accelerating pipeline development, automating data quality processes, and enabling richer, faster insights across our Cloud Analytics Data Platform rather than relying on manual, repetitive engineering approaches. Role and Responsibilities: AI-Augmented Pipeline Development & Automation Develop, construct, and maintain large-scale data processing systems that collect data from a variety of structured and unstructured sources — using AI code generation tools to accelerate pipeline authoring, reduce boilerplate, and improve code quality. Build and optimize ELT pipelines using AI-assisted tooling to identify bottlenecks, suggest optimizations, and automate routine pipeline maintenance tasks. Identify, design, and implement internal process improvements: use AI to automate manual processes, optimize data delivery, and re-design infrastructure for greater scalability — replacing manual analysis with AI-driven discovery of improvement opportunities. Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources; use AI to accelerate infrastructure-as-code authoring and configuration. AI-Ready Data Preparation & ML Enablement Prepare data for data scientist exploration and discovery using AI-assisted data profiling and quality assessment tools — surfacing anomalies, schema drift, and data gaps faster than manual inspection allows. Perform data wrangling and munging for downstream analytics and machine learning; leverage AI tools to generate and validate transformation logic against business rules. Assemble large, complex datasets that meet functional and non-functional business requirements; use AI to rapidly evaluate dimensional modeling approaches and ontology alignment strategies. Enable large-scale machine learning by designing and maintaining annotated datasets, elastic search approaches, and scalable data lake structures that support AI/ML workloads. Analytics Pipeline & Insight Generation Create and maintain analytics pipelines that generate data and insight to power business decision-making; use AI-assisted analysis to proactively surface trends, anomalies, and opportunities within pipeline outputs. Collaborate with data scientists, analysts, and business stakeholders on requirements for dimensional modeling, distributed ETL pipelines, and cross-repository data migration. Evaluate, compare, and improve design patterns, data lifecycle approaches, and data ontology alignment — using AI to model trade-offs and accelerate proof-of-concept validation. Work with data and analytics experts to continuously improve the functionality, reliability, and intelligence of data systems. Root Cause Analysis & Quality Management Perform root cause analysis on internal and external data and processes using AI-assisted investigation tools — replacing slow, manual log and lineage review with faster, AI-accelerated diagnostics. Develop and maintain data quality frameworks; use AI to automate anomaly detection, schema validation, and data contract enforcement across pipelines. Develop a strong understanding of company domains, strategic direction, and user needs to ensure data systems are aligned to business outcomes, not just technical requirements. Qualifications and Requirements: 4+ years of experience in a Data Engineer role. Graduate degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field. Advanced SQL knowledge and experience with relational databases and query authoring. Required: Demonstrated, hands-on experience using AI tools to accelerate data engineering tasks — pipeline development, data quality automation, code generation, or root cause analysis — with specific examples you can speak to. Experience building and optimizing data pipelines, architectures, and datasets. Strong analytic skills working with unstructured and disconnected datasets. Experience with big data tools: Hadoop, Spark, Kafka, etc. Experience with relational and NoSQL databases including Postgres and Cassandra. Experience with pipeline and workflow management tools: Airflow, Luigi, Azkaban, or similar. Experience with AWS cloud services: EC2, EMR, RDS, Redshift. Experience with stream-processing systems: Storm, Spark Streaming, or similar. Working knowledge of message queuing, stream processing, and highly scalable data stores. Proficiency in object-oriented/scripting languages: Python, Java, Scala, C++, or similar. Experience supporting cross-functional teams in dynamic, agile environments. Preferred Experience: Experience designing or supporting data infrastructure for AI/ML model training, including annotated datasets and feature stores. Familiarity with AI-assisted data quality or observability platforms (e.g., Monte Carlo, Soda, or similar). Experience with LLM-based data processing pipelines or retrieval-augmented generation (RAG) architectures. Healthcare data experience; familiarity with FHIR/HL7 standards a plus. High English proficiency

Full job record

Job IDf0c9765af9828ff9c31f793e42d8236964c84e18
Org ID4cd4a191-0968-42e8-9a8f-e4a9a35b49fd
Source ID59d126cb-05fc-457f-9ee8-fb61c5d59b52
Board ID59d126cb-05fc-457f-9ee8-fb61c5d59b52
Providerbamboohr
Provider Job Key169
TitleSenior Data Engineer
Normalized Title
Statusactive
Activeyes
Location Text
Department1420 - Decision Science
Team
Employment Typefull_time
Workplace Type
Remote Policy
CountryPeru
RegionLima
CityLima
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://stellusrx.bamboohr.com/careers/169
Apply URLhttps://stellusrx.bamboohr.com/careers/169
First Seen At2026-05-30 05:50:36Z
Last Seen At2026-06-06 10:24:49Z
Last Checked At2026-06-06 10:24:49Z
Last Changed At2026-05-30 05:50:36Z
Inactive At
Source Posted At2026-04-07 00:00:00Z
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=bamboohr/board=stellusrx/date=2026-06-06/2026-06-06T10-24-47-593Z-e43bfe07c219220fca6f79d22abbf1c35cce9ea01b0ca8bc0a5d41ff2363fe0d.json
Event Fields
{
  "content_hash": "3d4baab8dacac257e50344df96ccafe4b137cddf84d66f52313662818aba08e1",
  "source_hash": "f8708fe4bc9b2c423a0eac6c2cb62c2ff7efde2f6d9c9271caf89d1016229b64",
  "last_changed_at": "2026-05-30T05:50:36.717Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Lima, Lima, Peru",
    "city": "Lima",
    "region": "Lima",
    "country": "Peru",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T10:24:49.190Z",
  "launch_scope": {
    "reason": "bamboohr_production_catalog",
    "included": true,
    "location": {
      "raw": "Lima, Lima, Peru",
      "city": "Lima",
      "region": "Lima",
      "country": "Peru",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "Peru"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": null,
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "list_job": {
    "id": "169",
    "isRemote": null,
    "location": {
      "city": null,
      "state": null
    },
    "atsLocation": {
      "city": "Lima",
      "state": null,
      "country": "Peru",
      "province": "Lima"
    },
    "departmentId": "18660",
    "locationType": "1",
    "jobOpeningName": "Senior Data Engineer",
    "departmentLabel": "1420 - Decision Science",
    "employmentStatusLabel": "Full Time"
  },
  "detail_errors": [],
  "detail_job_opening": {
    "location": {
      "city": null,
      "state": null,
      "postalCode": null,
      "addressCountry": null
    },
    "datePosted": "2026-04-07",
    "atsLocation": {
      "city": "Lima",
      "state": "Lima",
      "country": "Peru",
      "countryId": "168"
    },
    "description": "<p><span style=\"font-weight: bold\">Job Summary</span></p>\n<p>We're opening eyes, hearts and minds to the impact that a pharmacy team can have in changing lives.</p>\n<p> </p>\n<p>Join our group of talented, committed team members-pharmacists, pharmacy care coordinators, technologists, product strategists and more-to create and expand the delivery of personalized health support that people didn't even know could be possible.</p>\n<p><br></p>\n<p>The <span style=\"font-weight: bold\">Senior Data Engineer</span> for Stellus Rx will be a key member of our <span style=\"font-weight: bold\">Technology Team</span>, working closely with Stellus Rx leaders and across the organization to unlock the health of millions of Americans. We are a culture that is unabashedly driven by purpose — making a difference to patients and team members while growing at an accelerated rate.</p>\n<p><br></p>\n<p>This role is built for a data engineer who uses AI as an active part of their workflow — accelerating pipeline development, automating data quality processes, and enabling richer, faster insights across our Cloud Analytics Data Platform rather than relying on manual, repetitive engineering approaches.</p>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Role and Responsibilities:</span></p>\n<p><span style=\"font-weight: bold\">AI-Augmented Pipeline Development &amp; Automation</span></p>\n<ul>\n<li>Develop, construct, and maintain large-scale data processing systems that collect data from a variety of structured and unstructured sources — using AI code generation tools to accelerate pipeline authoring, reduce boilerplate, and improve code quality.</li>\n<li>Build and optimize ELT pipelines using AI-assisted tooling to identify bottlenecks, suggest optimizations, and automate routine pipeline maintenance tasks.</li>\n<li>Identify, design, and implement internal process improvements: use AI to automate manual processes, optimize data delivery, and re-design infrastructure for greater scalability — replacing manual analysis with AI-driven discovery of improvement opportunities.</li>\n<li>Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources; use AI to accelerate infrastructure-as-code authoring and configuration.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">AI-Ready Data Preparation &amp; ML Enablement</span></p>\n<ul>\n<li>Prepare data for data scientist exploration and discovery using AI-assisted data profiling and quality assessment tools — surfacing anomalies, schema drift, and data gaps faster than manual inspection allows.</li>\n<li>Perform data wrangling and munging for downstream analytics and machine learning; leverage AI tools to generate and validate transformation logic against business rules.</li>\n<li>Assemble large, complex datasets that meet functional and non-functional business requirements; use AI to rapidly evaluate dimensional modeling approaches and ontology alignment strategies.</li>\n<li>Enable large-scale machine learning by designing and maintaining annotated datasets, elastic search approaches, and scalable data lake structures that support AI/ML workloads.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Analytics Pipeline &amp; Insight Generation</span></p>\n<ul>\n<li>Create and maintain analytics pipelines that generate data and insight to power business decision-making; use AI-assisted analysis to proactively surface trends, anomalies, and opportunities within pipeline outputs.</li>\n<li>Collaborate with data scientists, analysts, and business stakeholders on requirements for dimensional modeling, distributed ETL pipelines, and cross-repository data migration.</li>\n<li>Evaluate, compare, and improve design patterns, data lifecycle approaches, and data ontology alignment — using AI to model trade-offs and accelerate proof-of-concept validation.</li>\n<li>Work with data and analytics experts to continuously improve the functionality, reliability, and intelligence of data systems.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Root Cause Analysis &amp; Quality Management</span></p>\n<ul>\n<li>Perform root cause analysis on internal and external data and processes using AI-assisted investigation tools — replacing slow, manual log and lineage review with faster, AI-accelerated diagnostics.</li>\n<li>Develop and maintain data quality frameworks; use AI to automate anomaly detection, schema validation, and data contract enforcement across pipelines.</li>\n<li>Develop a strong understanding of company domains, strategic direction, and user needs to ensure data systems are aligned to business outcomes, not just technical requirements.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Qualifications and Requirements:</span></p>\n<ul>\n<li>4+ years of experience in a Data Engineer role.</li>\n<li>Graduate degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field.</li>\n<li>Advanced SQL knowledge and experience with relational databases and query authoring.</li>\n<li>Required: Demonstrated, hands-on experience using AI tools to accelerate data engineering tasks — pipeline development, data quality automation, code generation, or root cause analysis — with specific examples you can speak to.</li>\n<li>Experience building and optimizing data pipelines, architectures, and datasets.</li>\n<li>Strong analytic skills working with unstructured and disconnected datasets.</li>\n<li>Experience with big data tools: Hadoop, Spark, Kafka, etc.</li>\n<li>Experience with relational and NoSQL databases including Postgres and Cassandra.</li>\n<li>Experience with pipeline and workflow management tools: Airflow, Luigi, Azkaban, or similar.</li>\n<li>Experience with AWS cloud services: EC2, EMR, RDS, Redshift.</li>\n<li>Experience with stream-processing systems: Storm, Spark Streaming, or similar.</li>\n<li>Working knowledge of message queuing, stream processing, and highly scalable data stores.</li>\n<li>Proficiency in object-oriented/scripting languages: Python, Java, Scala, C++, or similar.</li>\n<li>Experience supporting cross-functional teams in dynamic, agile environments.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Preferred Experience:</span></p>\n<ul>\n<li>Experience designing or supporting data infrastructure for AI/ML model training, including annotated datasets and feature stores.</li>\n<li>Familiarity with AI-assisted data quality or observability platforms (e.g., Monte Carlo, Soda, or similar).</li>\n<li>Experience with LLM-based data processing pipelines or retrieval-augmented generation (RAG) architectures.</li>\n<li>Healthcare data experience; familiarity with FHIR/HL7 standards a plus.</li>\n<li>High English proficiency<br></li>\n</ul>",
    "compensation": null,
    "departmentId": "18660",
    "locationType": "1",
    "seekPromoted": false,
    "jobCategoryId": null,
    "jobOpeningName": "Senior Data Engineer",
    "departmentLabel": "1420 - Decision Science",
    "jobOpeningStatus": "Open",
    "minimumExperience": "Experienced",
    "jobOpeningShareUrl": "https://stellusrx.bamboohr.com/careers/169",
    "employmentStatusLabel": "Full Time"
  }
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/f0c9765af9828ff9c31f793e42d8236964c84e18?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/4cd4a191-0968-42e8-9a8f-e4a9a35b49fdJSON
GET https://api.bluedoor.sh/job-postings/v1/sources/59d126cb-05fc-457f-9ee8-fb61c5d59b52JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/f0c9765af9828ff9c31f793e42d8236964c84e18/eventsJSON