Home › Companies › Stellusrx › Senior Data Engineer

Senior Data Engineer

Job facts

Field	Value
Company	Stellusrx
Title	Senior Data Engineer
Normalized title	-
Department / team	1420 - Decision Science
Location	Lima, Lima, Peru
Work model	-
Employment type	Full Time
Salary	-
Status	active
ATS provider	BambooHR
Posted / first seen	2026-04-07 / 2026-05-30
Changed / last seen	2026-05-30 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Stellusrx.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through BambooHR.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in Lima.	Open
Department jobs	Active postings in 1420 - Decision Science.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Stellusrx
Source	59d126cb-05fc-457f-9ee8-fb61c5d59b52
ATS provider	BambooHR

Description

Job Summary We're opening eyes, hearts and minds to the impact that a pharmacy team can have in changing lives. Join our group of talented, committed team members-pharmacists, pharmacy care coordinators, technologists, product strategists and more-to create and expand the delivery of personalized health support that people didn't even know could be possible. The Senior Data Engineer for Stellus Rx will be a key member of our Technology Team , working closely with Stellus Rx leaders and across the organization to unlock the health of millions of Americans. We are a culture that is unabashedly driven by purpose — making a difference to patients and team members while growing at an accelerated rate. This role is built for a data engineer who uses AI as an active part of their workflow — accelerating pipeline development, automating data quality processes, and enabling richer, faster insights across our Cloud Analytics Data Platform rather than relying on manual, repetitive engineering approaches. Role and Responsibilities: AI-Augmented Pipeline Development & Automation Develop, construct, and maintain large-scale data processing systems that collect data from a variety of structured and unstructured sources — using AI code generation tools to accelerate pipeline authoring, reduce boilerplate, and improve code quality. Build and optimize ELT pipelines using AI-assisted tooling to identify bottlenecks, suggest optimizations, and automate routine pipeline maintenance tasks. Identify, design, and implement internal process improvements: use AI to automate manual processes, optimize data delivery, and re-design infrastructure for greater scalability — replacing manual analysis with AI-driven discovery of improvement opportunities. Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources; use AI to accelerate infrastructure-as-code authoring and configuration. AI-Ready Data Preparation & ML Enablement Prepare data for data scientist exploration and discovery using AI-assisted data profiling and quality assessment tools — surfacing anomalies, schema drift, and data gaps faster than manual inspection allows. Perform data wrangling and munging for downstream analytics and machine learning; leverage AI tools to generate and validate transformation logic against business rules. Assemble large, complex datasets that meet functional and non-functional business requirements; use AI to rapidly evaluate dimensional modeling approaches and ontology alignment strategies. Enable large-scale machine learning by designing and maintaining annotated datasets, elastic search approaches, and scalable data lake structures that support AI/ML workloads. Analytics Pipeline & Insight Generation Create and maintain analytics pipelines that generate data and insight to power business decision-making; use AI-assisted analysis to proactively surface trends, anomalies, and opportunities within pipeline outputs. Collaborate with data scientists, analysts, and business stakeholders on requirements for dimensional modeling, distributed ETL pipelines, and cross-repository data migration. Evaluate, compare, and improve design patterns, data lifecycle approaches, and data ontology alignment — using AI to model trade-offs and accelerate proof-of-concept validation. Work with data and analytics experts to continuously improve the functionality, reliability, and intelligence of data systems. Root Cause Analysis & Quality Management Perform root cause analysis on internal and external data and processes using AI-assisted investigation tools — replacing slow, manual log and lineage review with faster, AI-accelerated diagnostics. Develop and maintain data quality frameworks; use AI to automate anomaly detection, schema validation, and data contract enforcement across pipelines. Develop a strong understanding of company domains, strategic direction, and user needs to ensure data systems are aligned to business outcomes, not just technical requirements. Qualifications and Requirements: 4+ years of experience in a Data Engineer role. Graduate degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field. Advanced SQL knowledge and experience with relational databases and query authoring. Required: Demonstrated, hands-on experience using AI tools to accelerate data engineering tasks — pipeline development, data quality automation, code generation, or root cause analysis — with specific examples you can speak to. Experience building and optimizing data pipelines, architectures, and datasets. Strong analytic skills working with unstructured and disconnected datasets. Experience with big data tools: Hadoop, Spark, Kafka, etc. Experience with relational and NoSQL databases including Postgres and Cassandra. Experience with pipeline and workflow management tools: Airflow, Luigi, Azkaban, or similar. Experience with AWS cloud services: EC2, EMR, RDS, Redshift. Experience with stream-processing systems: Storm, Spark Streaming, or similar. Working knowledge of message queuing, stream processing, and highly scalable data stores. Proficiency in object-oriented/scripting languages: Python, Java, Scala, C++, or similar. Experience supporting cross-functional teams in dynamic, agile environments. Preferred Experience: Experience designing or supporting data infrastructure for AI/ML model training, including annotated datasets and feature stores. Familiarity with AI-assisted data quality or observability platforms (e.g., Monte Carlo, Soda, or similar). Experience with LLM-based data processing pipelines or retrieval-augmented generation (RAG) architectures. Healthcare data experience; familiarity with FHIR/HL7 standards a plus. High English proficiency

Full job record

Job ID	f0c9765af9828ff9c31f793e42d8236964c84e18
Org ID	4cd4a191-0968-42e8-9a8f-e4a9a35b49fd
Source ID	59d126cb-05fc-457f-9ee8-fb61c5d59b52
Board ID	59d126cb-05fc-457f-9ee8-fb61c5d59b52
Provider	bamboohr
Provider Job Key	169
Title	Senior Data Engineer
Normalized Title	—
Status	active
Active	yes
Location Text	—
Department	1420 - Decision Science
Team	—
Employment Type	full_time
Workplace Type	—
Remote Policy	—
Country	Peru
Region	Lima
City	Lima
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://stellusrx.bamboohr.com/careers/169
Apply URL	https://stellusrx.bamboohr.com/careers/169
First Seen At	2026-05-30 05:50:36Z
Last Seen At	2026-06-06 10:24:49Z
Last Checked At	2026-06-06 10:24:49Z
Last Changed At	2026-05-30 05:50:36Z
Inactive At	—
Source Posted At	2026-04-07 00:00:00Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=bamboohr/board=stellusrx/date=2026-06-06/2026-06-06T10-24-47-593Z-e43bfe07c219220fca6f79d22abbf1c35cce9ea01b0ca8bc0a5d41ff2363fe0d.json

Event Fields

{
  "content_hash": "3d4baab8dacac257e50344df96ccafe4b137cddf84d66f52313662818aba08e1",
  "source_hash": "f8708fe4bc9b2c423a0eac6c2cb62c2ff7efde2f6d9c9271caf89d1016229b64",
  "last_changed_at": "2026-05-30T05:50:36.717Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Lima, Lima, Peru",
    "city": "Lima",
    "region": "Lima",
    "country": "Peru",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T10:24:49.190Z",
  "launch_scope": {
    "reason": "bamboohr_production_catalog",
    "included": true,
    "location": {
      "raw": "Lima, Lima, Peru",
      "city": "Lima",
      "region": "Lima",
      "country": "Peru",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "Peru"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": null,
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "list_job": {
    "id": "169",
    "isRemote": null,
    "location": {
      "city": null,
      "state": null
    },
    "atsLocation": {
      "city": "Lima",
      "state": null,
      "country": "Peru",
      "province": "Lima"
    },
    "departmentId": "18660",
    "locationType": "1",
    "jobOpeningName": "Senior Data Engineer",
    "departmentLabel": "1420 - Decision Science",
    "employmentStatusLabel": "Full Time"
  },
  "detail_errors": [],
  "detail_job_opening": {
    "location": {
      "city": null,
      "state": null,
      "postalCode": null,
      "addressCountry": null
    },
    "datePosted": "2026-04-07",
    "atsLocation": {
      "city": "Lima",
      "state": "Lima",
      "country": "Peru",
      "countryId": "168"
    },
    "description": "<p><span style=\"font-weight: bold\">Job Summary</span></p>\n<p>We're opening eyes, hearts and minds to the impact that a pharmacy team can have in changing lives.</p>\n<p> </p>\n<p>Join our group of talented, committed team members-pharmacists, pharmacy care coordinators, technologists, product strategists and more-to create and expand the delivery of personalized health support that people didn't even know could be possible.</p>\n<p><br></p>\n<p>The <span style=\"font-weight: bold\">Senior Data Engineer</span> for Stellus Rx will be a key member of our <span style=\"font-weight: bold\">Technology Team</span>, working closely with Stellus Rx leaders and across the organization to unlock the health of millions of Americans. We are a culture that is unabashedly driven by purpose — making a difference to patients and team members while growing at an accelerated rate.</p>\n<p><br></p>\n<p>This role is built for a data engineer who uses AI as an active part of their workflow — accelerating pipeline development, automating data quality processes, and enabling richer, faster insights across our Cloud Analytics Data Platform rather than relying on manual, repetitive engineering approaches.</p>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Role and Responsibilities:</span></p>\n<p><span style=\"font-weight: bold\">AI-Augmented Pipeline Development &amp; Automation</span></p>\n<ul>\n<li>Develop, construct, and maintain large-scale data processing systems that collect data from a variety of structured and unstructured sources — using AI code generation tools to accelerate pipeline authoring, reduce boilerplate, and improve code quality.</li>\n<li>Build and optimize ELT pipelines using AI-assisted tooling to identify bottlenecks, suggest optimizations, and automate routine pipeline maintenance tasks.</li>\n<li>Identify, design, and implement internal process improvements: use AI to automate manual processes, optimize data delivery, and re-design infrastructure for greater scalability — replacing manual analysis with AI-driven discovery of improvement opportunities.</li>\n<li>Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources; use AI to accelerate infrastructure-as-code authoring and configuration.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">AI-Ready Data Preparation &amp; ML Enablement</span></p>\n<ul>\n<li>Prepare data for data scientist exploration and discovery using AI-assisted data profiling and quality assessment tools — surfacing anomalies, schema drift, and data gaps faster than manual inspection allows.</li>\n<li>Perform data wrangling and munging for downstream analytics and machine learning; leverage AI tools to generate and validate transformation logic against business rules.</li>\n<li>Assemble large, complex datasets that meet functional and non-functional business requirements; use AI to rapidly evaluate dimensional modeling approaches and ontology alignment strategies.</li>\n<li>Enable large-scale machine learning by designing and maintaining annotated datasets, elastic search approaches, and scalable data lake structures that support AI/ML workloads.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Analytics Pipeline &amp; Insight Generation</span></p>\n<ul>\n<li>Create and maintain analytics pipelines that generate data and insight to power business decision-making; use AI-assisted analysis to proactively surface trends, anomalies, and opportunities within pipeline outputs.</li>\n<li>Collaborate with data scientists, analysts, and business stakeholders on requirements for dimensional modeling, distributed ETL pipelines, and cross-repository data migration.</li>\n<li>Evaluate, compare, and improve design patterns, data lifecycle approaches, and data ontology alignment — using AI to model trade-offs and accelerate proof-of-concept validation.</li>\n<li>Work with data and analytics experts to continuously improve the functionality, reliability, and intelligence of data systems.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Root Cause Analysis &amp; Quality Management</span></p>\n<ul>\n<li>Perform root cause analysis on internal and external data and processes using AI-assisted investigation tools — replacing slow, manual log and lineage review with faster, AI-accelerated diagnostics.</li>\n<li>Develop and maintain data quality frameworks; use AI to automate anomaly detection, schema validation, and data contract enforcement across pipelines.</li>\n<li>Develop a strong understanding of company domains, strategic direction, and user needs to ensure data systems are aligned to business outcomes, not just technical requirements.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Qualifications and Requirements:</span></p>\n<ul>\n<li>4+ years of experience in a Data Engineer role.</li>\n<li>Graduate degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field.</li>\n<li>Advanced SQL knowledge and experience with relational databases and query authoring.</li>\n<li>Required: Demonstrated, hands-on experience using AI tools to accelerate data engineering tasks — pipeline development, data quality automation, code generation, or root cause analysis — with specific examples you can speak to.</li>\n<li>Experience building and optimizing data pipelines, architectures, and datasets.</li>\n<li>Strong analytic skills working with unstructured and disconnected datasets.</li>\n<li>Experience with big data tools: Hadoop, Spark, Kafka, etc.</li>\n<li>Experience with relational and NoSQL databases including Postgres and Cassandra.</li>\n<li>Experience with pipeline and workflow management tools: Airflow, Luigi, Azkaban, or similar.</li>\n<li>Experience with AWS cloud services: EC2, EMR, RDS, Redshift.</li>\n<li>Experience with stream-processing systems: Storm, Spark Streaming, or similar.</li>\n<li>Working knowledge of message queuing, stream processing, and highly scalable data stores.</li>\n<li>Proficiency in object-oriented/scripting languages: Python, Java, Scala, C++, or similar.</li>\n<li>Experience supporting cross-functional teams in dynamic, agile environments.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Preferred Experience:</span></p>\n<ul>\n<li>Experience designing or supporting data infrastructure for AI/ML model training, including annotated datasets and feature stores.</li>\n<li>Familiarity with AI-assisted data quality or observability platforms (e.g., Monte Carlo, Soda, or similar).</li>\n<li>Experience with LLM-based data processing pipelines or retrieval-augmented generation (RAG) architectures.</li>\n<li>Healthcare data experience; familiarity with FHIR/HL7 standards a plus.</li>\n<li>High English proficiency<br></li>\n</ul>",
    "compensation": null,
    "departmentId": "18660",
    "locationType": "1",
    "seekPromoted": false,
    "jobCategoryId": null,
    "jobOpeningName": "Senior Data Engineer",
    "departmentLabel": "1420 - Decision Science",
    "jobOpeningStatus": "Open",
    "minimumExperience": "Experienced",
    "jobOpeningShareUrl": "https://stellusrx.bamboohr.com/careers/169",
    "employmentStatusLabel": "Full Time"
  }
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/f0c9765af9828ff9c31f793e42d8236964c84e18?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/4cd4a191-0968-42e8-9a8f-e4a9a35b49fdJSON

GET https://api.bluedoor.sh/job-postings/v1/sources/59d126cb-05fc-457f-9ee8-fb61c5d59b52JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/f0c9765af9828ff9c31f793e42d8236964c84e18/eventsJSON

Docs · Get an API key