Home › Companies › Stellusrx › Senior Data Engineer
Senior Data Engineer
Job facts
| Field | Value |
|---|---|
| Company | Stellusrx |
| Title | Senior Data Engineer |
| Normalized title | - |
| Department / team | 1420 - Decision Science |
| Location | Lima, Lima, Peru |
| Work model | - |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | BambooHR |
| Posted / first seen | 2026-04-07 / 2026-05-30 |
| Changed / last seen | 2026-05-30 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Stellusrx. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through BambooHR. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Lima. | Open |
| Department jobs | Active postings in 1420 - Decision Science. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Stellusrx |
| Source | 59d126cb-05fc-457f-9ee8-fb61c5d59b52 |
| ATS provider | BambooHR |
Description
Job Summary
We're opening eyes, hearts and minds to the impact that a pharmacy team can have in changing lives.
Join our group of talented, committed team members-pharmacists, pharmacy care coordinators, technologists, product strategists and more-to create and expand the delivery of personalized health support that people didn't even know could be possible.
The Senior Data Engineer for Stellus Rx will be a key member of our Technology Team , working closely with Stellus Rx leaders and across the organization to unlock the health of millions of Americans. We are a culture that is unabashedly driven by purpose — making a difference to patients and team members while growing at an accelerated rate.
This role is built for a data engineer who uses AI as an active part of their workflow — accelerating pipeline development, automating data quality processes, and enabling richer, faster insights across our Cloud Analytics Data Platform rather than relying on manual, repetitive engineering approaches.
Role and Responsibilities:
AI-Augmented Pipeline Development & Automation
Develop, construct, and maintain large-scale data processing systems that collect data from a variety of structured and unstructured sources — using AI code generation tools to accelerate pipeline authoring, reduce boilerplate, and improve code quality.
Build and optimize ELT pipelines using AI-assisted tooling to identify bottlenecks, suggest optimizations, and automate routine pipeline maintenance tasks.
Identify, design, and implement internal process improvements: use AI to automate manual processes, optimize data delivery, and re-design infrastructure for greater scalability — replacing manual analysis with AI-driven discovery of improvement opportunities.
Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources; use AI to accelerate infrastructure-as-code authoring and configuration.
AI-Ready Data Preparation & ML Enablement
Prepare data for data scientist exploration and discovery using AI-assisted data profiling and quality assessment tools — surfacing anomalies, schema drift, and data gaps faster than manual inspection allows.
Perform data wrangling and munging for downstream analytics and machine learning; leverage AI tools to generate and validate transformation logic against business rules.
Assemble large, complex datasets that meet functional and non-functional business requirements; use AI to rapidly evaluate dimensional modeling approaches and ontology alignment strategies.
Enable large-scale machine learning by designing and maintaining annotated datasets, elastic search approaches, and scalable data lake structures that support AI/ML workloads.
Analytics Pipeline & Insight Generation
Create and maintain analytics pipelines that generate data and insight to power business decision-making; use AI-assisted analysis to proactively surface trends, anomalies, and opportunities within pipeline outputs.
Collaborate with data scientists, analysts, and business stakeholders on requirements for dimensional modeling, distributed ETL pipelines, and cross-repository data migration.
Evaluate, compare, and improve design patterns, data lifecycle approaches, and data ontology alignment — using AI to model trade-offs and accelerate proof-of-concept validation.
Work with data and analytics experts to continuously improve the functionality, reliability, and intelligence of data systems.
Root Cause Analysis & Quality Management
Perform root cause analysis on internal and external data and processes using AI-assisted investigation tools — replacing slow, manual log and lineage review with faster, AI-accelerated diagnostics.
Develop and maintain data quality frameworks; use AI to automate anomaly detection, schema validation, and data contract enforcement across pipelines.
Develop a strong understanding of company domains, strategic direction, and user needs to ensure data systems are aligned to business outcomes, not just technical requirements.
Qualifications and Requirements:
4+ years of experience in a Data Engineer role.
Graduate degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field.
Advanced SQL knowledge and experience with relational databases and query authoring.
Required: Demonstrated, hands-on experience using AI tools to accelerate data engineering tasks — pipeline development, data quality automation, code generation, or root cause analysis — with specific examples you can speak to.
Experience building and optimizing data pipelines, architectures, and datasets.
Strong analytic skills working with unstructured and disconnected datasets.
Experience with big data tools: Hadoop, Spark, Kafka, etc.
Experience with relational and NoSQL databases including Postgres and Cassandra.
Experience with pipeline and workflow management tools: Airflow, Luigi, Azkaban, or similar.
Experience with AWS cloud services: EC2, EMR, RDS, Redshift.
Experience with stream-processing systems: Storm, Spark Streaming, or similar.
Working knowledge of message queuing, stream processing, and highly scalable data stores.
Proficiency in object-oriented/scripting languages: Python, Java, Scala, C++, or similar.
Experience supporting cross-functional teams in dynamic, agile environments.
Preferred Experience:
Experience designing or supporting data infrastructure for AI/ML model training, including annotated datasets and feature stores.
Familiarity with AI-assisted data quality or observability platforms (e.g., Monte Carlo, Soda, or similar).
Experience with LLM-based data processing pipelines or retrieval-augmented generation (RAG) architectures.
Healthcare data experience; familiarity with FHIR/HL7 standards a plus.
High English proficiency
Full job record
| Job ID | f0c9765af9828ff9c31f793e42d8236964c84e18 |
| Org ID | 4cd4a191-0968-42e8-9a8f-e4a9a35b49fd |
| Source ID | 59d126cb-05fc-457f-9ee8-fb61c5d59b52 |
| Board ID | 59d126cb-05fc-457f-9ee8-fb61c5d59b52 |
| Provider | bamboohr |
| Provider Job Key | 169 |
| Title | Senior Data Engineer |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | — |
| Department | 1420 - Decision Science |
| Team | — |
| Employment Type | full_time |
| Workplace Type | — |
| Remote Policy | — |
| Country | Peru |
| Region | Lima |
| City | Lima |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://stellusrx.bamboohr.com/careers/169 |
| Apply URL | https://stellusrx.bamboohr.com/careers/169 |
| First Seen At | 2026-05-30 05:50:36Z |
| Last Seen At | 2026-06-06 10:24:49Z |
| Last Checked At | 2026-06-06 10:24:49Z |
| Last Changed At | 2026-05-30 05:50:36Z |
| Inactive At | — |
| Source Posted At | 2026-04-07 00:00:00Z |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=bamboohr/board=stellusrx/date=2026-06-06/2026-06-06T10-24-47-593Z-e43bfe07c219220fca6f79d22abbf1c35cce9ea01b0ca8bc0a5d41ff2363fe0d.json |
Event Fields
{
"content_hash": "3d4baab8dacac257e50344df96ccafe4b137cddf84d66f52313662818aba08e1",
"source_hash": "f8708fe4bc9b2c423a0eac6c2cb62c2ff7efde2f6d9c9271caf89d1016229b64",
"last_changed_at": "2026-05-30T05:50:36.717Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Lima, Lima, Peru",
"city": "Lima",
"region": "Lima",
"country": "Peru",
"is_remote": false,
"confidence": 0.8
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T10:24:49.190Z",
"launch_scope": {
"reason": "bamboohr_production_catalog",
"included": true,
"location": {
"raw": "Lima, Lima, Peru",
"city": "Lima",
"region": "Lima",
"country": "Peru",
"is_remote": false,
"confidence": 0.8
},
"countries": [
"Peru"
]
},
"remote_policy": null,
"salary_period": null,
"workplace_type": null,
"salary_currency": null
}Extensions
{}Native Structured
{
"list_job": {
"id": "169",
"isRemote": null,
"location": {
"city": null,
"state": null
},
"atsLocation": {
"city": "Lima",
"state": null,
"country": "Peru",
"province": "Lima"
},
"departmentId": "18660",
"locationType": "1",
"jobOpeningName": "Senior Data Engineer",
"departmentLabel": "1420 - Decision Science",
"employmentStatusLabel": "Full Time"
},
"detail_errors": [],
"detail_job_opening": {
"location": {
"city": null,
"state": null,
"postalCode": null,
"addressCountry": null
},
"datePosted": "2026-04-07",
"atsLocation": {
"city": "Lima",
"state": "Lima",
"country": "Peru",
"countryId": "168"
},
"description": "<p><span style=\"font-weight: bold\">Job Summary</span></p>\n<p>We're opening eyes, hearts and minds to the impact that a pharmacy team can have in changing lives.</p>\n<p> </p>\n<p>Join our group of talented, committed team members-pharmacists, pharmacy care coordinators, technologists, product strategists and more-to create and expand the delivery of personalized health support that people didn't even know could be possible.</p>\n<p><br></p>\n<p>The <span style=\"font-weight: bold\">Senior Data Engineer</span> for Stellus Rx will be a key member of our <span style=\"font-weight: bold\">Technology Team</span>, working closely with Stellus Rx leaders and across the organization to unlock the health of millions of Americans. We are a culture that is unabashedly driven by purpose — making a difference to patients and team members while growing at an accelerated rate.</p>\n<p><br></p>\n<p>This role is built for a data engineer who uses AI as an active part of their workflow — accelerating pipeline development, automating data quality processes, and enabling richer, faster insights across our Cloud Analytics Data Platform rather than relying on manual, repetitive engineering approaches.</p>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Role and Responsibilities:</span></p>\n<p><span style=\"font-weight: bold\">AI-Augmented Pipeline Development & Automation</span></p>\n<ul>\n<li>Develop, construct, and maintain large-scale data processing systems that collect data from a variety of structured and unstructured sources — using AI code generation tools to accelerate pipeline authoring, reduce boilerplate, and improve code quality.</li>\n<li>Build and optimize ELT pipelines using AI-assisted tooling to identify bottlenecks, suggest optimizations, and automate routine pipeline maintenance tasks.</li>\n<li>Identify, design, and implement internal process improvements: use AI to automate manual processes, optimize data delivery, and re-design infrastructure for greater scalability — replacing manual analysis with AI-driven discovery of improvement opportunities.</li>\n<li>Build the infrastructure required for optimal extraction, transformation, and loading of data from various sources; use AI to accelerate infrastructure-as-code authoring and configuration.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">AI-Ready Data Preparation & ML Enablement</span></p>\n<ul>\n<li>Prepare data for data scientist exploration and discovery using AI-assisted data profiling and quality assessment tools — surfacing anomalies, schema drift, and data gaps faster than manual inspection allows.</li>\n<li>Perform data wrangling and munging for downstream analytics and machine learning; leverage AI tools to generate and validate transformation logic against business rules.</li>\n<li>Assemble large, complex datasets that meet functional and non-functional business requirements; use AI to rapidly evaluate dimensional modeling approaches and ontology alignment strategies.</li>\n<li>Enable large-scale machine learning by designing and maintaining annotated datasets, elastic search approaches, and scalable data lake structures that support AI/ML workloads.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Analytics Pipeline & Insight Generation</span></p>\n<ul>\n<li>Create and maintain analytics pipelines that generate data and insight to power business decision-making; use AI-assisted analysis to proactively surface trends, anomalies, and opportunities within pipeline outputs.</li>\n<li>Collaborate with data scientists, analysts, and business stakeholders on requirements for dimensional modeling, distributed ETL pipelines, and cross-repository data migration.</li>\n<li>Evaluate, compare, and improve design patterns, data lifecycle approaches, and data ontology alignment — using AI to model trade-offs and accelerate proof-of-concept validation.</li>\n<li>Work with data and analytics experts to continuously improve the functionality, reliability, and intelligence of data systems.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Root Cause Analysis & Quality Management</span></p>\n<ul>\n<li>Perform root cause analysis on internal and external data and processes using AI-assisted investigation tools — replacing slow, manual log and lineage review with faster, AI-accelerated diagnostics.</li>\n<li>Develop and maintain data quality frameworks; use AI to automate anomaly detection, schema validation, and data contract enforcement across pipelines.</li>\n<li>Develop a strong understanding of company domains, strategic direction, and user needs to ensure data systems are aligned to business outcomes, not just technical requirements.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Qualifications and Requirements:</span></p>\n<ul>\n<li>4+ years of experience in a Data Engineer role.</li>\n<li>Graduate degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field.</li>\n<li>Advanced SQL knowledge and experience with relational databases and query authoring.</li>\n<li>Required: Demonstrated, hands-on experience using AI tools to accelerate data engineering tasks — pipeline development, data quality automation, code generation, or root cause analysis — with specific examples you can speak to.</li>\n<li>Experience building and optimizing data pipelines, architectures, and datasets.</li>\n<li>Strong analytic skills working with unstructured and disconnected datasets.</li>\n<li>Experience with big data tools: Hadoop, Spark, Kafka, etc.</li>\n<li>Experience with relational and NoSQL databases including Postgres and Cassandra.</li>\n<li>Experience with pipeline and workflow management tools: Airflow, Luigi, Azkaban, or similar.</li>\n<li>Experience with AWS cloud services: EC2, EMR, RDS, Redshift.</li>\n<li>Experience with stream-processing systems: Storm, Spark Streaming, or similar.</li>\n<li>Working knowledge of message queuing, stream processing, and highly scalable data stores.</li>\n<li>Proficiency in object-oriented/scripting languages: Python, Java, Scala, C++, or similar.</li>\n<li>Experience supporting cross-functional teams in dynamic, agile environments.</li>\n</ul>\n<p><br></p>\n<p><span style=\"font-weight: bold\">Preferred Experience:</span></p>\n<ul>\n<li>Experience designing or supporting data infrastructure for AI/ML model training, including annotated datasets and feature stores.</li>\n<li>Familiarity with AI-assisted data quality or observability platforms (e.g., Monte Carlo, Soda, or similar).</li>\n<li>Experience with LLM-based data processing pipelines or retrieval-augmented generation (RAG) architectures.</li>\n<li>Healthcare data experience; familiarity with FHIR/HL7 standards a plus.</li>\n<li>High English proficiency<br></li>\n</ul>",
"compensation": null,
"departmentId": "18660",
"locationType": "1",
"seekPromoted": false,
"jobCategoryId": null,
"jobOpeningName": "Senior Data Engineer",
"departmentLabel": "1420 - Decision Science",
"jobOpeningStatus": "Open",
"minimumExperience": "Experienced",
"jobOpeningShareUrl": "https://stellusrx.bamboohr.com/careers/169",
"employmentStatusLabel": "Full Time"
}
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/f0c9765af9828ff9c31f793e42d8236964c84e18?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/4cd4a191-0968-42e8-9a8f-e4a9a35b49fdJSONGET https://api.bluedoor.sh/job-postings/v1/sources/59d126cb-05fc-457f-9ee8-fb61c5d59b52JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/f0c9765af9828ff9c31f793e42d8236964c84e18/eventsJSON