Home › Companies › Mamahealth › Senior Data Engineer – Berlin (On-Site)
Senior Data Engineer – Berlin (On-Site)
Mamahealth · Berlin · On Site · Active · Personio
Job facts
| Field | Value |
|---|---|
| Company | Mamahealth |
| Title | Senior Data Engineer – Berlin (On-Site) |
| Normalized title | - |
| Department / team | AI and Data Science / Standard process |
| Location | Berlin |
| Work model | On Site |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | Personio |
| Posted / first seen | 2026-01-05 / 2026-05-30 |
| Changed / last seen | 2026-05-30 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Mamahealth. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Personio. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| Department jobs | Active postings in AI and Data Science. | Open |
| Work model jobs | Active On Site postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Mamahealth |
| Source | 162afd2b-34e8-4e36-a774-066e10fc0709 |
| ATS provider | Personio |
Description
Your mission
As a Senior Data Engineer , you will play a key role in building the data backbone of mama health’s real-world evidence platform. We use data mining and analytics to systematically map the end-to-end experiences of chronic patients—because today, researchers, pharmaceutical companies, and governments often can’t reliably observe what happens to patients along their real care path. They rely on theoretical protocols (what should happen), but lack tools to monitor and understand the reality (what actually happens). We bridge this gap by transforming patient-generated data into structured, high-quality evidence that can improve decision-making and, ultimately, patient outcomes.
In this role, you will work hands-on with data from multiple sources and formats, including event data, operational systems, partner datasets, and unstructured inputs such as free text . You will help turn these raw inputs into reliable, analytics-ready datasets that power patient journey analysis, product features, and reporting for pharmaceutical partners. This includes building and maintaining pipelines, defining consistent data models, and enabling fast and trustworthy access to data for analytics, product, and AI teams.
You’ll also contribute to the operational excellence of our data ecosystem: implementing monitoring and quality checks, improving pipeline reliability and performance, and helping us evolve the platform as our product and data volume grow. Working closely with cross-functional stakeholders (product, AI, data science, and healthcare experts), you will translate real-world needs into pragmatic data solutions—balancing speed and iteration with a high bar for privacy, security, and correctness in a sensitive healthcare context.
Your work will directly support our mission to transform healthcare by leveraging the collective wisdom of patients and generating real-world insights that accelerate research and improve lives.
Your profile
Bachelor’s or Master’s degree in Computer Science, Data Engineering, Data Science , or a related field.
3+ years (or equivalent) experience in a Data Engineering / Analytics Engineering / Backend-leaning data role.
Strong programming skills in Python and solid SQL skills (joins, window functions, performance basics).
Solid understanding of ETL/ELT concepts and experience building or maintaining data pipelines (batch and/or streaming).
Experience working with relational databases (e.g. PostgreSQL/MySQL) and familiarity with analytical storage (e.g. BigQuery/Snowflake/Redshift) is a plus.
Familiarity with orchestration or transformation tooling (e.g. Airflow, dbt, Dagster, Prefect ).
Ability to clean, transform, and validate large datasets , and to implement practical data quality checks .
Interest or experience in extracting structured signals from unstructured data (e.g. free text / conversations) to support analytics.
Basic experience with monitoring/observability for pipelines (logs, alerts, SLAs) and willingness to own reliability.
Familiarity with cloud platforms (preferably AWS ; GCP/Azure also fine) and containerization (e.g. Docker ).
Knowledge Graph is a plus but highly valued.
Good engineering hygiene: Git, code reviews, small frequent commits, tests, CI/CD (to the level appropriate for data systems).
Strong communication skills and willingness to collaborate with product, analytics, and AI teams.
Comfortable working in a fast-paced startup environment with iterative delivery.
This role requires you to be based in Berlin.
Why us?
A mission that matters: help shape the future of healthcare at one of the fastest-growing AI healthtech startups in Europe Competitive compensation package, including equity Regular team events and off-sites with an exceptional team Wolt dinner + a ride home when working late Being at the forefront of technology, building cutting-edge AI products for healthcare Great office space and work environment in Berlin
Full job record
| Job ID | fe132299840ed541e889f8bd6a072f5a99bc3f48 |
| Org ID | c5059808-2325-4e5a-8a7a-1d7902aa189e |
| Source ID | 162afd2b-34e8-4e36-a774-066e10fc0709 |
| Board ID | 162afd2b-34e8-4e36-a774-066e10fc0709 |
| Provider | personio |
| Provider Job Key | 2476886 |
| Title | Senior Data Engineer – Berlin (On-Site) |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Berlin |
| Department | AI and Data Science |
| Team | Standard process |
| Employment Type | full_time |
| Workplace Type | on_site |
| Remote Policy | — |
| Country | Berlin |
| Region | — |
| City | — |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://mamahealth.jobs.personio.com/job/2476886?language=en |
| Apply URL | https://mamahealth.jobs.personio.com/job/2476886?language=en |
| First Seen At | 2026-05-30 05:42:07Z |
| Last Seen At | 2026-06-06 07:49:43Z |
| Last Checked At | 2026-06-06 07:49:43Z |
| Last Changed At | 2026-05-30 05:42:07Z |
| Inactive At | — |
| Source Posted At | 2026-01-05 16:29:28Z |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=personio/board=mamahealth.com/date=2026-06-06/2026-06-06T07-49-43-331Z-b56607a4f2e026dc0c627ae899efee1e04589bfbdde26bf55fdd8a6f6f33e3fd.json |
Event Fields
{
"content_hash": "cf87b25a28ab2b05cdf53638de932451eadc832ffdd6953e9fc0eefcf046ea90",
"source_hash": "b4ae74aec30c610ca0aa3a0d638b9d4094524502745f0636fb450e0b8f0cee98",
"last_changed_at": "2026-05-30T05:42:07.755Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Berlin",
"city": null,
"region": null,
"country": "Berlin",
"is_remote": false,
"confidence": 0.8
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T07:49:43.989Z",
"launch_scope": {
"reason": "personio_production_catalog",
"included": true,
"location": {
"raw": "Berlin",
"city": null,
"region": null,
"country": "Berlin",
"is_remote": false,
"confidence": 0.8
},
"countries": [
"Berlin"
]
},
"remote_policy": null,
"salary_period": null,
"workplace_type": "on_site",
"salary_currency": null
}Extensions
{}Native Structured
{
"id": "2476886",
"name": "Senior Data Engineer – Berlin (On-Site)",
"office": "Berlin",
"keywords": [],
"schedule": "full-time",
"createdAt": "2026-01-05T16:29:28+00:00",
"seniority": "experienced",
"department": "AI and Data Science",
"occupation": "database_development_and_administration",
"subcompany": null,
"employmentType": "permanent",
"jobDescriptions": [
{
"name": "Your mission",
"value": "<p>As a Senior <span>Data Engineer</span>, you will play a key role in building the data backbone of mama health’s real-world evidence platform. We use data mining and analytics to systematically map the end-to-end experiences of chronic patients—because today, researchers, pharmaceutical companies, and governments often can’t reliably observe what happens to patients along their real care path. They rely on theoretical protocols (what should happen), but lack tools to monitor and understand the reality (what actually happens). We bridge this gap by transforming patient-generated data into structured, high-quality evidence that can improve decision-making and, ultimately, patient outcomes.</p><p>In this role, you will work hands-on with data from multiple sources and formats, including event data, operational systems, partner datasets, and <span>unstructured inputs such as free text</span>. You will help turn these raw inputs into reliable, analytics-ready datasets that power patient journey analysis, product features, and reporting for pharmaceutical partners. This includes building and maintaining pipelines, defining consistent data models, and enabling fast and trustworthy access to data for analytics, product, and AI teams.</p><p>You’ll also contribute to the operational excellence of our data ecosystem: implementing monitoring and quality checks, improving pipeline reliability and performance, and helping us evolve the platform as our product and data volume grow. Working closely with cross-functional stakeholders (product, AI, data science, and healthcare experts), you will translate real-world needs into pragmatic data solutions—balancing speed and iteration with a high bar for privacy, security, and correctness in a sensitive healthcare context.</p><p>Your work will directly support our mission to transform healthcare by leveraging the collective wisdom of patients and generating real-world insights that accelerate research and improve lives.</p>"
},
{
"name": "Your profile",
"value": "<ul><li><p>Bachelor’s or Master’s degree in <span>Computer Science, Data Engineering, Data Science</span>, or a related field.</p></li><li><p><span>3+ years</span> (or equivalent) experience in a Data Engineering / Analytics Engineering / Backend-leaning data role.</p></li><li><p>Strong programming skills in <span>Python</span> and solid <span>SQL</span> skills (joins, window functions, performance basics).</p></li><li><p>Solid understanding of <span>ETL/ELT concepts</span> and experience building or maintaining data pipelines (batch and/or streaming).</p></li><li><p>Experience working with <span>relational databases</span> (e.g. PostgreSQL/MySQL) and familiarity with analytical storage (e.g. BigQuery/Snowflake/Redshift) is a plus.</p></li><li><p>Familiarity with orchestration or transformation tooling (e.g. <span>Airflow, dbt, Dagster, Prefect</span>).</p></li><li><p>Ability to clean, transform, and validate <span>large datasets</span>, and to implement practical <span>data quality checks</span>.</p></li><li><p>Interest or experience in extracting structured signals from <span>unstructured data</span> (e.g. free text / conversations) to support analytics.</p></li><li><p>Basic experience with <span>monitoring/observability</span> for pipelines (logs, alerts, SLAs) and willingness to own reliability.</p></li><li><p>Familiarity with cloud platforms (preferably <span>AWS</span>; GCP/Azure also fine) and containerization (e.g. <span>Docker</span>).</p></li><li><p>Knowledge Graph is a plus but highly valued.</p></li><li><p>Good engineering hygiene: <span>Git, code reviews, small frequent commits, tests, CI/CD</span> (to the level appropriate for data systems).</p></li><li><p>Strong communication skills and willingness to collaborate with product, analytics, and AI teams.</p></li><li><p>Comfortable working in a fast-paced startup environment with iterative delivery.</p></li><li><p>This role requires you to be based in Berlin.</p></li></ul>"
},
{
"name": "Why us?",
"value": "<ul><li style=\"font-weight:bold;\"><strong>A mission that matters:</strong><strong> help shape the future of healthcare </strong><strong>at one of the fastest-growing AI healthtech startups in Europe</strong></li><li><strong>Competitive compensation package, including equity</strong></li><li><strong>Regular team events and off-sites with an exceptional team</strong></li><li><strong>Wolt dinner + a ride home when working late</strong></li><li><strong>Being at the forefront of technology, building cutting-edge AI products for healthcare</strong></li><li><strong>Great office space and work environment in Berlin</strong></li></ul>"
}
],
"occupationCategory": "it_software",
"recruitingCategory": "Standard process"
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/fe132299840ed541e889f8bd6a072f5a99bc3f48?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/c5059808-2325-4e5a-8a7a-1d7902aa189eJSONGET https://api.bluedoor.sh/job-postings/v1/sources/162afd2b-34e8-4e36-a774-066e10fc0709JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/fe132299840ed541e889f8bd6a072f5a99bc3f48/eventsJSON