Home › Companies › Mamahealth › Senior Data Engineer – Berlin (On-Site)

Senior Data Engineer – Berlin (On-Site)

Mamahealth · Berlin · On Site · Active · Personio

Job facts

Field	Value
Company	Mamahealth
Title	Senior Data Engineer – Berlin (On-Site)
Normalized title	-
Department / team	AI and Data Science / Standard process
Location	Berlin
Work model	On Site
Employment type	Full Time
Salary	-
Status	active
ATS provider	Personio
Posted / first seen	2026-01-05 / 2026-05-30
Changed / last seen	2026-05-30 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Mamahealth.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Personio.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
Department jobs	Active postings in AI and Data Science.	Open
Work model jobs	Active On Site postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Mamahealth
Source	162afd2b-34e8-4e36-a774-066e10fc0709
ATS provider	Personio

Description

Your mission As a Senior Data Engineer , you will play a key role in building the data backbone of mama health’s real-world evidence platform. We use data mining and analytics to systematically map the end-to-end experiences of chronic patients—because today, researchers, pharmaceutical companies, and governments often can’t reliably observe what happens to patients along their real care path. They rely on theoretical protocols (what should happen), but lack tools to monitor and understand the reality (what actually happens). We bridge this gap by transforming patient-generated data into structured, high-quality evidence that can improve decision-making and, ultimately, patient outcomes. In this role, you will work hands-on with data from multiple sources and formats, including event data, operational systems, partner datasets, and unstructured inputs such as free text . You will help turn these raw inputs into reliable, analytics-ready datasets that power patient journey analysis, product features, and reporting for pharmaceutical partners. This includes building and maintaining pipelines, defining consistent data models, and enabling fast and trustworthy access to data for analytics, product, and AI teams. You’ll also contribute to the operational excellence of our data ecosystem: implementing monitoring and quality checks, improving pipeline reliability and performance, and helping us evolve the platform as our product and data volume grow. Working closely with cross-functional stakeholders (product, AI, data science, and healthcare experts), you will translate real-world needs into pragmatic data solutions—balancing speed and iteration with a high bar for privacy, security, and correctness in a sensitive healthcare context. Your work will directly support our mission to transform healthcare by leveraging the collective wisdom of patients and generating real-world insights that accelerate research and improve lives. Your profile Bachelor’s or Master’s degree in Computer Science, Data Engineering, Data Science , or a related field. 3+ years (or equivalent) experience in a Data Engineering / Analytics Engineering / Backend-leaning data role. Strong programming skills in Python and solid SQL skills (joins, window functions, performance basics). Solid understanding of ETL/ELT concepts and experience building or maintaining data pipelines (batch and/or streaming). Experience working with relational databases (e.g. PostgreSQL/MySQL) and familiarity with analytical storage (e.g. BigQuery/Snowflake/Redshift) is a plus. Familiarity with orchestration or transformation tooling (e.g. Airflow, dbt, Dagster, Prefect ). Ability to clean, transform, and validate large datasets , and to implement practical data quality checks . Interest or experience in extracting structured signals from unstructured data (e.g. free text / conversations) to support analytics. Basic experience with monitoring/observability for pipelines (logs, alerts, SLAs) and willingness to own reliability. Familiarity with cloud platforms (preferably AWS ; GCP/Azure also fine) and containerization (e.g. Docker ). Knowledge Graph is a plus but highly valued. Good engineering hygiene: Git, code reviews, small frequent commits, tests, CI/CD (to the level appropriate for data systems). Strong communication skills and willingness to collaborate with product, analytics, and AI teams. Comfortable working in a fast-paced startup environment with iterative delivery. This role requires you to be based in Berlin. Why us? A mission that matters: help shape the future of healthcare at one of the fastest-growing AI healthtech startups in Europe Competitive compensation package, including equity Regular team events and off-sites with an exceptional team Wolt dinner + a ride home when working late Being at the forefront of technology, building cutting-edge AI products for healthcare Great office space and work environment in Berlin

Full job record

Job ID	fe132299840ed541e889f8bd6a072f5a99bc3f48
Org ID	c5059808-2325-4e5a-8a7a-1d7902aa189e
Source ID	162afd2b-34e8-4e36-a774-066e10fc0709
Board ID	162afd2b-34e8-4e36-a774-066e10fc0709
Provider	personio
Provider Job Key	2476886
Title	Senior Data Engineer – Berlin (On-Site)
Normalized Title	—
Status	active
Active	yes
Location Text	Berlin
Department	AI and Data Science
Team	Standard process
Employment Type	full_time
Workplace Type	on_site
Remote Policy	—
Country	Berlin
Region	—
City	—
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://mamahealth.jobs.personio.com/job/2476886?language=en
Apply URL	https://mamahealth.jobs.personio.com/job/2476886?language=en
First Seen At	2026-05-30 05:42:07Z
Last Seen At	2026-06-06 07:49:43Z
Last Checked At	2026-06-06 07:49:43Z
Last Changed At	2026-05-30 05:42:07Z
Inactive At	—
Source Posted At	2026-01-05 16:29:28Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=personio/board=mamahealth.com/date=2026-06-06/2026-06-06T07-49-43-331Z-b56607a4f2e026dc0c627ae899efee1e04589bfbdde26bf55fdd8a6f6f33e3fd.json

Event Fields

{
  "content_hash": "cf87b25a28ab2b05cdf53638de932451eadc832ffdd6953e9fc0eefcf046ea90",
  "source_hash": "b4ae74aec30c610ca0aa3a0d638b9d4094524502745f0636fb450e0b8f0cee98",
  "last_changed_at": "2026-05-30T05:42:07.755Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Berlin",
    "city": null,
    "region": null,
    "country": "Berlin",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T07:49:43.989Z",
  "launch_scope": {
    "reason": "personio_production_catalog",
    "included": true,
    "location": {
      "raw": "Berlin",
      "city": null,
      "region": null,
      "country": "Berlin",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "Berlin"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "id": "2476886",
  "name": "Senior Data Engineer – Berlin (On-Site)",
  "office": "Berlin",
  "keywords": [],
  "schedule": "full-time",
  "createdAt": "2026-01-05T16:29:28+00:00",
  "seniority": "experienced",
  "department": "AI and Data Science",
  "occupation": "database_development_and_administration",
  "subcompany": null,
  "employmentType": "permanent",
  "jobDescriptions": [
    {
      "name": "Your mission",
      "value": "<p>As a Senior <span>Data Engineer</span>, you will play a key role in building the data backbone of mama health’s real-world evidence platform. We use data mining and analytics to systematically map the end-to-end experiences of chronic patients—because today, researchers, pharmaceutical companies, and governments often can’t reliably observe what happens to patients along their real care path. They rely on theoretical protocols (what should happen), but lack tools to monitor and understand the reality (what actually happens). We bridge this gap by transforming patient-generated data into structured, high-quality evidence that can improve decision-making and, ultimately, patient outcomes.</p><p>In this role, you will work hands-on with data from multiple sources and formats, including event data, operational systems, partner datasets, and <span>unstructured inputs such as free text</span>. You will help turn these raw inputs into reliable, analytics-ready datasets that power patient journey analysis, product features, and reporting for pharmaceutical partners. This includes building and maintaining pipelines, defining consistent data models, and enabling fast and trustworthy access to data for analytics, product, and AI teams.</p><p>You’ll also contribute to the operational excellence of our data ecosystem: implementing monitoring and quality checks, improving pipeline reliability and performance, and helping us evolve the platform as our product and data volume grow. Working closely with cross-functional stakeholders (product, AI, data science, and healthcare experts), you will translate real-world needs into pragmatic data solutions—balancing speed and iteration with a high bar for privacy, security, and correctness in a sensitive healthcare context.</p><p>Your work will directly support our mission to transform healthcare by leveraging the collective wisdom of patients and generating real-world insights that accelerate research and improve lives.</p>"
    },
    {
      "name": "Your profile",
      "value": "<ul><li><p>Bachelor’s or Master’s degree in <span>Computer Science, Data Engineering, Data Science</span>, or a related field.</p></li><li><p><span>3+ years</span> (or equivalent) experience in a Data Engineering / Analytics Engineering / Backend-leaning data role.</p></li><li><p>Strong programming skills in <span>Python</span> and solid <span>SQL</span> skills (joins, window functions, performance basics).</p></li><li><p>Solid understanding of <span>ETL/ELT concepts</span> and experience building or maintaining data pipelines (batch and/or streaming).</p></li><li><p>Experience working with <span>relational databases</span> (e.g. PostgreSQL/MySQL) and familiarity with analytical storage (e.g. BigQuery/Snowflake/Redshift) is a plus.</p></li><li><p>Familiarity with orchestration or transformation tooling (e.g. <span>Airflow, dbt, Dagster, Prefect</span>).</p></li><li><p>Ability to clean, transform, and validate <span>large datasets</span>, and to implement practical <span>data quality checks</span>.</p></li><li><p>Interest or experience in extracting structured signals from <span>unstructured data</span> (e.g. free text / conversations) to support analytics.</p></li><li><p>Basic experience with <span>monitoring/observability</span> for pipelines (logs, alerts, SLAs) and willingness to own reliability.</p></li><li><p>Familiarity with cloud platforms (preferably <span>AWS</span>; GCP/Azure also fine) and containerization (e.g. <span>Docker</span>).</p></li><li><p>Knowledge Graph is a plus but highly valued.</p></li><li><p>Good engineering hygiene: <span>Git, code reviews, small frequent commits, tests, CI/CD</span> (to the level appropriate for data systems).</p></li><li><p>Strong communication skills and willingness to collaborate with product, analytics, and AI teams.</p></li><li><p>Comfortable working in a fast-paced startup environment with iterative delivery.</p></li><li><p>This role requires you to be based in Berlin.</p></li></ul>"
    },
    {
      "name": "Why us?",
      "value": "<ul><li style=\"font-weight:bold;\"><strong>A mission that matters:</strong><strong> help shape the future of healthcare </strong><strong>at one of the fastest-growing AI healthtech startups in Europe</strong></li><li><strong>Competitive compensation package, including equity</strong></li><li><strong>Regular team events and off-sites with an exceptional team</strong></li><li><strong>Wolt dinner + a ride home when working late</strong></li><li><strong>Being at the forefront of technology, building cutting-edge AI products for healthcare</strong></li><li><strong>Great office space and work environment in Berlin</strong></li></ul>"
    }
  ],
  "occupationCategory": "it_software",
  "recruitingCategory": "Standard process"
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/fe132299840ed541e889f8bd6a072f5a99bc3f48?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/c5059808-2325-4e5a-8a7a-1d7902aa189eJSON

GET https://api.bluedoor.sh/job-postings/v1/sources/162afd2b-34e8-4e36-a774-066e10fc0709JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/fe132299840ed541e889f8bd6a072f5a99bc3f48/eventsJSON

Docs · Get an API key