bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesMamahealthSenior Data Engineer – Berlin (On-Site)

Senior Data Engineer – Berlin (On-Site)

Mamahealth · Berlin · On Site · Active · Personio

Job facts

FieldValue
CompanyMamahealth
TitleSenior Data Engineer – Berlin (On-Site)
Normalized title-
Department / teamAI and Data Science / Standard process
LocationBerlin
Work modelOn Site
Employment typeFull Time
Salary-
Statusactive
ATS providerPersonio
Posted / first seen2026-01-05 / 2026-05-30
Changed / last seen2026-05-30 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Mamahealth.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Personio.Open
Provider filtered searchThe same provider as a filtered job collection.Open
Department jobsActive postings in AI and Data Science.Open
Work model jobsActive On Site postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyMamahealth
Source162afd2b-34e8-4e36-a774-066e10fc0709
ATS providerPersonio

Description

Your mission As a Senior  Data Engineer , you will play a key role in building the data backbone of mama health’s real-world evidence platform. We use data mining and analytics to systematically map the end-to-end experiences of chronic patients—because today, researchers, pharmaceutical companies, and governments often can’t reliably observe what happens to patients along their real care path. They rely on theoretical protocols (what should happen), but lack tools to monitor and understand the reality (what actually happens). We bridge this gap by transforming patient-generated data into structured, high-quality evidence that can improve decision-making and, ultimately, patient outcomes. In this role, you will work hands-on with data from multiple sources and formats, including event data, operational systems, partner datasets, and unstructured inputs such as free text . You will help turn these raw inputs into reliable, analytics-ready datasets that power patient journey analysis, product features, and reporting for pharmaceutical partners. This includes building and maintaining pipelines, defining consistent data models, and enabling fast and trustworthy access to data for analytics, product, and AI teams. You’ll also contribute to the operational excellence of our data ecosystem: implementing monitoring and quality checks, improving pipeline reliability and performance, and helping us evolve the platform as our product and data volume grow. Working closely with cross-functional stakeholders (product, AI, data science, and healthcare experts), you will translate real-world needs into pragmatic data solutions—balancing speed and iteration with a high bar for privacy, security, and correctness in a sensitive healthcare context. Your work will directly support our mission to transform healthcare by leveraging the collective wisdom of patients and generating real-world insights that accelerate research and improve lives. Your profile Bachelor’s or Master’s degree in Computer Science, Data Engineering, Data Science , or a related field. 3+ years (or equivalent) experience in a Data Engineering / Analytics Engineering / Backend-leaning data role. Strong programming skills in Python and solid SQL skills (joins, window functions, performance basics). Solid understanding of ETL/ELT concepts and experience building or maintaining data pipelines (batch and/or streaming). Experience working with relational databases (e.g. PostgreSQL/MySQL) and familiarity with analytical storage (e.g. BigQuery/Snowflake/Redshift) is a plus. Familiarity with orchestration or transformation tooling (e.g. Airflow, dbt, Dagster, Prefect ). Ability to clean, transform, and validate large datasets , and to implement practical data quality checks . Interest or experience in extracting structured signals from unstructured data (e.g. free text / conversations) to support analytics. Basic experience with monitoring/observability for pipelines (logs, alerts, SLAs) and willingness to own reliability. Familiarity with cloud platforms (preferably AWS ; GCP/Azure also fine) and containerization (e.g. Docker ). Knowledge Graph is a plus but highly valued. Good engineering hygiene: Git, code reviews, small frequent commits, tests, CI/CD (to the level appropriate for data systems). Strong communication skills and willingness to collaborate with product, analytics, and AI teams. Comfortable working in a fast-paced startup environment with iterative delivery. This role requires you to be based in Berlin. Why us? A mission that matters:  help shape the future of healthcare  at one of the fastest-growing AI healthtech startups in Europe Competitive compensation package, including equity Regular team events and off-sites with an exceptional team Wolt dinner + a ride home when working late Being at the forefront of technology, building cutting-edge AI products for healthcare Great office space and work environment in Berlin

Full job record

Job IDfe132299840ed541e889f8bd6a072f5a99bc3f48
Org IDc5059808-2325-4e5a-8a7a-1d7902aa189e
Source ID162afd2b-34e8-4e36-a774-066e10fc0709
Board ID162afd2b-34e8-4e36-a774-066e10fc0709
Providerpersonio
Provider Job Key2476886
TitleSenior Data Engineer – Berlin (On-Site)
Normalized Title
Statusactive
Activeyes
Location TextBerlin
DepartmentAI and Data Science
TeamStandard process
Employment Typefull_time
Workplace Typeon_site
Remote Policy
CountryBerlin
Region
City
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://mamahealth.jobs.personio.com/job/2476886?language=en
Apply URLhttps://mamahealth.jobs.personio.com/job/2476886?language=en
First Seen At2026-05-30 05:42:07Z
Last Seen At2026-06-06 07:49:43Z
Last Checked At2026-06-06 07:49:43Z
Last Changed At2026-05-30 05:42:07Z
Inactive At
Source Posted At2026-01-05 16:29:28Z
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=personio/board=mamahealth.com/date=2026-06-06/2026-06-06T07-49-43-331Z-b56607a4f2e026dc0c627ae899efee1e04589bfbdde26bf55fdd8a6f6f33e3fd.json
Event Fields
{
  "content_hash": "cf87b25a28ab2b05cdf53638de932451eadc832ffdd6953e9fc0eefcf046ea90",
  "source_hash": "b4ae74aec30c610ca0aa3a0d638b9d4094524502745f0636fb450e0b8f0cee98",
  "last_changed_at": "2026-05-30T05:42:07.755Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Berlin",
    "city": null,
    "region": null,
    "country": "Berlin",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T07:49:43.989Z",
  "launch_scope": {
    "reason": "personio_production_catalog",
    "included": true,
    "location": {
      "raw": "Berlin",
      "city": null,
      "region": null,
      "country": "Berlin",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "Berlin"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "2476886",
  "name": "Senior Data Engineer – Berlin (On-Site)",
  "office": "Berlin",
  "keywords": [],
  "schedule": "full-time",
  "createdAt": "2026-01-05T16:29:28+00:00",
  "seniority": "experienced",
  "department": "AI and Data Science",
  "occupation": "database_development_and_administration",
  "subcompany": null,
  "employmentType": "permanent",
  "jobDescriptions": [
    {
      "name": "Your mission",
      "value": "<p>As a Senior <span>Data Engineer</span>, you will play a key role in building the data backbone of mama health’s real-world evidence platform. We use data mining and analytics to systematically map the end-to-end experiences of chronic patients—because today, researchers, pharmaceutical companies, and governments often can’t reliably observe what happens to patients along their real care path. They rely on theoretical protocols (what should happen), but lack tools to monitor and understand the reality (what actually happens). We bridge this gap by transforming patient-generated data into structured, high-quality evidence that can improve decision-making and, ultimately, patient outcomes.</p><p>In this role, you will work hands-on with data from multiple sources and formats, including event data, operational systems, partner datasets, and <span>unstructured inputs such as free text</span>. You will help turn these raw inputs into reliable, analytics-ready datasets that power patient journey analysis, product features, and reporting for pharmaceutical partners. This includes building and maintaining pipelines, defining consistent data models, and enabling fast and trustworthy access to data for analytics, product, and AI teams.</p><p>You’ll also contribute to the operational excellence of our data ecosystem: implementing monitoring and quality checks, improving pipeline reliability and performance, and helping us evolve the platform as our product and data volume grow. Working closely with cross-functional stakeholders (product, AI, data science, and healthcare experts), you will translate real-world needs into pragmatic data solutions—balancing speed and iteration with a high bar for privacy, security, and correctness in a sensitive healthcare context.</p><p>Your work will directly support our mission to transform healthcare by leveraging the collective wisdom of patients and generating real-world insights that accelerate research and improve lives.</p>"
    },
    {
      "name": "Your profile",
      "value": "<ul><li><p>Bachelor’s or Master’s degree in <span>Computer Science, Data Engineering, Data Science</span>, or a related field.</p></li><li><p><span>3+ years</span> (or equivalent) experience in a Data Engineering / Analytics Engineering / Backend-leaning data role.</p></li><li><p>Strong programming skills in <span>Python</span> and solid <span>SQL</span> skills (joins, window functions, performance basics).</p></li><li><p>Solid understanding of <span>ETL/ELT concepts</span> and experience building or maintaining data pipelines (batch and/or streaming).</p></li><li><p>Experience working with <span>relational databases</span> (e.g. PostgreSQL/MySQL) and familiarity with analytical storage (e.g. BigQuery/Snowflake/Redshift) is a plus.</p></li><li><p>Familiarity with orchestration or transformation tooling (e.g. <span>Airflow, dbt, Dagster, Prefect</span>).</p></li><li><p>Ability to clean, transform, and validate <span>large datasets</span>, and to implement practical <span>data quality checks</span>.</p></li><li><p>Interest or experience in extracting structured signals from <span>unstructured data</span> (e.g. free text / conversations) to support analytics.</p></li><li><p>Basic experience with <span>monitoring/observability</span> for pipelines (logs, alerts, SLAs) and willingness to own reliability.</p></li><li><p>Familiarity with cloud platforms (preferably <span>AWS</span>; GCP/Azure also fine) and containerization (e.g. <span>Docker</span>).</p></li><li><p>Knowledge Graph is a plus but highly valued.</p></li><li><p>Good engineering hygiene: <span>Git, code reviews, small frequent commits, tests, CI/CD</span> (to the level appropriate for data systems).</p></li><li><p>Strong communication skills and willingness to collaborate with product, analytics, and AI teams.</p></li><li><p>Comfortable working in a fast-paced startup environment with iterative delivery.</p></li><li><p>This role requires you to be based in Berlin.</p></li></ul>"
    },
    {
      "name": "Why us?",
      "value": "<ul><li style=\"font-weight:bold;\"><strong>A mission that matters:</strong><strong> help shape the future of healthcare </strong><strong>at one of the fastest-growing AI healthtech startups in Europe</strong></li><li><strong>Competitive compensation package, including equity</strong></li><li><strong>Regular team events and off-sites with an exceptional team</strong></li><li><strong>Wolt dinner + a ride home when working late</strong></li><li><strong>Being at the forefront of technology, building cutting-edge AI products for healthcare</strong></li><li><strong>Great office space and work environment in Berlin</strong></li></ul>"
    }
  ],
  "occupationCategory": "it_software",
  "recruitingCategory": "Standard process"
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/fe132299840ed541e889f8bd6a072f5a99bc3f48?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/c5059808-2325-4e5a-8a7a-1d7902aa189eJSON
GET https://api.bluedoor.sh/job-postings/v1/sources/162afd2b-34e8-4e36-a774-066e10fc0709JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/fe132299840ed541e889f8bd6a072f5a99bc3f48/eventsJSON