bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesStatworxData Engineer NLP (m/w/d)

Data Engineer NLP (m/w/d)

Statworx · Frankfurt am Main · Active · Personio

Job facts

FieldValue
CompanyStatworx
TitleData Engineer NLP (m/w/d)
Normalized title-
Department / teamAI Development / Full Time
LocationFrankfurt am Main
Work model-
Employment typeFull Time
Salary-
Statusactive
ATS providerPersonio
Posted / first seen2025-11-13 / 2026-05-30
Changed / last seen2026-05-30 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Statworx.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Personio.Open
Provider filtered searchThe same provider as a filtered job collection.Open
Department jobsActive postings in AI Development.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyStatworx
Source04cb15d2-3af6-4f08-b78f-47aad1066cc3
ATS providerPersonio

Description

About us With deep expertise in Data Engineering, Data Science, and Machine Learning, we help our clients unlock the full potential of their data.statworx is a leading consulting and development company for data and AI, based in Frankfurt am Main. We offer strategic consulting for medium-sized businesses and global corporations. We develop innovative data and AI solutions across all business areas and corporate functions. We empower people at all levels of expertise with our data and AI education formats. In short: We support companies in all aspects of digital transformation – for over 10 years, in more than 1000 data and AI projects, and for over 100 clients from almost all industries. Our AI Development department acts as a catalyst for data and AI transformation. We take a holistic approach that spans the entire journey — from assessing AI maturity to designing, developing, and scaling end-to-end data and AI solutions. Your tasks Focus: Data pipelines and provisioning for NLP and LLM-based applications Combine classical data engineering with modern NLP approaches – particularly in the context of Large Language Models (LLMs), embeddings, knowledge graphs, Retrieval-Augmented Generation (RAG), and text-to-SQL applications Design, develop, and operate modern data architectures that form the foundation for advanced NLP applications – from knowledge management systems and semantic search solutions to RAG use cases Work closely with our clients to understand their business requirements and data processes, and translate them into tailored, scalable data and AI solutions Implement scalable data pipelines and infrastructures to efficiently provide, transform, and version large volumes of structured and unstructured data Ensure data quality, security, and governance along the entire value chain, and establish best practices for handling sensitive data in AI projects Build and operate scalable data infrastructures in cloud environments, and automate deployments and monitoring systems to ensure reliability and availability Provide strategic advice to clients and internal teams on data architecture, technologies, tools, and best practices, acting as a trusted advisor Support and mentor junior colleagues, share your knowledge within the team, and contribute to the development of statworx’s data engineering community through workshops, blog posts, or internal talks Your profile You hold a Master’s degree in (Business) Informatics, Computer Science, or a related field You have at least five years of relevant professional experience in data engineering or data architecture You have a strong understanding of modern data architectures (Data Lakes, Lakehouses, Data Warehouses) and are experienced in ETL/ELT processes and data modeling Ideally, you have experience building data infrastructures for NLP applications – especially in the context of LLMs, Retrieval-Augmented Generation (RAG), semantic layers, and knowledge graphs Hands-on experience with text-to-SQL systems or developing interfaces between natural language and databases is a plus You are experienced with cloud platforms (Azure, AWS, or GCP) and data platforms such as Databricks or Snowflake You are familiar with Infrastructure-as-Code (e.g., Terraform, Pulumi) and CI/CD workflows (e.g., GitHub Actions, GitLab CI, Azure DevOps) You have excellent programming skills in Python, SQL, and Bash/Shell, and you write clean, efficient, and maintainable code You understand the importance of data governance, security, and privacy (e.g., GDPR) and incorporate these principles into your architectural design You combine strong analytical thinking with the ability to translate business requirements into technical solutions and communicate effectively with stakeholders at all levels You are fluent in English (written and spoken) and have advanced German skills — or are willing to actively improve them Our offer Data & AI consulting as our core business: Work on exciting projects with leading clients – from cutting-edge NLP use cases to complex data science and machine learning solutions Depth and diversity: Engage with challenging, multifaceted problems and continuously expand your expertise in data science, machine learning, and AI Continuous development: We support your professional and personal growth through regular feedback, tailored learning opportunities, and our mentoring program Culture and collaboration: Experience an open, inclusive, and respectful working environment with flat hierarchies, short decision-making paths, and a strong sense of team spirit Agile mindset: We embrace a modern, iterative way of working characterized by transparency, autonomy, and space for new ideas Transparent compensation: Enjoy fair, structured salary bands that are regularly reviewed and adjusted to market developments Flexible work setup:   Our modern Frankfurt office is your main place of work – at the same time, you have the flexibility to work remotely on a regular basis and up to four weeks per year from within the EU Mobility & well-being: Benefit from a subsidized Germany Ticket and discounted access to sports and wellness programs via Wellpass Equipment & extras: High-quality IT equipment (e.g., MacBook Pro), regular team events, childcare support, and attractive employee discounts complete your package Your application Simply apply via the application form and attach your current CV including a description of your methodological skills. We will contact you as soon as possible. If you have any questions on your application, you can reach us at [email protected]. What is particularly important to us is that we value the uniqueness of each person and always treat each other as equals. Different backgrounds, attitudes and ideas enrich us and form the basis of our success. That's why we welcome every application - regardless of gender, nationality, ethnic and social origin, religion, ideology, disability, age, or sexual orientation and identity.

Full job record

Job IDbcded71d9f7a50d4bd0a6ca3e3961c09b3f9a82c
Org IDaa4d9607-2425-49b3-b45c-61d78cbaf6c8
Source ID04cb15d2-3af6-4f08-b78f-47aad1066cc3
Board ID04cb15d2-3af6-4f08-b78f-47aad1066cc3
Providerpersonio
Provider Job Key2426002
TitleData Engineer NLP (m/w/d)
Normalized Title
Statusactive
Activeyes
Location TextFrankfurt am Main
DepartmentAI Development
TeamFull Time
Employment Typefull_time
Workplace Type
Remote Policy
CountryFrankfurt am Main
Region
City
Salary Raw
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://statworx.jobs.personio.de/job/2426002?language=en
Apply URLhttps://statworx.jobs.personio.de/job/2426002?language=en
First Seen At2026-05-30 05:39:34Z
Last Seen At2026-06-06 07:45:55Z
Last Checked At2026-06-06 07:45:55Z
Last Changed At2026-05-30 05:39:34Z
Inactive At
Source Posted At2025-11-13 10:24:32Z
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=personio/board=statworx.de/date=2026-06-06/2026-06-06T07-45-54-672Z-03e067361893c78a106c5b153c898474accda661749543d2b84035fc81a87b60.json
Event Fields
{
  "content_hash": "3d48b1a260a181e60464948330d90cf678f45642dd75271dc2681cc9cfd38e3e",
  "source_hash": "fef08790105d5f1565e9f0e6f03bb5c82f25501831c436c0c386dae9e3ee226e",
  "last_changed_at": "2026-05-30T05:39:34.202Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Frankfurt am Main",
    "city": null,
    "region": null,
    "country": "Frankfurt am Main",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T07:45:55.509Z",
  "launch_scope": {
    "reason": "personio_production_catalog",
    "included": true,
    "location": {
      "raw": "Frankfurt am Main",
      "city": null,
      "region": null,
      "country": "Frankfurt am Main",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "Frankfurt am Main"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": null,
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "id": "2426002",
  "name": "Data Engineer NLP (m/w/d)",
  "office": "Frankfurt am Main",
  "keywords": [],
  "schedule": "full-time",
  "createdAt": "2025-11-13T10:24:32+00:00",
  "seniority": "experienced",
  "department": "AI Development",
  "occupation": "database_development_and_administration",
  "subcompany": "statworx GmbH (DE)",
  "employmentType": "permanent",
  "jobDescriptions": [
    {
      "name": "About us",
      "value": "<span style=\"font-size:14px;\">With deep expertise in Data Engineering, Data Science, and Machine Learning, we help our clients unlock the full potential of their data.statworx is a leading consulting and development company for data and AI, based in Frankfurt am Main. We offer strategic consulting for medium-sized businesses and global corporations. We develop innovative data and AI solutions across all business areas and corporate functions. We empower people at all levels of expertise with our data and AI education formats. In short: We support companies in all aspects of digital transformation – for over 10 years, in more than 1000 data and AI projects, and for over 100 clients from almost all industries.</span><br><br><span style=\"font-size:14px;\"><span style=\"font-size:14px;\">Our AI Development department acts as a catalyst for data and AI transformation. We take a holistic approach that spans the entire journey — from assessing AI maturity to designing, developing, and scaling end-to-end data and AI solutions.</span></span>"
    },
    {
      "name": "Your tasks",
      "value": "<div><strong>Focus: Data pipelines and provisioning for NLP and LLM-based applications</strong></div><ul><li>Combine classical data engineering with modern NLP approaches – particularly in the context of Large Language Models (LLMs), embeddings, knowledge graphs, Retrieval-Augmented Generation (RAG), and text-to-SQL applications</li><li>Design, develop, and operate modern data architectures that form the foundation for advanced NLP applications – from knowledge management systems and semantic search solutions to RAG use cases</li><li>Work closely with our clients to understand their business requirements and data processes, and translate them into tailored, scalable data and AI solutions</li><li>Implement scalable data pipelines and infrastructures to efficiently provide, transform, and version large volumes of structured and unstructured data</li><li>Ensure data quality, security, and governance along the entire value chain, and establish best practices for handling sensitive data in AI projects</li><li>Build and operate scalable data infrastructures in cloud environments, and automate deployments and monitoring systems to ensure reliability and availability</li><li>Provide strategic advice to clients and internal teams on data architecture, technologies, tools, and best practices, acting as a trusted advisor</li><li>Support and mentor junior colleagues, share your knowledge within the team, and contribute to the development of statworx’s data engineering community through workshops, blog posts, or internal talks</li></ul><br>"
    },
    {
      "name": "Your profile",
      "value": "<ul><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You hold a Master’s degree in (Business) Informatics, Computer Science, or a related field</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You have at least five years of relevant professional experience in data engineering or data architecture</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You have a strong understanding of modern data architectures (Data Lakes, Lakehouses, Data Warehouses) and are experienced in ETL/ELT processes and data modeling</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">Ideally, you have experience building data infrastructures for NLP applications – especially in the context of LLMs, Retrieval-Augmented Generation (RAG), semantic layers, and knowledge graphs</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">Hands-on experience with text-to-SQL systems or developing interfaces between natural language and databases is a plus</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You are experienced with cloud platforms (Azure, AWS, or GCP) and data platforms such as Databricks or Snowflake</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You are familiar with Infrastructure-as-Code (e.g., Terraform, Pulumi) and CI/CD workflows (e.g., GitHub Actions, GitLab CI, Azure DevOps)</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You have excellent programming skills in Python, SQL, and Bash/Shell, and you write clean, efficient, and maintainable code</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You understand the importance of data governance, security, and privacy (e.g., GDPR) and incorporate these principles into your architectural design</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You combine strong analytical thinking with the ability to translate business requirements into technical solutions and communicate effectively with stakeholders at all levels</li><li style=\"font-family:Arial, Helvetica, sans-serif;font-size:14px;\">You are fluent in English (written and spoken) and have advanced German skills — or are willing to actively improve them</li></ul>"
    },
    {
      "name": "Our offer",
      "value": "<ul><li>Data & AI consulting as our core business: Work on exciting projects with leading clients – from cutting-edge NLP use cases to complex data science and machine learning solutions</li><li>Depth and diversity: Engage with challenging, multifaceted problems and continuously expand your expertise in data science, machine learning, and AI</li><li>Continuous development: We support your professional and personal growth through regular feedback, tailored learning opportunities, and our mentoring program</li><li>Culture and collaboration: Experience an open, inclusive, and respectful working environment with flat hierarchies, short decision-making paths, and a strong sense of team spirit</li><li>Agile mindset: We embrace a modern, iterative way of working characterized by transparency, autonomy, and space for new ideas</li><li>Transparent compensation: Enjoy fair, structured salary bands that are regularly reviewed and adjusted to market developments</li><li>Flexible work setup:<span> </span>Our modern Frankfurt office is your main place of work – at the same time, you have the flexibility to work remotely on a regular basis and up to four weeks per year from within the EU</li><li>Mobility & well-being: Benefit from a subsidized Germany Ticket and discounted access to sports and wellness programs via Wellpass</li><li>Equipment & extras: High-quality IT equipment (e.g., MacBook Pro), regular team events, childcare support, and attractive employee discounts complete your package</li></ul><br>"
    },
    {
      "name": "Your application",
      "value": "<p>Simply apply via the application form and attach your current CV including a description of your methodological skills. We will contact you as soon as possible. If you have any questions on your application, you can reach us at [email protected].<br><br>What is particularly important to us is that we value the uniqueness of each person and always treat each other as equals. Different backgrounds, attitudes and ideas enrich us and form the basis of our success. That's why we welcome every application - regardless of gender, nationality, ethnic and social origin, religion, ideology, disability, age, or sexual orientation and identity.</p>"
    }
  ],
  "occupationCategory": "it_software",
  "recruitingCategory": "Full Time"
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/bcded71d9f7a50d4bd0a6ca3e3961c09b3f9a82c?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/aa4d9607-2425-49b3-b45c-61d78cbaf6c8JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/04cb15d2-3af6-4f08-b78f-47aad1066cc3JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/bcded71d9f7a50d4bd0a6ca3e3961c09b3f9a82c/eventsJSON