bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesC The SignsAI Data Engineer

AI Data Engineer

C The Signs · Boston, United States (Remote) · Remote · Active · Workable

Job facts

FieldValue
CompanyC The Signs
TitleAI Data Engineer
Normalized title-
Department / teamOther
LocationBoston, United States
Work modelRemote / Remote
Employment type-
Salary-
Statusactive
ATS providerWorkable
Posted / first seen2026-04-28 / 2026-05-31
Changed / last seen2026-05-31 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from C The Signs.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Workable.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Boston.Open
Department jobsActive postings in Other.Open
Work model jobsActive Remote postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyC The Signs
Sourcec0cdce61-3e3d-481f-ac34-799da9c624b4
ATS providerWorkable

Description

Description Position Summary The Data Engineer will play a crucial role in developing and fine tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data. You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high quality, high volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement Key Responsibilities Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine tuning. Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets. Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets. Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity. Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models. Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA). Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability. Document data engineering processes, data models, and data dictionaries. Stay up to date with the latest advancements in data engineering, big data technologies, and machine learning. Requirements Required Bachelor's degree in Computer Science, Engineering, or a related field. Proven experience as a Data Engineer, with a focus on big data technologies. Strong proficiency in programming languages such as Python, Scala, or Java. Extensive experience with data warehousing, ETL processes, and data modeling. Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services. Hands on experience with big data frameworks like Apache Spark for distributed processing. Excellent problem solving skills and the ability to work independently and as part of a team. Strong communication and interpersonal skills. Preferred Master's degree in a related field. Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7). Familiarity with machine learning concepts and LLM fine tuning processes. Experience with data orchestration tools (e.g., Apache Airflow). Work Authorization: Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa Benefits Why Join Us? Joining  C the Signs  is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact. Benefits: Competitive salary and benefits package. Flexible working arrangements (remote or hybrid options available). The opportunity to work on life changing AI technology that directly impacts patient outcomes. Join a team that combines cutting edge innovation with a mission to save lives and improve health equity. Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.

Full job record

Job ID15af990f294dc2a44c051fe7cf7a9a8dd2a0bd96
Org ID80a156fe-fe28-40ac-8f76-79f62cd700cf
Source IDc0cdce61-3e3d-481f-ac34-799da9c624b4
Board IDc0cdce61-3e3d-481f-ac34-799da9c624b4
Providerworkable
Provider Job Key0BC90FBDB6
TitleAI Data Engineer
Normalized Title
Statusactive
Activeyes
Location TextBoston, United States (Remote)
DepartmentOther
Team
Employment Type
Workplace Typeremote
Remote Policyremote
CountryUnited States
Region
CityBoston
Salary RawDescription Position Summary The Data Engineer will play a crucial role in developing and fine tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data. You will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high quality, high volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement Key Responsibilities Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine tuning. Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets. Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets. Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity. Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models. Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA). Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability. Document data engineering processes, data models, and data dictionaries. Stay up to date with the latest advancements in data engineering, big data technologies, and machine learning. Requirements Required Bachelor's degree in Computer Science, Engineering, or a related field. Proven experience as a Data Engineer, with a focus on big data technologies. Strong proficiency in programming languages such as Python, Scala, or Java. Extensive experience with data warehousing, ETL processes, and data modeling. Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services. Hands on experience with big data frameworks like Apache Spark for distributed processing. Excellent problem solving skills and the ability to work independently and as part of a team. Strong communication and interpersonal skills. Preferred Master's degree in a related field. Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7). Familiarity with machine learning concepts and LLM fine tuning processes. Experience with data orchestration tools (e.g., Apache Airflow). Work Authorization: Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa Benefits Why Join Us? Joining  C the Signs  is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact. Benefits: Competitive salary and benefits package. Flexible working arrangements (remote or hybrid options available). The opportunity to work on life changing AI technology that directly impacts patient outcomes. Join a team that combines cutting edge innovation with a mission to save lives and improve health equity. Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.
Salary Min
Salary Max
Salary Currency
Salary Period
Source URLhttps://apply.workable.com/c-the-signs/jobs/view/0BC90FBDB6
Apply URLhttps://apply.workable.com/c-the-signs/j/0BC90FBDB6/apply
First Seen At2026-05-31 17:47:30Z
Last Seen At2026-06-06 13:32:14Z
Last Checked At2026-06-06 13:32:14Z
Last Changed At2026-05-31 17:47:30Z
Inactive At
Source Posted At2026-04-28 00:00:00Z
Source Updated At
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=workable/board=c-the-signs/date=2026-06-06/2026-06-06T13-32-14-201Z-1d7e9089a1f44896e8046ec330c66acb780333ce7f00506c73e19318243ac2e3.json
Event Fields
{
  "content_hash": "647b6ba872e864757a98ddeb2bd69b2332725d1dfae788e6d4643905a8bdf16a",
  "source_hash": "9365aab644c48ac3704ff1448cf61406538058cf7842190a026346f4107c2f5b",
  "last_changed_at": "2026-05-31T17:47:30.512Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Boston, United States (Remote)",
    "city": "Boston",
    "region": null,
    "country": "United States",
    "is_remote": true,
    "confidence": 0.95
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T13:32:14.640Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Boston, United States (Remote)",
      "city": "Boston",
      "region": null,
      "country": "United States",
      "is_remote": true,
      "confidence": 0.95
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "remote",
  "salary_period": null,
  "workplace_type": "remote",
  "salary_currency": null
}
Extensions
{}
Native Structured
{
  "detail": {
    "type": "—",
    "title": "AI Data Engineer",
    "posted": "2026-04-28",
    "company": "C the Signs",
    "applyUrl": "https://apply.workable.com/c-the-signs/j/0BC90FBDB6/apply",
    "location": "Boston, United States (Remote)",
    "workplace": "remote",
    "department": null,
    "descriptionText": "Description\n\n Position Summary\n\nThe Data Engineer will play a crucial role in developing and fine tuning data specifically for our LLMs and machine learning models. This individual will be responsible for the entire data lifecycle, including gathering, cleaning, structuring, and optimizing large, diverse healthcare datasets. The ideal candidate will have a strong background in data engineering principles, experience with big data technologies, and a keen understanding of the unique challenges and requirements of healthcare data.\n\nYou will design, build, and maintain scalable data pipelines that source, preprocess, and deliver high quality, high volume datasets to our machine learning engineers. This role requires a deep understanding of data engineering best practices coupled with specific knowledge of the data requirements for LLM training and refinement\n\n Key Responsibilities\n\n Collaborate with data scientists and machine learning engineers to understand data requirements for LLM and machine learning model fine tuning.\n Design, build, and maintain scalable data pipelines to ingest, process, and store massive and diverse healthcare datasets.\n Implement robust data validation and monitoring to ensure the integrity, accuracy, and consistency of all training datasets.\n Implement robust data cleaning, validation, and transformation processes to ensure data quality and integrity.\n Develop and optimize data structures and schemas for efficient access and utilization by LLMs and machine learning models.\n Work with the team to identify and acquire new data sources, ensuring compliance with relevant healthcare regulations (e.g., HIPAA).\n Monitor data pipeline performance, troubleshoot issues, and implement optimizations to improve efficiency and reliability.\n Document data engineering processes, data models, and data dictionaries.\n Stay up to date with the latest advancements in data engineering, big data technologies, and machine learning.\n\n Requirements\n\n Required\n\n Bachelor's degree in Computer Science, Engineering, or a related field.\n Proven experience as a Data Engineer, with a focus on big data technologies.\n Strong proficiency in programming languages such as Python, Scala, or Java.\n Extensive experience with data warehousing, ETL processes, and data modeling.\n Experience with major cloud providers (e.g., AWS, GCP, Azure) and their data storage and processing services.\n Hands on experience with big data frameworks like Apache Spark for distributed processing.\n Excellent problem solving skills and the ability to work independently and as part of a team.\n Strong communication and interpersonal skills.\n\n Preferred\n\n Master's degree in a related field.\n Experience with healthcare data and a good understanding of healthcare data standards (e.g., FHIR, HL7).\n Familiarity with machine learning concepts and LLM fine tuning processes.\n Experience with data orchestration tools (e.g., Apache Airflow).\n\n Work Authorization:\n\n Must be a US Citizen, Green Card holder, or currently in the US have valid H1B visa\n\n Benefits\n\n Why Join Us? \n\nJoining  C the Signs  is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.\n\n Benefits: \n\n Competitive salary and benefits package.\n Flexible working arrangements (remote or hybrid options available).\n The opportunity to work on life changing AI technology that directly impacts patient outcomes.\n Join a team that combines cutting edge innovation with a mission to save lives and improve health equity.\n Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare."
  },
  "list_job": {
    "id": "0BC90FBDB6",
    "type": null,
    "title": "AI Data Engineer",
    "posted": "2026-04-28",
    "salary": null,
    "location": "Boston, United States (Remote)",
    "detailUrl": "https://apply.workable.com/c-the-signs/jobs/view/0BC90FBDB6.md",
    "department": "Other"
  },
  "detail_meta": {
    "url": "https://apply.workable.com/c-the-signs/jobs/view/0BC90FBDB6.md",
    "http_status": 200,
    "content_type": "text/markdown; charset=utf-8",
    "response_bytes": 4186
  },
  "detail_errors": []
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/15af990f294dc2a44c051fe7cf7a9a8dd2a0bd96?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/80a156fe-fe28-40ac-8f76-79f62cd700cfJSON
GET https://api.bluedoor.sh/job-postings/v1/sources/c0cdce61-3e3d-481f-ac34-799da9c624b4JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/15af990f294dc2a44c051fe7cf7a9a8dd2a0bd96/eventsJSON