bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesCareers Berkley Icims ComDatabricks Data Engineer

Databricks Data Engineer

Careers Berkley Icims Com · Manassas, VA, US · On Site · Active · $1 · iCIMS

Job facts

FieldValue
CompanyCareers Berkley Icims Com
TitleDatabricks Data Engineer
Normalized title-
Department / teamFinancial ServicesInsurance
LocationManassas, VA, United States
Work modelOn Site
Employment typeOTHER
Salary$1
Statusactive
ATS provideriCIMS
Posted / first seen2026-03-25 / 2026-05-31
Changed / last seen2026-06-01 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Careers Berkley Icims Com.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through iCIMS.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Manassas.Open
Department jobsActive postings in Financial ServicesInsurance.Open
Work model jobsActive On Site postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyCareers Berkley Icims Com
Source127a694f-8b17-4da1-aadb-c31f41496ac9
ATS provideriCIMS

Description

Company Details We started in early 2019 as a small group of technologists with a passion for making insurance better. Today we are working with a team of industry experts who run five different insurance brands and collectively control $1 billion in annual premiums. We believe in an idea and execution meritocracy. In other words, a place where the best ideas win and the people who deliver the most value get the most opportunities. As we grow our team, we are looking for inquisitive, entrepreneurial people who are excited to reimagine the insurance industry. Insurance is too complex. Help us make it better. Responsibilities This position requires on-site work Monday–Thursday at either our Manassas, VA or Chesterfield, MO location. The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision advantage at scale. This role will work with cutting-edge tools like Databricks, Delta Lake, PySpark, and AI/BI genie to transform raw data into actionable insights. As a hands-on Databricks Data Engineer with deep expertise in Azure Databricks and MLOps, this role will have the opportunity to migrate and translate legacy SSIS ETL logic into scalable, cloud-native data pipelines in Databricks. This role will partner with data engineers, data scientists, and product manager to design features, train/evaluate models, and deploy them to production using MLflow, Databricks and Workflows—with rigorous observability, governance (Unity Catalog), and CI/CD automation. Data Pipeline Engineering Design, build, and maintain high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark. Convert and modernize existing SSIS package logic into cloud-native Databricks pipelines using PySpark notebooks, Delta Live Tables (DLT), and Databricks Workflows. Implement reliable batch and streaming pipelines with robust data quality and validation frameworks. Optimize pipeline performance using Photon, efficient file formats, partitioning, Z-ordering, and caching strategies. Lakehouse Platform Development Develop and manage datasets within Delta Lake, ensuring ACID reliability, schema evolution, versioning, and time travel. Architect feature-rich data layers including: Bronze (raw ingestion) Silver (validated, conformed) Gold (analytics-ready and ML-ready) Implement data governance using Unity Catalog for fine-grained access control, lineage, auditability, and metadata management. MLOps & ML-Enabled Data Pipelines Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. Deploy and operationalize models using MLflow, Databricks Model Registry, and Databricks Workflows. Use Databricks built-in AI SQL functions such as ai_query, ai_forecast, ai_analyze_sentiment to generate actionable insight from large amount of unstructured or structured raw data Implement monitoring for: Pipeline failures Data/feature drift Model performance degradation Operational SLAs/SLIs/SLOs Build automated CI/CD workflows using GitHub Actions or Azure DevOps for notebook deployment, pipeline testing, and environment promotion. Data Platform, Data Security & Data Governance Collaborate with data engineers to design reliable data products on Delta Lake ; leverage Delta Live Tables (DLT) for declarative pipelines when applicable. Enforce Unity Catalog for lineage, permissions, and audit; manage secrets, tokens, and keys securely (e.g., Databricks secrets , Key Vault/Secrets Manager ). Collaboration & Leadership Work closely with cross-functional teams: data engineering, data scientist, product manager, and business stakeholders. Serve as a Databricks SME—championing best practices, code standards, governance, and reusable frameworks. Document architecture, workflows, data models, runbooks, and operational procedures. Qualifications Minimum of 3 years of experience in Databricks, PySpark notebooks, Python, DevOps, software development, and data engineering. Certified Databricks Data Engineer Associate or Professional is a plus. Skills & Competencies Proficient in designing, building, deploying, and maintaining high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark Notebook. Proficient in building, deploying, and operating production ML models such as supervised, unsupervised, and anomaly detection, including techniques for imbalanced datasets Proficient with ML engineering and MLOps, including model versioning, CI/CD for ML, monitoring, drift detection, and automated retraining Proficiency in Python including Pandas and PySpark Dataframes Expert level of SQL skills including Stored Procedure, experience with SSIS, SSRS, Power BI is a plus. Proficient with cloud data engineering platforms, such as Azure, Databricks, Spark, or SQL, and batch and streaming pipelines Familiar with Databricks AI Built-In Functions such as AI_Query, AI_Gen, AI_Classify, AI_Forecast, AI_Analyze_Sentiment, able to use them to extract actionable insights from large amount of unstructured or structured raw data Experience with Python and ML frameworks, such as PyTorch or TensorFlow Experience improving data quality, lineage, and observability in enterprise data environments and operationalizing rules and model-driven scoring for prioritization, routing, or case selection Experience with predictive analytics, machine learning and artificial intelligence desired. Education A Bachelor’s degree in Computer Science, Management Information Systems, Engineering, Math, Physics, or a related quantitative field is required (4-year degree). Master’s degree preferred Experience in the commercial insurance industry is a plus. Additional Company Details The Company is an equal employment opportunity employer. We do not accept any unsolicited resumes from external recruiting firms. The company offers a competitive compensation plan and robust benefits package for full time regular employees. Base salary & Benefits include Health, dental, vision, life, disability, wellness, paid time off, 401(k) and profit-sharing plans. The actual salary for this position will be determined by a number of factors, including the scope, complexity and location of the role; the skills, education, training, credentials and experience of the candidate; and other conditions of employment. Additional Requirements • Ability to travel locally and nationally up to 5% of the time Sponsorship Details Sponsorship not Offered for this Role

Full job record

Job ID9af1490f7b4ffdc6f981a0737723420754131ee7
Org IDcf2e78de-6d29-45b2-b00e-598433eb6813
Source ID127a694f-8b17-4da1-aadb-c31f41496ac9
Board ID127a694f-8b17-4da1-aadb-c31f41496ac9
Providericims
Provider Job Key13782
TitleDatabricks Data Engineer
Normalized Title
Statusactive
Activeyes
Location TextManassas, VA, US
DepartmentFinancial ServicesInsurance
Team
Employment TypeOTHER
Workplace Typeon_site
Remote Policy
CountryUnited States
RegionVA
CityManassas
Salary RawCompany Details We started in early 2019 as a small group of technologists with a passion for making insurance better. Today we are working with a team of industry experts who run five different insurance brands and collectively control $1 billion in annual premiums. We believe in an idea and execution meritocracy. In other words, a place where the best ideas win and the people who deliver the most value get the most opportunities. As we grow our team, we are looking for inquisitive, entrepreneurial people who are excited to reimagine the insurance industry. Insurance is too complex. Help us make it better. Responsibilities This position requires on-site work Monday–Thursday at either our Manassas, VA or Chesterfield, MO location. The Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision advantage at scale. This role will work with cutting-edge tools like Databricks, Delta Lake, PySpark, and AI/BI genie to transform raw data into actionable insights. As a hands-on Databricks Data Engineer with deep expertise in Azure Databricks and MLOps, this role will have the opportunity to migrate and translate legacy SSIS ETL logic into scalable, cloud-native data pipelines in Databricks. This role will partner with data engineers, data scientists, and product manager to design features, train/evaluate models, and deploy them to production using MLflow, Databricks and Workflows—with rigorous observability, governance (Unity Catalog), and CI/CD automation. Data Pipeline Engineering Design, build, and maintain high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark. Convert and modernize existing SSIS package logic into cloud-native Databricks pipelines using PySpark notebooks, Delta Live Tables (DLT), and Databricks Workflows. Implement reliable batch and streaming pipelines with robust data quality and validation frameworks. Optimize pipeline performance using Photon, efficient file formats, partitioning, Z-ordering, and caching strategies. Lakehouse Platform Development Develop and manage datasets within Delta Lake, ensuring ACID reliability, schema evolution, versioning, and time travel. Architect feature-rich data layers including: Bronze (raw ingestion) Silver (validated, conformed) Gold (analytics-ready and ML-ready) Implement data governance using Unity Catalog for fine-grained access control, lineage, auditability, and metadata management. MLOps & ML-Enabled Data Pipelines Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines. Deploy and operationalize models using MLflow, Databricks Model Registry, and Databricks Workflows. Use Databricks built-in AI SQL functions such as ai_query, ai_forecast, ai_analyze_sentiment to generate actionable insight from large amount of unstructured or structured raw data Implement monitoring for: Pipeline failures Data/feature drift Model performance degradation Operational SLAs/SLIs/SLOs Build automated CI/CD workflows using GitHub Actions or Azure DevOps for notebook deployment, pipeline testing, and environment promotion. Data Platform, Data Security & Data Governance Collaborate with data engineers to design reliable data products on Delta Lake ; leverage Delta Live Tables (DLT) for declarative pipelines when applicable. Enforce Unity Catalog for lineage, permissions, and audit; manage secrets, tokens, and keys securely (e.g., Databricks secrets , Key Vault/Secrets Manager ). Collaboration & Leadership Work closely with cross-functional teams: data engineering, data scientist, product manager, and business stakeholders. Serve as a Databricks SME—championing best practices, code standards, governance, and reusable frameworks. Document architecture, workflows, data models, runbooks, and operational procedures. Qualifications Minimum of 3 years of experience in Databricks, PySpark notebooks, Python, DevOps, software development, and data engineering. Certified Databricks Data Engineer Associate or Professional is a plus. Skills & Competencies Proficient in designing, building, deploying, and maintaining high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark Notebook. Proficient in building, deploying, and operating production ML models such as supervised, unsupervised, and anomaly detection, including techniques for imbalanced datasets Proficient with ML engineering and MLOps, including model versioning, CI/CD for ML, monitoring, drift detection, and automated retraining Proficiency in Python including Pandas and PySpark Dataframes Expert level of SQL skills including Stored Procedure, experience with SSIS, SSRS, Power BI is a plus. Proficient with cloud data engineering platforms, such as Azure, Databricks, Spark, or SQL, and batch and streaming pipelines Familiar with Databricks AI Built-In Functions such as AI_Query, AI_Gen, AI_Classify, AI_Forecast, AI_Analyze_Sentiment, able to use them to extract actionable insights from large amount of unstructured or structured raw data Experience with Python and ML frameworks, such as PyTorch or TensorFlow Experience improving data quality, lineage, and observability in enterprise data environments and operationalizing rules and model-driven scoring for prioritization, routing, or case selection Experience with predictive analytics, machine learning and artificial intelligence desired. Education A Bachelor’s degree in Computer Science, Management Information Systems, Engineering, Math, Physics, or a related quantitative field is required (4-year degree). Master’s degree preferred Experience in the commercial insurance industry is a plus. Additional Company Details The Company is an equal employment opportunity employer. We do not accept any unsolicited resumes from external recruiting firms. The company offers a competitive compensation plan and robust benefits package for full time regular employees. Base salary & Benefits include Health, dental, vision, life, disability, wellness, paid time off, 401(k) and profit-sharing plans. The actual salary for this position will be determined by a number of factors, including the scope, complexity and location of the role; the skills, education, training, credentials and experience of the candidate; and other conditions of employment. Additional Requirements • Ability to travel locally and nationally up to 5% of the time Sponsorship Details Sponsorship not Offered for this Role
Salary Min1
Salary Max
Salary CurrencyUSD
Salary Period
Source URLhttps://careers-berkley.icims.com/jobs/13782/databricks-data-engineer/job
Apply URLhttps://careers-berkley.icims.com/jobs/13782/databricks-data-engineer/job
First Seen At2026-05-31 18:35:52Z
Last Seen At2026-06-06 19:18:17Z
Last Checked At2026-06-06 19:18:17Z
Last Changed At2026-06-01 13:36:56Z
Inactive At
Source Posted At2026-03-25 04:00:00Z
Source Updated At2026-04-14 18:41:12Z
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=icims/board=careers-berkley.icims.com/date=2026-06-06/2026-06-06T19-18-01-336Z-6e84da442d0714cb57ad5f35c4d425b9a9c615ca8b37605eafc5ae1170347bea.json
Event Fields
{
  "content_hash": "934bf3c832fa2a5919ba43472f3a834453c12539f9b310f8529aaec45aae98e4",
  "source_hash": "b4f6d96b95bb45b6332dc12f9dce5b4ee3e27e1396833824d63842adb1ec9f50",
  "last_changed_at": "2026-06-01T13:36:56.009Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Manassas, VA, US",
    "city": "Manassas",
    "region": "VA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": 1,
  "inferred_at": "2026-06-06T19:18:17.790Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Manassas, VA, US",
      "city": "Manassas",
      "region": "VA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "json_ld": {
    "url": "https://careers-berkley.icims.com/jobs/13782/databricks-data-engineer/job",
    "@type": "JobPosting",
    "title": "Databricks Data Engineer",
    "@context": "http://schema.org",
    "industry": "Financial ServicesInsurance",
    "datePosted": "2026-03-25T04:00:00.000Z",
    "description": "<h2>Company Details</h2>\n<p></p>\n<p> </p>\n<p>We started in early 2019 as a small group of technologists with a passion for making insurance better. Today we are working with a team of industry experts who run five different insurance brands and collectively control $1 billion in annual premiums.</p>\n<p> </p>\n<p>We believe in an idea and execution meritocracy. In other words, a place where the best ideas win and the people who deliver the most value get the most opportunities.</p>\n<p> </p>\n<p>As we grow our team, we are looking for inquisitive, entrepreneurial people who are excited to reimagine the insurance industry.</p>\n<p> </p>\n<p>Insurance is too complex. Help us make it better.</p>\n<h2>Responsibilities</h2>\n<p> </p>\n<em><strong>This position requires on-site work Monday–Thursday at either our Manassas, VA or Chesterfield, MO location. </strong></em>\n<p> </p>\n<p> </p>\n<p>The<strong> Databricks Data Engineer </strong>will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision advantage at scale.  This role will work with cutting-edge tools like Databricks, Delta Lake, PySpark, and AI/BI genie to transform raw data into actionable insights.  As a hands-on Databricks Data Engineer with deep expertise in Azure Databricks and MLOps, this role will have the opportunity to migrate and translate legacy SSIS ETL logic into scalable, cloud-native data pipelines in Databricks. This role will partner with data engineers, data scientists, and product manager to design features, train/evaluate models, and deploy them to production using MLflow, Databricks and Workflows—with rigorous observability, governance (Unity Catalog), and CI/CD automation.</p>\n<p> </p>\n<p><strong>Data Pipeline Engineering</strong></p>\n<p> </p>\n<ul>\n <li>Design, build, and maintain high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark.</li>\n <li>Convert and modernize existing SSIS package logic into cloud-native Databricks pipelines using PySpark notebooks, Delta Live Tables (DLT), and Databricks Workflows.</li>\n <li>Implement reliable batch and streaming pipelines with robust data quality and validation frameworks.</li>\n <li>Optimize pipeline performance using Photon, efficient file formats, partitioning, Z-ordering, and caching strategies.</li>\n</ul>\n<p> </p>\n<p><strong>Lakehouse Platform Development</strong></p>\n<p> </p>\n<ul>\n <li>Develop and manage datasets within Delta Lake, ensuring ACID reliability, schema evolution, versioning, and time travel.</li>\n <li>Architect feature-rich data layers including:\n  <ul>\n   <li>Bronze (raw ingestion)</li>\n   <li>Silver (validated, conformed)</li>\n   <li>Gold (analytics-ready and ML-ready)</li>\n  </ul></li>\n <li>Implement data governance using Unity Catalog for fine-grained access control, lineage, auditability, and metadata management.</li>\n</ul>\n<p> </p>\n<p> </p>\n<p><strong>MLOps & ML-Enabled Data Pipelines</strong></p>\n<p> </p>\n<ul>\n <li>Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines.</li>\n <li>Deploy and operationalize models using MLflow, Databricks Model Registry, and Databricks Workflows.</li>\n <li>Use Databricks built-in AI SQL functions such as ai_query, ai_forecast, ai_analyze_sentiment to generate actionable insight from large amount of unstructured or structured raw data</li>\n <li>Implement monitoring for:\n  <ul>\n   <li>Pipeline failures</li>\n   <li>Data/feature drift</li>\n   <li>Model performance degradation</li>\n   <li>Operational SLAs/SLIs/SLOs</li>\n  </ul></li>\n <li>Build automated CI/CD workflows using GitHub Actions or Azure DevOps for notebook deployment, pipeline testing, and environment promotion.</li>\n</ul>\n<p> </p>\n<p><strong>Data Platform, Data Security & Data Governance</strong></p>\n<p> </p>\n<ul>\n <li>Collaborate with data engineers to design reliable data products on <strong>Delta Lake</strong>; leverage <strong>Delta Live Tables (DLT)</strong> for declarative pipelines when applicable.</li>\n <li>Enforce <strong>Unity Catalog</strong> for lineage, permissions, and audit; manage secrets, tokens, and keys securely (e.g., <strong>Databricks secrets</strong>, <strong>Key Vault/Secrets Manager</strong>).</li>\n</ul>\n<p> </p>\n<p><strong>Collaboration & Leadership</strong></p>\n<p> </p>\n<ul>\n <li>Work closely with cross-functional teams: data engineering, data scientist, product manager, and business stakeholders.</li>\n <li>Serve as a Databricks SME—championing best practices, code standards, governance, and reusable frameworks.</li>\n <li>Document architecture, workflows, data models, runbooks, and operational procedures.</li>\n</ul>\n<p> </p>\n<h2>Qualifications</h2>\n<ul>\n <li>Minimum of 3 years of experience in Databricks, PySpark notebooks, Python, DevOps, software development, and data engineering.</li>\n <li>Certified Databricks Data Engineer Associate or Professional is a plus.</li>\n</ul>\n<p><strong> </strong></p>\n<p><strong>Skills & Competencies</strong></p>\n<ul>\n <li>Proficient in designing, building, deploying, and maintaining high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark Notebook.</li>\n <li>Proficient in building, deploying, and operating production ML models such as supervised, unsupervised, and anomaly detection, including techniques for imbalanced datasets</li>\n <li>Proficient with ML engineering and MLOps, including model versioning, CI/CD for ML, monitoring, drift detection, and automated retraining</li>\n</ul>\n<ul>\n <li>Proficiency in Python including Pandas and PySpark Dataframes</li>\n <li>Expert level of SQL skills including Stored Procedure, experience with SSIS, SSRS, Power BI is a plus.</li>\n</ul>\n<ul>\n <li>Proficient with cloud data engineering platforms, such as Azure, Databricks, Spark, or SQL, and batch and streaming pipelines</li>\n <li>Familiar with Databricks AI Built-In Functions such as AI_Query, AI_Gen, AI_Classify, AI_Forecast, AI_Analyze_Sentiment, able to use them to extract actionable insights from large amount of unstructured or structured raw data</li>\n <li>Experience with Python and ML frameworks, such as PyTorch or TensorFlow</li>\n <li>Experience improving data quality, lineage, and observability in enterprise data environments and operationalizing rules and model-driven scoring for prioritization, routing, or case selection</li>\n <li>Experience with predictive analytics, machine learning and artificial intelligence desired.</li>\n</ul>\n<p><strong>Education</strong></p>\n<ul>\n <li>A Bachelor’s degree in Computer Science, Management Information Systems, Engineering, Math, Physics, or a related quantitative field is required (4-year degree). Master’s degree preferred</li>\n <li>Experience in the commercial insurance industry is a plus.</li>\n</ul>\n<p><strong> </strong></p>\n<h2>Additional Company Details</h2>The Company is an equal employment opportunity employer. We do not accept any unsolicited resumes from external recruiting firms. The company offers a competitive compensation plan and robust benefits package for full time regular employees. Base salary & Benefits include Health, dental, vision, life, disability, wellness, paid time off, 401(k) and profit-sharing plans. The actual salary for this position will be determined by a number of factors, including the scope, complexity and location of the role; the skills, education, training, credentials and experience of the candidate; and other conditions of employment.\n<h2>Additional Requirements</h2>• Ability to travel locally and nationally up to 5% of the time\n<h2>Sponsorship Details</h2>Sponsorship not Offered for this Role",
    "directApply": true,
    "jobLocation": [
      {
        "@type": "Place",
        "address": {
          "@type": "PostalAddress",
          "postalCode": "20110",
          "addressRegion": "VA",
          "streetAddress": "9301 Innovation Drive",
          "addressCountry": "US",
          "addressLocality": "Manassas",
          "postOfficeBoxNumber": "UNAVAILABLE"
        }
      }
    ],
    "validThrough": "2027-03-25T04:00:00.000Z",
    "employmentType": "OTHER",
    "responsibilities": " \r\nThis position requires on-site work Monday–Thursday at either our Manassas, VA or Chesterfield, MO location. \r\n \r\n \r\nThe Databricks Data Engineer will help design, build, deploy, and maintain scalable and production grade data pipelines in modern cloud environments, enabling analytics, AI, ML, and decision advantage at scale.  This role will work with cutting-edge tools like Databricks, Delta Lake, PySpark, and AI/BI genie to transform raw data into actionable insights.  As a hands-on Databricks Data Engineer with deep expertise in Azure Databricks and MLOps, this role will have the opportunity to migrate and translate legacy SSIS ETL logic into scalable, cloud-native data pipelines in Databricks. This role will partner with data engineers, data scientists, and product manager to design features, train/evaluate models, and deploy them to production using MLflow, Databricks and Workflows—with rigorous observability, governance (Unity Catalog), and CI/CD automation.\r\n \r\nData Pipeline Engineering\r\n \r\n- Design, build, and maintain high-performance, scalable ETL/ELT pipelines using Azure Databricks, Delta Lake, and PySpark.\r\n- Convert and modernize existing SSIS package logic into cloud-native Databricks pipelines using PySpark notebooks, Delta Live Tables (DLT), and Databricks Workflows.\r\n- Implement reliable batch and streaming pipelines with robust data quality and validation frameworks.\r\n- Optimize pipeline performance using Photon, efficient file formats, partitioning, Z-ordering, and caching strategies.\r\n \r\nLakehouse Platform Development\r\n \r\n- Develop and manage datasets within Delta Lake, ensuring ACID reliability, schema evolution, versioning, and time travel.\r\n- Architect feature-rich data layers including:\r\n- Bronze (raw ingestion)\r\n- Silver (validated, conformed)\r\n- Gold (analytics-ready and ML-ready)\r\n\r\n- Implement data governance using Unity Catalog for fine-grained access control, lineage, auditability, and metadata management.\r\n \r\n \r\nMLOps & ML-Enabled Data Pipelines\r\n \r\n- Partner with data scientists and data engineers to create feature pipelines, model training pipelines, and production scoring pipelines.\r\n- Deploy and operationalize models using MLflow, Databricks Model Registry, and Databricks Workflows.\r\n- Use Databricks built-in AI SQL functions such as ai_query, ai_forecast, ai_analyze_sentiment to generate actionable insight from large amount of unstructured or structured raw data\r\n- Implement monitoring for:\r\n- Pipeline failures\r\n- Data/feature drift\r\n- Model performance degradation\r\n- Operational SLAs/SLIs/SLOs\r\n\r\n- Build automated CI/CD workflows using GitHub Actions or Azure DevOps for notebook deployment, pipeline testing, and environment promotion.\r\n \r\nData Platform, Data Security & Data Governance\r\n \r\n- Collaborate with data engineers to design reliable data products on Delta Lake; leverage Delta Live Tables (DLT) for declarative pipelines when applicable.\r\n- Enforce Unity Catalog for lineage, permissions, and audit; manage secrets, tokens, and keys securely (e.g., Databricks secrets, Key Vault/Secrets Manager).\r\n \r\nCollaboration & Leadership\r\n \r\n- Work closely with cross-functional teams: data engineering, data scientist, product manager, and business stakeholders.\r\n- Serve as a Databricks SME—championing best practices, code standards, governance, and reusable frameworks.\r\n- Document architecture, workflows, data models, runbooks, and operational procedures.\r\n ",
    "hiringOrganization": {
      "name": "Berkley",
      "@type": "Organization",
      "sameAs": "UNAVAILABLE"
    },
    "occupationalCategory": "Information Technology",
    "experienceRequirements": "Mid-Senior Level"
  },
  "detail_meta": {
    "url": "https://careers-berkley.icims.com/jobs/13782/databricks-data-engineer/job?in_iframe=1",
    "http_status": 200,
    "content_type": "text/html;charset=UTF-8",
    "response_bytes": 51002,
    "compact_response_bytes": 12679,
    "original_response_bytes": 51002
  },
  "sitemap_job": {
    "id": "13782",
    "url": "https://careers-berkley.icims.com/jobs/13782/databricks-data-engineer/job",
    "slug": "databricks-data-engineer",
    "lastmod": "2026-04-14T14:41:12-04:00"
  },
  "detail_errors": []
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/9af1490f7b4ffdc6f981a0737723420754131ee7?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/cf2e78de-6d29-45b2-b00e-598433eb6813JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/127a694f-8b17-4da1-aadb-c31f41496ac9JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/9af1490f7b4ffdc6f981a0737723420754131ee7/eventsJSON