Home › Companies › Virtusa Ex En › Data Engineer
Data Engineer
Virtusa Ex En · US-NY-New York · Active · $933,538 / year · Oracle Taleo Enterprise
Job facts
| Field | Value |
|---|---|
| Company | Virtusa Ex En |
| Title | Data Engineer |
| Normalized title | - |
| Department / team | Full-time |
| Location | New York, NY, United States |
| Work model | - |
| Employment type | Full Time |
| Salary | $933,538 / year |
| Status | active |
| ATS provider | Oracle Taleo Enterprise |
| Posted / first seen | — / 2026-06-05 |
| Changed / last seen | 2026-06-05 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Virtusa Ex En. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Oracle Taleo Enterprise. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in New York. | Open |
| Department jobs | Active postings in Full-time. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Virtusa Ex En |
| Source | 9a0ed1cb-24de-45ee-a170-2dd3f0d8fc1f |
| ATS provider | Oracle Taleo Enterprise |
Description
Role Summary
The Data Engineer will design and implement scalable, distributed data pipelines and enterprise data lakes, leveraging Spark, Databricks, Python, and SQL. The role focuses on building high-performance ETL/ELT pipelines, metadata-driven architectures, and governed analytical data stores supporting advanced analytics and machine learning workloads in cloud environments.
Key Responsibilities
Pipeline Design & Optimization Design high-performance ETL/ELT pipelines using PySpark and Spark SQL to translate business requirements into optimized data pipelines, demonstrating an ability to reduce processing latency by up to 50%.
Cloud Data Architecture Design and implement data models for Medallion architecture (Bronze, Silver, Gold) using Delta Lake, enabling scalable and reusable data processing.
Data Ingestion & Orchestration Orchestrate data pipelines using Azure Data Factory (ADF) to reliably ingest, transform, and load enterprise datasets. Implement data ingestion pipelines, including those connecting on-premises HDFS with Azure Data Factory and Databricks, to create curated Gold-layer datasets supporting Microsoft Fabric analytics.
Data Governance & Security Implement centralized data governance using Unity Catalog for managing catalogs, schemas, role-based access controls (RBAC), and fine-grained permissions.
Quality Assurance & Cost Management Build scenario-based test frameworks in Databricks using PySpark for data validation. Optimize storage costs (e.g., 25% reduction in Azure Storage) by managing required history/versions of Delta tables.
Operational Monitoring Generate an automated email reporting framework to set up pipeline failure alerts, reducing manual support efforts by 40%.
Role Summary
The Data Engineer will design and implement scalable, distributed data pipelines and enterprise data lakes, leveraging Spark, Databricks, Python, and SQL. The role focuses on building high-performance ETL/ELT pipelines, metadata-driven architectures, and governed analytical data stores supporting advanced analytics and machine learning workloads in cloud environments.
Key Responsibilities
Pipeline Design & Optimization Design high-performance ETL/ELT pipelines using PySpark and Spark SQL to translate business requirements into optimized data pipelines, demonstrating an ability to reduce processing latency by up to 50%.
Cloud Data Architecture Design and implement data models for Medallion architecture (Bronze, Silver, Gold) using Delta Lake, enabling scalable and reusable data processing.
Data Ingestion & Orchestration Orchestrate data pipelines using Azure Data Factory (ADF) to reliably ingest, transform, and load enterprise datasets. Implement data ingestion pipelines, including those connecting on-premises HDFS with Azure Data Factory and Databricks, to create curated Gold-layer datasets supporting Microsoft Fabric analytics.
Data Governance & Security Implement centralized data governance using Unity Catalog for managing catalogs, schemas, role-based access controls (RBAC), and fine-grained permissions.
Quality Assurance & Cost Management Build scenario-based test frameworks in Databricks using PySpark for data validation. Optimize storage costs (e.g., 25% reduction in Azure Storage) by managing required history/versions of Delta tables.
Operational Monitoring Generate an automated email reporting framework to set up pipeline failure alerts, reducing manual support efforts by 40%.
Full job record
| Job ID | 052e58b903173e89a9e0022117a4609facfe0f58 |
| Org ID | bac2a245-aafb-4707-a720-29dd7b4a1f45 |
| Source ID | 9a0ed1cb-24de-45ee-a170-2dd3f0d8fc1f |
| Board ID | 9a0ed1cb-24de-45ee-a170-2dd3f0d8fc1f |
| Provider | oracle_taleo |
| Provider Job Key | 933538 |
| Title | Data Engineer |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | US-NY-New York |
| Department | Full-time |
| Team | — |
| Employment Type | full_time |
| Workplace Type | — |
| Remote Policy | — |
| Country | United States |
| Region | NY |
| City | New York |
| Salary Raw | $933538 - $Submission for the position: Data Engineer - (Job Number: CREQ259345) false |
| Salary Min | 933,538 |
| Salary Max | — |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://virtusa.taleo.net/careersection/ex/jobdetail.ftl?job=933538&lang=en |
| Apply URL | https://virtusa.taleo.net/careersection/ex/jobdetail.ftl?job=933538&lang=en |
| First Seen At | 2026-06-05 01:53:12Z |
| Last Seen At | 2026-06-06 13:49:40Z |
| Last Checked At | 2026-06-06 13:49:40Z |
| Last Changed At | 2026-06-05 01:53:12Z |
| Inactive At | — |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=oracle_taleo/board=virtusa|ex|en/date=2026-06-06/2026-06-06T13-49-28-189Z-c6e76519748991600c536284d050ee5172ad3948734ba04367ab0b8209e016a2.json |
Event Fields
{
"content_hash": "27788ba05f89d20d0c932f6d6eeee4d749c1605228c6d82b359fe843887672d4",
"source_hash": "3d39663eca8b592704689e87ca2765f28a0a8a33780878081d6fb4caffdceed3",
"last_changed_at": "2026-06-05T01:53:12.618Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "US-NY-New York",
"city": "New York",
"region": "NY",
"country": "United States",
"is_remote": false,
"confidence": 0.95
},
"salary_max": null,
"salary_min": 933538,
"inferred_at": "2026-06-06T13:49:39.989Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "US-NY-New York",
"city": "New York",
"region": "NY",
"country": "United States",
"is_remote": false,
"confidence": 0.95
},
"countries": [
"United States"
]
},
"remote_policy": null,
"salary_period": "year",
"workplace_type": null,
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"list_job": {
"raw": {
"draft": false,
"jobId": "933538",
"column": [
"Data Engineer",
"[\"US-NY-New York\"]",
"04/06/2026"
],
"hotJob": false,
"contestNo": "CREQ259345",
"toReApply": false,
"linkedColumn": 0,
"addedToJobCart": false,
"alreadyAppliedOn": false,
"locationsColumns": [
1
]
},
"jobId": "933538",
"title": "Data Engineer",
"legacy": false,
"category": null,
"schedule": null,
"contestNo": "CREQ259345",
"detailUrl": "https://virtusa.taleo.net/careersection/ex/jobdetail.ftl?job=933538&lang=en",
"locations": [
"US-NY-New York"
],
"postingDate": "04/06/2026"
},
"detail_meta": {
"url": "https://virtusa.taleo.net/careersection/ex/jobdetail.ftl?job=933538&lang=en",
"http_status": 200,
"content_type": "text/html;charset=UTF-8",
"response_bytes": 54864
},
"detail_errors": [],
"detail_values_count": 48
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/052e58b903173e89a9e0022117a4609facfe0f58?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/bac2a245-aafb-4707-a720-29dd7b4a1f45JSONGET https://api.bluedoor.sh/job-postings/v1/sources/9a0ed1cb-24de-45ee-a170-2dd3f0d8fc1fJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/052e58b903173e89a9e0022117a4609facfe0f58/eventsJSON