Home › Companies › Nuvitek › Data Engineer
Data Engineer
Nuvitek · Remote, Arlington, Virginia · Remote · Deleted · $115,000–$125,000 / year · Pinpoint
Job facts
| Field | Value |
|---|---|
| Company | Nuvitek |
| Title | Data Engineer |
| Normalized title | - |
| Department / team | Data |
| Location | Arlington, VA, United States |
| Work model | Remote / Remote |
| Employment type | Full Time |
| Salary | $115,000–$125,000 / year |
| Status | deleted |
| ATS provider | Pinpoint |
| Posted / first seen | — / 2026-06-03 |
| Changed / last seen | 2026-06-05 / 2026-06-03 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Nuvitek. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Pinpoint. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Arlington. | Open |
| Department jobs | Active postings in Data. | Open |
| Work model jobs | Active Remote postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Nuvitek |
| Source | aaba8499-2ad9-4d13-98a4-d511eaba5c33 |
| ATS provider | Pinpoint |
Description
At Nüvitek, customer success is our Ethos; together, we drive transformational outcomes. We only succeed when our customers succeed. We partner with our customers to achieve business objectives by using our proven customer-centric, value-driven business practices and service delivery methodologies.
Nüvitek is seeking a highly skilled Data Engineer to support the design, development, and optimization of advanced AI and data processing solutions. This role will focus on building scalable data pipelines that power large language model (LLM) applications, including retrieval systems, document ingestion workflows, and intelligent search capabilities.
The ideal candidate has hands-on experience with retrieval-augmented generation (RAG), contextual augmentation generation (CAG), OCR processing, vector databases, and modern AI data architectures. This role requires strong technical expertise, problem-solving skills, and the ability to work collaboratively within agile pod-based teams.
In assuming this position, you will be a critical contributor to meeting Nuvitek's mission: To deliver innovative, cost-effective solutions and services that enable our customers to rapidly adapt to dynamic environments.
Design, develop, and maintain scalable RAG/CAG pipelines for AI-powered applications Build and optimize document ingestion workflows for structured and unstructured data sources Manage and maintain vector stores to support semantic search and retrieval capabilities Develop OCR processing pipelines for historical and modern document collections spanning 1781–2025 Optimize retrieval performance, relevance tuning, and ranking strategies for LLM-based systems Build reliable data pipelines that support integrations with large language models and AI services Collaborate with engineers, UX teams, product owners, and stakeholders to deliver scalable AI solutions Ensure data quality, integrity, security, and performance across ingestion and retrieval systems Implement monitoring, logging, and troubleshooting for AI and data processing workflows Contribute to architecture decisions, technical documentation, and engineering best practices Participate in agile pod-based development teams and continuous improvement initiatives
4+ years of experience in data engineering, data platform development, or AI/ML infrastructure Strong experience building RAG and/or CAG pipelines Hands-on experience with vector databases and semantic retrieval systems Experience developing document ingestion and OCR processing workflows Strong understanding of LLM integrations and AI data pipeline architectures Experience working with structured, semi-structured, and unstructured datasets Proficiency with Python and modern data engineering frameworks Familiarity with APIs, ETL/ELT pipelines, and distributed processing systems Experience building and operating data pipelines in secure federal cloud environments, including FedRAMP Moderate and Zero Trust architectures, with appropriate handling of sensitive data and Controlled Unclassified Information (CUI) Ability to obtain and maintain a federal Public Trust (or higher) clearance Strong analytical, troubleshooting, and performance optimization skills Ability to work effectively in agile or pod-based delivery environments Excellent communication and collaboration skills It Would Be Great If You Also Had Experience working with historical archives or large-scale document digitization efforts Familiarity with cloud-native data platforms and AI infrastructure Experience with search relevance tuning and ranking optimization Knowledge of embedding models, chunking strategies, and retrieval optimization techniques Experience with containerization and orchestration technologies such as Docker and Kubernetes Familiarity with accessibility, governance, and secure data handling practices Passion for building scalable AI-driven solutions that improve user experiences and operational efficiency
Nuvitek is proud to offer a comprehensive benefits package: Medical Insurance Dental Insurance Vision Insurance Disability and Life Insurance Parental Leave 401K Paid Time Off Equal Opportunity Employer Statement
Nuvitek is an equal-opportunity employer as to all protected groups, including protected veterans and individuals with disabilities.
Full job record
| Job ID | 6faa134b5d4e7d295c4ade1224fd265dfda49d0f |
| Org ID | eb7e522d-20ec-4562-9aa7-8a18da4594e3 |
| Source ID | aaba8499-2ad9-4d13-98a4-d511eaba5c33 |
| Board ID | aaba8499-2ad9-4d13-98a4-d511eaba5c33 |
| Provider | pinpoint |
| Provider Job Key | 519905 |
| Title | Data Engineer |
| Normalized Title | — |
| Status | deleted |
| Active | no |
| Location Text | Remote, Arlington, Virginia |
| Department | Data |
| Team | — |
| Employment Type | full_time |
| Workplace Type | remote |
| Remote Policy | remote |
| Country | United States |
| Region | VA |
| City | Arlington |
| Salary Raw | $115,000 - $125,000 / year |
| Salary Min | 115,000 |
| Salary Max | 125,000 |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://nuvitek.pinpointhq.com/en/postings/13f16cc1-27d5-411b-aa38-de375d7c451a |
| Apply URL | https://nuvitek.pinpointhq.com/en/postings/13f16cc1-27d5-411b-aa38-de375d7c451a |
| First Seen At | 2026-06-03 07:45:30Z |
| Last Seen At | 2026-06-03 07:45:30Z |
| Last Checked At | 2026-06-05 01:29:52Z |
| Last Changed At | 2026-06-05 01:29:52Z |
| Inactive At | 2026-06-05 01:29:52Z |
| Source Posted At | — |
| Source Updated At | — |
| Raw Payload Uri | s3://bluework-jobs-prod-raw-590183727216/raw/provider=pinpoint/board=nuvitek/date=2026-06-03/2026-06-03T07-45-29-326Z-dcf40e18a2be9ee4860dab0ddca879c1ac7fa07f3b2eb3f4fc258415d98f6607.json |
Event Fields
{
"content_hash": "335e8571da447f1f0bacb1ff8dbda6a6b1eb0a63376a30c95129576293bfe747",
"source_hash": "7ae974cfb2aa7d3522a6261ff377da43826e8a63026db62234be8846c7e7cd27",
"last_changed_at": "2026-06-05T01:29:52.247Z",
"active_status": "deleted"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Remote, Arlington, Virginia",
"city": "Arlington",
"region": "VA",
"country": "United States",
"is_remote": true,
"confidence": 0.85
},
"salary_max": 125000,
"salary_min": 115000,
"inferred_at": "2026-06-03T07:45:30.149Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "Remote, Arlington, Virginia",
"city": "Arlington",
"region": "VA",
"country": "United States",
"is_remote": true,
"confidence": 0.85
},
"countries": [
"United States"
]
},
"remote_policy": "remote",
"salary_period": "year",
"workplace_type": "remote",
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"id": "519905",
"job": {
"id": "525858",
"division": null,
"department": {
"id": "39257",
"name": "Data"
},
"requisition_id": "PIN0116",
"structure_custom_group_one": null
},
"url": "https://nuvitek.pinpointhq.com/en/postings/13f16cc1-27d5-411b-aa38-de375d7c451a",
"path": "/en/postings/13f16cc1-27d5-411b-aa38-de375d7c451a",
"title": "Data Engineer",
"benefits": "<div><!--block--><strong>Nuvitek is proud to offer a comprehensive benefits package:</strong></div><ul><li><!--block-->Medical Insurance</li><li><!--block-->Dental Insurance</li><li><!--block-->Vision Insurance</li><li><!--block-->Disability and Life Insurance</li><li><!--block-->Parental Leave</li><li><!--block-->401K</li><li><!--block-->Paid Time Off</li></ul><div><!--block--><strong>Equal Opportunity Employer Statement<br></strong>Nuvitek is an equal-opportunity employer as to all protected groups, including protected veterans and individuals with disabilities.</div>",
"location": {
"id": "32348",
"city": "Arlington",
"name": "Remote",
"province": "Virginia",
"postal_code": "22209"
},
"deadline_at": null,
"description": "<div><!--block-->At <strong>Nüvitek,</strong> customer success is our Ethos; together, we drive transformational outcomes. We only succeed when our customers succeed. We partner with our customers to achieve business objectives by using our proven customer-centric, value-driven business practices and service delivery methodologies.<br><br>Nüvitek is seeking a highly skilled <strong>Data Engineer</strong> to support the design, development, and optimization of advanced AI and data processing solutions. This role will focus on building scalable data pipelines that power large language model (LLM) applications, including retrieval systems, document ingestion workflows, and intelligent search capabilities.<br><br></div><div><!--block-->The ideal candidate has hands-on experience with retrieval-augmented generation (RAG), contextual augmentation generation (CAG), OCR processing, vector databases, and modern AI data architectures. This role requires strong technical expertise, problem-solving skills, and the ability to work collaboratively within agile pod-based teams.<br><br></div><div><!--block--><br></div><div><!--block--><br><br><em>In assuming this position, you will be a critical contributor to meeting Nuvitek's mission: To deliver innovative, cost-effective solutions and services that enable our customers to rapidly adapt to dynamic environments.</em></div>",
"compensation": "$115,000 - $125,000 / year",
"reporting_to": "",
"workplace_type": "remote",
"benefits_header": "Benefits",
"employment_type": "full_time",
"workplace_type_text": "Fully remote",
"compensation_maximum": 125000,
"compensation_minimum": 115000,
"compensation_visible": true,
"employment_type_text": "Full Time",
"key_responsibilities": "<ul><li><!--block-->Design, develop, and maintain scalable RAG/CAG pipelines for AI-powered applications</li><li><!--block-->Build and optimize document ingestion workflows for structured and unstructured data sources</li><li><!--block-->Manage and maintain vector stores to support semantic search and retrieval capabilities</li><li><!--block-->Develop OCR processing pipelines for historical and modern document collections spanning 1781–2025</li><li><!--block-->Optimize retrieval performance, relevance tuning, and ranking strategies for LLM-based systems</li><li><!--block-->Build reliable data pipelines that support integrations with large language models and AI services</li><li><!--block-->Collaborate with engineers, UX teams, product owners, and stakeholders to deliver scalable AI solutions</li><li><!--block-->Ensure data quality, integrity, security, and performance across ingestion and retrieval systems</li><li><!--block-->Implement monitoring, logging, and troubleshooting for AI and data processing workflows</li><li><!--block-->Contribute to architecture decisions, technical documentation, and engineering best practices</li><li><!--block-->Participate in agile pod-based development teams and continuous improvement initiatives</li></ul><div><!--block--><br></div>",
"compensation_currency": "USD",
"compensation_frequency": "year",
"skills_knowledge_expertise": "<ul><li><!--block-->4+ years of experience in data engineering, data platform development, or AI/ML infrastructure</li><li><!--block-->Strong experience building RAG and/or CAG pipelines</li><li><!--block-->Hands-on experience with vector databases and semantic retrieval systems</li><li><!--block-->Experience developing document ingestion and OCR processing workflows</li><li><!--block-->Strong understanding of LLM integrations and AI data pipeline architectures</li><li><!--block-->Experience working with structured, semi-structured, and unstructured datasets</li><li><!--block-->Proficiency with Python and modern data engineering frameworks</li><li><!--block-->Familiarity with APIs, ETL/ELT pipelines, and distributed processing systems</li><li><!--block-->Experience building and operating data pipelines in secure federal cloud environments, including FedRAMP Moderate and Zero Trust architectures, with appropriate handling of sensitive data and Controlled Unclassified Information (CUI)</li><li><!--block-->Ability to obtain and maintain a federal Public Trust (or higher) clearance</li><li><!--block-->Strong analytical, troubleshooting, and performance optimization skills</li><li><!--block-->Ability to work effectively in agile or pod-based delivery environments</li><li><!--block-->Excellent communication and collaboration skills</li></ul><h2><!--block--><strong>It Would Be Great If You Also Had</strong></h2><ul><li><!--block-->Experience working with historical archives or large-scale document digitization efforts</li><li><!--block-->Familiarity with cloud-native data platforms and AI infrastructure</li><li><!--block-->Experience with search relevance tuning and ranking optimization</li><li><!--block-->Knowledge of embedding models, chunking strategies, and retrieval optimization techniques</li><li><!--block-->Experience with containerization and orchestration technologies such as Docker and Kubernetes</li><li><!--block-->Familiarity with accessibility, governance, and secure data handling practices</li><li><!--block-->Passion for building scalable AI-driven solutions that improve user experiences and operational efficiency<br><br></li></ul><div><!--block--><br></div>",
"key_responsibilities_header": "What You Will Do",
"skills_knowledge_expertise_header": "What You Will Bring"
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/6faa134b5d4e7d295c4ade1224fd265dfda49d0f?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/eb7e522d-20ec-4562-9aa7-8a18da4594e3JSONGET https://api.bluedoor.sh/job-postings/v1/sources/aaba8499-2ad9-4d13-98a4-d511eaba5c33JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/6faa134b5d4e7d295c4ade1224fd265dfda49d0f/eventsJSON