Home › Companies › Lts › RAG and Evaluation Engineer
RAG and Evaluation Engineer
Lts · United States - Remote · Remote · Active · Greenhouse
Job facts
| Field | Value |
|---|---|
| Company | Lts |
| Title | RAG and Evaluation Engineer |
| Normalized title | - |
| Department / team | Product & Software Development |
| Location | United States |
| Work model | Remote / Remote |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | Greenhouse |
| Posted / first seen | 2026-06-12 / 2026-06-13 |
| Changed / last seen | 2026-06-16 / 2026-06-19 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Lts. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Greenhouse. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| Department jobs | Active postings in Product & Software Development. | Open |
| Work model jobs | Active Remote postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Lts |
| Source | 24631dfb-c289-495a-ac71-fba5cc3c5238 |
| ATS provider | Greenhouse |
Description
LTS is seeking a RAG & Evaluation Engineer to join a small, senior engineering team applying frontier AI to one of the most consequential legacy systems still running in production today.
The mission: build agents that read, translate, and modernize a decades-old codebase that millions of people quietly depend on. The work has executive backing, real users, and a customer who knows exactly what they’re buying. Specifics shared once we’re talking.
The team is small by design. Every seat carries unusual leverage, and we hire people who are already deep in this work. We use AI tooling natively — agents in parallel, model as collaborator, no exceptions.
What You’ll Do:
The RAG & Evaluation Engineer owns the knowledge surface and the eval harness. Ingestion pipelines for source code, structured metadata, technical documentation, patches, and additional corpora the customer provides. Retrieval quality across chunking, embeddings, hybrid retrieval, reranking, freshness. Benchmarks for translation accuracy, dependency-map correctness, and overall agent quality. The feedback loop from production usage back into evals and retrieval lives here.
Own the knowledge surface — ingestion pipelines for source code, structured metadata, technical documentation, patches, and additional corpora the customer provides.
Own retrieval quality — chunking, embeddings, hybrid retrieval, reranking, and freshness.
Own the eval harness — benchmarks for translation accuracy, dependency-map correctness, and overall agent quality.
Run A/B testing and regression detection across prompts, retrieval, and model changes.
Operate the feedback loop from production usage back into evals and retrieval.
Define what “good” means for the platform when no one else has a clear view, so the team can tell whether the agent is actually improving.
Pair with the Agent Engineers on the prompt-and-eval iteration cycle.
What We’re Looking For:
Bachelor’s degree in Computer Science, Engineering, Information Science, or a related field, plus 4 years of professional software engineering experience; equivalent experience may substitute for the degree requirement.
Has shipped a production RAG system with quality the candidate can describe in numbers (rigor matters more than scale).
Ability to work in a fast-paced, collaborative environment.
Production experience with retrieval pipelines — ingestion, chunking, embedding, hybrid retrieval, reranking.
Strong applied evaluation skills — benchmark design, regression detection, LLM-as-judge patterns.
Knows when BM25 beats embeddings and when neither is enough.
Measures everything they ship; opinions about chunking are backed by benchmarks.
Patient with detail; comfortable defining metrics before the team has agreed on them.
Heavy native use of AI tooling: agents in parallel, model as collaborator.
Strong TypeScript or Python.
Demonstrated experience in a remote work environment.
Nice to Have:
Code-as-corpus retrieval (search over source code rather than prose).
Applied IR or search-engine background.
Synthetic data generation and LLM-as-judge patterns.
Open-source contributions to retrieval, eval, or RAG tooling.
Experience integrating retrieval feedback loops with production usage.
Healthcare IT or legacy modernization domain experience.
Public technical writing or conference talks on retrieval or evaluation.
What’s in it for you?
The opportunity to support high visibility federal missions in IT and healthcare
A culture that values innovation, growth, collaboration, and quality
Access to cutting-edge tools and technologies
Comprehensive benefits for you and your family
A career path that rewards ambition and performance
If you’re ready to push boundaries, sharpen your skills, and join a team that is passionate about building what’s next, we’d love to meet you. Apply today and let’s build a future together!
LTS shares salary ranges to promote transparency. Compensation ranges are provided for informational purposes, and final compensation may vary based on experience, skills, location, and role requirements.
LTS is committed to offering eligible employees comprehensive benefits that will provide them with options intended to meet their needs and the needs of their family.
Full job record
| Job ID | d056dbbda4cd697fa45cae66bb038166aec33157 |
| Org ID | 81eb1409-20a7-42da-88a5-4509400827f3 |
| Source ID | 24631dfb-c289-495a-ac71-fba5cc3c5238 |
| Board ID | 24631dfb-c289-495a-ac71-fba5cc3c5238 |
| Provider | greenhouse |
| Provider Job Key | 4284737009 |
| Title | RAG and Evaluation Engineer |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | United States - Remote |
| Department | Product & Software Development |
| Team | — |
| Employment Type | Full-time |
| Workplace Type | remote |
| Remote Policy | remote |
| Country | United States |
| Region | — |
| City | — |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://job-boards.greenhouse.io/lts/jobs/4284737009 |
| Apply URL | https://job-boards.greenhouse.io/lts/jobs/4284737009 |
| First Seen At | 2026-06-13 07:32:31Z |
| Last Seen At | 2026-06-19 07:32:48Z |
| Last Checked At | 2026-06-19 07:32:48Z |
| Last Changed At | 2026-06-16 07:32:11Z |
| Inactive At | — |
| Source Posted At | 2026-06-12 14:31:44Z |
| Source Updated At | 2026-06-15 15:37:12Z |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=lts/date=2026-06-19/2026-06-19T07-32-48-067Z-7746fea2b194e3366f6b352b0d579e81f9a214eb1a003e7d0ed2fbd7774753d8.json |
Event Fields
{
"content_hash": "54cead16676516b6aacbb72322e9b7df98ba6a4000721e829a59dd8e019796aa",
"source_hash": "80453ea014b6fb2afc4932f7a1ca66dca4d3fe13e97295589cdbe948f1f72b81",
"last_changed_at": "2026-06-16T07:32:11.464Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "United States - Remote",
"city": null,
"region": null,
"country": "United States",
"is_remote": true,
"confidence": 0.95
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-19T07:32:48.157Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "United States - Remote",
"city": null,
"region": null,
"country": "United States",
"is_remote": true,
"confidence": 0.95
},
"countries": [
"United States"
]
},
"remote_policy": "remote",
"salary_period": null,
"workplace_type": "remote",
"salary_currency": null
}Extensions
{}Native Structured
{
"title": "RAG and Evaluation Engineer",
"offices": [
{
"id": 4014571009,
"name": "United States - Remote",
"location": null,
"child_ids": [],
"parent_id": null
}
],
"language": "en",
"location": {
"name": "United States - Remote"
},
"metadata": [
{
"id": 4296756009,
"name": "Employment Type",
"value": "Full-time",
"value_type": "single_select"
}
],
"updated_at": "2026-06-15T11:37:12-04:00",
"departments": [
{
"id": 4015429009,
"name": "Product & Software Development",
"child_ids": [],
"parent_id": null
}
],
"company_name": "LTS",
"requisition_id": 4167430009,
"first_published": "2026-06-12T10:31:44-04:00",
"application_deadline": null
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/d056dbbda4cd697fa45cae66bb038166aec33157?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/81eb1409-20a7-42da-88a5-4509400827f3JSONGET https://api.bluedoor.sh/job-postings/v1/sources/24631dfb-c289-495a-ac71-fba5cc3c5238JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/d056dbbda4cd697fa45cae66bb038166aec33157/eventsJSON