Home › Companies › 10pearls › Site Reliability Engineer (Lead)

Site Reliability Engineer (Lead)

10pearls · Islamabad · Active · JazzHR / ApplyToJob

Job facts

Field	Value
Company	10pearls
Title	Site Reliability Engineer (Lead)
Normalized title	-
Department / team	-
Location	Islamabad
Work model	-
Employment type	Full Time
Salary	-
Status	active
ATS provider	JazzHR / ApplyToJob
Posted / first seen	2026-04-17 / 2026-05-30
Changed / last seen	2026-05-30 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from 10pearls.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through JazzHR / ApplyToJob.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	10pearls
Source	ecc85604-d4af-4971-b467-d3e9f14798bc
ATS provider	JazzHR / ApplyToJob

Description

Company Overview 10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and even high-growth start-ups. But those are just facts. What makes us unique is that we have a true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good and know how to balance the two. Role: As an SRE Lead, you will be responsible for owning and scaling the organization’s core platform infrastructure, ensuring high availability and reliability of distributed systems. You will manage the Kubernetes-based substrate along with key components like identity, secrets, storage, registry, and gateway systems. You will drive SLO frameworks, incident response processes, and production reliability through on-call practices, postmortems, and error budget management. Additionally, you will co-own release strategies and lead initiatives around performance, capacity planning, and system resilience. You will work closely with engineering and platform teams to enforce infrastructure standards, automate operations, and lead a high-performing platform squad in building reliable, scalable systems. Key Responsibilities Substrate operation — own the Kubernetes cluster plus Keycloak (identity), Vault (secrets), MinIO (object storage), Harbor (registry), Kong (gateway) — from bootstrap to day-2 operations. SLO framework — define, publish, and defend SLOs for every tier-1 service; own error budgets and burn-rate alerting. Incident response — build the on-call rotation, paging, runbook library, and post mortem culture; lead incident command during P1/P2 events. Release operations — co-own the blue-green / canary release model with L6 Delivery; sign off production-bound releases. Air-gap operations — ensure every operational runbook works in a fully offline environment — no assumption of external dependencies. Lead the Platform squad — technically lead 1 Infrastructure Engineer, 1 Observability Engineer, 2 DevOps Engineers; set standards for infra-as-code and automation Required Qualifications & Skills Bachelor's degree in computer science or related field. 5–8 years in SRE or production-engineering roles running distributed systems at scale. Deep Kubernetes expertise — operators, RBAC, network policy, storage, upgrades. Hands-on with Keycloak / Vault / MinIO / Harbor / Kong or equivalent identity/secrets/storage/registry/gateway stacks. Strong Linux fundamentals and at least one systems language (Go, Rust) or shell/Python for tooling. Proven SLO/SLI authorship and error-budget-driven decision-making Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, Loki, Tempo). Calm, clear communication during incidents; strong post-mortem writing. Hands-on with infra-as-code — Helm, Kustomize, Terraform. Nice to Have Prior experience running air-gapped or on-prem platforms for regulated customers Cilium/Istio service-mesh operation GitOps delivery with ArgoCD or Flux FinOps / cost-attribution experience Certified Kubernetes Administrator (CKA) or equivalent

Full job record

Job ID	9cb01d099730e020daee286678febc42884756dc
Org ID	e69a6fcc-024f-4d99-ada4-5630f4f934d3
Source ID	ecc85604-d4af-4971-b467-d3e9f14798bc
Board ID	ecc85604-d4af-4971-b467-d3e9f14798bc
Provider	jazzhr
Provider Job Key	lB9FBwpPpT
Title	Site Reliability Engineer (Lead)
Normalized Title	—
Status	active
Active	yes
Location Text	Islamabad
Department	—
Team	—
Employment Type	full_time
Workplace Type	—
Remote Policy	—
Country	Islamabad
Region	—
City	—
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://10pearls.applytojob.com/apply/lB9FBwpPpT/Site-Reliability-Engineer-Lead
Apply URL	https://10pearls.applytojob.com/apply/lB9FBwpPpT/Site-Reliability-Engineer-Lead
First Seen At	2026-05-30 06:11:52Z
Last Seen At	2026-06-06 10:53:47Z
Last Checked At	2026-06-06 10:53:47Z
Last Changed At	2026-05-30 06:11:52Z
Inactive At	—
Source Posted At	2026-04-17 00:00:00Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=jazzhr/board=10pearls/date=2026-06-06/2026-06-06T10-53-46-778Z-dd1be45ea8392f07e55ccd9e692a28f4fb6f163fbfc9b33232e4c22b3d4a5683.json

Event Fields

{
  "content_hash": "8c4539ec98f5ba857b975946e857911b558b3e9e4babeda73c8a2f8388f73a20",
  "source_hash": "658c56794420d4ab93f2309bb8f42ab9bab429c4e9bd5993041061733486efed",
  "last_changed_at": "2026-05-30T06:11:52.568Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Islamabad",
    "city": null,
    "region": null,
    "country": "Islamabad",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T10:53:47.665Z",
  "launch_scope": {
    "reason": "jazzhr_production_catalog",
    "included": true,
    "location": {
      "raw": "Islamabad",
      "city": null,
      "region": null,
      "country": "Islamabad",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "Islamabad"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": null,
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "detail": {
    "url": "https://10pearls.applytojob.com/apply/jobs/details/lB9FBwpPpT?&",
    "heading": "Site Reliability Engineer (Lead)",
    "html_title": "JazzHR &raquo; Job Listings",
    "canonical_url": "https://10pearls.applytojob.com/apply/lB9FBwpPpT/Site-Reliability-Engineer-Lead",
    "description_html": "<p><strong>Company Overview </strong></p><p>10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and even high-growth start-ups. But those are just facts. What makes us unique is that we have a true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good and know how to balance the two. </p><p><strong>Role:  </strong></p><p>As an SRE Lead, you will be responsible for owning and scaling the organization’s core platform infrastructure, ensuring high availability and reliability of distributed systems. You will manage the Kubernetes-based substrate along with key components like identity, secrets, storage, registry, and gateway systems. </p><p>You will drive SLO frameworks, incident response processes, and production reliability through on-call practices, postmortems, and error budget management. Additionally, you will co-own release strategies and lead initiatives around performance, capacity planning, and system resilience. </p><p>You will work closely with engineering and platform teams to enforce infrastructure standards, automate operations, and lead a high-performing platform squad in building reliable, scalable systems. </p><p><strong>Key Responsibilities </strong></p><ul><li><p>Substrate operation — own the Kubernetes cluster plus Keycloak (identity), Vault (secrets), MinIO (object storage), Harbor (registry), Kong (gateway) — from bootstrap to day-2 operations. </p></li></ul><ul><li><p>SLO framework — define, publish, and defend SLOs for every tier-1 service; own error budgets and burn-rate alerting. </p></li></ul><ul><li><p>Incident response — build the on-call rotation, paging, runbook library, and post mortem culture; lead incident command during P1/P2 events. </p></li></ul><ul><li><p>Release operations — co-own the blue-green / canary release model with L6 Delivery; sign off production-bound releases. </p></li></ul><ul><li><p>Air-gap operations — ensure every operational runbook works in a fully offline environment — no assumption of external dependencies. </p></li></ul><ul><li><p>Lead the Platform squad — technically lead 1 Infrastructure Engineer, 1 Observability Engineer, 2 DevOps Engineers; set standards for infra-as-code and automation </p></li></ul><p><strong>Required Qualifications & Skills </strong></p><ul><li><p>Bachelor's degree in computer science or related field. </p></li></ul><ul><li><p>5–8 years in SRE or production-engineering roles running distributed systems at scale. </p></li></ul><ul><li><p>Deep Kubernetes expertise — operators, RBAC, network policy, storage, upgrades. </p></li></ul><ul><li><p>Hands-on with Keycloak / Vault / MinIO / Harbor / Kong or equivalent identity/secrets/storage/registry/gateway stacks. </p></li></ul><ul><li><p>Strong Linux fundamentals and at least one systems language (Go, Rust) or shell/Python for tooling.  </p></li></ul><ul><li><p>Proven SLO/SLI authorship and error-budget-driven decision-making  </p></li></ul><ul><li><p>Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, Loki, Tempo).  </p></li></ul><ul><li><p>Calm, clear communication during incidents; strong post-mortem writing. </p></li></ul><ul><li><p>Hands-on with infra-as-code — Helm, Kustomize, Terraform. </p></li></ul><p><strong>Nice to Have  </strong></p><ul><li><p>Prior experience running air-gapped or on-prem platforms for regulated customers  </p></li></ul><ul><li><p>Cilium/Istio service-mesh operation  </p></li></ul><ul><li><p>GitOps delivery with ArgoCD or Flux  </p></li></ul><ul><li><p>FinOps / cost-attribution experience  </p></li></ul><ul><li><p>Certified Kubernetes Administrator (CKA) or equivalent </p></li></ul>",
    "description_text": "Company Overview\n 10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and even high-growth start-ups. But those are just facts. What makes us unique is that we have a true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good and know how to balance the two.\n Role:\n As an SRE Lead, you will be responsible for owning and scaling the organization’s core platform infrastructure, ensuring high availability and reliability of distributed systems. You will manage the Kubernetes-based substrate along with key components like identity, secrets, storage, registry, and gateway systems.\n You will drive SLO frameworks, incident response processes, and production reliability through on-call practices, postmortems, and error budget management. Additionally, you will co-own release strategies and lead initiatives around performance, capacity planning, and system resilience.\n You will work closely with engineering and platform teams to enforce infrastructure standards, automate operations, and lead a high-performing platform squad in building reliable, scalable systems.\n Key Responsibilities\n Substrate operation — own the Kubernetes cluster plus Keycloak (identity), Vault (secrets), MinIO (object storage), Harbor (registry), Kong (gateway) — from bootstrap to day-2 operations.\n SLO framework — define, publish, and defend SLOs for every tier-1 service; own error budgets and burn-rate alerting.\n Incident response — build the on-call rotation, paging, runbook library, and post mortem culture; lead incident command during P1/P2 events.\n Release operations — co-own the blue-green / canary release model with L6 Delivery; sign off production-bound releases.\n Air-gap operations — ensure every operational runbook works in a fully offline environment — no assumption of external dependencies.\n Lead the Platform squad — technically lead 1 Infrastructure Engineer, 1 Observability Engineer, 2 DevOps Engineers; set standards for infra-as-code and automation\n Required Qualifications & Skills\n Bachelor's degree in computer science or related field.\n 5–8 years in SRE or production-engineering roles running distributed systems at scale.\n Deep Kubernetes expertise — operators, RBAC, network policy, storage, upgrades.\n Hands-on with Keycloak / Vault / MinIO / Harbor / Kong or equivalent identity/secrets/storage/registry/gateway stacks.\n Strong Linux fundamentals and at least one systems language (Go, Rust) or shell/Python for tooling.\n Proven SLO/SLI authorship and error-budget-driven decision-making\n Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, Loki, Tempo).\n Calm, clear communication during incidents; strong post-mortem writing.\n Hands-on with infra-as-code — Helm, Kustomize, Terraform.\n Nice to Have\n Prior experience running air-gapped or on-prem platforms for regulated customers\n Cilium/Istio service-mesh operation\n GitOps delivery with ArgoCD or Flux\n FinOps / cost-attribution experience\n Certified Kubernetes Administrator (CKA) or equivalent",
    "jsonld_jobposting": {
      "url": "https://10pearls.applytojob.com/apply/lB9FBwpPpT/Site-Reliability-Engineer-Lead",
      "@type": "JobPosting",
      "title": "Site Reliability Engineer (Lead)",
      "@context": "http://schema.org/",
      "datePosted": "2026-04-17",
      "description": "<p><strong>Company Overview </strong></p><p>10Pearls is an award-winning end-to-end digital innovation company that helps businesses imagine and build the future. We are proud to announce that 10Pearls was named as winner of the Best Tech Work Culture Timmy Award in Washington DC by Tech in Motion, recognized on the Inc. 5000 Fastest-Growing Companies List, and was ranked the #1 Most Diverse Midsize Company in Greater Washington. We partner with businesses to help them transform, scale, and accelerate by adopting digital and exponential technologies. Our work has ranged from creating highly usable, secure digital experiences, mobile and software products, to helping businesses modernize through cloud adoption and development and the digitalization of their business processes. Our clientele is highly diverse, including Global 1000 enterprises, mid-market businesses, and even high-growth start-ups. But those are just facts. What makes us unique is that we have a true heart and soul. We have a strong focus on a double bottom line and actively support and engage with the communities where we live and work to make the world a better place. In a nutshell, we believe in doing well, while doing good and know how to balance the two. </p><p><strong>Role:  </strong></p><p>As an SRE Lead, you will be responsible for owning and scaling the organization’s core platform infrastructure, ensuring high availability and reliability of distributed systems. You will manage the Kubernetes-based substrate along with key components like identity, secrets, storage, registry, and gateway systems. </p><p>You will drive SLO frameworks, incident response processes, and production reliability through on-call practices, postmortems, and error budget management. Additionally, you will co-own release strategies and lead initiatives around performance, capacity planning, and system resilience. </p><p>You will work closely with engineering and platform teams to enforce infrastructure standards, automate operations, and lead a high-performing platform squad in building reliable, scalable systems. </p><p><strong>Key Responsibilities </strong></p><ul><li><p>Substrate operation — own the Kubernetes cluster plus Keycloak (identity), Vault (secrets), MinIO (object storage), Harbor (registry), Kong (gateway) — from bootstrap to day-2 operations. </p></li></ul><ul><li><p>SLO framework — define, publish, and defend SLOs for every tier-1 service; own error budgets and burn-rate alerting. </p></li></ul><ul><li><p>Incident response — build the on-call rotation, paging, runbook library, and post mortem culture; lead incident command during P1/P2 events. </p></li></ul><ul><li><p>Release operations — co-own the blue-green / canary release model with L6 Delivery; sign off production-bound releases. </p></li></ul><ul><li><p>Air-gap operations — ensure every operational runbook works in a fully offline environment — no assumption of external dependencies. </p></li></ul><ul><li><p>Lead the Platform squad — technically lead 1 Infrastructure Engineer, 1 Observability Engineer, 2 DevOps Engineers; set standards for infra-as-code and automation </p></li></ul><p><strong>Required Qualifications & Skills </strong></p><ul><li><p>Bachelor's degree in computer science or related field. </p></li></ul><ul><li><p>5–8 years in SRE or production-engineering roles running distributed systems at scale. </p></li></ul><ul><li><p>Deep Kubernetes expertise — operators, RBAC, network policy, storage, upgrades. </p></li></ul><ul><li><p>Hands-on with Keycloak / Vault / MinIO / Harbor / Kong or equivalent identity/secrets/storage/registry/gateway stacks. </p></li></ul><ul><li><p>Strong Linux fundamentals and at least one systems language (Go, Rust) or shell/Python for tooling.  </p></li></ul><ul><li><p>Proven SLO/SLI authorship and error-budget-driven decision-making  </p></li></ul><ul><li><p>Experience with observability stacks (Prometheus, Grafana, OpenTelemetry, Loki, Tempo).  </p></li></ul><ul><li><p>Calm, clear communication during incidents; strong post-mortem writing. </p></li></ul><ul><li><p>Hands-on with infra-as-code — Helm, Kustomize, Terraform. </p></li></ul><p><strong>Nice to Have  </strong></p><ul><li><p>Prior experience running air-gapped or on-prem platforms for regulated customers  </p></li></ul><ul><li><p>Cilium/Istio service-mesh operation  </p></li></ul><ul><li><p>GitOps delivery with ArgoCD or Flux  </p></li></ul><ul><li><p>FinOps / cost-attribution experience  </p></li></ul><ul><li><p>Certified Kubernetes Administrator (CKA) or equivalent </p></li></ul>",
      "jobLocation": {
        "@type": "Place",
        "address": {
          "@type": "PostalAddress",
          "postalCode": "",
          "addressRegion": "",
          "addressLocality": "Islamabad"
        }
      },
      "validThrough": "2026-07-16",
      "uniqueJobCode": "job_20260417105342_CYCVH2NKI798PLP2",
      "employmentType": "FULL_TIME",
      "hiringOrganization": {
        "logo": "https://s3.amazonaws.com/resumator/customer_20200617142926_FIFOKRA3QXMMR03Z/logos/20230316120547_10plogo_2_x50.png",
        "name": "10Pearls",
        "@type": "Organization",
        "sameAs": "https://10pearls.com/"
      },
      "experienceRequirements": "Experienced"
    }
  },
  "list_job": {
    "id": "lB9FBwpPpT",
    "title": "Site Reliability Engineer (Lead)",
    "detailUrl": "https://10pearls.applytojob.com/apply/jobs/details/lB9FBwpPpT?&"
  },
  "detail_errors": []
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/9cb01d099730e020daee286678febc42884756dc?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/e69a6fcc-024f-4d99-ada4-5630f4f934d3JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/ecc85604-d4af-4971-b467-d3e9f14798bcJSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/9cb01d099730e020daee286678febc42884756dc/eventsJSON

Docs · Get an API key