Home › Companies › Firstup › Director of Cloud Operations

Director of Cloud Operations

Firstup · Remote - US · Remote · Active · Lever

Job facts

Field	Value
Company	Firstup
Title	Director of Cloud Operations
Normalized title	-
Department / team	Product Management & Engineering / DevOps
Location	United States
Work model	Remote / Remote
Employment type	Full Time
Salary	-
Status	active
ATS provider	Lever
Posted / first seen	2026-04-17 / 2026-05-29
Changed / last seen	2026-05-29 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Firstup.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Lever.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
Department jobs	Active postings in Product Management & Engineering.	Open
Work model jobs	Active Remote postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Firstup
Source	8f017508-dc1c-4b6b-ac24-942bd6092a47
ATS provider	Lever

Description

Who We Are At Firstup, our mission is to improve the employee experience at every moment that matters, large and small. As the communication pipeline for the world's workforce, we now serve 40 of the Fortune 100 companies, reaching and connecting more than 17 million employees daily. Our employees are experts in the employee experience, workforce communications and technology. Joining Firstup means joining a movement to make work better for every worker. As the world’s first intelligent communication platform, Firstup meaningfully engages employees at every moment from hire to retire, and delivers engagement insights to help companies support, promote and retain their talent. Our movement has taken root and is evident in our world-class customer base. Now we need your help. Ready to make a difference in the world? Job Summary: We are seeking a Director of Cloud Operations (CloudOps) to lead and evolve our cloud infrastructure and operational practices across a globally distributed SaaS platform. This is a hands-on leadership role responsible for ensuring the reliability, scalability, and efficiency of our systems running across multiple AWS regions in the United States and Europe. As part of the senior leadership team, you will partner closely with Engineering, Security, and Product to strengthen operational excellence, enhance system observability, and drive continuous improvement in how we build and run services. You will lead a distributed team of engineers across the US and UK, fostering a high-performing, collaborative, and growth-oriented environment. This role is ideal for a leader who combines deep technical expertise with a pragmatic approach to improving systems, processes, and team capabilities. Firstup expects the base salary for this role to be between $200,000-$228,000. The starting rate of pay may vary based on factors including, but not limited to, position offered, location, education, training, and/or experience. Why Firstup? Because you care - about people, the work you do, and the connections you make. Work is such a large part of life; it only makes sense to make it awesome. If you want to engage brilliant minds in a high-growth and inclusive environment where ideas are rewarded regardless of who they come from, join us. This is a rapidly changing space so if you thrive on ambiguity, are hungry for a challenge, and have the guts to speak your mind, you could be a perfect fit. We offer an excellent PTO program, great health benefits, a casual and friendly environment, remote work, and a leadership team who truly believes in your growth – both personally and professionally. Firstup is committed to providing equal employment opportunities to all applicants for employment and to all employees, without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, protected disability, veteran status, or any other protected status in accordance with applicable federal, state or local laws. #LI-TM1 #LI-Remote What You’ll Do Cloud Platform & Reliability Own the availability, performance, and resilience of our multi-region AWS platform. Drive improvements in system reliability through well-defined SLIs/SLOs , error budgets, and proactive engineering practices. Lead efforts to reduce MTTR and improve incident response effectiveness across the organization. Guide architecture decisions for microservices, Kubernetes (EKS), and serverless workloads to ensure scalability and fault tolerance. Observability & Incident Management Advance our observability strategy using Datadog , ensuring actionable insights across infrastructure and applications. Establish and refine incident management practices, including on-call processes, escalation paths, and post-incident reviews. Act as an incident commander for critical events and contribute to the on-call rotation. Operational Excellence & Efficiency Elevate operational standards through automation, standardization, and adoption of modern best practices. Drive cost optimization initiatives across AWS environments without compromising performance or reliability. Leverage AI and automation to improve operational efficiency, accelerate root cause analysis, and enhance system insights. Continuously improve CI/CD pipelines (CircleCI) and infrastructure-as-code practices (Terraform). Team Leadership & Development Lead, mentor, and support a distributed team of CloudOps engineers across the US and UK. Foster a culture of accountability, learning, and continuous improvement. Provide technical guidance while enabling the team to grow in ownership and capability. Hybrid & Legacy Environment Support Ensure stability and support for existing customers while maintaining clear operational boundaries with the cloud platform. What We’re Looking For Experience 10+ years in cloud infrastructure, SRE, or DevOps roles, with 3+ years experience leading CloudOps/SRE teams . Proven track record of leading operational or platform transformations in a SaaS environment. Experience operating multi-region, customer-facing systems at scale . Technical Expertise Strong hands-on experience with: AWS (multi-region architectures) Kubernetes (EKS) and containerized environments Infrastructure as Code (Terraform preferred) CI/CD pipelines (CircleCI or similar) Observability platforms (Datadog or equivalent) Solid understanding of microservices and distributed systems design. Familiarity with serverless architectures and modern cloud-native patterns. Operational Leadership Deep experience with incident management , on-call operations, and reliability engineering practices. Strong understanding of SLO/SLI frameworks , monitoring strategies, and performance optimization. Demonstrated ability to balance hands-on technical work with team leadership . Leadership & Mindset Collaborative, pragmatic leader who can influence across teams and functions. Passion for building and supporting high-performing teams . Focus on continuous improvement, with a bias toward measurable outcomes.

Full job record

Job ID	6d22ba2da451a3b3b867deb08195453c60784cbb
Org ID	8715351a-68ee-40b4-ad54-bee843123509
Source ID	8f017508-dc1c-4b6b-ac24-942bd6092a47
Board ID	8f017508-dc1c-4b6b-ac24-942bd6092a47
Provider	lever
Provider Job Key	36ea56ef-72af-4726-a936-1737b75a67de
Title	Director of Cloud Operations
Normalized Title	—
Status	active
Active	yes
Location Text	Remote - US
Department	Product Management & Engineering
Team	DevOps
Employment Type	Full-time
Workplace Type	remote
Remote Policy	remote
Country	United States
Region	—
City	—
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://jobs.lever.co/firstup/36ea56ef-72af-4726-a936-1737b75a67de
Apply URL	https://jobs.lever.co/firstup/36ea56ef-72af-4726-a936-1737b75a67de/apply
First Seen At	2026-05-29 07:00:45Z
Last Seen At	2026-06-06 07:56:44Z
Last Checked At	2026-06-06 07:56:44Z
Last Changed At	2026-05-29 07:00:45Z
Inactive At	—
Source Posted At	2026-04-17 17:30:19Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=lever/board=firstup/date=2026-06-06/2026-06-06T07-56-42-238Z-b27ff46d26efb638421d0c18bdc76f86023d6f50b84819ddf386497a21615966.json

Event Fields

{
  "content_hash": "47660f974c12a8a8de99a2c1aec9a57b657d5f5f53b34cb6660384f548347656",
  "source_hash": "0137422c8c0d2486dca06eb434bf72d28adbe69a25c08058b0ed27850caf3364",
  "last_changed_at": "2026-05-29T07:00:45.360Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Remote - US",
    "city": null,
    "region": null,
    "country": "United States",
    "is_remote": true,
    "confidence": 0.95
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T07:56:44.234Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Remote - US",
      "city": null,
      "region": null,
      "country": "United States",
      "is_remote": true,
      "confidence": 0.95
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "remote",
  "salary_period": null,
  "workplace_type": "remote",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "lists": [
    {
      "text": "What You’ll Do",
      "content": "\n<li>\n<h4><strong>Cloud Platform &amp; Reliability</strong></h4>\n\n</li><li>\n<p>Own the availability, performance, and resilience of our multi-region AWS platform.</p>\n</li>\n<li>\n<p>Drive improvements in system reliability through well-defined <strong>SLIs/SLOs</strong>, error budgets, and proactive engineering practices.</p>\n</li>\n<li>\n<p>Lead efforts to reduce <strong>MTTR</strong> and improve incident response effectiveness across the organization.</p>\n</li>\n<li>\n<p>Guide architecture decisions for microservices, Kubernetes (EKS), and serverless workloads to ensure scalability and fault tolerance.</p>\n</li>\n\n\n<li>\n<h4><strong>Observability &amp; Incident Management</strong></h4>\n\n</li><li>\n<p>Advance our observability strategy using <strong>Datadog</strong>, ensuring actionable insights across infrastructure and applications.</p>\n</li>\n<li>\n<p>Establish and refine incident management practices, including on-call processes, escalation paths, and post-incident reviews.</p>\n</li>\n<li>\n<p>Act as an incident commander for critical events and contribute to the on-call rotation.</p>\n</li>\n\n\n<li>\n<h4><strong>Operational Excellence &amp; Efficiency</strong></h4>\n\n</li><li>\n<p>Elevate operational standards through automation, standardization, and adoption of modern best practices.</p>\n</li>\n<li>\n<p>Drive cost optimization initiatives across AWS environments without compromising performance or reliability.</p>\n</li>\n<li>\n<p>Leverage <strong>AI and automation</strong> to improve operational efficiency, accelerate root cause analysis, and enhance system insights.</p>\n</li>\n<li>\n<p>Continuously improve CI/CD pipelines (CircleCI) and infrastructure-as-code practices (Terraform).</p>\n</li>\n\n\n<li>\n<h4><strong>Team Leadership &amp; Development</strong></h4>\n\n</li><li>\n<p>Lead, mentor, and support a distributed team of CloudOps engineers across the US and UK.</p>\n</li>\n<li>\n<p>Foster a culture of accountability, learning, and continuous improvement.</p>\n</li>\n<li>\n<p>Provide technical guidance while enabling the team to grow in ownership and capability.</p>\n</li>\n\n\n<li>\n<h4><strong>Hybrid &amp; Legacy Environment Support</strong></h4>\n\n</li><li>\n<p>Ensure stability and support for existing customers while maintaining clear operational boundaries with the cloud platform.</p>\n</li>\n\n\n"
    },
    {
      "text": "What We’re Looking For",
      "content": "\n<li><strong>Experience</strong>\n\n</li><li>\n<p>10+ years in cloud infrastructure, SRE, or DevOps roles, with<strong> 3+ years</strong> <strong>experience leading CloudOps/SRE teams</strong>.</p>\n</li>\n<li>\n<p>Proven track record of leading <strong>operational or platform transformations</strong> in a SaaS environment.</p>\n</li>\n<li>\n<p>Experience operating <strong>multi-region, customer-facing systems at scale</strong>.</p>\n</li>\n\n\n<li>\n<h4><strong>Technical Expertise</strong></h4>\n\n</li><li>\n<p>Strong hands-on experience with:</p>\n\n</li><li>\n<p><strong>AWS</strong> (multi-region architectures)</p>\n</li>\n<li>\n<p><strong>Kubernetes (EKS)</strong> and containerized environments</p>\n</li>\n<li>\n<p><strong>Infrastructure as Code</strong> (Terraform preferred)</p>\n</li>\n<li>\n<p><strong>CI/CD pipelines</strong> (CircleCI or similar)</p>\n</li>\n<li>\n<p><strong>Observability platforms</strong> (Datadog or equivalent)</p>\n</li>\n\n\n<li>\n<p>Solid understanding of microservices and distributed systems design.</p>\n</li>\n<li>\n<p>Familiarity with serverless architectures and modern cloud-native patterns.</p>\n</li>\n\n\n<li>\n<h4><strong>Operational Leadership</strong></h4>\n\n</li><li>\n<p>Deep experience with <strong>incident management</strong>, on-call operations, and reliability engineering practices.</p>\n</li>\n<li>\n<p>Strong understanding of <strong>SLO/SLI frameworks</strong>, monitoring strategies, and performance optimization.</p>\n</li>\n<li>\n<p>Demonstrated ability to balance <strong>hands-on technical work with team leadership</strong>.</p>\n</li>\n\n\n<li>\n<h4><strong>Leadership &amp; Mindset</strong></h4>\n\n</li><li>\n<p>Collaborative, pragmatic leader who can influence across teams and functions.</p>\n</li>\n<li>\n<p>Passion for building and supporting <strong>high-performing teams</strong>.</p>\n</li>\n<li>\n<p>Focus on continuous improvement, with a bias toward measurable outcomes.</p>\n</li>\n\n\n"
    }
  ],
  "country": "US",
  "createdAt": 1776447019677,
  "updatedAt": null,
  "categories": {
    "team": "DevOps",
    "location": "Remote - US",
    "commitment": "Full-time",
    "department": "Product Management & Engineering",
    "allLocations": [
      "Remote - US"
    ]
  },
  "salaryRange": null,
  "workplaceType": "remote"
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/6d22ba2da451a3b3b867deb08195453c60784cbb?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/8715351a-68ee-40b4-ad54-bee843123509JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/8f017508-dc1c-4b6b-ac24-942bd6092a47JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/6d22ba2da451a3b3b867deb08195453c60784cbb/eventsJSON

Docs · Get an API key