Home › Companies › Egay Fa Us6 Oraclecloud Com CX 4001 › Site Reliability Engineer
Site Reliability Engineer
Egay Fa Us6 Oraclecloud Com CX 4001 · IN KL Kochi NOV DTC, Kochi, Kerala, IN · Active · Oracle Recruiting Cloud / Fusion HCM
Job facts
| Field | Value |
|---|---|
| Company | Egay Fa Us6 Oraclecloud Com CX 4001 |
| Title | Site Reliability Engineer |
| Normalized title | - |
| Department / team | Technical |
| Location | Kerala, IN, United States |
| Work model | - |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | Oracle Recruiting Cloud / Fusion HCM |
| Posted / first seen | 2026-06-23 / 2026-06-23 |
| Changed / last seen | 2026-06-23 / 2026-06-23 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Egay Fa Us6 Oraclecloud Com CX 4001. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Oracle Recruiting Cloud / Fusion HCM. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Kerala. | Open |
| Department jobs | Active postings in Technical. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Egay Fa Us6 Oraclecloud Com CX 4001 |
| Source | fe4e74ed-d842-4ad5-b13f-38d2eb0357cc |
| ATS provider | Oracle Recruiting Cloud / Fusion HCM |
Description
Description
We are looking for an experienced SRE to lead production reliability, performance tuning, and operational excellence across our platform. You’ll work at the intersection of software engineering and systems engineering—driving improvements that directly impact product uptime, velocity, and user satisfaction.
If you are passionate about reliability, automation, and scalability—and have the technical depth to back it up—this role is for you.
Responsibilities:
As a Site Reliability Engineer, you will be responsible for Operational Excellence & Incident Management
Maintain and monitor production systems for availability, latency, and performance.
Lead incident response efforts, including communication, resolution, and postmortem documentation.
Design and implement health checks, alerting systems, and automated remediation workflows.
Drive root cause analysis and implement permanent resolutions for recurring issues.
Set up and maintain full observability stacks (logging, metrics, tracing) using tools like Prometheus, Grafana, Datadog, OpenTelemetry, or ELK.
Analyze telemetry and logs to identify trends, anomalies, and opportunities for improvement.
Conduct post-incident reviews and use insights to inform future engineering investments.
Tune and optimize distributed systems, including AKKA.NET actors, for performance and resource efficiency.
Work with developers to evolve architecture and improve system throughput, latency, and stability.
Optimize PostgreSQL performance, queries, and maintenance strategies.
Design and maintain modern CI/CD pipelines using GitHub Actions, Azure Pipelines, or GitLab CI.
Automate deployment, testing, and rollback processes to reduce friction and increase deployment frequency.
Standardize infrastructure as code practices across environments.
Requirements:
5–10 years of experience in SRE, DevOps, or Infrastructure Engineering roles.
Expertise in Kubernetes and container orchestration at scale.
Strong experience with AKKA.NET or similar actor-based frameworks.
Proficiency with scripting and automation (Bash, PowerShell, Python).
Experience with observability tools (Phobos,Datadog, Prometheus, Grafana, OpenTelemetry, ELK).
Hands-on experience with cloud platforms (AWS, Azure, or GCP).
Strong PostgreSQL knowledge—performance tuning, query optimization, maintenance.
Proven ability to lead incident management and drive postmortem processes.
A builder’s mindset with high standards for operational excellence and technical ownership.
Preferred Tools & Ecosystem Experience
CI/CD: GitHub Actions, Azure Pipelines, GitLab CI
Infrastructure: Kubernetes, Docker, Terraform
Monitoring: Phobos (AKKA.NET), Datadog, Prometheus
Source Control: GitHub, GitLab, Azure DevOps
Programming: C#, Python, Bash, PowerShell
Company
Every day, the oil and gas industry’s best minds put more than 150 years of experience to work to help our customers achieve lasting success.
We Power the Industry that Powers the World
Throughout every region in the world and across every area of drilling and production, our family of companies has provided the technical expertise, advanced equipment, and operational support necessary for success—now and in the future.
Global Family
We are a global family of thousands of individuals, working as one team to create a lasting impact for ourselves, our customers, and the communities where we live and work.
Purposeful Innovation
Through purposeful business innovation, product creation, and service delivery, we are driven to power the industry that powers the world better.
Service Above All
This drives us to anticipate our customers’ needs and work with them to deliver the finest products and services on time and on budget.
Full job record
| Job ID | 55303becc78f79794852199a345337dfe9a13e0d |
| Org ID | ed4800e6-a68f-4753-bef9-3311479be754 |
| Source ID | fe4e74ed-d842-4ad5-b13f-38d2eb0357cc |
| Board ID | fe4e74ed-d842-4ad5-b13f-38d2eb0357cc |
| Provider | oracle_hcm |
| Provider Job Key | 41953 |
| Title | Site Reliability Engineer |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | IN KL Kochi NOV DTC, Kochi, Kerala, IN |
| Department | Technical |
| Team | — |
| Employment Type | full_time |
| Workplace Type | — |
| Remote Policy | — |
| Country | United States |
| Region | IN |
| City | Kerala |
| Salary Raw | Description We are looking for an experienced SRE to lead production reliability, performance tuning, and operational excellence across our platform. You’ll work at the intersection of software engineering and systems engineering—driving improvements that directly impact product uptime, velocity, and user satisfaction. If you are passionate about reliability, automation, and scalability—and have the technical depth to back it up—this role is for you. Responsibilities: As a Site Reliability Engineer, you will be responsible for Operational Excellence & Incident Management Maintain and monitor production systems for availability, latency, and performance. Lead incident response efforts, including communication, resolution, and postmortem documentation. Design and implement health checks, alerting systems, and automated remediation workflows. Drive root cause analysis and implement permanent resolutions for recurring issues. Set up and maintain full observability stacks (logging, metrics, tracing) using tools like Prometheus, Grafana, Datadog, OpenTelemetry, or ELK. Analyze telemetry and logs to identify trends, anomalies, and opportunities for improvement. Conduct post-incident reviews and use insights to inform future engineering investments. Tune and optimize distributed systems, including AKKA.NET actors, for performance and resource efficiency. Work with developers to evolve architecture and improve system throughput, latency, and stability. Optimize PostgreSQL performance, queries, and maintenance strategies. Design and maintain modern CI/CD pipelines using GitHub Actions, Azure Pipelines, or GitLab CI. Automate deployment, testing, and rollback processes to reduce friction and increase deployment frequency. Standardize infrastructure as code practices across environments. Requirements: 5–10 years of experience in SRE, DevOps, or Infrastructure Engineering roles. Expertise in Kubernetes and container orchestration at scale. Strong experience with AKKA.NET or similar actor-based frameworks. Proficiency with scripting and automation (Bash, PowerShell, Python). Experience with observability tools (Phobos,Datadog, Prometheus, Grafana, OpenTelemetry, ELK). Hands-on experience with cloud platforms (AWS, Azure, or GCP). Strong PostgreSQL knowledge—performance tuning, query optimization, maintenance. Proven ability to lead incident management and drive postmortem processes. A builder’s mindset with high standards for operational excellence and technical ownership. Preferred Tools & Ecosystem Experience CI/CD: GitHub Actions, Azure Pipelines, GitLab CI Infrastructure: Kubernetes, Docker, Terraform Monitoring: Phobos (AKKA.NET), Datadog, Prometheus Source Control: GitHub, GitLab, Azure DevOps Programming: C#, Python, Bash, PowerShell Company Every day, the oil and gas industry’s best minds put more than 150 years of experience to work to help our customers achieve lasting success. We Power the Industry that Powers the World Throughout every region in the world and across every area of drilling and production, our family of companies has provided the technical expertise, advanced equipment, and operational support necessary for success—now and in the future. Global Family We are a global family of thousands of individuals, working as one team to create a lasting impact for ourselves, our customers, and the communities where we live and work. Purposeful Innovation Through purposeful business innovation, product creation, and service delivery, we are driven to power the industry that powers the world better. Service Above All This drives us to anticipate our customers’ needs and work with them to deliver the finest products and services on time and on budget. |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | day |
| Source URL | https://egay.fa.us6.oraclecloud.com/hcmUI/CandidateExperience/en/sites/CX_4001/job/41953 |
| Apply URL | https://egay.fa.us6.oraclecloud.com/hcmUI/CandidateExperience/en/sites/CX_4001/job/41953 |
| First Seen At | 2026-06-23 11:54:32Z |
| Last Seen At | 2026-06-23 11:54:32Z |
| Last Checked At | 2026-06-23 11:54:32Z |
| Last Changed At | 2026-06-23 11:54:32Z |
| Inactive At | — |
| Source Posted At | 2026-06-23 10:56:27Z |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=oracle_hcm/board=egay.fa.us6.oraclecloud.com|CX_4001/date=2026-06-23/2026-06-23T11-54-03-770Z-feddd9c2916ae8cfc5d8a4fbf60009bc71eae82cc49e44f804c45360235721f9.json |
Event Fields
{
"content_hash": "2a016a2cce1783b197afbedddcd59a23564b0b7dd093bb8c9c2a76ae7bc4dad9",
"source_hash": "a2eb69a95dd88a2e9b71a813ed70d34ab19d56a713433b75b30372defe81eea0",
"last_changed_at": "2026-06-23T11:54:32.379Z",
"active_status": "active"
}Parsed Structured
{
"dedupe": null,
"language": "en",
"location": {
"raw": "IN KL Kochi NOV DTC, Kochi, Kerala, IN",
"city": "Kerala",
"region": "IN",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-23T11:54:31.818Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "IN KL Kochi NOV DTC, Kochi, Kerala, IN",
"city": "Kerala",
"region": "IN",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"countries": [
"United States"
]
},
"remote_policy": null,
"salary_period": "day",
"workplace_type": null,
"salary_currency": null
}Extensions
{}Native Structured
{
"detail": {
"Id": "41953",
"Title": "Site Reliability Engineer",
"media": [],
"skills": [],
"JobType": null,
"Category": "Technical",
"JobGrade": null,
"JobLevel": null,
"JobShift": "Evening",
"WorkDays": null,
"WorkHours": null,
"WorkYears": null,
"Department": null,
"HotJobFlag": false,
"StudyLevel": null,
"WorkMonths": null,
"WorkerType": null,
"GeographyId": 300000934580725,
"JobFamilyId": 300000018266896,
"JobFunction": "Engineering",
"JobSchedule": "Full time",
"BusinessUnit": null,
"ContractType": null,
"Organization": null,
"TrendingFlag": false,
"workLocation": [
{
"Country": "IN",
"Region1": null,
"Region2": "Kerala",
"Region3": null,
"Building": null,
"Latitude": "10.0097",
"Longitude": "76.36382",
"LocationId": 300003594952072,
"PostalCode": "682042",
"TownOrCity": "Kochi",
"AddressLine1": "Lulu Cyber Tower 2",
"AddressLine2": "10th Floor, 1003",
"AddressLine3": "Infopark Special Economic Zone, Kakkanad",
"AddressLine4": null,
"LocationName": "IN KL Kochi NOV DTC"
}
],
"ContentLocale": "en",
"HiringManager": null,
"LegalEmployer": null,
"RequisitionId": 300004593366645,
"WorkplaceType": "",
"BusinessUnitId": 300000007962530,
"OrganizationId": 300003405013902,
"GeographyNodeId": 100023524884344,
"JobFunctionCode": "ENGG",
"LegalEmployerId": 300000007923223,
"PrimaryLocation": "Kochi, Kerala, India",
"RequisitionType": "NOV Standard",
"NumberOfOpenings": null,
"WorkplaceTypeCode": null,
"BeFirstToApplyFlag": true,
"otherWorkLocations": [],
"secondaryLocations": [],
"ExternalContactName": null,
"ShortDescriptionStr": "Join our High-Impact Site Reliability Engineering Team\nCustomer experience is at the core of everything we do. To deliver resilient, high-performance systems at scale, we’re expanding our DevOps function into a more robust Site Reliability Engineering (SRE) discipline. We’re looking for an experienced SRE to lead production reliability, performance tuning, and operational excellence across our platform. You’ll work at the intersection of software engineering and systems engineering—driving improvements that directly impact product uptime, velocity, and user satisfaction.\nIf you are passionate about reliability, automation, and scalability—and have the technical depth to back it up—this role is for you.\n",
"ExternalContactEmail": null,
"ExternalPostedEndDate": "2026-08-31T10:56:00+00:00",
"OtherRequisitionTitle": null,
"requisitionFlexFields": [],
"ApplyWhenNotPostedFlag": true,
"DomesticTravelRequired": null,
"ExternalDescriptionStr": "<p>We are looking for an experienced SRE to lead production reliability, performance tuning, and operational excellence across our platform. You’ll work at the intersection of software engineering and systems engineering—driving improvements that directly impact product uptime, velocity, and user satisfaction.</p>\n<p>If you are passionate about reliability, automation, and scalability—and have the technical depth to back it up—this role is for you.</p>\n<p><strong>Responsibilities:</strong></p>\n<ul>\n <li>As a Site Reliability Engineer, you will be responsible for Operational Excellence & Incident Management</li>\n <li>Maintain and monitor production systems for availability, latency, and performance.</li>\n <li>Lead incident response efforts, including communication, resolution, and postmortem documentation.</li>\n <li>Design and implement health checks, alerting systems, and automated remediation workflows.</li>\n <li>Drive root cause analysis and implement permanent resolutions for recurring issues.</li>\n <li>Set up and maintain full observability stacks (logging, metrics, tracing) using tools like Prometheus, Grafana, Datadog, OpenTelemetry, or ELK.</li>\n <li>Analyze telemetry and logs to identify trends, anomalies, and opportunities for improvement.</li>\n <li>Conduct post-incident reviews and use insights to inform future engineering investments.</li>\n <li>Tune and optimize distributed systems, including AKKA.NET actors, for performance and resource efficiency.</li>\n <li>Work with developers to evolve architecture and improve system throughput, latency, and stability.</li>\n <li>Optimize PostgreSQL performance, queries, and maintenance strategies.</li>\n <li>Design and maintain modern CI/CD pipelines using GitHub Actions, Azure Pipelines, or GitLab CI.</li>\n <li>Automate deployment, testing, and rollback processes to reduce friction and increase deployment frequency.</li>\n <li>Standardize infrastructure as code practices across environments.</li>\n</ul>\n<p><strong>Requirements:</strong></p>\n<ul>\n <li>5–10 years of experience in SRE, DevOps, or Infrastructure Engineering roles.</li>\n <li>Expertise in Kubernetes and container orchestration at scale.</li>\n <li>Strong experience with AKKA.NET or similar actor-based frameworks.</li>\n <li>Proficiency with scripting and automation (Bash, PowerShell, Python).</li>\n <li>Experience with observability tools (Phobos,Datadog, Prometheus, Grafana, OpenTelemetry, ELK).</li>\n <li>Hands-on experience with cloud platforms (AWS, Azure, or GCP).</li>\n <li>Strong PostgreSQL knowledge—performance tuning, query optimization, maintenance.</li>\n <li>Proven ability to lead incident management and drive postmortem processes.</li>\n <li>A builder’s mindset with high standards for operational excellence and technical ownership.</li>\n <li>Preferred Tools & Ecosystem Experience</li>\n <li>CI/CD: GitHub Actions, Azure Pipelines, GitLab CI</li>\n <li>Infrastructure: Kubernetes, Docker, Terraform</li>\n <li>Monitoring: Phobos (AKKA.NET), Datadog, Prometheus</li>\n <li>Source Control: GitHub, GitLab, Azure DevOps</li>\n <li>Programming: C#, Python, Bash, PowerShell</li>\n</ul>",
"ObjectVerNumberProfile": "1",
"PrimaryLocationCountry": "IN",
"CorporateDescriptionStr": "Every day, the oil and gas industry’s best minds put more than 150 years of experience to work to help our customers achieve lasting success.<br/><br/> \n<b>We Power the Industry that Powers the World</b><br/>Throughout every region in the world and across every area of drilling and production, our family of companies has provided the technical expertise, advanced equipment, and operational support necessary for success—now and in the future.<br/><br/> \n<b>Global Family</b><br/>We are a global family of thousands of individuals, working as one team to create a lasting impact for ourselves, our customers, and the communities where we live and work. <br/><br/> \n<b>Purposeful Innovation</b><br/>Through purposeful business innovation, product creation, and service delivery, we are driven to power the industry that powers the world better.<br/><br/> \n<b>Service Above All</b><br/>This drives us to anticipate our customers’ needs and work with them to deliver the finest products and services on time and on budget.<br/>",
"ExternalPostedStartDate": "2026-06-23T10:56:27+00:00",
"ExternalQualificationsStr": "",
"InternalQualificationsStr": "",
"OrganizationDescriptionStr": "",
"primaryLocationCoordinates": [
{
"Latitude": "9.98995",
"Longitude": "76.31374",
"CountryCode": "IN",
"GeographyId": 300000934580725,
"GeographyNodeId": 100023524884344
}
],
"ExternalResponsibilitiesStr": "",
"InternalResponsibilitiesStr": "",
"InternationalTravelRequired": null
},
"list_job": {
"Id": "41953",
"Title": "Site Reliability Engineer",
"JobType": null,
"Distance": 1782172800000,
"JobShift": null,
"Language": "US",
"WorkDays": null,
"JobFamily": null,
"Relevancy": 9,
"WorkHours": null,
"Department": null,
"HotJobFlag": false,
"PostedDate": "2026-06-23",
"StudyLevel": null,
"WorkerType": null,
"GeographyId": 300000934580725,
"JobFunction": null,
"JobSchedule": null,
"BusinessUnit": null,
"ContractType": null,
"ManagerLevel": null,
"Organization": null,
"TrendingFlag": false,
"workLocation": [
{
"Country": "IN",
"Region1": null,
"Region2": "Kerala",
"Region3": null,
"Building": null,
"Latitude": 10.0097,
"Longitude": 76.36382,
"LocationId": 300003594952072,
"PostalCode": "682042",
"TownOrCity": "Kochi",
"AddressLine1": "Lulu Cyber Tower 2",
"AddressLine2": "10th Floor, 1003",
"AddressLine3": "Infopark Special Economic Zone, Kakkanad",
"AddressLine4": null,
"LocationName": "IN KL Kochi NOV DTC"
}
],
"LegalEmployer": null,
"MediaThumbURL": null,
"WorkplaceType": "",
"BusinessUnitId": 300000007962530,
"OrganizationId": 300003405013902,
"PostingEndDate": null,
"LegalEmployerId": 300000007923223,
"PrimaryLocation": "Kochi, Kerala, India",
"WorkDurationYears": null,
"WorkplaceTypeCode": null,
"BeFirstToApplyFlag": true,
"WorkDurationMonths": null,
"otherWorkLocations": [],
"secondaryLocations": [],
"ShortDescriptionStr": "Join our High-Impact Site Reliability Engineering Team\nCustomer experience is at the core of everything we do. To deliver resilient, high-performance systems at scale, we’re expanding our DevOps function into a more robust Site Reliability Engineering (SRE) discipline. We’re looking for an experienced SRE to lead production reliability, performance tuning, and operational excellence across our platform. You’ll work at the intersection of software engineering and systems engineering—driving improvements that directly impact product uptime, velocity, and user satisfaction.\nIf you are passionate about reliability, automation, and scalability—and have the technical depth to back it up—this role is for you.\n",
"requisitionFlexFields": [],
"DomesticTravelRequired": null,
"PrimaryLocationCountry": "IN",
"ExternalQualificationsStr": null,
"ExternalResponsibilitiesStr": null,
"InternationalTravelRequired": null
},
"detail_meta": {
"url": "https://egay.fa.us6.oraclecloud.com/hcmRestApi/resources/latest/recruitingCEJobRequisitionDetails?expand=all&onlyData=true&finder=ById;Id=%2241953%22,siteNumber=CX_4001",
"http_status": 200,
"content_type": "application/json",
"response_bytes": 8076
},
"detail_errors": []
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/55303becc78f79794852199a345337dfe9a13e0d?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/ed4800e6-a68f-4753-bef9-3311479be754JSONGET https://api.bluedoor.sh/job-postings/v1/sources/fe4e74ed-d842-4ad5-b13f-38d2eb0357ccJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/55303becc78f79794852199a345337dfe9a13e0d/eventsJSON