Home › Companies › Menlo › DevOps Engineer
DevOps Engineer
Menlo · Singapore, Singapore, 180000, Singapore · Hybrid · Active · BambooHR
Job facts
| Field | Value |
|---|---|
| Company | Menlo |
| Title | DevOps Engineer |
| Normalized title | - |
| Department / team | Menlo HQ |
| Location | Singapore, Singapore |
| Work model | Hybrid / Hybrid |
| Employment type | Full Time |
| Salary | - |
| Status | active |
| ATS provider | BambooHR |
| Posted / first seen | 2026-04-15 / 2026-05-30 |
| Changed / last seen | 2026-05-30 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Menlo. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through BambooHR. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Singapore. | Open |
| Department jobs | Active postings in Menlo HQ. | Open |
| Work model jobs | Active Hybrid postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Menlo |
| Source | 0de7fe14-773e-46a7-bf7b-e1b137c04318 |
| ATS provider | BambooHR |
Description
About Menlo
Menlo Research is an Applied R&D lab building Asimov, an open-source humanoid robot platform, and the full software stack that powers it. Our mission is to make humanoid labor economically viable -- turning software into physical labor at scale. We build across the full stack: hardware architecture, locomotion, autonomy, simulation, and infrastructure. We move fast, ship to real robots, and open-source everything we can. If you want your work to matter beyond a paper or a demo, this is the place.
The Role
As an DevOps Engineer, you will own and evolve the platform that everything at Menlo runs on -- from inference serving, to training rigs, to the agentic coding infrastructure that powers day-to-day engineering. You will work deep in the stack across Kubernetes, networking, and where it matters bare metal, and help set the technical direction for how Menlo Cloud scales.
What You'll Do
Operate and evolve our Kubernetes platform across multiple clusters and environments (Prod, Dev, hybrid on-prem and public cloud), covering control plane operations, node lifecycle, upgrades, and autoscaling at every layer (Cluster Autoscaler, HPA, KEDA).
Architect and manage hybrid cloud infrastructure spanning on-premises and public clouds (GCP, AWS), including workload placement, cross-cloud networking, and unified resource management.
Own the CI/CD and GitOps experience end-to-end: container build pipelines, image optimization, and progressive delivery via ArgoCD / FluxCD.
Own the observability stack as a single pane of glass across all clusters: Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus -- and help push toward agent-assisted SRE workflows.
Manage and improve our inference platform: vLLM serving and AIBrix for multi-model orchestration and autoscaling across a fleet of NVIDIA GPUs.
Operate platform services: Kafka, Redis, PostgreSQL, OpenSearch.
Manage identity and access via Keycloak integrated with Google Workspace; harden SSO, RBAC, and secrets management across the platform.
Harden network security across private load balancers, firewalls, and VPC segmentation; design and maintain hub-and-spoke / multi-AZ topologies.
Support training infrastructure: self-service VM provisioning, RunPod burst capacity, Weights and Biases integration.
Drive infrastructure reliability, cost efficiency, and capacity planning as the platform scales.
What We're Looking For
Kubernetes -- deep, hands-on. Strong production experience with Kubernetes, fluent in workloads and controllers, networking (Services, Ingress, CNI basics), storage (PV/PVC, CSI), RBAC, and the autoscaling story end-to-end (HPA, VPA, Cluster Autoscaler, KEDA). Cloud-managed Kubernetes (GKE, EKS, AKS) is fine; on-premises / self-managed Kubernetes (kubeadm, Cluster API, k3s, etc.) is a strong plus.
Networking -- design-level, not just operator-level. You have designed real network topologies at some point in your career -- hub-and-spoke, multi-AZ / multi-VPC, or an equivalent enterprise pattern -- and can defend the tradeoffs. Comfortable with VPCs, firewalls, load balancers, private cluster architecture, DNS, and routing. On-premises networking experience (VLANs, BGP, L2/L3 fabrics, pfSense / Fortinet / Palo Alto / Cisco) is a strong plus.
CI/CD and Docker -- concepts over tooling. You can build and optimize Dockerfiles (multi-stage builds, layer caching, small/secure base images) and have owned full CI/CD pipelines end-to-end. Tooling is flexible -- GitHub Actions, GitLab CI, Azure Pipelines, Jenkins, Argo Workflows, etc. -- but you should be able to clearly articulate the full lifecycle of a typical pipeline, and explain how CI/CD changes when the deployment target is Kubernetes (ArgoCD / FluxCD, GitOps patterns, progressive delivery).
Observability -- you have built this before. You have stood up a full observability stack from scratch and operated it in production -- metrics, logs, traces, alerting, on-call. Familiarity with the Grafana stack (Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus) is a strong plus. Bonus points if you have experimented with agent-assisted SRE workflows or LLM-driven incident triage.
SSO and identity. When you bring a new tool into the platform, your instinct is to wire it into a central IdP rather than leave it on local accounts. Comfortable with OpenID Connect, SAML, and traditional directory services (LDAP / Active Directory), and you have integrated tools with an IdP like Keycloak, Okta, Azure AD, or equivalent.
Linux and automation fundamentals. Strong Linux proficiency (RHEL/Ubuntu or equivalent) including basic performance and networking debugging. Comfort with infrastructure-as-code (Terraform / Terragrunt / Pulumi or equivalent) and configuration management.
Ownership mindset. Comfortable operating in a high-ownership environment where you make architecture decisions, push them to production, and own the outcomes.
Optional but valuable: hands-on experience operating any of Kafka, Redis, PostgreSQL, OpenSearch -- at production scale, including HA, backup/restore, and upgrade planning.
Bonus points for:
Experience with OpenStack in production: Nova, Neutron, Cinder, Trove, Horizon, and CLI administration.
Experience with KVM virtualization and storage backends like Ceph or Rook-Ceph on Kubernetes.
Familiarity with vLLM internals: PagedAttention, continuous batching, tensor parallelism.
Background in AI/ML infrastructure or GPU cluster operations at scale.
Experience with KEDA or event-driven autoscaling patterns in anger.
Prior open-source contributions to Kubernetes, OpenStack, or adjacent projects.
Kernel-level Linux debugging and performance tuning.
Why Join Menlo?
Most infrastructure teams manage someone else's cloud. At Menlo, you own the metal. Menlo Cloud is a first-class investment built from the ground up, and it sits at the center of everything we do, from coding agents to humanoid robots. You will have genuine ownership over a platform that is technically ambitious, cost-conscious by design, and critical to the mission. If you want to build infrastructure that actually matters and have the autonomy to do it right, this is the place.
Full job record
| Job ID | 86da6f104853f01842e3b60c2bf85c99a8b0f3a1 |
| Org ID | 6016aaf4-50e1-4e4b-831c-a615ce20aa74 |
| Source ID | 0de7fe14-773e-46a7-bf7b-e1b137c04318 |
| Board ID | 0de7fe14-773e-46a7-bf7b-e1b137c04318 |
| Provider | bamboohr |
| Provider Job Key | 128 |
| Title | DevOps Engineer |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Singapore, Singapore, 180000, Singapore |
| Department | Menlo HQ |
| Team | — |
| Employment Type | full_time |
| Workplace Type | hybrid |
| Remote Policy | hybrid |
| Country | — |
| Region | Singapore |
| City | Singapore |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://menlo.bamboohr.com/careers/128 |
| Apply URL | https://menlo.bamboohr.com/careers/128 |
| First Seen At | 2026-05-30 05:40:37Z |
| Last Seen At | 2026-06-06 10:21:32Z |
| Last Checked At | 2026-06-06 10:21:32Z |
| Last Changed At | 2026-05-30 05:40:37Z |
| Inactive At | — |
| Source Posted At | 2026-04-15 00:00:00Z |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=bamboohr/board=menlo/date=2026-06-06/2026-06-06T10-21-31-028Z-e4d32e244cc793dec40d75fb8836cb748032f5f97138ec86f1116441c77e211f.json |
Event Fields
{
"content_hash": "420ea9c56d0f669702c5ac262c5046c824401d63f3dbe82c3f5893578c6101f1",
"source_hash": "d0f0c23a6c22a5eb5d1ed4b39c0a330e387d392beb2ac54ba3cf68e5b5cf0e90",
"last_changed_at": "2026-05-30T05:40:37.676Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Singapore, Singapore, 180000, Singapore",
"city": "Singapore",
"region": "Singapore",
"country": null,
"is_remote": false,
"confidence": 0.8
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T10:21:32.708Z",
"launch_scope": {
"reason": "bamboohr_production_catalog",
"included": true,
"location": {
"raw": "Singapore, Singapore, 180000, Singapore",
"city": "Singapore",
"region": "Singapore",
"country": null,
"is_remote": false,
"confidence": 0.8
},
"countries": []
},
"remote_policy": "hybrid",
"salary_period": null,
"workplace_type": "hybrid",
"salary_currency": null
}Extensions
{}Native Structured
{
"list_job": {
"id": "128",
"isRemote": null,
"location": {
"city": "Singapore",
"state": "Singapore"
},
"atsLocation": {
"city": null,
"state": null,
"country": null,
"province": null
},
"departmentId": "18580",
"locationType": "0",
"jobOpeningName": "DevOps Engineer",
"departmentLabel": "Menlo HQ",
"employmentStatusLabel": "Full-Time"
},
"detail_errors": [],
"detail_job_opening": {
"location": {
"city": "Singapore",
"state": "Singapore",
"postalCode": "180000",
"addressCountry": "Singapore"
},
"datePosted": "2026-04-15",
"atsLocation": {
"city": null,
"state": null,
"country": null,
"countryId": null
},
"description": "<p><span style=\"font-family: arial, helvetica, sans-serif; font-size: 12pt; font-weight: bold\">About Menlo</span></p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">Menlo Research is an Applied R&D lab building Asimov, an open-source humanoid robot platform, and the full software stack that powers it. Our mission is to make humanoid labor economically viable -- turning software into physical labor at scale. We build across the full stack: hardware architecture, locomotion, autonomy, simulation, and infrastructure. We move fast, ship to real robots, and open-source everything we can. If you want your work to matter beyond a paper or a demo, this is the place.</span></p>\n<p><br></p>\n<p><span style=\"font-family: arial, helvetica, sans-serif; font-size: 12pt; font-weight: bold\">The Role</span></p>\n<p><span style=\"font-family: arial, helvetica, sans-serif\">As an DevOps Engineer, you will own and evolve the platform that everything at Menlo runs on -- from inference serving, to training rigs, to the agentic coding infrastructure that powers day-to-day engineering. You will work deep in the stack across Kubernetes, networking, and where it matters bare metal, and help set the technical direction for how Menlo Cloud scales.</span></p>\n<p><br></p>\n<p><span style=\"font-family: arial, helvetica, sans-serif; font-size: 12pt; font-weight: bold\">What You'll Do</span></p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Operate and evolve our Kubernetes platform across multiple clusters and environments (Prod, Dev, hybrid on-prem and public cloud), covering control plane operations, node lifecycle, upgrades, and autoscaling at every layer (Cluster Autoscaler, HPA, KEDA).</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Architect and manage hybrid cloud infrastructure spanning on-premises and public clouds (GCP, AWS), including workload placement, cross-cloud networking, and unified resource management.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Own the CI/CD and GitOps experience end-to-end: container build pipelines, image optimization, and progressive delivery via ArgoCD / FluxCD.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Own the observability stack as a single pane of glass across all clusters: Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus -- and help push toward agent-assisted SRE workflows.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Manage and improve our inference platform: vLLM serving and AIBrix for multi-model orchestration and autoscaling across a fleet of NVIDIA GPUs.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Operate platform services: Kafka, Redis, PostgreSQL, OpenSearch.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Manage identity and access via Keycloak integrated with Google Workspace; harden SSO, RBAC, and secrets management across the platform.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Harden network security across private load balancers, firewalls, and VPC segmentation; design and maintain hub-and-spoke / multi-AZ topologies.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Support training infrastructure: self-service VM provisioning, RunPod burst capacity, Weights and Biases integration.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Drive infrastructure reliability, cost efficiency, and capacity planning as the platform scales.</span><br></li>\n</ul>\n<p><br></p>\n<p><span style=\"font-family: arial, helvetica, sans-serif; font-size: 12pt; font-weight: bold\">What We're Looking For</span></p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Kubernetes -- deep, hands-on. Strong production experience with Kubernetes, fluent in workloads and controllers, networking (Services, Ingress, CNI basics), storage (PV/PVC, CSI), RBAC, and the autoscaling story end-to-end (HPA, VPA, Cluster Autoscaler, KEDA). Cloud-managed Kubernetes (GKE, EKS, AKS) is fine; on-premises / self-managed Kubernetes (kubeadm, Cluster API, k3s, etc.) is a strong plus.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Networking -- design-level, not just operator-level. You have designed real network topologies at some point in your career -- hub-and-spoke, multi-AZ / multi-VPC, or an equivalent enterprise pattern -- and can defend the tradeoffs. Comfortable with VPCs, firewalls, load balancers, private cluster architecture, DNS, and routing. On-premises networking experience (VLANs, BGP, L2/L3 fabrics, pfSense / Fortinet / Palo Alto / Cisco) is a strong plus.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">CI/CD and Docker -- concepts over tooling. You can build and optimize Dockerfiles (multi-stage builds, layer caching, small/secure base images) and have owned full CI/CD pipelines end-to-end. Tooling is flexible -- GitHub Actions, GitLab CI, Azure Pipelines, Jenkins, Argo Workflows, etc. -- but you should be able to clearly articulate the full lifecycle of a typical pipeline, and explain how CI/CD changes when the deployment target is Kubernetes (ArgoCD / FluxCD, GitOps patterns, progressive delivery).</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Observability -- you have built this before. You have stood up a full observability stack from scratch and operated it in production -- metrics, logs, traces, alerting, on-call. Familiarity with the Grafana stack (Grafana, Mimir, Tempo, Loki, Pyroscope, OnCall, Prometheus) is a strong plus. Bonus points if you have experimented with agent-assisted SRE workflows or LLM-driven incident triage.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">SSO and identity. When you bring a new tool into the platform, your instinct is to wire it into a central IdP rather than leave it on local accounts. Comfortable with OpenID Connect, SAML, and traditional directory services (LDAP / Active Directory), and you have integrated tools with an IdP like Keycloak, Okta, Azure AD, or equivalent.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Linux and automation fundamentals. Strong Linux proficiency (RHEL/Ubuntu or equivalent) including basic performance and networking debugging. Comfort with infrastructure-as-code (Terraform / Terragrunt / Pulumi or equivalent) and configuration management.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Ownership mindset. Comfortable operating in a high-ownership environment where you make architecture decisions, push them to production, and own the outcomes.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Optional but valuable: hands-on experience operating any of Kafka, Redis, PostgreSQL, OpenSearch -- at production scale, including HA, backup/restore, and upgrade planning.</span><br></li>\n</ul>\n<p><br><span style=\"font-family: arial, helvetica, sans-serif; font-size: 12pt; font-weight: bold\">Bonus points for:</span><br></p>\n<ul>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Experience with OpenStack in production: Nova, Neutron, Cinder, Trove, Horizon, and CLI administration.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Experience with KVM virtualization and storage backends like Ceph or Rook-Ceph on Kubernetes.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Familiarity with vLLM internals: PagedAttention, continuous batching, tensor parallelism.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Background in AI/ML infrastructure or GPU cluster operations at scale.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Experience with KEDA or event-driven autoscaling patterns in anger.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Prior open-source contributions to Kubernetes, OpenStack, or adjacent projects.</span></li>\n<li><span style=\"font-family: arial, helvetica, sans-serif\">Kernel-level Linux debugging and performance tuning.</span><br></li>\n</ul>\n<p><br></p>\n<p><span style=\"font-family: arial, helvetica, sans-serif; font-size: 12pt; font-weight: bold\">Why Join Menlo?</span></p>\n<p><span style=\"font-family: arial, helvetica, sans-serif; font-size: 12pt\">Most infrastructure teams manage someone else's cloud. At Menlo, you own the metal. Menlo Cloud is a first-class investment built from the ground up, and it sits at the center of everything we do, from coding agents to humanoid robots. You will have genuine ownership over a platform that is technically ambitious, cost-conscious by design, and critical to the mission. If you want to build infrastructure that actually matters and have the autonomy to do it right, this is the place.</span></p>",
"compensation": null,
"departmentId": "18580",
"locationType": "0",
"seekPromoted": false,
"jobCategoryId": null,
"jobOpeningName": "DevOps Engineer",
"departmentLabel": "Menlo HQ",
"jobOpeningStatus": "Open",
"minimumExperience": "Experienced",
"jobOpeningShareUrl": "https://menlo.bamboohr.com/careers/128",
"employmentStatusLabel": "Full-Time"
}
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/86da6f104853f01842e3b60c2bf85c99a8b0f3a1?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/6016aaf4-50e1-4e4b-831c-a615ce20aa74JSONGET https://api.bluedoor.sh/job-postings/v1/sources/0de7fe14-773e-46a7-bf7b-e1b137c04318JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/86da6f104853f01842e3b60c2bf85c99a8b0f3a1/eventsJSON