Home › Companies › Bot Auto › Senior ML/RL Engineer, Behavior Planning
Senior ML/RL Engineer, Behavior Planning
Bot Auto · Houston, TX or San Francisco Bay Area · Active · Greenhouse
Job facts
| Field | Value |
|---|---|
| Company | Bot Auto |
| Title | Senior ML/RL Engineer, Behavior Planning |
| Normalized title | - |
| Department / team | Algorithm |
| Location | Houston, TX, United States |
| Work model | - |
| Employment type | - |
| Salary | - |
| Status | active |
| ATS provider | Greenhouse |
| Posted / first seen | 2026-05-27 / 2026-05-29 |
| Changed / last seen | 2026-05-29 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Bot Auto. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Greenhouse. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Houston. | Open |
| Department jobs | Active postings in Algorithm . | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Bot Auto |
| Source | eab57010-dfaf-44bf-aaa8-d29d627f848a |
| ATS provider | Greenhouse |
Description
Company Introduction
At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a startup and the wisdom of seasoned experts, our team has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create groundbreaking solutions that propel the future of transportation. Join us and transform your ideas into reality.
Role Overview
We are seeking a Senior ML/RL Engineer to join our Algo team and drive the development of our unified behavioral architecture. In this role, you will help bridge the gap between simulation and the real world by developing a scalable policy framework that represents both our L4 ego-policy and a diverse population of simulated agents. You will work at the intersection of Multi-Agent Reinforcement Learning (MARL) and safety-critical system design to ensure our autonomous semi-trucks navigate highways with superhuman safety and precision.
Key Responsibilities
Behavioral Modeling: Develop and train diverse, conditioned policies that simulate realistic driving behaviors to stress-test and validate our autonomous driving stack.
Safety-Constrained Learning: Lead the research and implementation of advanced RL algorithms to ensure safety metrics are treated as primary constraints in the learning process.
Reward & Objective Design: Collaborate with cross-functional teams to design robust reward functions and evaluation metrics that balance safety, progress, and comfort.
Scalable Training Pipelines: Contribute to the optimization of our large-scale, high-throughput training environments to enable rapid iteration on complex multi-agent scenarios.
Model Architecture: Advance our state-of-the-art neural architectures to improve spatial reasoning, long-horizon planning, and interaction modeling.
Cross-Team Collaboration: Work closely with Simulation and Planning teams to integrate research-grade models into production-quality, safety-critical software.
Required Qualifications
Professional RL Experience: Proven track record of training and deploying deep RL algorithms (e.g., PPO, SAC) for complex, real-world robotic or autonomous systems.
Technical Mastery: Expertise in Python and PyTorch ; strong understanding of modern deep learning architectures and optimization techniques.
Academic Background: MS or PhD in Computer Science, Robotics, or a related quantitative field.
Scientific Intuition: Ability to diagnose and solve fundamental challenges in RL training, such as variance management and distribution shift.
Preferred Qualifications
Safe RL Specialization: Experience with constrained optimization or safety-critical learning frameworks.
Multi-Agent Systems: Background in MARL training stability, including self-play and decentralized execution strategies.
Autonomous Driving Domain: Familiarity with vehicle dynamics and behavior planning, particularly for long-haul highway environments.
Additional Information
Compensation: Competitive salary based on experience, with opportunities for performance bonuses and equity.
Benefits: Comprehensive health insurance, paid time off, and the opportunity to work at the forefront of the autonomous trucking industry.
Why Bot Auto?
We are a small, hyper-focused team on a mission to beat human cost-per-mile through technology. We recently successfully completed the industry’s first fully humanless commercial truckload, proving that our vision is a reality. If you are passionate about AI, safety, and transforming logistics, we want to hear from you.
Full job record
| Job ID | 1dd588179c998d4a683a40efcebff2bcaf374a9c |
| Org ID | c4a9840f-06e1-4766-a812-7620f723b497 |
| Source ID | eab57010-dfaf-44bf-aaa8-d29d627f848a |
| Board ID | eab57010-dfaf-44bf-aaa8-d29d627f848a |
| Provider | greenhouse |
| Provider Job Key | 5233238008 |
| Title | Senior ML/RL Engineer, Behavior Planning |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Houston, TX or San Francisco Bay Area |
| Department | Algorithm |
| Team | — |
| Employment Type | — |
| Workplace Type | — |
| Remote Policy | — |
| Country | United States |
| Region | TX |
| City | Houston |
| Salary Raw | — |
| Salary Min | — |
| Salary Max | — |
| Salary Currency | — |
| Salary Period | — |
| Source URL | https://job-boards.greenhouse.io/botauto/jobs/5233238008 |
| Apply URL | https://job-boards.greenhouse.io/botauto/jobs/5233238008 |
| First Seen At | 2026-05-29 22:42:24Z |
| Last Seen At | 2026-06-06 07:35:26Z |
| Last Checked At | 2026-06-06 07:35:26Z |
| Last Changed At | 2026-05-29 22:42:24Z |
| Inactive At | — |
| Source Posted At | 2026-05-27 20:25:11Z |
| Source Updated At | 2026-05-27 20:25:11Z |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=botauto/date=2026-06-06/2026-06-06T07-35-26-284Z-ccb05dbb347ba614b293f5d1ba32db0e1a25bcf04344f90f2b14e404a853d04b.json |
Event Fields
{
"content_hash": "b545948c7f474fdcec301b57b02914ea528a02bbade82c0c3b36cb290240a81f",
"source_hash": "3a8b1e2dc88036c4ab14ea385d294841a1a9b0ea78cac97975d8fdcca4fc006d",
"last_changed_at": "2026-05-29T22:42:24.026Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Houston, TX",
"city": "Houston",
"region": "TX",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"salary_max": null,
"salary_min": null,
"inferred_at": "2026-06-06T07:35:26.409Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "Houston, TX",
"city": "Houston",
"region": "TX",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"countries": [
"United States"
]
},
"remote_policy": null,
"salary_period": null,
"workplace_type": null,
"salary_currency": null
}Extensions
{}Native Structured
{
"title": "Senior ML/RL Engineer, Behavior Planning",
"offices": [
{
"id": 4021863008,
"name": "Houston Office",
"location": "Houston, Texas, United States",
"child_ids": [],
"parent_id": null
}
],
"language": "en",
"location": {
"name": "Houston, TX or San Francisco Bay Area"
},
"metadata": [],
"updated_at": "2026-05-27T16:25:11-04:00",
"departments": [
{
"id": 4025660008,
"name": "Algorithm ",
"child_ids": [],
"parent_id": null
}
],
"company_name": "Bot Auto",
"requisition_id": 4481007008,
"first_published": "2026-05-27T16:25:11-04:00",
"application_deadline": null
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/1dd588179c998d4a683a40efcebff2bcaf374a9c?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/c4a9840f-06e1-4766-a812-7620f723b497JSONGET https://api.bluedoor.sh/job-postings/v1/sources/eab57010-dfaf-44bf-aaa8-d29d627f848aJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/1dd588179c998d4a683a40efcebff2bcaf374a9c/eventsJSON