Home › Companies › Autonomous Teaming › Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)

Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)

Autonomous Teaming · Munich (DEU) · Active · Personio

Job facts

Field	Value
Company	Autonomous Teaming
Title	Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)
Normalized title	-
Department / team	Engineering & Tech / Recruitingprozess ATS - Tech
Location	Munich (DEU)
Work model	-
Employment type	Full Time
Salary	-
Status	active
ATS provider	Personio
Posted / first seen	2026-04-08 / 2026-05-30
Changed / last seen	2026-05-30 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Autonomous Teaming.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through Personio.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
Department jobs	Active postings in Engineering & Tech.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Autonomous Teaming
Source	6f006c31-6a96-4a1c-8a79-ff5caa669177
ATS provider	Personio

Description

What we offer Opportunity to work on a new solution from scratch in a technical complex environment Work in an international, agile, cross-functional team creating the future of autonomous systems Grow your career in a expanding and ambitious engineering team Build innovative products using state-of-the-art technologies in AI, robotics, and autonomy Benefit from a steep learning curve and continuous development Enjoy team events and a strong, collaborative culture Your mission Build real autonomous systems that operate in the real world, not in the lab. Join our engineering team of a new product and help build the core autonomy that powers our next generation robotic systems used for defense and mission-critical operations. You will design, implement, and harden robotic software that must perform under real operational conditions - outdoors, under uncertainty, with real consequences. Your work will directly shape the reliability, safety, and tactical capability of the systems we deliver. Research and prototype novel RL algorithms (e.g. exploration, POMDPs, multi-agent systems) Define, design and implement use-cases for DRL on edge devices Translate theory into scalable systems with support from our engineering teams Collaborate with simulation, autonomy and AI infrastructure teams Develop decision-making for intelligent behavior and architectures Your profile Deep knowledge of RL theory and practice: policy gradients, value iteration, Q-learning, etc. Experience with ML training in physics based simulation (Gazebo, IsaacSim, Mujoco, Carla, etc.). Strong Programming proficiency (Python, C/C++). Comfortable with ML tooling and maintaining ML pipelines (Pytorch Lightning, MlFlow, etc.). Have experience with deploying ML methods to physical devices. Experience with version control (git). Familiarity with statistics, evaluation methods and experiment design. You think rigorously and build practically. Nice to have PhD in Reinforcement Learning, Robot Engineering or equivalent with experience in deploying developed methods to real robots. OR masters degree in relevant field with extensive experience in RL. Experience with sensor based end-to-end ML architectures. Familiar with Transformers, Attention, Graphs, VLAs and other modern day ML building blocks. Publications at NeurIPS, ICLR, ICML, ICRA, IROS, etc. are a plus Experience with robotics middleware (ISAAC, ROS/ROS2, etc.) Why us? Willingness to travel Citizenship of NATO member country or closed allied are mandatory

Full job record

Job ID	1a570640f10a72a4c18fb2635e2fc5c034f4aa96
Org ID	623c37ab-5724-4c7a-8195-6a809e0f1b8b
Source ID	6f006c31-6a96-4a1c-8a79-ff5caa669177
Board ID	6f006c31-6a96-4a1c-8a79-ff5caa669177
Provider	personio
Provider Job Key	2594292
Title	Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)
Normalized Title	—
Status	active
Active	yes
Location Text	Munich (DEU)
Department	Engineering & Tech
Team	Recruitingprozess ATS - Tech
Employment Type	full_time
Workplace Type	—
Remote Policy	—
Country	Munich (DEU)
Region	—
City	—
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://autonomous-teaming.jobs.personio.de/job/2594292?language=en
Apply URL	https://autonomous-teaming.jobs.personio.de/job/2594292?language=en
First Seen At	2026-05-30 05:52:53Z
Last Seen At	2026-06-06 07:53:15Z
Last Checked At	2026-06-06 07:53:15Z
Last Changed At	2026-05-30 05:52:53Z
Inactive At	—
Source Posted At	2026-04-08 09:50:43Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=personio/board=autonomous-teaming.de/date=2026-06-06/2026-06-06T07-53-15-195Z-17ed904b677b8771b12fae73531035d5a79ddc9ecd46205605b63d0aa1694412.json

Event Fields

{
  "content_hash": "a5cc94af1ce7db23b1c96e46029a7053c76fa8fb0187956496deb9233fa531f2",
  "source_hash": "a974178670d19c2e268ff30972f4665f1345885107f21fabd2c58ac35694cc9d",
  "last_changed_at": "2026-05-30T05:52:53.220Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Munich (DEU)",
    "city": null,
    "region": null,
    "country": "Munich (DEU)",
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T07:53:15.906Z",
  "launch_scope": {
    "reason": "personio_production_catalog",
    "included": true,
    "location": {
      "raw": "Munich (DEU)",
      "city": null,
      "region": null,
      "country": "Munich (DEU)",
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": [
      "Munich (DEU)"
    ]
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": null,
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "id": "2594292",
  "name": "Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)",
  "office": "Munich (DEU)",
  "keywords": [],
  "schedule": "full-time",
  "createdAt": "2026-04-08T09:50:43+00:00",
  "seniority": "experienced",
  "department": "Engineering & Tech",
  "occupation": "software_and_system_architecture",
  "subcompany": "Autonomous Teaming Solutions ATS GmbH",
  "employmentType": "permanent",
  "jobDescriptions": [
    {
      "name": "What we offer",
      "value": "<ul style=\"border:0px solid;color:rgb(32,32,32);font-family:Inter, '-apple-system', BlinkMacSystemFont, 'Segoe UI', Roboto, 'Helvetica Neue', 'Open Sans', 'system-ui', '-apple-system', 'Segoe UI', Roboto, Ubuntu, Cantarell, 'Noto Sans', sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji';font-size:14px;font-style:normal;font-weight:400;text-transform:none;background-color:rgb(255,255,255);\"><li style=\"border:0px solid;margin:0px;\">Opportunity to work on a new solution from scratch in a technical complex environment</li><li style=\"border:0px solid;margin:0px;\">Work in an international, agile, cross-functional team creating the future of autonomous systems</li><li style=\"border:0px solid;margin:0px;\">Grow your career in a expanding and ambitious engineering team</li><li style=\"border:0px solid;margin:0px;\">Build innovative products using state-of-the-art technologies in AI, robotics, and autonomy </li><li style=\"border:0px solid;margin:0px;\">Benefit from a steep learning curve and continuous development</li><li style=\"border:0px solid;margin:0px;\">Enjoy team events and a strong, collaborative culture</li></ul>"
    },
    {
      "name": "Your mission",
      "value": "<div style=\"border:0px solid;margin:0px;color:rgb(32,32,32);font-family:Inter, '-apple-system', BlinkMacSystemFont, 'Segoe UI', Roboto, 'Helvetica Neue', 'Open Sans', 'system-ui', '-apple-system', 'Segoe UI', Roboto, Ubuntu, Cantarell, 'Noto Sans', sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji';font-size:14px;font-style:normal;font-weight:400;text-transform:none;background-color:rgb(255,255,255);\">Build real autonomous systems that operate in the real world, not in the lab. </div><br><div style=\"border:0px solid;margin:0px;color:rgb(32,32,32);font-family:Inter, '-apple-system', BlinkMacSystemFont, 'Segoe UI', Roboto, 'Helvetica Neue', 'Open Sans', 'system-ui', '-apple-system', 'Segoe UI', Roboto, Ubuntu, Cantarell, 'Noto Sans', sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji';font-size:14px;font-style:normal;font-weight:400;text-transform:none;background-color:rgb(255,255,255);\">Join our engineering team of a new product and help build the core autonomy that powers our next generation robotic systems used for defense and mission-critical operations. You will design, implement, and harden robotic software that must perform under real operational conditions - outdoors, under uncertainty, with real consequences. Your work will directly shape the reliability, safety, and tactical capability of the systems we deliver. </div><div style=\"border:0px solid;margin:0px;color:rgb(32,32,32);font-family:Inter, '-apple-system', BlinkMacSystemFont, 'Segoe UI', Roboto, 'Helvetica Neue', 'Open Sans', 'system-ui', '-apple-system', 'Segoe UI', Roboto, Ubuntu, Cantarell, 'Noto Sans', sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji';font-size:14px;font-style:normal;font-weight:400;text-transform:none;background-color:rgb(255,255,255);\"> </div><ul><li><span style=\"font-size:inherit;\">Research and prototype novel RL algorithms (e.g. exploration, POMDPs, multi-agent systems) </span></li><li><span style=\"font-size:inherit;\">Define, design and implement use-cases for DRL on edge devices </span></li><li><span style=\"font-size:inherit;\">Translate theory into scalable systems with support from our engineering teams </span></li><li><span style=\"font-size:inherit;\">Collaborate with simulation, autonomy and AI infrastructure teams </span></li><li><span style=\"font-size:inherit;\">Develop decision-making for intelligent behavior and architectures </span></li></ul>"
    },
    {
      "name": "Your profile",
      "value": "<ul><li><span style=\"font-size:inherit;\">Deep knowledge of RL theory and practice: policy gradients, value iteration, Q-learning, etc. </span></li><li><span style=\"font-size:inherit;\">Experience with ML training in physics based simulation (Gazebo, IsaacSim, Mujoco, Carla, etc.).</span></li><li><span style=\"font-size:inherit;\">Strong Programming proficiency (Python, C/C++).</span></li><li><span style=\"font-size:inherit;\">Comfortable with ML tooling and maintaining ML pipelines (Pytorch Lightning, MlFlow, etc.).</span></li><li><span style=\"font-size:inherit;\">Have experience with deploying ML methods to physical devices.</span></li><li><span style=\"font-size:inherit;\">Experience with version control (git).</span></li><li><span style=\"font-size:inherit;\">Familiarity with statistics, evaluation methods and experiment design.</span></li><li><span style=\"font-size:inherit;\">You think rigorously and build practically.</span></li></ul>"
    },
    {
      "name": "Nice to have",
      "value": "<ul><li><span style=\"font-size:inherit;\">PhD in Reinforcement Learning, Robot Engineering or equivalent with experience in deploying developed methods to real robots.</span></li><li><span style=\"font-size:inherit;\">OR masters degree in relevant field with extensive experience in RL.</span></li><li><span style=\"font-size:inherit;\">Experience with sensor based end-to-end ML architectures.</span></li><li><span style=\"font-size:inherit;\">Familiar with Transformers, Attention, Graphs, VLAs and other modern day ML building blocks.</span></li><li><span style=\"font-size:inherit;\">Publications at NeurIPS, ICLR, ICML, ICRA, IROS, etc. are a plus </span></li><li><span style=\"font-size:inherit;\"><span>Experience with robotics middleware (ISAAC, ROS/ROS2, etc.)</span></span></li></ul>"
    },
    {
      "name": "Why us?",
      "value": "<ul style=\"border:0px solid;color:rgb(32,32,32);font-family:Inter, '-apple-system', BlinkMacSystemFont, 'Segoe UI', Roboto, 'Helvetica Neue', 'Open Sans', 'system-ui', '-apple-system', 'Segoe UI', Roboto, Ubuntu, Cantarell, 'Noto Sans', sans-serif, 'Apple Color Emoji', 'Segoe UI Emoji';font-size:14px;font-style:normal;font-weight:400;text-transform:none;background-color:rgb(255,255,255);\"><li style=\"border:0px solid;margin:0px;\">Willingness to travel</li><li style=\"border:0px solid;margin:0px;\">Citizenship of NATO member country or closed allied are mandatory</li></ul>"
    }
  ],
  "occupationCategory": "it_software",
  "recruitingCategory": "Recruitingprozess ATS - Tech"
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/1a570640f10a72a4c18fb2635e2fc5c034f4aa96?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/623c37ab-5724-4c7a-8195-6a809e0f1b8bJSON

GET https://api.bluedoor.sh/job-postings/v1/sources/6f006c31-6a96-4a1c-8a79-ff5caa669177JSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/1a570640f10a72a4c18fb2635e2fc5c034f4aa96/eventsJSON

Docs · Get an API key