Home › Companies › Resaroai › Senior Data Scientist, Reinforcement Learning

Senior Data Scientist, Reinforcement Learning

Resaroai · Munich, Bavaria, 80333, Germany · On Site · Active · BambooHR

Job facts

Field	Value
Company	Resaroai
Title	Senior Data Scientist, Reinforcement Learning
Normalized title	-
Department / team	Embedded Team - DE
Location	Munich, Bavaria
Work model	On Site
Employment type	Full Time
Salary	-
Status	active
ATS provider	BambooHR
Posted / first seen	2026-03-01 / 2026-05-30
Changed / last seen	2026-05-30 / 2026-06-06

Related slices

Page	What it contains	Open
Company jobs	Active postings from Resaroai.	Open
Company breakdowns	Role, location, ATS, and work model facets for this company.	Open
ATS provider jobs	Active postings observed through BambooHR.	Open
Provider filtered search	The same provider as a filtered job collection.	Open
City jobs	Active postings in Munich.	Open
Department jobs	Active postings in Embedded Team - DE.	Open
Work model jobs	Active On Site postings.	Open
Lifecycle events	Open, update, close, and reopen events for this posting.	Open
Original posting	Canonical source or apply URL captured from the ATS.	Open

Linked records

Company	Resaroai
Source	73c75124-6f1a-4720-ab36-38de6a06689e
ATS provider	BambooHR

Description

Resaro builds advanced AI testing software to help organizations verify, validate, and trust their most critical AI systems — from computer vision to generative AI and autonomous systems . Our mission is to ensure that AI technologies deployed in real-world, high-stakes environments are robust, explainable, and secure. We work closely with our customers through embedded delivery teams who operate on-site or in close collaboration. These teams tailor solutions to specific mission needs, helping organizations — especially in the public safety and national security sectors — evaluate and improve the performance of their AI-enabled systems. About the Role: As the Senior Expert Reinforcement Learning , you will be the primary architect of our AI Test, Evaluation, Verification, and Validation (TEVV) product suite for reinforcement learning systems. You will lead the development of next-generation AI testing and assurance frameworks with applications in Autonomous Driving and Robotics . Your mission is to scale our capabilities in Reinforcement Learning , to ensure autonomous agents are safe, robust, and explainable in the field. Key Responsibilities Independently implement Resaro’s RL validation prototype to expose agent instability and vulnerability in a mission-critical and complex environment. Scale, lead and mentor a global, cross-functional, high-performing team of AI researchers and engineers, drawing on experience steering organizations of 30+ experts. Define the long-term vision and technical roadmap for RL TEVV, focusing on validating RL algorithms and learned policies in complex environments with mission-critical applications across system control , autonomous vehicles, and robotics . Advance methods for learning probabilistic reward functions from human feedback (RLHF) to align AI behavior with mission goals. Partner with Product Management to translate product vision, customer problems, and market opportunities into end‑to‑end solution architecture and technical roadmaps that support a product-led growth strategy. Must-Have Skills and Experience Master / Ph.D. in Robot Reinforcement Learning or a closely related field. Proven track record in developing and implementing novel RL and ML algorithms, e.g. research or commercial implementation. Demonstrated deep theoretical understanding of and practical experience with the RL framework, including bandit setting, (in-)finite horizon setting, on- and off-policy RL, and trust-region RL approaches. Experience in Bayesian Machine Learning and probabilistic models. Understanding of AI/ML/RL lifecycle and the state-of-the-art approaches and limitations of testing and validating complex use cases. Strong skills in requirements gathering, stakeholder communication, and solution scoping. Nice-to-Have Experience with fully differentiable deep learning for highly unstable systems. Experience with Active Learning and RLHF. Background in model compression and pruning for deploying large RL models onto edge devices. Hands-on experience with Bayesian Meta-Learning to reduce training time and absolute error in complex models. A strong portfolio of innovation, including multiple successful paper submissions at conferences like NeurIPS, ICML, ICLR, IROS, ICRA, CoRL, and a deep patent history (e.g., 17+ patents). Experience spearheading global AI initiatives and delivering AI solutions for both B2G (Unmanned Systems) and B2B (IoT) sectors. Demonstrated success in leading cross-functional teams to deliver technical solutions. Knowledge of deployment constraints in high-security or classified environments. Prior exposure or experience with directly engaging senior stakeholders from Director to C-suite level. Prior security clearance at Government CONFIDENTIAL and above. Why Join Resaro Work on mission-critical AI systems in defence, aerospace, and public safety. Help define the future of AI testing and assurance in real-world environments. Collaborate with a tight-knit, expert team working at the intersection of AI, systems engineering, and policy. Shape product direction while being close to the operational reality of AI deployments. Resaro is an Equal Opportunity Employer. We respect each individual and support the diverse cultures, perspectives, skills and experiences within our teams.

Full job record

Job ID	5431e49cef707d6fe37ac77ef017acef2fa196c7
Org ID	f0b57942-c9a0-4c9c-b8a7-6f665516be58
Source ID	73c75124-6f1a-4720-ab36-38de6a06689e
Board ID	73c75124-6f1a-4720-ab36-38de6a06689e
Provider	bamboohr
Provider Job Key	90
Title	Senior Data Scientist, Reinforcement Learning
Normalized Title	—
Status	active
Active	yes
Location Text	Munich, Bavaria, 80333, Germany
Department	Embedded Team - DE
Team	—
Employment Type	full_time
Workplace Type	on_site
Remote Policy	—
Country	—
Region	Bavaria
City	Munich
Salary Raw	—
Salary Min	—
Salary Max	—
Salary Currency	—
Salary Period	—
Source URL	https://resaroai.bamboohr.com/careers/90
Apply URL	https://resaroai.bamboohr.com/careers/90
First Seen At	2026-05-30 05:52:50Z
Last Seen At	2026-06-06 10:19:42Z
Last Checked At	2026-06-06 10:19:42Z
Last Changed At	2026-05-30 05:52:50Z
Inactive At	—
Source Posted At	2026-03-01 00:00:00Z
Source Updated At	—
Raw Payload Uri	s3://job-postings-prod-raw-590183727216/raw/provider=bamboohr/board=resaroai/date=2026-06-06/2026-06-06T10-19-41-202Z-3847ba5b4a09c2248e9b5614ae7849714b0ebe68070ca10e436d186b652b1137.json

Event Fields

{
  "content_hash": "44d44729896f5565eb14099b2c2cd2f6000694a075d1a29eb943ec8ddb3626de",
  "source_hash": "6b7d04e94bd6248e8c3e91471df1a45471bda2f3b311626008a7babe3fddf134",
  "last_changed_at": "2026-05-30T05:52:50.295Z",
  "active_status": "active"
}

Parsed Structured

{
  "language": "en",
  "location": {
    "raw": "Munich, Bavaria, 80333, Germany",
    "city": "Munich",
    "region": "Bavaria",
    "country": null,
    "is_remote": false,
    "confidence": 0.8
  },
  "salary_max": null,
  "salary_min": null,
  "inferred_at": "2026-06-06T10:19:42.229Z",
  "launch_scope": {
    "reason": "bamboohr_production_catalog",
    "included": true,
    "location": {
      "raw": "Munich, Bavaria, 80333, Germany",
      "city": "Munich",
      "region": "Bavaria",
      "country": null,
      "is_remote": false,
      "confidence": 0.8
    },
    "countries": []
  },
  "remote_policy": null,
  "salary_period": null,
  "workplace_type": "on_site",
  "salary_currency": null
}

Extensions

{}

Native Structured

{
  "list_job": {
    "id": "90",
    "isRemote": null,
    "location": {
      "city": "Munich",
      "state": "Bavaria"
    },
    "atsLocation": {
      "city": null,
      "state": null,
      "country": null,
      "province": null
    },
    "departmentId": "18605",
    "locationType": "2",
    "jobOpeningName": "Senior Data Scientist, Reinforcement Learning ",
    "departmentLabel": "Embedded Team - DE",
    "employmentStatusLabel": "Full-Time"
  },
  "detail_errors": [],
  "detail_job_opening": {
    "location": {
      "city": "Munich",
      "state": "Bavaria",
      "postalCode": "80333",
      "addressCountry": "Germany"
    },
    "datePosted": "2026-03-01",
    "atsLocation": {
      "city": null,
      "state": null,
      "country": null,
      "countryId": null
    },
    "description": "<p><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Resaro</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> builds advanced </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">AI testing software</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> to help organizations verify, validate, and trust their most critical AI systems — from </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">computer vision</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> to </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">generative AI</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> and </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">autonomous systems</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">. Our mission is to ensure that AI technologies deployed in real-world, high-stakes environments are robust, explainable, and secure.</span></p>\n<p><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">We work closely with our customers through </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">embedded delivery teams</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> who operate on-site or in close collaboration. These teams tailor solutions to specific mission needs, helping organizations — especially in the </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">public safety and national security sectors</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> — evaluate and improve the performance of their AI-enabled systems.</span></p>\n<p><br></p>\n<p><span style=\"color: rgb(67, 67, 67); font-family: Arial, sans-serif; font-size: 12pt; font-weight: bold\">About the Role:</span></p>\n<p><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">As the </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Senior Expert Reinforcement Learning</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">, you will be the primary architect of our AI </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Test, Evaluation, Verification, and Validation (TEVV)</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\"> product suite for reinforcement learning systems. You will lead the development of next-generation AI testing and assurance frameworks with applications in </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Autonomous Driving</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\"> and </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Robotics</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">. Your mission is to scale our capabilities in </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Reinforcement Learning</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">, </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">to ensure autonomous agents are safe, robust, and explainable in the field.</span></p>\n<p><br></p>\n<p><span style=\"color: rgb(67, 67, 67); font-family: Arial, sans-serif; font-size: 12pt; font-weight: bold\">Key Responsibilities</span></p>\n<ul>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Independently implement Resaro’s RL validation prototype to expose agent instability and vulnerability in a mission-critical and complex environment.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Scale, lead and mentor a global, cross-functional, high-performing team of AI researchers and engineers, drawing on experience steering organizations of 30+ experts.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Define the long-term vision and technical roadmap for RL TEVV, focusing on validating RL algorithms and learned policies in complex environments with mission-critical applications across </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">system control</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">, </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">autonomous vehicles, and robotics</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Advance methods for learning probabilistic reward functions from human feedback (RLHF) to align AI behavior with mission goals.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Partner with Product Management to translate product vision, customer problems, and market opportunities into end‑to‑end solution architecture and technical roadmaps that support a product-led growth strategy.</span></li>\n</ul>\n<p><br></p>\n<p><span style=\"color: rgb(67, 67, 67); font-family: Arial, sans-serif; font-size: 12pt; font-weight: bold\">Must-Have Skills and Experience</span></p>\n<ul>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Master / Ph.D. in Robot Reinforcement Learning or a closely related field.</span>\n<ul>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Proven track record in developing and implementing novel RL and ML algorithms, e.g. research or commercial implementation.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Demonstrated deep theoretical understanding of and practical experience with the RL framework, including bandit setting, (in-)finite horizon setting, on- and off-policy RL, and trust-region RL approaches.</span></li>\n</ul>\n</li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Experience in Bayesian Machine Learning and probabilistic models.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Understanding of AI/ML/RL lifecycle and the state-of-the-art approaches and limitations  of testing and validating complex use cases.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Strong skills in requirements gathering, stakeholder communication, and solution scoping.</span></li>\n</ul>\n<p><br></p>\n<p><span style=\"color: rgb(67, 67, 67); font-family: Arial, sans-serif; font-size: 12pt; font-weight: bold\">Nice-to-Have</span></p>\n<ul>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Experience with </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">fully differentiable deep learning</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\"> for highly unstable systems.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Experience with Active Learning and RLHF.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Background in </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">model compression and pruning</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\"> for deploying large RL models onto edge devices.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Hands-on experience with </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">Bayesian Meta-Learning</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\"> to reduce training time and absolute error in complex models.</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">A strong portfolio of innovation, including multiple successful </span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">paper submissions</span><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\"> at conferences like NeurIPS, ICML, ICLR, IROS, ICRA, CoRL, and a deep patent history (e.g., 17+ patents).</span></li>\n<li><span style=\"color: rgb(31, 31, 31); font-family: Arial, sans-serif; font-size: 10pt\">Experience spearheading global AI initiatives and delivering AI solutions for both B2G (Unmanned Systems) and B2B (IoT) sectors.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Demonstrated success in </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">leading cross-functional teams</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> to deliver technical solutions.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Knowledge of </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">deployment constraints</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> in high-security or classified environments.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Prior exposure or experience with directly engaging senior stakeholders from Director to C-suite level.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Prior security clearance at Government CONFIDENTIAL and above.</span><br></li>\n</ul>\n<p><br></p>\n<p><span style=\"font-family: Arial, sans-serif; font-size: 12pt; font-weight: bold\">Why Join Resaro</span></p>\n<ul>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Work on </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">mission-critical AI systems</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> in defence, aerospace, and public safety.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Help define the future of </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">AI testing and assurance</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> in real-world environments.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Collaborate with a </span><span style=\"font-family: Arial, sans-serif; font-size: 10pt; font-weight: bold\">tight-knit, expert team</span><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"> working at the intersection of AI, systems engineering, and policy.</span></li>\n<li><span style=\"font-family: Arial, sans-serif; font-size: 10pt\">Shape product direction while being close to the operational reality of AI deployments.</span><br></li>\n</ul>\n<p><br></p>\n<p><span style=\"font-family: Arial, sans-serif; font-size: 10pt\"><span style=\"font-family: Arial, sans-serif\">Resaro is an Equal Opportunity Employer. We respect each individual and support the diverse cultures, perspectives, skills and experiences within our teams.</span></span></p>",
    "compensation": null,
    "departmentId": "18605",
    "locationType": "2",
    "seekPromoted": false,
    "jobCategoryId": null,
    "jobOpeningName": "Senior Data Scientist, Reinforcement Learning ",
    "departmentLabel": "Embedded Team - DE",
    "jobOpeningStatus": "Open",
    "minimumExperience": "Senior Manager/Supervisor",
    "jobOpeningShareUrl": "https://resaroai.bamboohr.com/careers/90",
    "employmentStatusLabel": "Full-Time"
  }
}

Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/5431e49cef707d6fe37ac77ef017acef2fa196c7?include=descriptionJSON

GET https://api.bluedoor.sh/job-postings/v1/orgs/f0b57942-c9a0-4c9c-b8a7-6f665516be58JSON

GET https://api.bluedoor.sh/job-postings/v1/sources/73c75124-6f1a-4720-ab36-38de6a06689eJSON

GET https://api.bluedoor.sh/job-postings/v1/jobs/5431e49cef707d6fe37ac77ef017acef2fa196c7/eventsJSON

Docs · Get an API key