Home › Companies › Valo Health › Staff Data Scientist, Machine Learning in Epidemiology and Patient Data Products
Staff Data Scientist, Machine Learning in Epidemiology and Patient Data Products
Valo Health · Lexington, Massachusetts, United States; Remote; San Francisco, California, United States · Remote · Active · $165,000–$190,000 / year · Greenhouse
Job facts
| Field | Value |
|---|---|
| Company | Valo Health |
| Title | Staff Data Scientist, Machine Learning in Epidemiology and Patient Data Products |
| Normalized title | - |
| Department / team | Translational Data Science |
| Location | Lexington, MA, United States |
| Work model | Remote / Remote |
| Employment type | - |
| Salary | $165,000–$190,000 / year |
| Status | active |
| ATS provider | Greenhouse |
| Posted / first seen | 2026-05-15 / 2026-05-29 |
| Changed / last seen | 2026-05-29 / 2026-06-06 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Valo Health. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Greenhouse. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Lexington. | Open |
| Department jobs | Active postings in Translational Data Science. | Open |
| Work model jobs | Active Remote postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Valo Health |
| Source | 41c57134-c3f9-46f9-a14e-97c32355cfa8 |
| ATS provider | Greenhouse |
Description
About Us
Valo Health is a human-centric, AI-enabled biotechnology company working to make new drugs for patients faster. The company’s Opal Computational Platform transforms drug discovery and development through a unique combination of real-world data, AI, human translational models and predictive chemistry.
Our talented team of biologists, chemists and engineers, armed with advanced AI/ML tools, work together to break down traditional R&D silos and accelerate the speed and scale of drug discovery and development.
Valo is committed to hiring diverse talent, prioritizing growth and development, fostering an inclusive environment, and creating opportunities to bring together a group of different experiences, backgrounds, and voices to work together. We embrace new ways of learning, solve complex problems and welcome diverse perspectives that can help us advance patient-centric innovation.
Valo is headquartered in Lexington, MA, with additional offices in New York, NY and Tel Aviv, Israel. To learn more, visit www.valohealth.com .
About the Role...
As a Staff Data Scientist, Machine Learning in Epidemiology and Patient Data Products, you will be a core member on a team of data scientists building a powerful computational platform for advancing the discovery and development of new medicines. In this role, you will develop machine learning tools for patient data and drive their adoption across teams, under the guidance of epidemiology and biology program leads. Successful candidates will work with a diverse group of scientists and domain experts, in ways that cut across traditional industry boundaries in an innovative startup environment.
What You’ll Do…
Your primary areas of responsibility will be:
As a senior member of our team, you will lead the development of machine learning (ML) methods and analyses of patient data with diverse stakeholders. For example, integrate clinical insights into supervised and unsupervised learning approaches and generate patient profiles.
Perform project-specific hands-on analysis and modeling of high-dimensional longitudinal real-world data, spanning electronic medical records (EHRs), clinical notes, sequencing data, and multi-omics, using modern data science tools in cloud environments.
Contribute to the design, implementation, and evaluation of innovative machine learning approaches for patient data to provide novel clinical insights.
Be comfortable with scientific uncertainty and embrace curiosity and creative solutions. Many of the challenges we tackle don’t have known solutions or established pathways.
Use your technical knowledge and intuition to articulate and break down large problems into solvable pieces. There are a lot of problems to solve; you’ll need to prioritize which of these are critical-path today from those that can wait.
Be a dynamic and active team member, championing shared coding standards, participating in code reviews, and providing regular updates on your work and input into the work of your colleagues.
What You Bring…
MS, MPH, or PhD in health data science, biostatistics, or a related quantitative field, with 5 years of experience developing and applying ML methods, including at least 3 years working directly with real-world patient data. Experience in a biopharmaceutical, epidemiological or biostatistical setting is a plus.
Extensive experience developing and implementing machine learning solutions in healthcare databases, including EHRs, administrative claims, and patient registries. Familiarity with U.S. and global medical coding ontologies and data models (ICD, ATC, LOINC, SNOMED, CPT, HCPCS, OMOP, etc.). Confident working with highly sparse and high-dimensional data. Experience processing and mining clinical notes is a plus.
Extensive experience building, maintaining, and operationalizing ML pipelines, and translating model outputs into meaningful insights for diverse audiences.
Broad proficiency across core ML paradigms (e.g., supervised, unsupervised, semi-supervised) and experience with linear and logistic regression, classification and tree‑based methods, clustering and dimensionality‑reduction techniques, and deep learning architectures. Hands-on experience with representation learning and transformer-based and other sequence models is a plus.
Strong grounding in key components of the ML development lifecycle, including evaluation metrics, hyperparameter tuning, model selection, feature engineering and selection, model explainability, and MLOps best practices.
Mastery of Python and modern data science tools (e.g., scikit-learn, PyTorch, statsmodels, SciPy, MLlib, MLflow). Experience with AI-assisted coding tools (e.g., Claude Code) is a plus.
Comfortable working in ambiguous problem spaces; experience working in a start-up or agile work environment as part of cross-functional project teams.
Ability to lead and facilitate meetings and work collaboratively on multi-disciplinary project teams.
Exceptional time management, ability to prioritize multiple tasks simultaneously, and deliver products on time every time.
Enthusiastic about documentation–ensuring that all analyses are clear and reproducible with thorough documentation of key assumptions and decision points.
You May Also Bring…
Advanced knowledge of biostatistics approaches, including inferential and predictive modeling. Experience in causal approaches for observational studies, including propensity score methods, bias adjustment, and covariate selection and adjustment.
Familiarity with or exposure to traditional drug discovery and development processes and approaches.
Remote Salary Range $165,000 — $190,000 USD CA Salary Range $175,000 — $220,000 USD
Compensation for the role will depend on a number of factors, including a candidate’s qualifications, skills, competencies, and experience. Valo Health currently offers healthcare coverage, annual incentive program, retirement benefits and a broad range of other benefits. Compensation and benefits information is based on Valo Health's good faith estimate as of the date of publication and may be modified in the future.
Please note: At this time, we are only able to consider candidates who currently have permanent US work authorization without the need for immediate or future sponsorship.
Full job record
| Job ID | 83fe949ae9f2dd151e38a4e2c990900539ce18bf |
| Org ID | 2af1e08e-b2ab-4706-8b5e-433cd74c3437 |
| Source ID | 41c57134-c3f9-46f9-a14e-97c32355cfa8 |
| Board ID | 41c57134-c3f9-46f9-a14e-97c32355cfa8 |
| Provider | greenhouse |
| Provider Job Key | 8550220002 |
| Title | Staff Data Scientist, Machine Learning in Epidemiology and Patient Data Products |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Lexington, Massachusetts, United States; Remote; San Francisco, California, United States |
| Department | Translational Data Science |
| Team | — |
| Employment Type | — |
| Workplace Type | remote |
| Remote Policy | remote |
| Country | United States |
| Region | MA |
| City | Lexington |
| Salary Raw | Salary Range $165,000 — $190,000 USD CA Salary Range $175,000 — $220,000 USD Compensation |
| Salary Min | 165,000 |
| Salary Max | 190,000 |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://job-boards.greenhouse.io/valohealth/jobs/8550220002 |
| Apply URL | https://job-boards.greenhouse.io/valohealth/jobs/8550220002 |
| First Seen At | 2026-05-29 22:58:12Z |
| Last Seen At | 2026-06-06 20:02:34Z |
| Last Checked At | 2026-06-06 20:02:34Z |
| Last Changed At | 2026-05-29 22:58:12Z |
| Inactive At | — |
| Source Posted At | 2026-05-15 22:35:49Z |
| Source Updated At | 2026-05-28 18:46:29Z |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=valohealth/date=2026-06-06/2026-06-06T20-02-33-959Z-3790ef60c9fe3678d7b0a4bc11852ed19d9821d0ecc26cf2bd64d4988ab83f2f.json |
Event Fields
{
"content_hash": "1eda7a86ba52fc9ba1e8859e35e846513ff26370b782a26bd1bf278b45c9b5c4",
"source_hash": "1e1496254f24034e236d645ecab190ae467ce0ca9cb052ff91c101060910f703",
"last_changed_at": "2026-05-29T22:58:12.088Z",
"active_status": "active"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "Lexington, Massachusetts, United States",
"city": "Lexington",
"region": "MA",
"country": "United States",
"is_remote": true,
"confidence": 0.95
},
"salary_max": 190000,
"salary_min": 165000,
"inferred_at": "2026-06-06T20:02:34.032Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "Lexington, Massachusetts, United States",
"city": "Lexington",
"region": "MA",
"country": "United States",
"is_remote": true,
"confidence": 0.95
},
"countries": [
"United States"
]
},
"remote_policy": "remote",
"salary_period": "year",
"workplace_type": "remote",
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"title": "Staff Data Scientist, Machine Learning in Epidemiology and Patient Data Products",
"offices": [
{
"id": 4036211002,
"name": "Valo HQ - Lexington, MA",
"location": "Lexington, Massachusetts, United States",
"child_ids": [],
"parent_id": null
},
{
"id": 4088760002,
"name": "Valo - Remote",
"location": "Remote",
"child_ids": [],
"parent_id": null
},
{
"id": 4120756002,
"name": "Valo - San Francisco, CA",
"location": "San Francisco, California, United States",
"child_ids": [],
"parent_id": null
}
],
"language": "en",
"location": {
"name": "Lexington, Massachusetts, United States; Remote; San Francisco, California, United States"
},
"metadata": [],
"updated_at": "2026-05-28T14:46:29-04:00",
"departments": [
{
"id": 4061147002,
"name": "Translational Data Science",
"child_ids": [],
"parent_id": 4061136002
}
],
"company_name": "Valo Health",
"requisition_id": 6415095002,
"first_published": "2026-05-15T18:35:49-04:00",
"application_deadline": null
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/83fe949ae9f2dd151e38a4e2c990900539ce18bf?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/2af1e08e-b2ab-4706-8b5e-433cd74c3437JSONGET https://api.bluedoor.sh/job-postings/v1/sources/41c57134-c3f9-46f9-a14e-97c32355cfa8JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/83fe949ae9f2dd151e38a4e2c990900539ce18bf/eventsJSON