bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesLila SciencesML Research Scientist I/II, Multimodal Data Extraction

ML Research Scientist I/II, Multimodal Data Extraction

Lila Sciences · Cambridge, MA USA · Active · $176,000–$304,000 / year · Greenhouse

Job facts

FieldValue
CompanyLila Sciences
TitleML Research Scientist I/II, Multimodal Data Extraction
Normalized title-
Department / teamPhysical Sciences AI
LocationCambridge, MA, United States
Work model-
Employment type-
Salary$176,000–$304,000 / year
Statusactive
ATS providerGreenhouse
Posted / first seen2025-11-03 / 2026-05-29
Changed / last seen2026-05-29 / 2026-06-06

Related slices

PageWhat it containsOpen
Company jobsActive postings from Lila Sciences.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Greenhouse.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Cambridge.Open
Department jobsActive postings in Physical Sciences AI.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyLila Sciences
Sourcea1e67975-fd33-4f8d-940f-2dbc2480c450
ATS providerGreenhouse

Description

Your Impact at LILA As a ML Research Scientist - Multimodal Data Extraction , you will advance Lila’s vision of scientific superintelligence by developing foundation models that autonomously read, interpret, and structure scientific knowledge across text, images, and experimental data in the physical sciences. Your research will help unify the world’s scientific information into machine-understandable form, powering reasoning, prediction, and autonomous discovery across materials science and chemistry. What You'll Be Building Research and develop AI systems that extract and structure knowledge from diverse scientific sources. Design and fine-tune large language, multi-modal and specialized models for factual, interpretable data extraction. Build scalable pipelines for unstructured and heterogeneous scientific data , integrating text, tables, and visuals. Collaborate with domain experts to align extracted data with real-world discovery workflows. Publish research that advances the state of the art in multimodal understanding and AI-driven knowledge extraction. What You’ll Need to Succeed PhD (or equivalent research experience) in Computer Science, Chemistry, Materials Science, or related field. Expertise in machine learning , NLP , and vision–language modeling using PyTorch and Hugging Face Transformers . Proven ability to train, fine-tune, and evaluate LLMs and multimodal models for scientific data extraction. Strong understanding of data structures and representations used in the physical sciences. Demonstrated research impact through publications, preprints, or open-source work (e.g., NeurIPS, ICLR, ICML, ACL, EMNLP, Scientific Journals). Bonus Points For Experience with multimodal fusion architectures and document-level understanding. Knowledge of scientific document parsing (OCR, table extraction, figure-caption linking). Familiarity with knowledge graph construction or reasoning systems for science. Experience with noisy or heterogeneous real-world scientific data. Collaborative mindset and passion for advancing AI in the physical sciences. Compensation We offer competitive base compensation with bonus potential and generous early-stage equity. Your final offer will reflect your background, expertise, and expected impact. U.S. Benefits. Full-time U.S. employees receive a comprehensive benefits program including medical, dental, and vision coverage; employer-paid life and disability insurance; flexible time off with generous company wide holidays; paid parental leave; an educational assistance program; commuter benefits, including bike share memberships for office based employees; and a company subsidized lunch program. International Benefits. Full-time employees outside the U.S. receive a comprehensive benefits program tailored to their region. USD salary ranges apply only to U.S.-based positions; international salaries are set to local market. Expected Base Salary Range $176,000 — $304,000 USD About LILA Lila Sciences is building Scientific Superintelligence™ to solve humankind's greatest challenges. We believe science is the most inspiring frontier for AI. Rather than hard-coding expert knowledge into tools, LILA builds systems that can learn for themselves. LILA combines advanced AI models with proprietary AI Science Factory™ instruments into an operating system for science that executes the entire scientific method autonomously, accelerating discovery at unprecedented speed, scale, and impact across medicine, materials, and energy. Learn more at www.lila.ai. Guided by our core values of truth, trust, curiosity, grit, and velocity, we move with startup speed while tackling problems of historic importance. If this sounds like an environment you'd love to work in, even if you don't meet every qualification listed above, we encourage you to apply. We’re All In Lila Sciences is committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. Information you provide during your application process will be handled in accordance with our Candidate Privacy Policy . A Note to Agencies Lila Sciences does not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to Lila Sciences or its employees is strictly prohibited unless contacted directly by Lila Science’s internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Lila Sciences, and Lila Sciences will not owe any referral or other fees with respect thereto.

Full job record

Job ID2419284734fade2bb2818b48d967511cec2b7ae7
Org IDffee088c-1794-41ee-9ae7-d3e130389319
Source IDa1e67975-fd33-4f8d-940f-2dbc2480c450
Board IDa1e67975-fd33-4f8d-940f-2dbc2480c450
Providergreenhouse
Provider Job Key4052832009
TitleML Research Scientist I/II, Multimodal Data Extraction
Normalized Title
Statusactive
Activeyes
Location TextCambridge, MA USA
DepartmentPhysical Sciences AI
Team
Employment Type
Workplace Type
Remote Policy
CountryUnited States
RegionMA
CityCambridge
Salary RawSalary Range $176,000 — $304,000 USD About LILA Lila Sciences is building Scientific Superintel
Salary Min176,000
Salary Max304,000
Salary CurrencyUSD
Salary Periodyear
Source URLhttps://job-boards.greenhouse.io/lilasciences/jobs/4052832009
Apply URLhttps://job-boards.greenhouse.io/lilasciences/jobs/4052832009
First Seen At2026-05-29 23:01:25Z
Last Seen At2026-06-06 07:34:08Z
Last Checked At2026-06-06 07:34:08Z
Last Changed At2026-05-29 23:01:25Z
Inactive At
Source Posted At2025-11-03 18:31:24Z
Source Updated At2026-05-14 21:07:20Z
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=lilasciences/date=2026-06-06/2026-06-06T07-34-07-839Z-8910748710b03f2325a47d95b68801d1b5e36c899ac6edbc67e980d6caead799.json
Event Fields
{
  "content_hash": "c12f344cce41f6e2c3f3ad1b45ede82db6cf93469e8382dc4c1865dcd648bdba",
  "source_hash": "9c617c4707db05180a970f470f5df64b382c41f89643f92e3b27f63efcc1f7b5",
  "last_changed_at": "2026-05-29T23:01:25.475Z",
  "active_status": "active"
}
Parsed Structured
{
  "language": "en",
  "location": {
    "raw": "Cambridge, MA USA",
    "city": "Cambridge",
    "region": "MA",
    "country": "United States",
    "is_remote": false,
    "confidence": 0.95
  },
  "salary_max": 304000,
  "salary_min": 176000,
  "inferred_at": "2026-06-06T07:34:08.138Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Cambridge, MA USA",
      "city": "Cambridge",
      "region": "MA",
      "country": "United States",
      "is_remote": false,
      "confidence": 0.95
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": null,
  "salary_period": "year",
  "workplace_type": null,
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "title": "ML Research Scientist I/II, Multimodal Data Extraction",
  "offices": [
    {
      "id": 4011687009,
      "name": "One Charles Park, Cambridge, MA",
      "location": null,
      "child_ids": [],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "Cambridge, MA USA"
  },
  "metadata": [
    {
      "id": 4340245009,
      "name": "Confidential?",
      "value": false,
      "value_type": "yes_no"
    }
  ],
  "updated_at": "2026-05-14T17:07:20-04:00",
  "departments": [
    {
      "id": 4013388009,
      "name": "Physical Sciences AI",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "company_name": "Lila Sciences",
  "requisition_id": 4035982009,
  "first_published": "2025-11-03T13:31:24-05:00",
  "application_deadline": null
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/2419284734fade2bb2818b48d967511cec2b7ae7?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/ffee088c-1794-41ee-9ae7-d3e130389319JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/a1e67975-fd33-4f8d-940f-2dbc2480c450JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/2419284734fade2bb2818b48d967511cec2b7ae7/eventsJSON