bluedoor data·Job Postings API·bluedoor.sh ↗

HomeCompaniesArva IntelligenceData Engineer

Data Engineer

Arva Intelligence · Houston, Texas · Remote · Active · $95,000–$130,000 / year · Greenhouse

Job facts

FieldValue
CompanyArva Intelligence
TitleData Engineer
Normalized title-
Department / teamResearch and Innovation
LocationHouston, TX, United States
Work modelRemote / Remote
Employment type-
Salary$95,000–$130,000 / year
Statusactive
ATS providerGreenhouse
Posted / first seen2026-06-17 / 2026-06-18
Changed / last seen2026-06-18 / 2026-06-22

Related slices

PageWhat it containsOpen
Company jobsActive postings from Arva Intelligence.Open
Company breakdownsRole, location, ATS, and work model facets for this company.Open
ATS provider jobsActive postings observed through Greenhouse.Open
Provider filtered searchThe same provider as a filtered job collection.Open
City jobsActive postings in Houston.Open
Department jobsActive postings in Research and Innovation.Open
Work model jobsActive Remote postings.Open
Lifecycle eventsOpen, update, close, and reopen events for this posting.Open
Original postingCanonical source or apply URL captured from the ATS.Open

Linked records

CompanyArva Intelligence
Source0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1
ATS providerGreenhouse

Description

Job Title: Data Engineer Department : Modeling & Analytics Reports to : Lead Modeling Scientist Location : Remote Base Salary Range: $95k - $130k General Position Description The Data Engineer is responsible for building and scaling the data and computational backbone that supports Arva’s ecosystem modeling and measurement, reporting, and verification platforms. This role sits within a multidisciplinary Data Science team and focuses on designing reliable, auditable, and scalable data systems that enable biogeochemical modeling and optimization at production scale. In this role, the Data Engineer will design and maintain production-grade data pipelines that integrate diverse datasets including field measurements, management practices, soils, and weather with process-based ecosystem models. The role plays a critical part in ensuring data quality, reproducibility, and traceability so that scientific outputs can be translated into trusted, credit-grade results with real-world impact. Primary Job Responsibilities Data Pipeline and Workflow Development Design, implement, and maintain scalable data pipelines supporting ecosystem and biogeochemical modeling Build reproducible workflows that generate standardized model inputs and manage outputs across space, time, and scenario analysis Integrate heterogeneous datasets, including field data, management data, soil data, and weather data, into modeling pipelines Cloud Infrastructure and Data Systems Develop and maintain cloud-based infrastructure to support modeling pipelines and optimization workflows Implement data storage solutions using relational, spatial, and object-based databases Support efficient data access and processing using platforms such as PostgreSQL, PostGIS, and cloud object storage Data Quality, Governance, and Auditability Ensure data quality, versioning, traceability, and auditability to support measurement, reporting, and verification requirements Implement validation and monitoring processes to ensure reliability of model inputs and outputs Support transparent, repeatable workflows suitable for regulatory and credit market review Software Engineering and Collaboration Write clean, modular, and well-documented production code that supports maintainable and scalable data systems Apply software engineering best practices including testing, version control, and documentation Collaborate closely with Data Science and Technology teams to align data infrastructure with modeling, analytics, and production needs Key Competencies / Requirements 3+ years demonstrated experience building and maintaining data pipelines for large, complex, and heterogeneous datasets Strong proficiency in Python and modern data engineering tools, with experience writing production-grade, testable code Experience working with cloud platforms, with AWS strongly preferred Familiarity with containerization tools such as Docker and version control systems such as GitHub Experience with relational and spatial databases, including PostgreSQL and PostGIS Experience working with geospatial data formats and spatial data processing Experience supporting scientific or ecosystem modeling workflows preferred Familiarity with workflow orchestration tools such as Airflow or Prefect preferred Bachelor’s or Master’s degree or equivalent experience in Data Engineering, Computer Science, Environmental Informatics, or a related field

Full job record

Job ID88122df77fa9b2addcfb7cacc056f7673ac5e1a6
Org IDf06fbd7c-6a07-4004-a76b-9d42f3eb0880
Source ID0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1
Board ID0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1
Providergreenhouse
Provider Job Key5265566008
TitleData Engineer
Normalized Title
Statusactive
Activeyes
Location TextHouston, Texas
DepartmentResearch and Innovation
Team
Employment Type
Workplace Typeremote
Remote Policyremote
CountryUnited States
RegionTX
CityHouston
Salary RawSalary Range: $95k - $130k General Position Description The Data Engineer is responsible for building and
Salary Min95,000
Salary Max130,000
Salary CurrencyUSD
Salary Periodyear
Source URLhttps://job-boards.greenhouse.io/arvaintelligence/jobs/5265566008
Apply URLhttps://job-boards.greenhouse.io/arvaintelligence/jobs/5265566008
First Seen At2026-06-18 07:31:58Z
Last Seen At2026-06-22 07:38:04Z
Last Checked At2026-06-22 07:38:04Z
Last Changed At2026-06-18 07:31:58Z
Inactive At
Source Posted At2026-06-17 18:27:18Z
Source Updated At2026-06-17 18:27:18Z
Raw Payload Uris3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=arvaintelligence/date=2026-06-22/2026-06-22T07-38-04-024Z-a4d405f90ca119fee7e78ab41cbc6c6abdc27dc4bdcfca7319b3511cd3af36fc.json
Event Fields
{
  "content_hash": "9fd41433216bad60b7839c3b81e3a4af109e4306a701c79d54377eb5c9d772b1",
  "source_hash": "c7bf062c513fb917b4b8ea69cb89a6d100350099b4640d020378f4c1e1de770d",
  "last_changed_at": "2026-06-18T07:31:58.384Z",
  "active_status": "active"
}
Parsed Structured
{
  "dedupe": null,
  "language": "en",
  "location": {
    "raw": "Houston, Texas",
    "city": "Houston",
    "region": "TX",
    "country": "United States",
    "is_remote": true,
    "confidence": 0.85
  },
  "salary_max": 130000,
  "salary_min": 95000,
  "inferred_at": "2026-06-22T07:38:04.089Z",
  "launch_scope": {
    "reason": "english_us_canada",
    "included": true,
    "language": "en",
    "location": {
      "raw": "Houston, Texas",
      "city": "Houston",
      "region": "TX",
      "country": "United States",
      "is_remote": true,
      "confidence": 0.85
    },
    "countries": [
      "United States"
    ]
  },
  "remote_policy": "remote",
  "salary_period": "year",
  "workplace_type": "remote",
  "salary_currency": "USD"
}
Extensions
{}
Native Structured
{
  "title": "Data Engineer ",
  "offices": [
    {
      "id": 4029447008,
      "name": "Remote",
      "location": "Houston, Texas, United States",
      "child_ids": [],
      "parent_id": null
    }
  ],
  "language": "en",
  "location": {
    "name": "Houston, Texas"
  },
  "metadata": [],
  "updated_at": "2026-06-17T14:27:18-04:00",
  "departments": [
    {
      "id": 4033774008,
      "name": "Research and Innovation",
      "child_ids": [],
      "parent_id": 4033770008
    }
  ],
  "company_name": "Arva Intelligence",
  "requisition_id": 4489653008,
  "first_published": "2026-06-17T14:27:18-04:00",
  "application_deadline": null
}
Get this page with API

Rendered from the bluedoor Job Postings API. Reproduce it:

GET https://api.bluedoor.sh/job-postings/v1/jobs/88122df77fa9b2addcfb7cacc056f7673ac5e1a6?include=descriptionJSON
GET https://api.bluedoor.sh/job-postings/v1/orgs/f06fbd7c-6a07-4004-a76b-9d42f3eb0880JSON
GET https://api.bluedoor.sh/job-postings/v1/sources/0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1JSON
GET https://api.bluedoor.sh/job-postings/v1/jobs/88122df77fa9b2addcfb7cacc056f7673ac5e1a6/eventsJSON