Home › Companies › Arva Intelligence › Data Engineer
Data Engineer
Arva Intelligence · Houston, Texas · Remote · Active · $95,000–$130,000 / year · Greenhouse
Job facts
| Field | Value |
|---|---|
| Company | Arva Intelligence |
| Title | Data Engineer |
| Normalized title | - |
| Department / team | Research and Innovation |
| Location | Houston, TX, United States |
| Work model | Remote / Remote |
| Employment type | - |
| Salary | $95,000–$130,000 / year |
| Status | active |
| ATS provider | Greenhouse |
| Posted / first seen | 2026-06-17 / 2026-06-18 |
| Changed / last seen | 2026-06-18 / 2026-06-22 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Arva Intelligence. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Greenhouse. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in Houston. | Open |
| Department jobs | Active postings in Research and Innovation. | Open |
| Work model jobs | Active Remote postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Arva Intelligence |
| Source | 0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1 |
| ATS provider | Greenhouse |
Description
Job Title: Data Engineer
Department : Modeling & Analytics
Reports to : Lead Modeling Scientist
Location : Remote
Base Salary Range: $95k - $130k
General Position Description
The Data Engineer is responsible for building and scaling the data and computational backbone that supports Arva’s ecosystem modeling and measurement, reporting, and verification platforms. This role sits within a multidisciplinary Data Science team and focuses on designing reliable, auditable, and scalable data systems that enable biogeochemical modeling and optimization at production scale.
In this role, the Data Engineer will design and maintain production-grade data pipelines that integrate diverse datasets including field measurements, management practices, soils, and weather with process-based ecosystem models. The role plays a critical part in ensuring data quality, reproducibility, and traceability so that scientific outputs can be translated into trusted, credit-grade results with real-world impact.
Primary Job Responsibilities
Data Pipeline and Workflow Development
Design, implement, and maintain scalable data pipelines supporting ecosystem and biogeochemical modeling
Build reproducible workflows that generate standardized model inputs and manage outputs across space, time, and scenario analysis
Integrate heterogeneous datasets, including field data, management data, soil data, and weather data, into modeling pipelines
Cloud Infrastructure and Data Systems
Develop and maintain cloud-based infrastructure to support modeling pipelines and optimization workflows
Implement data storage solutions using relational, spatial, and object-based databases
Support efficient data access and processing using platforms such as PostgreSQL, PostGIS, and cloud object storage
Data Quality, Governance, and Auditability
Ensure data quality, versioning, traceability, and auditability to support measurement, reporting, and verification requirements
Implement validation and monitoring processes to ensure reliability of model inputs and outputs
Support transparent, repeatable workflows suitable for regulatory and credit market review
Software Engineering and Collaboration
Write clean, modular, and well-documented production code that supports maintainable and scalable data systems
Apply software engineering best practices including testing, version control, and documentation
Collaborate closely with Data Science and Technology teams to align data infrastructure with modeling, analytics, and production needs
Key Competencies / Requirements
3+ years demonstrated experience building and maintaining data pipelines for large, complex, and heterogeneous datasets
Strong proficiency in Python and modern data engineering tools, with experience writing production-grade, testable code
Experience working with cloud platforms, with AWS strongly preferred
Familiarity with containerization tools such as Docker and version control systems such as GitHub
Experience with relational and spatial databases, including PostgreSQL and PostGIS
Experience working with geospatial data formats and spatial data processing
Experience supporting scientific or ecosystem modeling workflows preferred
Familiarity with workflow orchestration tools such as Airflow or Prefect preferred
Bachelor’s or Master’s degree or equivalent experience in Data Engineering, Computer Science, Environmental Informatics, or a related field
Full job record
| Job ID | 88122df77fa9b2addcfb7cacc056f7673ac5e1a6 |
| Org ID | f06fbd7c-6a07-4004-a76b-9d42f3eb0880 |
| Source ID | 0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1 |
| Board ID | 0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1 |
| Provider | greenhouse |
| Provider Job Key | 5265566008 |
| Title | Data Engineer |
| Normalized Title | — |
| Status | active |
| Active | yes |
| Location Text | Houston, Texas |
| Department | Research and Innovation |
| Team | — |
| Employment Type | — |
| Workplace Type | remote |
| Remote Policy | remote |
| Country | United States |
| Region | TX |
| City | Houston |
| Salary Raw | Salary Range: $95k - $130k General Position Description The Data Engineer is responsible for building and |
| Salary Min | 95,000 |
| Salary Max | 130,000 |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://job-boards.greenhouse.io/arvaintelligence/jobs/5265566008 |
| Apply URL | https://job-boards.greenhouse.io/arvaintelligence/jobs/5265566008 |
| First Seen At | 2026-06-18 07:31:58Z |
| Last Seen At | 2026-06-22 07:38:04Z |
| Last Checked At | 2026-06-22 07:38:04Z |
| Last Changed At | 2026-06-18 07:31:58Z |
| Inactive At | — |
| Source Posted At | 2026-06-17 18:27:18Z |
| Source Updated At | 2026-06-17 18:27:18Z |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=greenhouse/board=arvaintelligence/date=2026-06-22/2026-06-22T07-38-04-024Z-a4d405f90ca119fee7e78ab41cbc6c6abdc27dc4bdcfca7319b3511cd3af36fc.json |
Event Fields
{
"content_hash": "9fd41433216bad60b7839c3b81e3a4af109e4306a701c79d54377eb5c9d772b1",
"source_hash": "c7bf062c513fb917b4b8ea69cb89a6d100350099b4640d020378f4c1e1de770d",
"last_changed_at": "2026-06-18T07:31:58.384Z",
"active_status": "active"
}Parsed Structured
{
"dedupe": null,
"language": "en",
"location": {
"raw": "Houston, Texas",
"city": "Houston",
"region": "TX",
"country": "United States",
"is_remote": true,
"confidence": 0.85
},
"salary_max": 130000,
"salary_min": 95000,
"inferred_at": "2026-06-22T07:38:04.089Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "Houston, Texas",
"city": "Houston",
"region": "TX",
"country": "United States",
"is_remote": true,
"confidence": 0.85
},
"countries": [
"United States"
]
},
"remote_policy": "remote",
"salary_period": "year",
"workplace_type": "remote",
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"title": "Data Engineer ",
"offices": [
{
"id": 4029447008,
"name": "Remote",
"location": "Houston, Texas, United States",
"child_ids": [],
"parent_id": null
}
],
"language": "en",
"location": {
"name": "Houston, Texas"
},
"metadata": [],
"updated_at": "2026-06-17T14:27:18-04:00",
"departments": [
{
"id": 4033774008,
"name": "Research and Innovation",
"child_ids": [],
"parent_id": 4033770008
}
],
"company_name": "Arva Intelligence",
"requisition_id": 4489653008,
"first_published": "2026-06-17T14:27:18-04:00",
"application_deadline": null
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/88122df77fa9b2addcfb7cacc056f7673ac5e1a6?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/f06fbd7c-6a07-4004-a76b-9d42f3eb0880JSONGET https://api.bluedoor.sh/job-postings/v1/sources/0bf34f3a-a7c8-4f1e-af0b-6b7580a9dca1JSONGET https://api.bluedoor.sh/job-postings/v1/jobs/88122df77fa9b2addcfb7cacc056f7673ac5e1a6/eventsJSON