Home › Companies › Payjoy › Staff Data Engineer
Staff Data Engineer
Payjoy · San Francisco, CA · Hybrid · Deleted · $270,932–$337,644 / year · Lever
Job facts
| Field | Value |
|---|---|
| Company | Payjoy |
| Title | Staff Data Engineer |
| Normalized title | - |
| Department / team | Product Development / Core Engineering |
| Location | San Francisco, CA, United States |
| Work model | Hybrid / Hybrid |
| Employment type | Full Time |
| Salary | $270,932–$337,644 / year |
| Status | deleted |
| ATS provider | Lever |
| Posted / first seen | 2025-12-08 / 2026-05-29 |
| Changed / last seen | 2026-06-11 / 2026-06-09 |
Related slices
| Page | What it contains | Open |
|---|---|---|
| Company jobs | Active postings from Payjoy. | Open |
| Company breakdowns | Role, location, ATS, and work model facets for this company. | Open |
| ATS provider jobs | Active postings observed through Lever. | Open |
| Provider filtered search | The same provider as a filtered job collection. | Open |
| City jobs | Active postings in San Francisco. | Open |
| Department jobs | Active postings in Product Development. | Open |
| Work model jobs | Active Hybrid postings. | Open |
| Lifecycle events | Open, update, close, and reopen events for this posting. | Open |
| Original posting | Canonical source or apply URL captured from the ATS. | Open |
Linked records
| Company | Payjoy |
| Source | ae38936e-612a-42ce-b00c-8808afce14fa |
| ATS provider | Lever |
Description
About PayJoy
PayJoy, a Public Benefit Corporation, is a mission-first credit provider dedicated to helping under-served customers in emerging markets to achieve financial stability and success. Our patented technology for secured credit provides an on-ramp for new customers to enter the credit system. Through PayJoy’s point-of-sale financing and card offerings, customers gain access to a modern quality of life. PayJoy’s credit also allows our customers to seize opportunities as micro-entrepreneurs, and acts as insurance for tough times. Through our cutting-edge machine learning, data science, and anti-fraud AI, we have served over 18 million customers as of 2025 while achieving solid profitability for sustainable growth.
This role
The Staff Data Engineer is responsible for ensuring that the organization has reliable, accessible, and well-organized data by designing and maintaining systems that process information in real time and in batches, defining a data strategy aligned with business goals, ensuring data quality and security, optimizing database performance, automating processes to increase efficiency, overseeing monitoring and rapid issue resolution, and providing leadership and guidance to other teams to ensure best practices in data usage and management
PayJoy is proud to be an Equal Employment Opportunity employer and we welcome and encourage people of all backgrounds. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
PayJoy Principles
Finance for the next billion * Ownership * Break Through Walls * Live Communication * Transparency & Directness * Focus on Scale * Work-Life Balance * Embrace Diversity * Speed * Active Listening
Responsibilities
Architect and Build Data Pipelines: Build, optimize, and maintain reliable, scalable, and efficient data pipelines for both batch and real-time data processing.
Data Strategy: Develop and maintain a data strategy aligned with business objectives, ensuring data infrastructure supports current and future needs.
Streaming Expertise: Lead the development of real-time ingestion pipelines using Kafka/Kinesis, and design data models optimized for streaming workloads.
Data Quality & Governance: Implement data quality checks, schema evolution, lineage tracking, and compliance using tools like Unity Catalog and Delta Lake etc.
Tool & Technology Selection: Evaluate and implement the latest data engineering tools and technologies that will best serve our needs, balancing innovation with practicality.
Automation and CI/CD : Drive automation of pipeline deployments, testing and monitoring using Terraform, CircleCi or similar tools.
Performance Tuning: Regularly review, refine, and optimize SQL queries across different systems to maintain peak performance. Identify and address bottlenecks, query performance issues, and resource utilization. Setup best practices and work with developers on education of what they should be doing in the software development lifecycle to ensure optimal performance.
Database Administration: Manage and maintain production AWS RDS MySQL, Aurora and postgres databases. Perform routine database operations, including backups, restores, and disaster recovery planning. Monitor database health, diagnose and resolve issues in a timely manner.
Knowledge and Training: Serve as the primary point of contact for database performance and usage related knowledge, providing guidance, training, and expertise to other teams and stakeholders.
Monitoring & Troubleshooting: Implement monitoring solutions to ensure high availability and troubleshoot data pipeline issues in real-time.
Documentation: Maintain comprehensive documentation of systems, pipelines, and processes for easy onboarding and collaboration.
Mentorship & Leadership: Mentor other engineers, review PRs, and establish best practices in data engineering.
Requirements
Bachelor’s or Master’s degree in Computer Science, Engineering, Data Science, or a related field.
12+ years of experience in data engineering, with at least 3+ years working in Databricks.
Deep hands-on experience with Apache Spark (PySpark/SQL), Delta Lake, and Structured Streaming.
Technical Expertise: Deep understanding of data engineering concepts, including ETL/ELT processes, data warehousing, big data technologies, and cloud platforms (e.g., AWS, Azure, GCP).
Strong proficiency in Python, SQL, and data modeling for both OLTP and OLAP systems.
Architectural Knowledge: Strong experience in designing and implementing data architectures, including real-time data processing, data lakes, and data warehouses.
Tool Proficiency: Hands-on experience with data engineering tools such as Apache Spark, Kafka, Databricks, Airflow, and modern data orchestration frameworks.
Innovation Mindset: A track record of implementing innovative solutions and reimagining data engineering practices.
Experience with Databricks Workflows, Delta Live Tables (DLT), and Unity Catalog.
Familiarity with stream processing patterns (exactly-once semantics, watermarking, checkpointing)
Benefits
100% Company-funded health insurance for employee and immediate family
Company-funded employee life and disability insurance
Paid vacation days, unlimited sick leave
$2,000 USD annual Co-working Travel perk
$2,000 USD annual Professional Development perk
Phone finance, headphone benefit, home office equipment allowance and wellness perks
Catered lunches
Commuter benefit
Full job record
| Job ID | 380bb61c37933200183e0e5d1734c474c20ce86b |
| Org ID | d303b3f7-bc2e-45ac-a63a-8b512151bb70 |
| Source ID | ae38936e-612a-42ce-b00c-8808afce14fa |
| Board ID | ae38936e-612a-42ce-b00c-8808afce14fa |
| Provider | lever |
| Provider Job Key | 6ca046be-9f53-405e-9654-074a37e32652 |
| Title | Staff Data Engineer |
| Normalized Title | — |
| Status | deleted |
| Active | no |
| Location Text | San Francisco, CA |
| Department | Product Development |
| Team | Core Engineering |
| Employment Type | Full-Time |
| Workplace Type | hybrid |
| Remote Policy | hybrid |
| Country | United States |
| Region | CA |
| City | San Francisco |
| Salary Raw | USD 270932-337644 per-year-salary |
| Salary Min | 270,932 |
| Salary Max | 337,644 |
| Salary Currency | USD |
| Salary Period | year |
| Source URL | https://jobs.lever.co/payjoy/6ca046be-9f53-405e-9654-074a37e32652 |
| Apply URL | https://jobs.lever.co/payjoy/6ca046be-9f53-405e-9654-074a37e32652/apply |
| First Seen At | 2026-05-29 07:01:30Z |
| Last Seen At | 2026-06-09 07:56:56Z |
| Last Checked At | 2026-06-11 07:58:24Z |
| Last Changed At | 2026-06-11 07:58:24Z |
| Inactive At | 2026-06-11 07:58:24Z |
| Source Posted At | 2025-12-08 19:53:09Z |
| Source Updated At | — |
| Raw Payload Uri | s3://job-postings-prod-raw-590183727216/raw/provider=lever/board=payjoy/date=2026-06-09/2026-06-09T07-56-55-956Z-0393df4dc9030bdf3141aebe91ffb197a7ed71075a41568be91b80ab7191bc88.json |
Event Fields
{
"content_hash": "537710e04e734c48f8a1ffb9178ad87485cf9d8895b3a5d989b663e19c9a5069",
"source_hash": "7c350bb1aaef811e589507b51a9b3a14090fe578c3d888912355321fd6db571b",
"last_changed_at": "2026-06-11T07:58:24.200Z",
"active_status": "deleted"
}Parsed Structured
{
"language": "en",
"location": {
"raw": "San Francisco, CA",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"salary_max": 337644,
"salary_min": 270932,
"inferred_at": "2026-06-09T07:56:56.291Z",
"launch_scope": {
"reason": "english_us_canada",
"included": true,
"language": "en",
"location": {
"raw": "San Francisco, CA",
"city": "San Francisco",
"region": "CA",
"country": "United States",
"is_remote": false,
"confidence": 0.9
},
"countries": [
"United States"
]
},
"remote_policy": "hybrid",
"salary_period": "year",
"workplace_type": "hybrid",
"salary_currency": "USD"
}Extensions
{}Native Structured
{
"lists": [
{
"text": "Responsibilities",
"content": "\n<li><strong>Architect and Build Data Pipelines:</strong> Build, optimize, and maintain reliable, scalable, and efficient data pipelines for both batch and real-time data processing.</li>\n<li><strong>Data Strategy:</strong> Develop and maintain a data strategy aligned with business objectives, ensuring data infrastructure supports current and future needs.</li>\n<li><strong>Streaming Expertise:</strong> Lead the development of real-time ingestion pipelines using Kafka/Kinesis, and design data models optimized for streaming workloads.</li>\n<li><strong>Data Quality & Governance: </strong>Implement data quality checks, schema evolution, lineage tracking, and compliance using tools like Unity Catalog and Delta Lake etc. </li>\n<li><strong>Tool & Technology Selection: </strong>Evaluate and implement the latest data engineering tools and technologies that will best serve our needs, balancing innovation with practicality.</li>\n<li><strong>Automation and CI/CD</strong>: Drive automation of pipeline deployments, testing and monitoring using Terraform, CircleCi or similar tools. </li>\n<li><strong>Performance Tuning: </strong>Regularly review, refine, and optimize SQL queries across different systems to maintain peak performance. Identify and address bottlenecks, query performance issues, and resource utilization. Setup best practices and work with developers on education of what they should be doing in the software development lifecycle to ensure optimal performance. </li>\n<li><strong>Database Administration: </strong>Manage and maintain production AWS RDS MySQL, Aurora and postgres databases. Perform routine database operations, including backups, restores, and disaster recovery planning. Monitor database health, diagnose and resolve issues in a timely manner.</li>\n<li><strong>Knowledge and Training: </strong>Serve as the primary point of contact for database performance and usage related knowledge, providing guidance, training, and expertise to other teams and stakeholders.</li>\n<li><strong>Monitoring & Troubleshooting: </strong>Implement monitoring solutions to ensure high availability and troubleshoot data pipeline issues in real-time.</li>\n<li><strong>Documentation: </strong>Maintain comprehensive documentation of systems, pipelines, and processes for easy onboarding and collaboration.</li>\n<li><strong>Mentorship & Leadership: </strong>Mentor other engineers, review PRs, and establish best practices in data engineering.</li>\n"
},
{
"text": "Requirements",
"content": "\n<li>Bachelor’s or Master’s degree in Computer Science, Engineering, Data Science, or a related field.</li>\n<li>12+ years of experience in data engineering, with at least 3+ years working in Databricks.</li>\n<li>Deep hands-on experience with Apache Spark (PySpark/SQL), Delta Lake, and Structured Streaming.</li>\n<li>Technical Expertise: Deep understanding of data engineering concepts, including ETL/ELT processes, data warehousing, big data technologies, and cloud platforms (e.g., AWS, Azure, GCP).</li>\n<li>Strong proficiency in Python, SQL, and data modeling for both OLTP and OLAP systems.</li>\n<li>Architectural Knowledge: Strong experience in designing and implementing data architectures, including real-time data processing, data lakes, and data warehouses.</li>\n<li>Tool Proficiency: Hands-on experience with data engineering tools such as Apache Spark, Kafka, Databricks, Airflow, and modern data orchestration frameworks.</li>\n<li>Innovation Mindset: A track record of implementing innovative solutions and reimagining data engineering practices.</li>\n<li>Experience with Databricks Workflows, Delta Live Tables (DLT), and Unity Catalog.</li>\n<li>Familiarity with stream processing patterns (exactly-once semantics, watermarking, checkpointing)</li>\n"
},
{
"text": "Benefits",
"content": "\n<li>100% Company-funded health insurance for employee and immediate family</li>\n<li>Company-funded employee life and disability insurance</li>\n<li>Paid vacation days, unlimited sick leave</li>\n<li>$2,000 USD annual Co-working Travel perk</li>\n<li>$2,000 USD annual Professional Development perk</li>\n<li>Phone finance, headphone benefit, home office equipment allowance and wellness perks</li>\n<li>Catered lunches</li>\n<li>Commuter benefit</li>\n"
}
],
"country": "US",
"createdAt": 1765223589428,
"updatedAt": null,
"categories": {
"team": "Core Engineering",
"location": "San Francisco, CA",
"commitment": "Full-Time",
"department": "Product Development",
"allLocations": [
"San Francisco, CA"
]
},
"salaryRange": {
"max": 337644,
"min": 270932,
"currency": "USD",
"interval": "per-year-salary"
},
"workplaceType": "hybrid"
}Get this page with API
Rendered from the bluedoor Job Postings API. Reproduce it:
GET https://api.bluedoor.sh/job-postings/v1/jobs/380bb61c37933200183e0e5d1734c474c20ce86b?include=descriptionJSONGET https://api.bluedoor.sh/job-postings/v1/orgs/d303b3f7-bc2e-45ac-a63a-8b512151bb70JSONGET https://api.bluedoor.sh/job-postings/v1/sources/ae38936e-612a-42ce-b00c-8808afce14faJSONGET https://api.bluedoor.sh/job-postings/v1/jobs/380bb61c37933200183e0e5d1734c474c20ce86b/eventsJSON