leap

Senior Data Engineer

Remote New York, NY
USD 200k - 250k
BigQuery Python SQL dbt Snowflake Airflow Dagster Prefect Kafka GCP AWS Salesforce HubSpot API
Description

Senior Data Engineer

Department: Engineering & Analytics

Location: US - Remote, New York City

Compensation: $200K – $250K • Offers Equity

Employment Type: FullTime

About Leap

Leap is one of the fastest-growing benefits solutions and a category-defining pioneer in employer specialty pharmacy. We are reshaping how life-changing therapies are delivered and financed, ensuring patients get the treatment they need while employers finally get a fair deal.


Specialty drugs and infusions represent nearly 10% of all healthcare spend and are the fastest-growing cost category for employers. Leap tackles this challenge with a novel approach: eliminating hidden markups, expanding access to high-quality infusion providers, and bringing clarity and fairness to how therapies are priced and paid for.


We’re proud to partner with numerous Fortune 500 companies and leading TPAs. Each patient we serve creates immediate ROI: lower costs, improved access, and better care. Join us as we redefine what’s possible in specialty care.

About the Role

You'd be the most senior data person on the team. You'll own the pipelines, the warehouse, and the reporting layer, and you'll make the design decisions about how they're built. You'll report to the engineering lead and work directly with clinical ops, business operations, and leadership. Small engineering team, high ownership.

Key Responsibilities

Pipelines and Warehouse

  • Build and own data pipelines and ETL for claims ingestion, drug pricing, and CRM sync (BigQuery, Python)

  • Design production pipelines for batch and streaming workloads — claims data is high-volume today, and new large-scale data sources are coming

  • Design warehouse schemas and transforms with clear separation between raw, staging, and modeled layers

  • Maintain data quality and reliability across systems that feed both human users and AI workloads — this means row-count checks, schema drift detection, anomaly alerting, and knowing when upstream sources have silently changed, not just whether the job ran

Data Governance

  • Build pipeline monitoring that tells you whether the data is right, not just whether the job ran

  • Design for recoverability. Pipelines should be idempotent and replayable, with raw data always preserved so you can reprocess when logic changes

  • Track data lineage: where it comes from, how it's transformed, and what depends on it

  • Validate data at every stage before it reaches a dashboard or an AI system

Reporting Infrastructure

  • Build reporting systems that give sales, clinical, and leadership teams live visibility into the business

  • Create automated alerting that surfaces when something has changed, so the team acts on data instead of asking for it

AI-Ready Data Infrastructure

  • Build PHI-safe pipelines that feed LLM workloads, agent systems, and automation

  • Design data architecture that connects claims, drug pricing, patient records, CRM activity, and clinical workflows into a usable whole

  • Own the ingestion of external data from non-standard formats and sources — we work with many providers who each send data differently, and new sources are added regularly

Qualifications

Required

  • Python, SQL, and dbt. You've worked with BigQuery, Snowflake, or a similar cloud warehouse and know your way around orchestration tools (Airflow, Dagster, Prefect, or similar).

  • You've architected data platforms, not just written pipelines. You've made decisions about batch vs streaming, incremental vs full-refresh, and warehouse structure — and you can explain why.

  • You care about monitoring, lineage, and governance. You've built systems where you can trace data from source to report.

  • You use AI tools in your own work and you know how to build data infrastructure that AI systems can rely on in production.

  • You've been an early employee, a solo data person, or the one who built the data stack from scratch.

Preferred

  • Healthcare or HIPAA experience, Fivetran or similar ingestion tools, CRM integrations (Salesforce, HubSpot), or experience building data infrastructure for LLM/AI workloads

  • Experience with streaming frameworks (Kafka, Pub/Sub, Flink) or designing systems that handle both batch and real-time data flows

  • Comfort with cloud infrastructure (GCP, AWS) or Linux/sysadmin fundamentals — you can debug a VM, read logs, and manage services, not just write SQL

  • A bias toward simple, cost-effective solutions — you reach for open-source first and know when a managed service is worth the price and lock-in

At Leap, we’re building an outlier company with real impact — and that takes focus, energy, and commitment. If that excites you, we’d love to hear from you.

Leap is an equal opportunity employer and welcomes applicants from all backgrounds. We’re committed to building a team that reflects a diversity of perspectives, experiences, and identities.

leap
leap

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say