| Senior Backend Engineer - Provider Directory Overview We are seeking a Senior Backend Engineer to own and scale the b.well Provider Directory - a critical infrastructure component that powers healthcare provider search and discovery for millions of users nationwide. As our provider directory continues to grow in scope and complexity, you will be the dedicated technical expert who enables us to: - Onboard new data sources efficiently and at scale
- Establish data quality standards that power life-changing healthcare decisions
- Build the data infrastructure that helps people find the right care at the right time
This role combines the technical challenges of large-scale data engineering with the meaningful impact of improving healthcare access for millions of Americans. This is a full time role open to fully remote work. Key Responsibilities: Data Ingestion & Pipelines - Design and implement scalable data ingestion pipelines for new data sources
- Integrate data feeds into the provider directory
- Onboard EHR brand files and other third-party data sources
- Leverage AI/LLMs to enhance provider data:
- Infer organization characteristics from unstructured data
- Identify practitioner-organization relationships
- Classify provider specialties and services
- Detect and resolve duplicate provider records
- Enrich provider profiles with additional context
- Build robust, fault-tolerant ETL processes to handle diverse data formats and volumes
- Monitor and maintain existing data ingestion pipelines
Data Standards & Governance - Define and document technical standards for provider directory data
- Stage and refine data to transform raw sources into standardized formats that normalize and enhance data
- Establish data schemas, validation rules, and quality thresholds and confidence scores
- Ensure compliance with industry standards (e.g., FHIR, HL7) where applicable
Data Quality & Analysis - Partner with Analytics and Reporting teams to develop automated data quality reporting and monitoring systems
- Conduct data quality analysis to identify issues and improvement opportunities
- Build dashboards and alerting mechanisms for data quality metrics
- Work with stakeholders to resolve data quality issues
- Implement data validation and reconciliation processes
Collaboration & Support - Partner with the Data Refinery team on data transformation and enhancement initiatives
- Collaborate with Product and Business teams to understand data requirements and prioritize work
- Work closely with Analytics and Reporting teams to build data quality monitoring systems
- Provide technical guidance on data integration best practices to cross-functional partners
- Lead incident response and troubleshooting for data-related issues
What We’re Looking For: Required Qualifications Technical Skills: - 5+ years of backend engineering experience with focus on data-intensive applications
- Strong proficiency in Python (primary language) or Java/Scala
- Hands-on experience with ETL/ELT frameworks: Spark, DataBricks, or similar distributed processing systems
- Data pipeline orchestration: Prefect (preferred), Airflow, or similar workflow engines
- Cloud infrastructure: AWS (preferred), GCP, or Azure - including S3, Lambda, ECS/EKS
- Database expertise: Both relational (PostgreSQL, MySQL) and NoSQL (MongoDB, DynamoDB)
- Data quality frameworks: Great Expectations, dbt, or custom validation systems
- Containerization & orchestration: Docker, Kubernetes
- Version control & CI/CD: Git, GitHub Actions, automated testing
Soft Skills: - Strong analytical and problem-solving abilities
- Excellent communication and collaboration skills
- Detail-oriented with commitment to data quality
- Ability to work cross-functionally with technical and non-technical stakeholders
Preferred Qualifications: - Healthcare data experience strongly preferred: Provider data, claims, EHR systems, or health IT
- Knowledge of healthcare data standards: FHIR (especially Practitioner, PractitionerRole, Organization, Endpoint, Location resources), HL7, NPI registry, NPPES
- Understanding of provider data challenges: Credentialing, network management, directory accuracy
- Experience with data governance and compliance (HIPAA awareness a plus)
- Familiarity with AI/ML for data enrichment: Entity resolution, classification, relationship inference
The target salary range for this position is $160,000 - $190,000 annually and is part of a competitive total rewards package including stock options, benefits, and incentive pay for eligible roles. Individual pay may vary from the target range and is determined by a number of factors including experience, location, internal pay equity, and other relevant business considerations. We review all employee pay and compensation programs annually at minimum to ensure competitive and fair pay. Data shows that women, people of color, and other underrepresented groups may be less likely to apply for jobs unless they believe they are a perfect match. But b.well holds diversity amongst its key values, and we have a strong commitment to building our workforce and products through that lens. You don't have to check every box in this job description to be a great fit for the role! If you're excited about this position and the prospect of working for b.well, please apply. If it turns out this role isn't for you, there may be other openings that could align with your experience and expertise! We are committed to an inclusive and diverse b.well. We are an equal opportunity employer. We do not discriminate based on race, ethnicity, color, ancestry, national origin, religion, sex, sexual orientation, gender identity, age, disability, veteran, genetic information, marital status or any other legally protected status |