Lead the development and evolution of data engineering systems in support of a mission-driven healthcare organization focused on preventive and precision medicine. This role is central to building reliable, scalable data pipelines and ensuring high-quality, governed data is available across clinical research, analytics, and product teams.
Key Responsibilities
- Design and manage data workflows using modern ETL/ELT practices to support analytics and reporting at scale
- Enhance and maintain the Snowflake data warehouse, applying sound modeling and performance optimization techniques
- Develop and enforce data quality standards through automated testing, monitoring, and validation frameworks
- Collaborate with product teams to define data contracts and embed data collection into application architectures
- Support analytics and business intelligence initiatives by delivering clean, timely data products
- Work with security and compliance to implement data governance, privacy controls, and auditability
- Partner with clinicians and researchers to enable data access for patient care and long-term studies
- Reduce data latency and improve integration with de-identification systems to meet regulatory requirements
- Guide junior engineers through mentorship and contribute to the development of team-wide engineering standards
- Advocate for automation, observability, and continuous improvement across data platforms
Required Expertise
- Strong proficiency in Python and SQL for data processing and transformation
- Hands-on experience with data orchestration tools such as Dagster and dbt
- Cloud platform experience, particularly with Google Cloud Platform and infrastructure-as-code using Terraform
- Familiarity with containerization and orchestration via Docker and Kubernetes
- Working knowledge of data storage systems including Snowflake, PostgreSQL, and MySQL
- Experience with event streaming platforms like Pub/Sub or Kafka
- Exposure to CI/CD pipelines, particularly GitHub Actions
- Proficiency with monitoring and observability tools such as Datadog, Grafana, or OpenTelemetry
Work Environment
This position supports remote work with periodic travel to designated locations. The role enables impactful contributions to long-term medical research and patient care through data innovation, offering opportunities for technical leadership and professional growth. The organization emphasizes interdisciplinary collaboration, preventive health, and advancing science through nonprofit research.


