Position Overview
Design, build, and maintain scalable batch and streaming pipelines using Google Cloud Dataflow, Google Cloud Datastream, Airbyte, and orchestration tools (Airflow/Prefect/Dagster). Develop and optimize ETL/ELT processes across AWS Postgres, Google FHIR Store, and Google BigQuery. Build and maintain unified data models that integrate multiple healthcare data sources (EHR/FHIR, claims/X12, ADT/HL7, CRM, transactional Postgres, Tuva, and third-party APIs). Implement dbt/OBT transformations to create curated semantic layers for AI/LLM, BI, and predictive analytics. Ensure data quality, lineage, validation, and governance while maintaining HIPAA compliance and PHI/PII security. Collaborate with AI/ML engineers, BI developers, and product teams to enable data-driven features, dashboards, and predictive models. Implement monitoring, anomaly detection, and pipeline optimization for performance, reliability, and cost efficiency. Participate in architecture discussions, code reviews, and mentori...