- Design and develop scalable, parallelized data systems to process complex, time-series health data captured from consumer-grade sensors.- Implement robust ETL pipelines using distributed computing frameworks such as Apache Spark to ingest and transform binary and proprietary data formats.- Collaborate with study stakeholders to gather requirements and contribute to the study design process, ensuring data architecture supports evolving research needs.- Build tools and dashboards to visualize health data and support data exploration, analysis, and research insights.- Work closely with cross-functional teams including Software, Algorithms, and Quality Engineering to support design, integration, testing, and deployment of health data solutions.- Own and evolve data architecture by driving standardization and reusability across studies, contributing to the development of a company-wide health data platform.