INTRODUCTION
Electronic health records (EHR) are increasingly being leveraged for public health surveillance. EHR-based small area estimates (SAEs) are often validated by comparison to survey data such as the Behavioral Risk Factor Surveillance System (BRFSS). However, survey and EHR-based SAEs are expected to differ. In this cross-sectional study, SAEs were generated using MDPHnet, a distributed EHR-based surveillance network, for all Massachusetts municipalities and zip code tabulation areas (ZCTAs), compared to BRFSS PLACES SAEs, and reasons for differences explored.
METHODS
This study delineated reasons a priori for how SAEs derived using EHRs may differ from surveys by comparing each strategy's case classification criteria and reviewing the literature. Hypertension, diabetes, obesity, asthma, and smoking EHR-based SAEs for 2021 in all ZCTAs and municipalities in Massachusetts were estimated with Bayesian mixed effects modeling and poststratification in the summer/fall of 2023. These SAEs were compared to BRFSS PLACES SAEs published by the US Centers for Disease Control and Prevention.
RESULTS
Mean prevalence was higher in EHR data versus BRFSS in both municipalities and ZCTAs for all outcomes except asthma. ZCTA and municipal symmetric mean absolute percentages ranged from 12.0-38.2% and 13.1-39.8%, respectively. There was greater variability in EHR-based SAEs versus BRFSS PLACES in both municipalities and ZCTAs.
CONCLUSIONS
EHR-based SAEs tended to be higher than BRFSS and more variable. Possible explanations include detection of undiagnosed cases and over-classification using EHR data, and under-reporting within BRFSS. Both EHR and survey-based surveillance have strengths and limitations that should inform their preferred uses in public health surveillance.