A Distributed Regression Analysis Application Package Using SAS

View Abstract

Distributed regression is a privacy-preserving analytical method that performs multiple regression analysis using only summary-level information from participating data partners in multi-center studies. To our knowledge, there are no distributed regression applications in SAS, the statistical software used by several large national distributed data networks (DDNs) in the United States, including the Sentinel System. This manuscript presents a SAS software package for distributed regression analysis in DDNs. We describe a distributed regression application developed for use in Base SAS and SAS/STAT modules. This application supports distributed linear, logistic, and stratified Cox proportional hazards regression analysis within horizontally partitioned DDNs. Real data examples are used to demonstrate the utility of the software package.

Publication Date
2024-07-09
Full Citation
Her, Q.L., Li, D., Vilk, Y. et al. A Distributed Regression Analysis Application Package Using SAS. Stat Biosci (2024). https://doi.org/10.1007/s12561-024-09445-6
Full Title
A Distributed Regression Analysis Application Package Using SAS
Authors
Qoua L. Her, Dongdong Li, Yury Vilk, Jessica Young, Zilu Zhang, Jessica M. Malenfant, Sarah Malek & Sengwee Toh
External Publication ID
https://doi.org/10.1007/s12561-024-09445-6