This site needs JavaScript to work properly. Please enable it to take advantage of the complete set of features!
Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

NIH NLM Logo
Log in
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
doi: 10.1145/3097983.3098118.

Federated Tensor Factorization for Computational Phenotyping

Affiliations

Federated Tensor Factorization for Computational Phenotyping

Yejin Kim et al. KDD. 2017 Aug.

Abstract

Tensor factorization models offer an effective approach to convert massive electronic health records into meaningful clinical concepts (phenotypes) for data analysis. These models need a large amount of diverse samples to avoid population bias. An open challenge is how to derive phenotypes jointly across multiple hospitals, in which direct patient-level data sharing is not possible (e.g., due to institutional policies). In this paper, we developed a novel solution to enable federated tensor factorization for computational phenotyping without sharing patient-level data. We developed secure data harmonization and federated computation procedures based on alternating direction method of multipliers (ADMM). Using this method, the multiple hospitals iteratively update tensors and transfer secure summarized information to a central server, and the server aggregates the information to generate phenotypes. We demonstrated with real medical datasets that our method resembles the centralized training model (based on combined datasets) in terms of accuracy and phenotypes discovery while respecting privacy.

Keywords: ADMM; Federated approach; Phenotype.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Process of federated tensor factorization.
Figure 2
Figure 2
Equivalence between tensor factorization with respect to each local tensor Ok and tensor factorization with respect to global tensor O. Without O, tensor factorization that is globally optimal across hospitals can be achieved via local tensor factorization.
Figure 3
Figure 3
Example of secure alignment on feature mode.
Figure 4
Figure 4
RMSE and total time over the number of nonzeros (Fig. 4a, 4b). The first, second, and third stacked bars in Fig. 4b refer to central model, Trip, and local model, respectively. RMSE of Trip, central model, and local model over iteration (Fig. 4c, 4d).
Figure 5
Figure 5
RMSE over the number of hospitals (Fig. 5a) and skewness (Fig. 5b). Total time over the number of hospitals (Fig. 5c) and skewness (Fig. 5d). The former and latter stacked bars in Fig. 5c, 5d refer to Trip and local model, respectively.

Comment in

References

    1. Bennett C, Doub T, Selove R. EHRs connect research and practice: Where predictive modeling, artificial intelligence, and clinical decision support intersect. Health Policy and Technology. 2012 Jun;1(2):105–114.
    1. Boyd S, Parikh N, Chu E, Peleato B, Eckstein J. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine Learning. 2011;3(1):1–122.
    1. Carroll JD, Chang J-J. Analysis of individual differences in multidimensional scaling via an n-way generalization of "eckart-young" decomposition. Psychometrika. 1970;35(3):283–319.
    1. Carroll RJ, Eyler AE, Denny JC. Naïve electronic health record phenotype identification for rheumatoid arthritis. Proceedings of the American Medical Informatics Association Annual Symposium; Jan. 2011.pp. 189–196. - PMC - PubMed
    1. Carroll RJ, Thompson WK, Eyler AE, Mandelin AM, Cai T, Zink RM, Pacheco JA, Boomershine CS, Lasko TA, Xu H, Karlson EW, Perez RG, Gainer VS, Murphy SN, Ruderman EM, Pope RM, Plenge RM, Kho AN, Liao KP, Denny JC. Portability of an algorithm to identify rheumatoid arthritis in electronic health records. Journal of the American Medical Informatics Association. 2012 Jun;19(1e):e162–e169. - PMC - PubMed
Cite

AltStyle によって変換されたページ (->オリジナル) /