Secure multi-party linear regression at plaintext speed

Jonathan M. Bloom, Hail Team / Neale Lab / Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard

We detail distributed algorithms for scalable, secure multi-party linear regression and feature selection at essentially the same speed as plaintext regression. While the core geometric ideas are simple, their utility in combination for feature selection appears to be novel. Our scheme opens the door to efficient and secure genome-wide association studies across multiple biobanks.

Read the arXiv note and watch a video primer presented at Models, Inference & Algorithms at the Broad Institute.

We recently found that the approach to vanilla SMC linear regression in Section 2 has been explored since at least 2005, c.f. Secure Regression on Distributed Databases. However, as far as we know, our extensions are novel and critical to applications structured like GWAS.

Run a Python demo of the multi-party linear regression algorithm in Section 2.

Run a Python demo and R demo of the distributed association scan hammer (DASH) in Section 4.

Hail uses the single-party distributed algorithm in Section 3 to enable massive genomic analyses and will include multi-party algorithms someday. Also check out the exciting work on secure genomics by Hoon Cho, Bonnie Berger and colleagues.

An OpenMined implementation is being supported by an RAAIS OpenMined Grant, awarded to André Farias.

Feedback welcome! Write Jon: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
LICENSE		LICENSE
README.md		README.md
dash.ipynb		dash.ipynb
dash.r		dash.r
multiparty_linear_regression.ipynb		multiparty_linear_regression.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Secure multi-party linear regression at plaintext speed

Jonathan M. Bloom, Hail Team / Neale Lab / Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard

About

Releases

Packages

Contributors 3

Languages

License

jbloom22/DASH

Folders and files

Latest commit

History

Repository files navigation

Secure multi-party linear regression at plaintext speed

Jonathan M. Bloom, Hail Team / Neale Lab / Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages