From 653b5850a4d7833e935889d545e2c0680a5e65d8 Mon Sep 17 00:00:00 2001 From: Arun Isaac Date: Mon, 8 Sep 2025 11:29:25 +0100 Subject: Update README instructions with phenotype encryption. --- README.md | 13 +++++++------ 1 file changed, 7 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index 2b3ab5d..399b39a 100644 --- a/README.md +++ b/README.md @@ -63,11 +63,11 @@ pyhegp --help ![Simple data sharing workflow](doc/simple-workflow.png) -In this simple scenario, there is only one data owner and they wish to share their encrypted data with a researcher. The data owner encrypts their data with: +In this simple scenario, there is only one data owner and they wish to share their encrypted data with a researcher. The data owner encrypts their genotype and phenotype data with: ``` -pyhegp encrypt genotype.tsv +pyhegp encrypt genotype.tsv phenotype.tsv ``` -They then send the encrypted data `genotype.tsv.hegp` to the researcher. Note that data sharing is carried out-of-band and is outside the scope of `pyhegp`. +They then send the encrypted `genotype.tsv.hegp` and `phenotype.tsv.hegp` to the researcher. Note that data sharing is carried out-of-band and is outside the scope of `pyhegp`. ## Joint/federated analysis with many data owners @@ -81,13 +81,14 @@ They share this with the data broker who pools it to compute the summary statist ``` pyhegp pool -o complete-summary summary1 summary2 ... ``` -The data broker shares these summary statistics with the data owners. The data owners standardize their data using these summary statistics, and encrypt their data using a random key. +The data broker shares these summary statistics with the data owners. The data owners standardize their data using these summary statistics, and encrypt their genotype and phenotype data using a random key. ``` -pyhegp encrypt -s complete-summary genotype.tsv +pyhegp encrypt -s complete-summary genotype.tsv phenotype.tsv ``` -Finally, the data owners share the encrypted data `genotype.tsv.hegp` with the broker who concatenates it and shares it with all parties. +Finally, the data owners share the encrypted `genotype.tsv.hegp` and `phenotype.tsv.hegp` with the broker who concatenates it and shares it with all parties. ``` pyhegp cat-genotype -o complete-genotype.tsv.hegp genotype1.tsv.hegp genotype2.tsv.hegp ... +pyhegp cat-phenotype -o complete-phenotype.tsv.hegp phenotype1.tsv.hegp phenotype2.tsv.hegp ... ``` Note that all data sharing is carried out-of-band and is outside the scope of `pyhegp`. -- cgit 1.4.1