about summary refs log tree commit diff
diff options
context:
space:
mode:
authorArun Isaac2025-09-08 11:33:43 +0100
committerArun Isaac2025-09-08 11:34:00 +0100
commita662340e3005500cf3d76a937ee13e103ff71fb3 (patch)
treefe4926e7719dc6c2d313317525de5a36c80f92e7
parent653b5850a4d7833e935889d545e2c0680a5e65d8 (diff)
downloadpyhegp-a662340e3005500cf3d76a937ee13e103ff71fb3.tar.gz
pyhegp-a662340e3005500cf3d76a937ee13e103ff71fb3.tar.lz
pyhegp-a662340e3005500cf3d76a937ee13e103ff71fb3.zip
Detail SNP dropping.
-rw-r--r--README.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/README.md b/README.md
index 399b39a..de93fd2 100644
--- a/README.md
+++ b/README.md
@@ -77,11 +77,11 @@ Data owners generate summary statistics for their data.
 ```
 pyhegp summary genotype.tsv -o summary
 ```
-They share this with the data broker who pools it to compute the summary statistics of the complete dataset.
+They share this with the data broker who pools it to compute the summary statistics of the complete dataset. Any SNPs not common to all summaries will be dropped.
 ```
 pyhegp pool -o complete-summary summary1 summary2 ...
 ```
-The data broker shares these summary statistics with the data owners. The data owners standardize their data using these summary statistics, and encrypt their genotype and phenotype data using a random key.
+The data broker shares these summary statistics with the data owners. The data owners standardize their data using these summary statistics, and encrypt their genotype and phenotype data using a random key. Any SNPs not in `complete-summary` or have a zero standard deviation are dropped. SNPs with a zero standard deviation have no discriminatory power in the analysis.
 ```
 pyhegp encrypt -s complete-summary genotype.tsv phenotype.tsv
 ```