pyhegp - Homomorphic encryption of genotypes and phenotypes

Age	Commit message (Collapse)	Author
2026-01-28	Require that (chromosome, position) in genotype frames is unique. HEAD main	Arun Isaac
	Earlier, we required that (chromosome, position, reference) was unique. We tighten this restriction requiring (chromosome, position) be unique. Therefore, there can be only one reference allele at any given chromosome and position.
2026-01-28	Restrict reference to single character.	Arun Isaac

2026-01-28	Do not drop zero standard deviation SNPs with --only-center.	Arun Isaac

2026-01-28	Remove unnecessary is_phenotype_metadata_column import.	Arun Isaac

2026-01-28	Explain why SNPs are being dropped.	Arun Isaac

2026-01-28	Move SNP deletion out of encrypt_genotype function.	Arun Isaac

2026-01-27	Recognize "intercept" as a phenotype metadata column.	Arun Isaac

2026-01-27	Remove key_blocks variable name in click.option.	Arun Isaac
	The key_blocks variable name is optional since the click option has the same name.
2026-01-16	Add --key-blocks argument.	Arun Isaac

2026-01-16	Allow generation of block diagonal keys.	Arun Isaac
	Allow generation of block diagonal keys, and extend tests to test with different number of blocks.
2026-01-16	Add BlockDiagonalMatrix.	Arun Isaac

2026-01-16	Remove unnecessary imports from test_serialization.py.	Arun Isaac

2026-01-16	Add --only-center option.	Arun Isaac

2026-01-16	Separate centering from normalization.	Arun Isaac

2026-01-16	Link to RFC2119 for definition of capitalized terms.	Arun Isaac

2026-01-16	Add intercept column in phenotypes file.	Arun Isaac

2026-01-15	Add --version option.	Arun Isaac

2026-01-15	Allow passing in a key.	Arun Isaac

2025-12-01	README: Add FAQ about standardization.	Arun Isaac

2025-11-28	Handle absent optional reference column.	Arun Isaac
	pyhegp was crashing if the optional reference column was absent. We handle it correctly now. And, we add several test cases to catch this in the future.
2025-11-28	Clarify that only one of pip or Guix is required.	Arun Isaac

2025-11-13	Disable tests for julia-jwas.	Arun Isaac

2025-11-13	Depend explicitly on the guix-rust-past-crates channel.	Arun Isaac

2025-11-13	Add JWAS tests to HSmice test.	Arun Isaac

2025-11-13	Fold line.	Arun Isaac

2025-11-13	Parameterize query when checking for the QTL.	Arun Isaac
	This will allow generalization to JWAS later.
2025-11-12	Replace python-hypothesis-next with python-hypothesis.	Arun Isaac
	python-hypothesis has now been updated on Guix upstream, and python-hypothesis-next has been deprecated.
2025-09-08	Link to PlantUML images built by the CI.	Arun Isaac
	We no longer commit the built images into the git repository. Committing build artifacts into the repository is never good practice. And, linking to CI build artifacts is also more automatic and seamless.
2025-09-08	Specify PlantUML output filenames.	Arun Isaac
	This is so that we don't end up with long Guix store hashes in the output filename when these images are built on CI using the Guix computed-file.
2025-09-08	Replace PlantUML image generation script with Guix computed-file.	Arun Isaac

2025-09-08	Hint at pyhegp commands in UML visualization.	Arun Isaac

2025-09-08	Add @startuml and @enduml tags.	Arun Isaac

2025-09-08	Split up hsmice test into smaller computed files.	Arun Isaac
	This will give us slightly better caching and code organization.
2025-09-08	Exclude .guix and e2e-tests from the pyhegp package.	Arun Isaac

2025-09-08	Detail SNP dropping.	Arun Isaac

2025-09-08	Update README instructions with phenotype encryption.	Arun Isaac

2025-09-08	Comment on QTL checking code.	Arun Isaac

2025-09-06	Update r-mixed-model-gwas to 1.3.1.	Arun Isaac

2025-09-06	Add end-to-end tests for hsmice dataset.	Arun Isaac
	* Add hsmice dataset wrangling and test scripts. * Add G-expression script to run test. * Depend on the guix-bioinformatics Guix channel for r-genio.
2025-09-06	Construct empty genotype frame using series.	Arun Isaac

2025-09-06	Roll cat_data_frames into cat_genotype.	Arun Isaac
	cat_data_frames is no longer a special function used by both cat_genotype and cat_phenotype. Specialize it and roll it into cat_genotype.
2025-09-06	Catenate phenotype frames along the index.	Arun Isaac

2025-09-06	Split catenable phenotype frames along the index.	Arun Isaac
	Phenotype frames are split by sample IDs. This corresponds to splitting along the index, unlike genotype frames which need to be split along the columns.
2025-09-06	Generalize split_data_frame to split along any axis.	Arun Isaac

2025-09-06	Simplify split_data_frame so it is more composable.	Arun Isaac
	split_data_frame should only split the data frame. It should not be filtering out metadata columns.
2025-09-05	Generate unique SNPs in genotype frames without dropping duplicates.	Arun Isaac
	Earlier, we were generating unique SNPs in genotype frames by dropping duplicates. This meant we couldn't control the number of SNPs. Rejection sampling is also not an option because it is too expensive. So, we now generate unique SNPs directly, by first generating a list with unique elements and then converting to a data frame.
2025-09-05	Deduplicate genotype frame metadata generation.	Arun Isaac
	Abstract out generation of genotype frame metadata (namely chromosome, position and reference) from summaries and genotype_frames into a new helper function genotype_metadata.
2025-09-05	Drop SNPs with a zero standard deviation.	Arun Isaac

2025-09-05	Fix typo in comment: tha->that.	Arun Isaac

2025-09-04	Avoid wildcard import from helpers.strategies.	Arun Isaac