about summary refs log tree commit diff
AgeCommit message (Collapse)Author
5 daysRemove unnecessary is_phenotype_metadata_column import.Arun Isaac
5 daysExplain why SNPs are being dropped.Arun Isaac
5 daysMove SNP deletion out of encrypt_genotype function.Arun Isaac
5 daysRecognize "intercept" as a phenotype metadata column.Arun Isaac
5 daysRemove key_blocks variable name in click.option.Arun Isaac
The key_blocks variable name is optional since the click option has the same name.
2026-01-16Add --key-blocks argument.Arun Isaac
2026-01-16Allow generation of block diagonal keys.Arun Isaac
Allow generation of block diagonal keys, and extend tests to test with different number of blocks.
2026-01-16Add BlockDiagonalMatrix.Arun Isaac
2026-01-16Remove unnecessary imports from test_serialization.py.Arun Isaac
2026-01-16Add --only-center option.Arun Isaac
2026-01-16Separate centering from normalization.Arun Isaac
2026-01-16Link to RFC2119 for definition of capitalized terms.Arun Isaac
2026-01-16Add intercept column in phenotypes file.Arun Isaac
2026-01-15Add --version option.Arun Isaac
2026-01-15Allow passing in a key.Arun Isaac
2025-12-01README: Add FAQ about standardization.Arun Isaac
2025-11-28Handle absent optional reference column.Arun Isaac
pyhegp was crashing if the optional reference column was absent. We handle it correctly now. And, we add several test cases to catch this in the future.
2025-11-28Clarify that only one of pip or Guix is required.Arun Isaac
2025-11-13Disable tests for julia-jwas.Arun Isaac
2025-11-13Depend explicitly on the guix-rust-past-crates channel.Arun Isaac
2025-11-13Add JWAS tests to HSmice test.Arun Isaac
2025-11-13Fold line.Arun Isaac
2025-11-13Parameterize query when checking for the QTL.Arun Isaac
This will allow generalization to JWAS later.
2025-11-12Replace python-hypothesis-next with python-hypothesis.Arun Isaac
python-hypothesis has now been updated on Guix upstream, and python-hypothesis-next has been deprecated.
2025-09-08Link to PlantUML images built by the CI.Arun Isaac
We no longer commit the built images into the git repository. Committing build artifacts into the repository is never good practice. And, linking to CI build artifacts is also more automatic and seamless.
2025-09-08Specify PlantUML output filenames.Arun Isaac
This is so that we don't end up with long Guix store hashes in the output filename when these images are built on CI using the Guix computed-file.
2025-09-08Replace PlantUML image generation script with Guix computed-file.Arun Isaac
2025-09-08Hint at pyhegp commands in UML visualization.Arun Isaac
2025-09-08Add @startuml and @enduml tags.Arun Isaac
2025-09-08Split up hsmice test into smaller computed files.Arun Isaac
This will give us slightly better caching and code organization.
2025-09-08Exclude .guix and e2e-tests from the pyhegp package.Arun Isaac
2025-09-08Detail SNP dropping.Arun Isaac
2025-09-08Update README instructions with phenotype encryption.Arun Isaac
2025-09-08Comment on QTL checking code.Arun Isaac
2025-09-06Update r-mixed-model-gwas to 1.3.1.Arun Isaac
2025-09-06Add end-to-end tests for hsmice dataset.Arun Isaac
* Add hsmice dataset wrangling and test scripts. * Add G-expression script to run test. * Depend on the guix-bioinformatics Guix channel for r-genio.
2025-09-06Construct empty genotype frame using series.Arun Isaac
2025-09-06Roll cat_data_frames into cat_genotype.Arun Isaac
cat_data_frames is no longer a special function used by both cat_genotype and cat_phenotype. Specialize it and roll it into cat_genotype.
2025-09-06Catenate phenotype frames along the index.Arun Isaac
2025-09-06Split catenable phenotype frames along the index.Arun Isaac
Phenotype frames are split by sample IDs. This corresponds to splitting along the index, unlike genotype frames which need to be split along the columns.
2025-09-06Generalize split_data_frame to split along any axis.Arun Isaac
2025-09-06Simplify split_data_frame so it is more composable.Arun Isaac
split_data_frame should only split the data frame. It should not be filtering out metadata columns.
2025-09-05Generate unique SNPs in genotype frames without dropping duplicates.Arun Isaac
Earlier, we were generating unique SNPs in genotype frames by dropping duplicates. This meant we couldn't control the number of SNPs. Rejection sampling is also not an option because it is too expensive. So, we now generate unique SNPs directly, by first generating a list with unique elements and then converting to a data frame.
2025-09-05Deduplicate genotype frame metadata generation.Arun Isaac
Abstract out generation of genotype frame metadata (namely chromosome, position and reference) from summaries and genotype_frames into a new helper function genotype_metadata.
2025-09-05Drop SNPs with a zero standard deviation.Arun Isaac
2025-09-05Fix typo in comment: tha->that.Arun Isaac
2025-09-04Avoid wildcard import from helpers.strategies.Arun Isaac
2025-09-04Limit values in genotype and phenotype strategies.Arun Isaac
2025-09-04Test that ciphertext does not contain NA values.Arun Isaac
2025-09-04Parameterize number of samples in phenotype frame strategy.Arun Isaac