aboutsummaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2020-09-04added in the sra script an option to include only a subset of idsAndreaGuarracino
2020-09-04sra script updated for managing more locationsAndreaGuarracino
2020-09-04synchronized the create_sra_metadata.py script with the latest updatesAndreaGuarracino
2020-08-29fixed few countries ontology terms; added a new speciesAndreaGuarracino
2020-08-28added script to remove entries on ArvadosAndreaGuarracino
2020-08-28local QC is optionalAndreaGuarracino
2020-08-28added control (locally and in the validation) that sample_id has to be the ↵AndreaGuarracino
same in the metadata and in the FASTA header #103
2020-08-28qc_fasta returns also the seq_type; verified that only one FASTA is sent at ↵AndreaGuarracino
a time; code cleaning
2020-08-28fixed local QC, modifying the regex for FASTA/FASTQ to be more generalAndreaGuarracino
2020-08-27updated dependency from clustalw to minimap2; the genbank script no longer ↵AndreaGuarracino
creates YAML/FASTA pairs for too short sequences
2020-08-26Increase RAM for odgi-build-from-spoa-gfaPeter Amstutz
Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <peter.amstutz@curii.com>
2020-08-26updated parameter nameAndreaGuarracino
2020-08-26added option in the genbank script to ignore (already validated) IDs; code ↵AndreaGuarracino
cleaning; typos
2020-08-26typos in the code; little code refactoringAndreaGuarracino
2020-08-26DocsPjotr Prins
2020-08-26Fix underscoresPjotr Prins
2020-08-25Increase RAM requirement for sort_fasta_by_quality_and_lenPeter Amstutz
Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <peter.amstutz@curii.com>
2020-08-25Using ArvadosPjotr Prins
2020-08-25the YAML/FASTA pair is not created for samples where at least one mandatory ↵AndreaGuarracino
field is missing
2020-08-24fixed protocol for the dictionary entries that caused validation problemsAndreaGuarracino
2020-08-24Fix dependency conflict with PyShExCPeter Amstutz
2020-08-24Adding meta keywords to HTMLPjotr Prins
2020-08-24genbank script documented in blogPjotr Prins
2020-08-24reverted the relaxation on the location attributeAndreaGuarracino
2020-08-24Merge pull request #102 from AndreaGuarracino/patch-4Pjotr Prins
a lot of new ontology terms, genbank/sra scripts updated to be more general
2020-08-24Genbank uploadPjotr Prins
2020-08-24Add note on most fields being optionalPjotr Prins
2020-08-24ImagesPjotr Prins
2020-08-23More infoPjotr Prins
2020-08-23Textual changesPjotr Prins
2020-08-23genbank/sra scripts update to be more generic with the specimen sourcesAndreaGuarracino
2020-08-23added new countries and speciesman sources: fixed few country entriesAndreaGuarracino
2020-08-22genbank/sra scripts updated to read the dictionaries in a more general wayAndreaGuarracino
2020-08-22lots of new dictionary termsAndreaGuarracino
2020-08-22GeneratedPjotr Prins
2020-08-22Relaxing the location attributePjotr Prins
2020-08-22FASTQ in capitalsPjotr Prins
2020-08-22More on contactingPjotr Prins
2020-08-22Repositioning contentPjotr Prins
2020-08-22Reorganize footerPjotr Prins
2020-08-22Reworded blurbPjotr Prins
2020-08-22Small text fixesPjotr Prins
2020-08-22guix-deploy scriptPjotr Prins
2020-08-20Update links to reflect latest workflowPeter Amstutz
Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <peter.amstutz@curii.com>
2020-08-19Fix output parametersPeter Amstutz
Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <peter.amstutz@curii.com>
2020-08-19Scaling pangenome generationPeter Amstutz
Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <peter.amstutz@curii.com>
2020-08-19Consolidate steps to scale graph generation workflowPeter Amstutz
Arvados-DCO-1.1-Signed-off-by: Peter Amstutz <peter.amstutz@curii.com>
2020-08-19Merge pull request #101 from AndreaGuarracino/patch-3Peter Amstutz
used builtin hashlib md5 for the deduplication step
2020-08-19used builtin hashlib md5 for the deduplication stepAndreaGuarracino
2020-08-19Merge pull request #100 from AndreaGuarracino/patch-3Peter Amstutz
integrated the deduplication step in the sorting by quality and length script