<feed xmlns='http://www.w3.org/2005/Atom'>
<title>bh20-seq-resource/scripts/download_genbank_data, branch fasta-subset-from-query</title>
<subtitle>Tool to upload SARS-CoV-2 sequences to BH20 Arvados instance and orchestrate analysis</subtitle>
<id>http://git.systemreboot.net/bh20-seq-resource/atom?h=fasta-subset-from-query</id>
<link rel='self' href='http://git.systemreboot.net/bh20-seq-resource/atom?h=fasta-subset-from-query'/>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/'/>
<updated>2020-09-28T09:57:51+00:00</updated>
<entry>
<title>new countries; updated genbank/sra scripts to manage more specimen sources</title>
<updated>2020-09-28T09:57:51+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-09-28T09:57:51+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=c72dab2788d010153d5406f2d5ecbe3824571931'/>
<id>urn:sha1:c72dab2788d010153d5406f2d5ecbe3824571931</id>
<content type='text'>
</content>
</entry>
<entry>
<title>genbank and sra scripts more picky on the ontologies; added utils.py for shared functions</title>
<updated>2020-09-28T07:13:50+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-09-28T07:13:50+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=bc2e51bc8418876cc826482ece10874b2a61fa03'/>
<id>urn:sha1:bc2e51bc8418876cc826482ece10874b2a61fa03</id>
<content type='text'>
</content>
</entry>
<entry>
<title>added in the sra script an option to include only a subset of ids</title>
<updated>2020-09-04T09:13:25+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-09-04T09:13:25+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=327b6db4e7c9ceda190617f2a793e6a646bb30cc'/>
<id>urn:sha1:327b6db4e7c9ceda190617f2a793e6a646bb30cc</id>
<content type='text'>
</content>
</entry>
<entry>
<title>synchronized the create_sra_metadata.py script with the latest updates</title>
<updated>2020-09-04T07:59:47+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-09-04T07:59:47+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=842f1a25cc2632771e8a816dbb4898db81d7a6c2'/>
<id>urn:sha1:842f1a25cc2632771e8a816dbb4898db81d7a6c2</id>
<content type='text'>
</content>
</entry>
<entry>
<title>added control (locally and in the validation) that sample_id has to be the same in the metadata and in the FASTA header #103</title>
<updated>2020-08-28T09:16:24+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-08-28T09:16:24+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=3165a31e321cbf4641f9afdcbea511ee66f673bb'/>
<id>urn:sha1:3165a31e321cbf4641f9afdcbea511ee66f673bb</id>
<content type='text'>
</content>
</entry>
<entry>
<title>updated dependency from clustalw to minimap2; the genbank script no longer creates YAML/FASTA pairs for too short sequences</title>
<updated>2020-08-26T22:18:24+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-08-26T22:18:24+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=4299c750728bbad4bdbf0311ff2a4b9c65d9883c'/>
<id>urn:sha1:4299c750728bbad4bdbf0311ff2a4b9c65d9883c</id>
<content type='text'>
</content>
</entry>
<entry>
<title>added option in the genbank script to ignore (already validated) IDs; code cleaning; typos</title>
<updated>2020-08-26T14:56:47+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-08-26T14:56:47+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=9107e59b2e69c4f3f3369175d57631ed878009e7'/>
<id>urn:sha1:9107e59b2e69c4f3f3369175d57631ed878009e7</id>
<content type='text'>
</content>
</entry>
<entry>
<title>the YAML/FASTA pair is not created for samples where at least one mandatory field is missing</title>
<updated>2020-08-24T22:02:32+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-08-24T22:02:32+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=2baa88b766ec540bd34b96599014dd16e393af39'/>
<id>urn:sha1:2baa88b766ec540bd34b96599014dd16e393af39</id>
<content type='text'>
</content>
</entry>
<entry>
<title>genbank/sra scripts update to be more generic with the specimen sources</title>
<updated>2020-08-23T09:28:02+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-08-23T09:28:02+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=3b9423891c4e90499a40c1be029ef40160efb557'/>
<id>urn:sha1:3b9423891c4e90499a40c1be029ef40160efb557</id>
<content type='text'>
</content>
</entry>
<entry>
<title>genbank/sra scripts updated to read the dictionaries in a more general way</title>
<updated>2020-08-22T14:41:02+00:00</updated>
<author>
<name>AndreaGuarracino</name>
</author>
<published>2020-08-22T14:41:02+00:00</published>
<link rel='alternate' type='text/html' href='http://git.systemreboot.net/bh20-seq-resource/commit/?id=d6559a65865ebbb14e041893c2dae742fa146143'/>
<id>urn:sha1:d6559a65865ebbb14e041893c2dae742fa146143</id>
<content type='text'>
</content>
</entry>
</feed>
