aboutsummaryrefslogtreecommitdiff
path: root/doc/blog/using-covid-19-pubseq-part3.org
diff options
context:
space:
mode:
Diffstat (limited to 'doc/blog/using-covid-19-pubseq-part3.org')
-rw-r--r--doc/blog/using-covid-19-pubseq-part3.org13
1 files changed, 12 insertions, 1 deletions
diff --git a/doc/blog/using-covid-19-pubseq-part3.org b/doc/blog/using-covid-19-pubseq-part3.org
index e8fee36..fda7be8 100644
--- a/doc/blog/using-covid-19-pubseq-part3.org
+++ b/doc/blog/using-covid-19-pubseq-part3.org
@@ -146,7 +146,6 @@ instead on https and entity instead of wiki) the submission went
through. Reload the page (it won't empty the fields) to re-enable the
submit button.
-
* Step 4: Check output
The current pipeline takes 5.5 hours to complete! Once it completes
@@ -237,3 +236,15 @@ The web interface using this exact same script so it should just work
We also use above script to bulk upload GenBank sequences with a [[https://github.com/arvados/bh20-seq-resource/blob/master/scripts/download_genbank_data/from_genbank_to_fasta_and_yaml.py][FASTA
and YAML]] extractor specific for GenBank. This means that the steps we
took above for uploading a GenBank sequence are already automated.
+
+The steps are: from the
+~bh20-seq-resource/scripts/download_genbank_data/~ directory
+
+#+BEGIN_SRC sh
+python3 from_genbank_to_fasta_and_yaml.py
+dir_fasta_and_yaml=~/bh20-seq-resource/scripts/download_genbank_data/fasta_and_yaml
+ls $dir_fasta_and_yaml/*.yaml | while read path_code_yaml; do
+ path_code_fasta=${path_code_yaml%.*}.fasta
+ bh20-seq-uploader --skip-qc $path_code_yaml $path_code_fasta
+done
+#+END_SRC