From fe1402f898e3673aba725df62109df5bb2d8eef4 Mon Sep 17 00:00:00 2001 From: Pjotr Prins Date: Mon, 24 Aug 2020 09:12:31 +0100 Subject: Images --- doc/web/about.html | 153 ++++++++++++++++++++++++++++---------------------- doc/web/about.org | 18 ++++-- doc/web/download.html | 110 +++++++++++++++++++----------------- doc/web/download.org | 5 ++ 4 files changed, 164 insertions(+), 122 deletions(-) (limited to 'doc/web') diff --git a/doc/web/about.html b/doc/web/about.html index aa12851..2d1f51d 100644 --- a/doc/web/about.html +++ b/doc/web/about.html @@ -3,7 +3,7 @@ "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
- +PubSeq, the public sequence resource, aims to provide a generic and @@ -322,8 +323,8 @@ follow.
We presented at the BOSC 2020 Have a look at the video (alternative
@@ -332,8 +333,8 @@ link) and the
-
The public sequence resource is an initiative by bioinformatics and
@@ -354,8 +355,8 @@ wrangling experts. Thank you everyone!
The short version is that we use state-of-the-art practices in
@@ -378,8 +379,8 @@ such as GISAID.
Funny question. There are only good reasons to upload your data here
@@ -433,8 +434,8 @@ for bulk uploads!
On uploading a sequence with metadata it will automatically be
@@ -445,8 +446,8 @@ using workflows from the High Performance Open Biology Lab defined
The Swiss Institute of Bioinformatics has included this data in
@@ -464,8 +465,8 @@ for monitoring, protein prediction and drug development.
You can contribute by submitting sequences, updating metadata, submit
@@ -477,8 +478,8 @@ point.
All data is published under a Creative Commons 4.0 attribution license
@@ -488,8 +489,8 @@ data and store it for further processing.
Absolutely. Free software allows for fully reproducible pipelines. You
@@ -498,8 +499,8 @@ can take our workflows and data and run it elsewhere!
We are preparing raw sequence data pipelines (fastq and BAM). The
@@ -514,8 +515,8 @@ assembly variations into consideration. This is all work in progress.
See the http://covid19.genenetwork.org/blog!
@@ -523,8 +524,8 @@ See the http://covid19.genenetwork
Workflows are on github and can be modified. See also the BLOG
@@ -533,8 +534,8 @@ Workflows are on
-
Go to our source code repositories, fork/clone the repository, change
@@ -544,8 +545,8 @@ many PRs we already merged.
Restrictive data licenses are hampering data sharing and reproducible
@@ -561,8 +562,8 @@ In all honesty: we prefer both data and software to be free.
We do output a RDF file with the pangenome built in, and you can parse it because it has variants implicitly.
@@ -574,8 +575,8 @@ We are also writing tools to generate VCF files directly from the pangenome.
A public sequence resource is about public data. Metadata can refer to
@@ -586,8 +587,8 @@ plan to combine identifiers with clinical data stored securely at
We are planning to remove reads that match the human reference.
@@ -595,8 +596,8 @@ We are planning to remove reads that match the human reference.
To date, PubSeq is a resource specific to SARS-CoV-2, but we are designing it to be able to support other species in the future.
@@ -605,8 +606,8 @@ To date, PubSeq is a resource specific to SARS-CoV-2, but we are designing it to
We use a gitter channel you can join. See also contact.
@@ -614,10 +615,26 @@ We use a
-
+We have two publications in the works. Until we have a DOI please cite
+PubSeq in the following way:
+
+We made use of the COVID-19 public sequence (PubSeq) resources hosted
+at http://covid19.genenetwork.org/.
+
The main sponsors are listed in the footer. In addition to the time
generously donated by many contributors we also acknowledge Amazon AWS
for donating COVID-19 related compute time.
@@ -626,7 +643,7 @@ for donating COVID-19 related compute time.
The last runs can be viewed here. If you click on a run you can see
@@ -285,8 +286,8 @@ is listed under
The public sequence resource provides all uploaded sequences as
@@ -296,8 +297,8 @@ also provide a single file
-
Metadata can be downloaded as Turtle RDF as a mergedmetadat.ttl which
@@ -319,8 +320,8 @@ graph can be downloaded from below Pangenome RDF format.
Pangenome data is made available in multiple guises. Variation graphs
@@ -328,8 +329,8 @@ Pangenome data is made available in multiple guises. Variation graphs
ODGI is a format that supports an optimised dynamic genome/graph
@@ -348,8 +349,8 @@ implementation.
An RDF file that includes the sequences themselves in a variation
@@ -360,8 +361,8 @@ graph can be downloaded from
The many JSON files that are named as
@@ -372,8 +373,8 @@ Pangenome browser.
Including in below link is a log file of the last workflow runs.
@@ -381,8 +382,8 @@ Including in below link is a log file of the last workflow runs.
https://collections.lugli.arvadosapi.com/c=lugli-4zz18-z513nlpqm03hpca/
@@ -390,16 +391,16 @@ Including in below link is a log file of the last workflow runs.
We are planning the add the following output (see also
See fastq tracker and BAM tracker.
@@ -407,8 +408,8 @@ See fastq track
See MSA tracker.
@@ -416,8 +417,8 @@ See MSA tracker
See Phylo tracker.
@@ -425,8 +426,8 @@ See Phylo track
We aim to make protein predictions available.
@@ -435,8 +436,8 @@ We aim to make protein predictions available.
All source code for this website and tooling is available
@@ -445,9 +446,18 @@ from
+See the FAQ.
+3 Who created the public sequence resource?
+3 Who created the public sequence resource?
4 How does the public sequence resource compare to other data resources?
+4 How does the public sequence resource compare to other data resources?
5 Why should I upload my data here?
+5 Why should I upload my data here?
6 Why should I not upload by data here?
+6 Why should I not upload by data here?
7 How does the public sequence resource work?
+7 How does the public sequence resource work?
8 Who uses the public sequence resource?
+8 Who uses the public sequence resource?
9 How can I contribute?
+9 How can I contribute?
10 Is this about open data?
+10 Is this about open data?
11 Is this about free software?
+11 Is this about free software?
12 How do I upload raw data?
+12 How do I upload raw data?
13 How do I change metadata?
+13 How do I change metadata?
14 How do I change the work flows?
+14 How do I change the work flows?
15 How do I change the source code?
+15 How do I change the source code?
16 Should I choose CC-BY or CC0?
+16 Should I choose CC-BY or CC0?
17 Are there also variant in the RDF databases? *
+17 Are there also variant in the RDF databases?
18 How do I deal with private data and privacy?
+18 How do I deal with private data and privacy?
19 Do you have any checks or concerns if human sequence accidentally submitted to your service as part of a fastq? *
+19 Do you have any checks or concerns if human sequence accidentally submitted to your service as part of a fastq? *
20 Does PubSeq support only SARS-CoV-2 data? *
+20 Does PubSeq support only SARS-CoV-2 data?
21 How do I communicate with you?
+21 How do I communicate with you?
22 Who are the sponsors?
+22 Citing PubSeq
23 Who are the sponsors?
+
Created by Pjotr Prins (pjotr.public768 at thebird 'dot' nl) using Emacs org-mode and a healthy dose of Lisp!
Modified 2020-08-23 Sun 04:26.
+
Created by Pjotr Prins (pjotr.public768 at thebird 'dot' nl) using Emacs org-mode and a healthy dose of Lisp!
Modified 2020-08-24 Mon 03:12.
Table of Contents
1 Workflow runs
+1 Workflow runs
Data collections
. All current data is listed
2 FASTA files
+2 FASTA files
3 Metadata
+3 Metadata
4 Pangenome
+4 Pangenome
4.1 Pangenome GFA format
+4.1 Pangenome GFA format
4.2 Pangenome in ODGI format
+4.2 Pangenome in ODGI format
4.3 Pangenome RDF format
+4.3 Pangenome RDF format
4.4 Pangenome Browser format
+4.4 Pangenome Browser format
5 Log of workflow output
+5 Log of workflow output
6 All files
+6 All files
7 Planned
+7 Planned
7.1 Raw sequence data
+7.1 Raw sequence data
7.2 Multiple Sequence Alignment (MSA)
+7.2 Multiple Sequence Alignment (MSA)
7.3 Phylogenetic tree
+7.3 Phylogenetic tree
7.4 Protein prediction
+7.4 Protein prediction
8 Source code
+8 Source code
9 Citing PubSeq
+
Created by Pjotr Prins (pjotr.public768 at thebird 'dot' nl) using Emacs org-mode and a healthy dose of Lisp!
Modified 2020-06-12 Fri 04:41.
+
Created by Pjotr Prins (pjotr.public768 at thebird 'dot' nl) using Emacs org-mode and a healthy dose of Lisp!
Modified 2020-08-24 Mon 03:07.