From bf6d73b68ab685ff4ad18ca883171e0fef40bc87 Mon Sep 17 00:00:00 2001 From: Pjotr Prins Date: Sun, 23 Aug 2020 10:41:03 +0100 Subject: Textual changes --- doc/web/about.html | 196 +++++++++++++++++++++++++++++++++-------------------- 1 file changed, 123 insertions(+), 73 deletions(-) (limited to 'doc/web/about.html') diff --git a/doc/web/about.html b/doc/web/about.html index f6d1761..aa12851 100644 --- a/doc/web/about.html +++ b/doc/web/about.html @@ -3,7 +3,7 @@ "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
- +-The public sequence resource aims to provide a generic and useful -resource for COVID-19 research. The focus is on providing the best -possible sequence data with associated metadata that can be used for -sequence comparison and protein prediction. +PubSeq, the public sequence resource, aims to provide a generic and +useful resource for COVID-19 research. The focus is on providing the +best possible sequence data with associated metadata that can be used +for sequence comparison and protein prediction. +
+ ++Just to avoid the notion that PubSeq is a biorepository: it is far +more than that. We are not competing with the likes of GenBank, +EBI/ENA and GISAID - in fact, PubSeq sources much of the data from +those and they get ours. +
+ ++There is no conflict of posting data to multiple repositories. +
+ ++It is better to view PubSeq as an open precision medicine initiative that +allows for early detection of new variants and strains; very valuable +for both testing and treatment. We have live analysis work flows that +run on uploading a new sequence. Contributing sequences helps the +overall detection. At this moment the output is limited. We will soon +add interactive discovery tools. Uploading data from a sequencer will +give researchers all information they want in 5 hours without any +bioinformatics knowledge. We may even provide workflows that take +data straight from the sequencer. In fact, you get very little out of +existing biorepositories, in our opinion, unless you have solid +bioinformatics support. And even then there is the question about +comparing data that was created using different technologies and +workflows. How do you deal with that? +
+ ++With PubSeq we are in a position to reanalyse that data from raw +material when new insights arise. It is likely that COVID19 will be +around for another year at least. After that we should prepare for +the next pandemic. PubSeq is meant for that. We take the long view and +it will only get more powerful. Who knows: maybe the live analysis +part of PubSeq will be an approach that other biorepositories will +follow.
We presented at the BOSC 2020 Have a look at the video (alternative
@@ -282,8 +332,8 @@ link) and the
-
The public sequence resource is an initiative by bioinformatics and
@@ -304,8 +354,8 @@ wrangling experts. Thank you everyone!
The short version is that we use state-of-the-art practices in
@@ -328,8 +378,8 @@ such as GISAID.
Funny question. There are only good reasons to upload your data here
@@ -383,8 +433,8 @@ for bulk uploads!
On uploading a sequence with metadata it will automatically be
@@ -395,8 +445,8 @@ using workflows from the High Performance Open Biology Lab defined
The Swiss Institute of Bioinformatics has included this data in
@@ -414,8 +464,8 @@ for monitoring, protein prediction and drug development.
You can contribute by submitting sequences, updating metadata, submit
@@ -427,8 +477,8 @@ point.
All data is published under a Creative Commons 4.0 attribution license
@@ -438,8 +488,8 @@ data and store it for further processing.
Absolutely. Free software allows for fully reproducible pipelines. You
@@ -448,8 +498,8 @@ can take our workflows and data and run it elsewhere!
We are preparing raw sequence data pipelines (fastq and BAM). The
@@ -464,8 +514,8 @@ assembly variations into consideration. This is all work in progress.
See the http://covid19.genenetwork.org/blog!
@@ -473,8 +523,8 @@ See the http://covid19.genenetwork
Workflows are on github and can be modified. See also the BLOG
@@ -483,8 +533,8 @@ Workflows are on
-
Go to our source code repositories, fork/clone the repository, change
@@ -494,8 +544,8 @@ many PRs we already merged.
Restrictive data licenses are hampering data sharing and reproducible
@@ -511,8 +561,8 @@ In all honesty: we prefer both data and software to be free.
We do output a RDF file with the pangenome built in, and you can parse it because it has variants implicitly.
@@ -524,8 +574,8 @@ We are also writing tools to generate VCF files directly from the pangenome.
A public sequence resource is about public data. Metadata can refer to
@@ -536,8 +586,8 @@ plan to combine identifiers with clinical data stored securely at
We are planning to remove reads that match the human reference.
@@ -545,8 +595,8 @@ We are planning to remove reads that match the human reference.
To date, PubSeq is a resource specific to SARS-CoV-2, but we are designing it to be able to support other species in the future.
@@ -555,8 +605,8 @@ To date, PubSeq is a resource specific to SARS-CoV-2, but we are designing it to
We use a gitter channel you can join. See also contact.
@@ -564,8 +614,8 @@ We use a
-
The main sponsors are listed in the footer. In addition to the time
@@ -576,7 +626,7 @@ for donating COVID-19 related compute time.
3 Who created the public sequence resource?
+3 Who created the public sequence resource?
4 How does the public sequence resource compare to other data resources?
+4 How does the public sequence resource compare to other data resources?
5 Why should I upload my data here?
+5 Why should I upload my data here?
6 Why should I not upload by data here?
+6 Why should I not upload by data here?
7 How does the public sequence resource work?
+7 How does the public sequence resource work?
8 Who uses the public sequence resource?
+8 Who uses the public sequence resource?
9 How can I contribute?
+9 How can I contribute?
10 Is this about open data?
+10 Is this about open data?
11 Is this about free software?
+11 Is this about free software?
12 How do I upload raw data?
+12 How do I upload raw data?
13 How do I change metadata?
+13 How do I change metadata?
14 How do I change the work flows?
+14 How do I change the work flows?
15 How do I change the source code?
+15 How do I change the source code?
16 Should I choose CC-BY or CC0?
+16 Should I choose CC-BY or CC0?
17 Are there also variant in the RDF databases? *
+17 Are there also variant in the RDF databases? *
18 How do I deal with private data and privacy?
+18 How do I deal with private data and privacy?
19 Do you have any checks or concerns if human sequence accidentally submitted to your service as part of a fastq? *
+19 Do you have any checks or concerns if human sequence accidentally submitted to your service as part of a fastq? *
20 Does PubSeq support only SARS-CoV-2 data? *
+20 Does PubSeq support only SARS-CoV-2 data? *
21 How do I communicate with you?
+21 How do I communicate with you?
22 Who are the sponsors?
+22 Who are the sponsors?
Created by Pjotr Prins (pjotr.public768 at thebird 'dot' nl) using Emacs org-mode and a healthy dose of Lisp!
Modified 2020-08-22 Sat 04:42.
+
Created by Pjotr Prins (pjotr.public768 at thebird 'dot' nl) using Emacs org-mode and a healthy dose of Lisp!
Modified 2020-08-23 Sun 04:26.