diff options
author | Pjotr Prins | 2021-01-04 08:58:38 +0000 |
---|---|---|
committer | Pjotr Prins | 2021-01-04 08:58:38 +0000 |
commit | 1c4e055b8a9dc53b7fdbdf12d4b0a7e877fbc2ef (patch) | |
tree | 34cc42ef12b81c05be8a57ca2a973b97e52f8461 /workflows/tools/normalize/README.md | |
parent | ba4161b1660c3a67090dd3715e9862906fb1cc5f (diff) | |
download | bh20-seq-resource-1c4e055b8a9dc53b7fdbdf12d4b0a7e877fbc2ef.tar.gz bh20-seq-resource-1c4e055b8a9dc53b7fdbdf12d4b0a7e877fbc2ef.tar.lz bh20-seq-resource-1c4e055b8a9dc53b7fdbdf12d4b0a7e877fbc2ef.zip |
Started on normalization
Diffstat (limited to 'workflows/tools/normalize/README.md')
-rw-r--r-- | workflows/tools/normalize/README.md | 14 |
1 files changed, 14 insertions, 0 deletions
diff --git a/workflows/tools/normalize/README.md b/workflows/tools/normalize/README.md new file mode 100644 index 0000000..b780a68 --- /dev/null +++ b/workflows/tools/normalize/README.md @@ -0,0 +1,14 @@ +# Normalization steps + +This library contains generic logic to normalize (string) data and +transforms strings to URIs. It should be applicable to data from +any source (GenBank, ENA etc). + +Important: missing data should be missing or None! Do not fill +in data by 'guessing'. + +When data is malformed a warning should be logged and added to the +warning list. Functions should be small enough to return only 1 +warning! + +Pjotr Prins (c) 2021 |