diff options
Diffstat (limited to 'scripts/db_enrichment')
-rw-r--r-- | scripts/db_enrichment/readme.md | 9 |
1 files changed, 5 insertions, 4 deletions
diff --git a/scripts/db_enrichment/readme.md b/scripts/db_enrichment/readme.md index 83297dc..55ec496 100644 --- a/scripts/db_enrichment/readme.md +++ b/scripts/db_enrichment/readme.md @@ -4,16 +4,17 @@ We have two files in the folder *semantic_enrichment* that are used to enrich th Static label about the ontology vocabulary terms we use. This file has to be updated manually. Use the OLS or bioportal to find more information about a used ontology term. ### semantic_enrichment/countries.ttl -File containing information about the countries in our database. Additional information about countries are e.g. the label or GPS coordinates. We enricht the country identifier via wikidata. +File containing information about the countries in our database. Additional information about countries are e.g. the label or GPS coordinates. We enricht the country identifier via wikidata. Please ensure that the .ttl file is valid by e.g. using his online validator (http://ttl.summerofcode.be/). #### Update process - What countries (=wikidata identifier) do we have to enrich? -This query retrieves all countries (ids) from our database that do not have a label yet: +This SPARQL query (http://sparql.genenetwork.org/sparql/) retrieves all countries (ids) from our database that do not have a label yet: + >SELECT DISTINCT ?geoLocation WHERE >{ ->?fasta ?x [<http://purl.obolibrary.org/obo/GAZ_00000448> ?geoLocation] . ->FILTER NOT EXISTS {?geoLocation <http://www.w3.org/2000/01/rdf-schema#label> ?geoLocation_tmp_label} +>?fasta ?x [ <<http://purl.obolibrary.org/obo/GAZ_00000448>> ?geoLocation] . +>FILTER NOT EXISTS {?geoLocation <<http://www.w3.org/2000/01/rdf-schema#label>> ?geoLocation_tmp_label} >} - Use the list of identifiers created with the query above as input for the update script *country_enrichment.py*. The script creates a temporary .ttl file in this folder |