Skip to content

Darwin Core Archive Controlled Field Assessor

John Wieczorek edited this page Nov 8, 2016 · 10 revisions

This workflow:

  • creates a given directory as a workspace
  • downloads a Darwin Core Archive from a given URL
  • downloads vocabulary lookup files from https://github.com/kurator-org/kurator-validation/tree/master/packages/kurator_dwca/data/vocabularies.
  • extracts the core file of a Darwin Core Archive to a tab-separated text file
  • for each field in a the list of Darwin Core Controlled Value fields (see below), creates a report of counts of distinct values
  • for each field in a the list of Darwin Core Controlled Value fields (see below), creates a report of recommended values for values that are not standard.

The files produced by this workflow are:

References

Workflow configuration file: https://github.com/kurator-org/kurator-validation/blob/master/packages/kurator_dwca/workflows/dwca_controlled_term_assessor.yaml

Darwin Core Controlled Value lookup files: https://github.com/kurator-org/kurator-validation/tree/master/packages/kurator_dwca/data/vocabularies

Darwin Core Controlled Value fields (from http://rs.tdwg.org/dwc/terms/index.htm):

Clone this wiki locally