-
Notifications
You must be signed in to change notification settings - Fork 3
Darwin Core Archive Assessor
John Wieczorek edited this page Oct 12, 2016
·
1 revision
This workflow:
- takes a Darwin Core CSV file as input
- runs a scientific name validation against a chosen taxonomic authority
- runs an eventDate validation (including checking the event date against birth/death dates for the collector, if known).
- runs a georeference validation
- runs an basisOfRecord validation
- creates an Excel spreadsheet containing the original data with highlights on fields with validation issues, and extra worksheets with annotations about the validation processes for each record.
The files produced by this workflow are:
- error_log_[a].xls - a log file of the the workflow run. a is an identifier produced by the workflow.
- output_[b].xls - an Excel spreadsheet containing the results of the workflow. b is an identifier produced by the workflow.
- output_[c].json - a json file containing the results of the workflow. c is an identifier produced by the workflow.
- output_log_[d].txt - a summary of the workflow run. d is an identifier produced by the workflow.
- README_[e].txt - a summary of the workflow run. e is an identifier produced by the workflow.
This workflow is effectively Workflow 1 (All-in-1:SciName-Georef-Date (DwCa)) described on: http://wiki.datakurator.net/web/FP-Akka_User_Documentation, but with a different data loading actor, with an Excel summary writer and postprocessor into a result spreadsheet.
Workflow source code: https://sourceforge.net/p/filteredpush/svn/HEAD/tree/trunk/FP-Tools/FP-CurationServices/
Actors in the workflow: