Biogeo : an R package for assessing and improving data quality of occurrence record datasets

dc.contributor.authorRobertson, Mark P.
dc.contributor.authorVisser, Vernon
dc.contributor.authorHui, Cang
dc.contributor.emailmrobertson@zoology.up.ac.zaen_ZA
dc.date.accessioned2017-03-09T07:55:36Z
dc.date.issued2016-04
dc.description.abstractOccurrence data from museum and herbarium collections are valuable for mapping biodiversity patterns in space and time. Unfortunately these collections datasets contain many errors and suffer from several data quality issues that can influence the quality of the products derived from them. It is up to the user to identify these errors and data quality issues when using these data. Despite the large number of potential users of these datasets there are few software tools dedicated to error detection and correction of collections datasets. The R package biogeo was developed for detecting and correcting errors and for assessing of data quality of collections datasets consisting of occurrence records. Features of the package include error detection, such as mismatches between the recorded country and the country where the record is plotted, records of terrestrial species that fall into the sea and outlier detection. A key feature of the package is the ability to identify likely alternative positions for points that represent obvious errors in the dataset and functions to explore records in geographical and environmental space in order to identify possible errors in the dataset. Functions are also available for converting coordinates that are in various text formats into degrees, minutes and seconds and then into decimal degrees.en_ZA
dc.description.departmentZoology and Entomologyen_ZA
dc.description.embargo2017-04-30
dc.description.librarianhb2017en_ZA
dc.description.sponsorshipThe DST-NRF Centre for Invasion Biology, the National Research Foundation and the University of Pretoria.en_ZA
dc.description.urihttp://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1600-0587en_ZA
dc.identifier.citationRobertson, MP, Visser, V & Hui, C 2016, 'Biogeo : an R package for assessing and improving data quality of occurrence record datasets', Ecography, vol. 39, no. 4, pp. 394-401.en_ZA
dc.identifier.issn0906-7590 (print)
dc.identifier.issn1600-0587 (online)
dc.identifier.other10.1111/ecog.02118
dc.identifier.urihttp://hdl.handle.net/2263/59342
dc.language.isoenen_ZA
dc.publisherWileyen_ZA
dc.rights© 2016 The Authors and Nordic Society Oikos. This is the pre-peer reviewed version of the following article : Biogeo : an R package for assessing and improving data quality of occurrence record datasets, Ecography, vol. 39, no. 4, pp. 394-401, 2016. doi : 10.1111/ecog.02118 . The definite version is available at : http://onlinelibrary.wiley.comjournal/10.1111/(ISSN)1600-0587.en_ZA
dc.subjectBiogeoen_ZA
dc.subjectAssessing and improving data qualityen_ZA
dc.subjectR packageen_ZA
dc.subjectOccurrence record datasetsen_ZA
dc.titleBiogeo : an R package for assessing and improving data quality of occurrence record datasetsen_ZA
dc.typePostprint Articleen_ZA

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Robertson_Biogeo_2016.pdf
Size:
1.13 MB
Format:
Adobe Portable Document Format
Description:
Postprint Article

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.75 KB
Format:
Item-specific license agreed upon to submission
Description: