Applying FAIR principles to research data is crucial for assuring effective and efficient data re-use which is becoming a prerequisite in contemporary research endeavors. However, despite the catchy acronym, ensuring FAIRness of research data can be tricky and tedious, discouraging rearchers from publishing their data under these constraints. This represents a great loss for the scientific community since the already acquired data will not be available for replication and reuse in the future.
With the goal of ensuring data FAIRness as the new standard, the webinar Tools and Resources for FAIR Data, organised by SSHOC and delivered by Anca Vlad (UK Data Service) on May 18, 2020, offered an overview of FAIR principles and available tools and resources that can help researchers easily prepare their data in adherence with FAIR guidelines. The webinar also showcased the QAMyData tool which enables researchers across scientific domains to check and improve the quality of their datasets.
F: To ensure that data is FINDABLE, check whether it has a globally unique and persistent digital identifier (such as a DOI in the citation). This video explains why citing data appropriately using such identifiers is so important. If data is published with an accredited archive or data repository, it will be assigned this unique identifier before publishing. Check whether the archive or repository of your choice is CoreTrustSeal accredited, since this guarantees their trustworthiness. If you are unsure where you can publish your research outputs (data, code or other), check out the re3data.org platform which contains over 2000 data repositories you can choose from.
A: To be ACCESSIBLE, (meta)data should be retrievable using an open, universally applicable and standardized protocol. Data does not need to be open to ensure its accessibility; what is essential is that the process of obtaining data is made clear and described in plain language for each dataset. Depending on the content of the data, this protocol would provide information about the access level, terms and conditions applicable. There is not much variation in terms of access restrictions across data providers (generally the options are open, safeguarded, or restricted), however the process and conditions for obtaining the data can be different depending on the archive/repository/data publisher.
I: To be INTEROPERABLE, (meta)data needs to be in specific formats, and use community agreed standards and vocabularies/ontologies, such as the DDI Schema. Specific formats should be respected because digital data is very software dependent and as such prone to corruption/loss due to the obsolescence of software/hardware. The EMM Survey Registry is a good example here as it uses a multitude of machine readable metadata fields to describe its datasets (such as scope, region(s), start date, end date of survey, target population, etc.).
R: And finally, to be REUSABLE, the quality of supporting documentation, metadata and of course data itself, as well as the applicable license, are important. The license should allow the data to be available to the widest possible audience with the widest possible range of uses. This webinar looked in particular at the QAMyData tool that can help researchers check and increase the quality of quantitative data, and, therefore, its reusability.
The FAIR concept pertains to the whole data lifecycle. In order to simplify editing at the end, it is important to keep the requirements of FAIR in mind from study design onwards. In this process of ensuring FAIRness of data, certain tools and resources can help overcome the challenges.
If you’re part of the Archaeology or Heritage Science, you might also be interested in our webinar on Use and Re-use of Scientific Data in Archaeology and Heritage.