Guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of datasets, known as FAIR principles, were introduced in 2016 to enable machines to perform automatic actions on a variety of digital objects, including datasets. Since then, the principles have been widely adopted by data creators and users worldwide with the ‘FAIR’ acronym becoming a common part of the vocabulary of data scientists. However, there is still some controversy on how datasets should be interpreted since not all datasets that are claimed to be FAIR, necessarily follow the principles. In this research, we propose the OntoUML FAIR Principles Schema, as an ontological representation of FAIR principles for data practitioners. The work is based on OntoUML, an ontologically well-founded language for Ontology-driven Conceptual Modeling. OntoUML is a proxy for ontological analysis that has proven effective in supporting the explanation of complex domains. Our schema aims to disentangle the intricacies of the FAIR principles’ definition, by resolving aspects that are ambiguous, under-specified, recursively-specified, or implicit. The schema can be considered as a blueprint, or a template to follow when the FAIR classification strategy of a dataset must be designed. To demonstrate the usefulness of the schema, we present a practical example based on genomic data and discuss how the results provided by the OntoUML FAIR Principles Schema contribute to existing data guidelines.

Ontological Representation of FAIR Principles: A Blueprint for FAIRer Data Sources

Bernasconi, Anna;
2023-01-01

Abstract

Guidelines to improve the Findability, Accessibility, Interoperability, and Reuse of datasets, known as FAIR principles, were introduced in 2016 to enable machines to perform automatic actions on a variety of digital objects, including datasets. Since then, the principles have been widely adopted by data creators and users worldwide with the ‘FAIR’ acronym becoming a common part of the vocabulary of data scientists. However, there is still some controversy on how datasets should be interpreted since not all datasets that are claimed to be FAIR, necessarily follow the principles. In this research, we propose the OntoUML FAIR Principles Schema, as an ontological representation of FAIR principles for data practitioners. The work is based on OntoUML, an ontologically well-founded language for Ontology-driven Conceptual Modeling. OntoUML is a proxy for ontological analysis that has proven effective in supporting the explanation of complex domains. Our schema aims to disentangle the intricacies of the FAIR principles’ definition, by resolving aspects that are ambiguous, under-specified, recursively-specified, or implicit. The schema can be considered as a blueprint, or a template to follow when the FAIR classification strategy of a dataset must be designed. To demonstrate the usefulness of the schema, we present a practical example based on genomic data and discuss how the results provided by the OntoUML FAIR Principles Schema contribute to existing data guidelines.
2023
Advanced Information Systems Engineering. CAiSE 2023.
978-3-031-34559-3
978-3-031-34560-9
FAIR data
OntoUML FAIR Principles Schema
FAIRness guidance
Ontological Modeling Language
OntoUML
File in questo prodotto:
File Dimensione Formato  
CAiSE_2023_FAIR_OntoUML.pdf

accesso aperto

: Post-Print (DRAFT o Author’s Accepted Manuscript-AAM)
Dimensione 687.33 kB
Formato Adobe PDF
687.33 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1240078
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? ND
social impact