Loading [MathJax]/extensions/MathZoom.js
SLICES Data Management Infrastructure for Reproducible Experimental Research on Digital Technologies | IEEE Conference Publication | IEEE Xplore

SLICES Data Management Infrastructure for Reproducible Experimental Research on Digital Technologies


Abstract:

This paper presents the ongoing research effort related to the design of the Data Management Infrastructure (DMI) to support experimental research on digital technologies...Show More

Abstract:

This paper presents the ongoing research effort related to the design of the Data Management Infrastructure (DMI) to support experimental research on digital technologies with application to the ESFRI SLICES scientific instrument. We consider the experiment documentation and data collection across the whole continuum of access network, IoT, edge, cloud, and data processing workflow. The paper includes the requirements analysis for DMI to enable research reproducibility of complex and large-scale experimentation. We provide an analysis of data collected and processed in SLICES and explain approaches and solutions used in SLICES for experimental research reproducibility, primarily based on the plain orchestration service and supported by metadata collection tools. The proposed multi-layer DMI includes: data (storage) access, data processing, data ingest, experiment management, and virtual research environment. The paper also provides recommendations for the selection of existing standards and tools for data and metadata management, in particular those developed by EOSC and supported by the RDA community to ensure wide compatibility and integration.
Date of Conference: 04-08 December 2023
Date Added to IEEE Xplore: 21 March 2024
ISBN Information:
Conference Location: Kuala Lumpur, Malaysia
References is not available for this document.

I. Introduction

Wider adoption of Open Science requires a modern research infrastructure and scientists to pay more attention to consistent data management in order to support effective data sharing and communication between researchers [1]. Introducing FAIR data principles and ongoing development and implementation of supporting standards, frameworks, and tools in recent years, significantly improved the possibility for sharing research data and research results, targeting research reproducibility, sharing data, or other publishable research results via the popular Open Access or self-archiving services OpenAIRE [2] and Zenodo [3]. The European Open Science Cloud (EOSC) [4] provides the federated data sharing infrastructure. Recent developments such as RO Crate [5,6] have the potential of supporting complex research objects and their evolution. This is especially important for experimental research reproducibility that requires documenting a large volume of information related to the experiment setup, workflow, input data, measurement data [7].

Select All
1.
Open Science, [online] Available: https://www.fosteropenscience.eu/content/whatopen-science-introduction.
2.
OpenAIRE, [online] Available: https://www.openaire.eu/en/home.
3.
Zenodo, [online] Available: https://zenodo.org/.
4.
EOSC Association, [online] Available: https://eosc.eu/about-eosc.
5.
Research Object Crate (RO-Crate), [online] Available: https://www.researchobject.org/ro-crate/.
6.
RO-Crate Metadata Specification 1.1, [online] Available: https://www.researchobject.org/ro-crate/1.1/.
7.
Y. Demchenko, S. Gallenmülle r, S. Fdida, P. Andreou, C. Cretaz and M. Kirkeng, "Experimental Research Reproducibility and Experiment Workflow Management", TASIR Workshop Proc. COMSNETS 2023 Conf..
8.
S. Fdida, N. Makris, T. Korakis, R. Bruno, A. Passarella, P. Andreou, et al., "Slices a scientific instrument for the networking community", Comput. Commun., vol. 193, pp. 189-203, 2022.
9.
Strategic Research and Innovation Agenda (SRIA) of the European Open Science Cloud (EOSC) Version 1.0 21, June 2021, [online] Available: https://op.europa.eu/nl/publication-detail/-/publication/f9b12d1d-74ea-11ec-9136-01aa75ed71a1.
10.
EOSC Portal Catalog and Marketplace, [online] Available: https://marketplace.eosc-portal.eu/.
11.
EOSC Components FAIRCORE4EOSC Project, [online] Available: https://faircore4eosc.eu/eosc-core-components.
12.
RELIANCE Project, [online] Available: https://www.reliance-project.eu/.
13.
ROHub, [online] Available: http://reliance.rohub.org.
14.
S. Soiland-Reyes et al., "Packaging research artefacts with ROCrate", Data Science, vol. 5, no. 2, pp. 97-138.
15.
FAIR Data Principles, [online] Available: https://www.go-fair.org/fair-principles/.
16.
"FAIR Data Maturity Model: specification and guidelines", Research Data Alliance, 2020.
17.
EGI, [online] Available: https://www.egi.eu/.
18.
Y. Demchenko, C. de Laat, W. Los and L. Gommans, "Defining platform research infrastructure as a service (PRIaaS) for future scientific data infrastructure" in Designing Data Spaces: The Ecosystem Approach to Competitive Advantage, Springer, pp. 241-260, 2022.
19.
"Deliverable D4.3 Definition of the SLICES metadata profiles to support FAIR principles" in SLICES-DS Project, 31 August 2021.
20.
"Deliverable D4.5 SLICES infrastructure and services integration with EOSC Open Science and FAIR: Recommendations and design patterns (final report)" in SLICES-DS Project, 31 August 2022.
21.
Fed4FIRE, [online] Available: https://www.fed4fire.eu/.
22.
OneLab, [online] Available: https://onelab.eu/.
23.
Planetlab, 2022-11-16, [online] Available: www.planet.com/.
24.
Geni, 11 2022, [online] Available: https://portal.geni.net/.
25.
T. Rakotoarivelo, M. Ott, G. Jourjon and I. Seskar, "OMF: a control and management framework for networking testbeds", ACM SIGOPS Oper. Syst. Rev., vol. 43, no. 4, pp. 54-59, 2009.
26.
A. Quereilhac, M. Lacage, C. D. Freire, T. Turletti and W. Dabbous, "NEPI: an integration framework for network experimentation", 19th Int Conf on Software, pp. 1-5, September 15-17, 2011.
27.
GitHub Features, [online] Available: https://github.com/features.
28.
Redhat Ansible, [online] Available: https://www.ansible.com/.
29.
Terraform, [online] Available: https://www.terraform.io/.
30.
Jupyter, [online] Available: https://jupyter.org/.
Contact IEEE to Subscribe

References

References is not available for this document.