Course description
We will explore the ELIXIR Data stewardship wizard and see how one can employ and adopt it towards a community or research project. We will set the first steps on the path to FAIRify metadata by means of linked data. And finally we will see how FAIR data can be used in computational workflows and how to gather provenance data while working with data assets. The training is organised by ELIXIR-NL and will be given by DTL, ELIXIR CZ, SURF and Wageningen University.
Target audience
The training is targeted at data stewards, data scientists and research software engineers who would like to deepen their knowledge on concepts and practical implementations of data management/stewardship tools. We assume that you are familiar with programming in a language like Python, R or Matlab.
Learning outcomes
Participants will gain insight into the Data Stewardship Wizard how to employ and adjust the tool towards a special use case. They will learn about the principles of data modeling and making data FAIR by means of linked data. During the course we will apply these technologies to a real world data set. Participants will know afterwards how to approach their own data and can set first steps on the FAIRification process. Finally participants will know how to manage data and metadata in iRODS and how to employ data and metadata stored in iRODS in calculations on a remote compute cluster.
Technologies
ELIXIR Data stewardship wizard; Linked data; iRODS; python; SURF compute clusters
Prerequisites
Own laptop; Bash-shell, login to VM by ssh; Elementary knowledge of python
Draft programme
Monday December 2, 2019 – 12:00-17:00 (onsite & remote training)
Trainers: Rob Hooft (ELIXIR-NL, DTL), Robert Pergl (ELIXIR-CZ, CCMI-FIT), Jan Slifka (ELIXIR-CZ, CCMI-FIT)
- 12:00 – arrival with lunch
- Introduction to Data management plan
- Introduction and practical demonstration of the features of DSW: user-view and Data-steward view
- Hands-on session
Tuesday December 3, 2019 – 09:00 – 17:00
Trainers: Ben Schaap (Agrosystems Research, WUR), Erik van den Berg (WDCC, WUR), Jasper Koehorst (Systems and Synthetic Biology, WUR), Anand Gavai (WFSR, WUR)
- Linked data training
- Introduction to Linked Open Data
- Exercise 1: pencil & paper exercise
- Presentation on SPARQL and wikidata
- Hands on real data, putting everything into practice
Wednesday December 4, 2019 – 09:00 – 17:00
Trainer: Arthur Newton, Claudia Behnke, Claudio Cacciari (SURFsara)
- iRODS and compute
You can find all the information and registration possibilities here