Bring Your Own Data workshops

At a Bring Your Own Data (BYOD) workshop, experts assist you in improving the FAIRness of your research data. The main goal of these three-day events is to learn how to FAIRify the datasets using linked data technology, and to combine the datasets with other FAIR datasets to answer a scientific question.

At a BYOD, data owners, domain experts (usually biologists or chemists), and FAIR data experts jointly work on specific data sets. At the start, data owners present the data they wish to make FAIR. The data experts have extensive knowledge about FAIR data formats and principles, and support the data owners in choosing the optimal data model. In addition, they make sure that FAIR linked data is produced in the end. Domain experts can assist the data owners and data experts to solve intellectually challenging data modelling issues and to demonstrate the added value of FAIR data in answering specific research questions.

Organising a BYOD
DTL is happy to help you organise a BYOD. We have developed scripts, working documents, planning documents, a budget, and training materials for life science researchers and future data experts. Furthermore, we have a pool of FAIR data experts standing by, eager to share their knowledge. The general procedure is as follows:

  • Send us an email outlining the goal, motivation, possible data owners, and preferred dates of the BYOD.
  • DTL will contact you and set up the logistics.
  • Once it is clear which data owners will attend the prospective BYOD, DTL will contact the FAIR data experts. At least two data experts are needed per data owner, and preferably at least one domain expert.
  • A definitive date is set for the BYOD.
  • Webinars are planned in which the data owners learn more about linked data and the FAIR principles, and in which demonstration use cases/ research questions for the BYOD will be defined. Data owners will also receive documentation prior to the workshop.
  • A report/blog about the outcomes of the BYOD will be available at the DTL website.

FAIR Data Training
The need for linked and FAIR data experts is growing rapidly. Hence, the upcoming BYODs will function as hands-on training events for FAIR/linked data experts. Every BYOD will have at least one participant that is responsible for training the new experts, for monitoring lessons learned, and for producing written documentation about the experience gained and new data types modelled.

Previous BYODs
Since 2014, we have organised multiple BYODs, supporting organisations and companies in producing FAIR data (e.g., Human Protein Atlas, Enza Zaden, Rijk Zwaan, the rare disease community). Since 2016, we have also organised BYODs for projects such as ELIXIR EXCELERATE and Odex4all.

<!–In order to make optimal use of research data and methods it is essential to connect and functionally interlink datasets and to publish these in a FAIR manner. FAIR data should be:

  1. Findable – easy to find by both humans and computer systems and based on mandatory description of the metadata that allows the discovery of interesting datasets;
  2. Accessible – stored for long term such that the data can be easily accessed and/or downloaded with well-defined license and access conditions (Open Access when possible), whether at the level of metadata, or at the level of the actual data content;
  3. Interoperable – ready to be combined with other datasets by humans as well as by computer systems;
  4. Reusable – ready to be used for future research and to be processed further using computational methods.

DTL and ELIXIR have developed the concept of Bring Your Own Data (BYOD) parties as a low barrier approach to get data owners acquainted with the possibilities opened by ‘functionally interlinking’ data with other important datasets.

Data owners and trainers

In a BYOD people or organisations that have data they want to make FAIR get together with data experts/trainers. The data owners bring their data to the BYOD and get two days of undivided attention and ‘hands-on’ transformation of the data into FAIR format. Furthermore, the data owners and data experts work out a showcase together to demonstrate the added value of FAIR data in answering research questions using multiple resources. At the end of a BYOD party data owners are familiar with the basic principles of making data FAIR, in such a way that they can start using the FAIR data approach themselves.

DTL provides a growing list of international data experts who are willing to come to BYOD parties; please let us know if you are interested to attend BYODs as a trainer.

Domain experts

A third expertise category that comes in handy at a BYOD are domain experts, i.e. biologists who understand the context of the data and can help solving content related challenges that may occur. Minimally one or two ‘real biologists’ need to be at the party, especially when the  biological knowledge present amongst the data owners is not sufficient.–>

A tailored programme is set up for each BYOD because it involves a unique combination of data owners and data sets. In principle, all BYODs contain the following elements:

Preparation
Prior to the BYOD, one or two webinars are organised to introduce the principles to the attendees, to point them to preparatory materials, and to provide the starting points for a BYOD: a list of questions from data owners and a list of existing linked data sets by the data experts. Having specific research questions or workflows at hand that cannot be answered with the data in its original format is also helpful to demonstrate the added value of the FAIR linked data approach. Good preparation is very important since a lot of time can be saved during the BYOD if data models and, for instance, the most relevant vocabularies and ontologies have been defined beforehand.

Execution
The first two days of the three-day BYOD are dedicated to transforming the data owners’ data into FAIR data using the FAIR Data unit’s technology and guidelines, meanwhile providing hands-on training for both the data owners and the future FAIR linked data experts.

Together, data owners and data experts work out a showcase to demonstrate the added value of FAIR data in answering specific research questions of the data owner using multiple integrated data resources. At the end of every BYOD, data owners will also get an overview of the potential of the interlinked data. In our experience, this demonstration will trigger the imagination and will bring about novel questions that can be answered using unimagined combinations of data.

The newly transformed FAIR data is then deposited in either an open or closed (in case of proprietary data) repository with the original data remaining at the data owner’s location. The FAIR Data Unit will host this repository.

The last day of the BYOD is dedicated to exploration of the data using analytics tools.

Follow up
After a BYOD, data owners are familiar enough with the basic principles of making data FAIR, allowing them to use the FAIR data approach themselves. For further support, two teleconferences are planned to follow up after 2-3 and 6-8 weeks, to tackle any problems the data owners might encounter.

One of the biggest challenges of data-intensive science is to facilitate knowledge discovery. Life scientists, both in the public and the private domain, produce large amounts of data that are both complex and heterogeneous. They also make use of multiple ‘core’ data resources like UniProt or ChEMBL. Researchers spend many hours in projects coupling these data sources, struggling to decrypt the data and to transform them into actionable knowledge. Connecting and functionally interlinking datasets is therefore essential for knowledge discovery.

The FAIR Data Unit (FDU) at DTL offers a helping hand in linking data. FDU organises Bring Your Own Data (BYOD) workshops in which experts in modeling data and content experts support data owners to make data FAIR using linked data technology. The acronym FAIR for data means that they are Findable, Accessible, Interoperable and Reusable, by humans and computers.

To generate value for a research community beyond the initial researchers, funding agencies are increasingly setting requirements for proper data stewardship of research data. Since FAIR data is vital to enable appropriate data stewardship and will be mandated by funding agencies and national governments alike, there will be a definitive need to publish FAIR data for new and legacy data sets. FAIR Data publishing will need to be a service provided by many certified entities across Europe.

We have developed a methodology for making data FAIR via BYOD workshops. A BYOD is a low barrier approach to get data owners acquainted with the possibilities opened up by ‘functionally interlinking’ data with other relevant datasets and demonstrating the added value of FAIR data for knowledge discovery. It is a lightweight, very effective and also fun way to collaborate across disciplinary and political borders, often yielding eye opening results. It typically is a three day event in which FAIR data are produced and analysed, and hands-on training modules.

Since 2014, we have organised multiple BYODs, supporting organisations and companies (a.o. Human Protein Atlas, Enza Zaden, Rijk Zwaan) and for instance the rare disease community, to produce FAIR data. Over time we developed scripts, working and planning documents, a budget, and training materials for life science researchers and future data experts. Furthermore, we have a pool of FAIR data experts standing by, eager to share their knowledge. From 2016 onwards, BYODs are scheduled for the ELIXIR EXCELERATE project, the Odex4all project and many other projects.