SlideShare une entreprise Scribd logo
1  sur  47
RO-Crate:
packaging metadata love notes into
FAIR Digital Objects
Professor Carole Goble CBE FREng
The University of Manchester, UK
ELIXIR-UK Head of Node
carole.goble@manchester.ac.uk
Significant contributor & RO-crate leader:
Stian Soiland-Reyes, The University of Manchester/ The University of Amsterdam
soiland-reyes@manchester.ac.uk
https://helmholtz-metadaten.de/en/events/hmc-conference-2022
HMC Conference 2022, 05 October 2022
Multi-institute Team Science
Collaborators
Using different platforms
Almeida, A., Mitchell, A.L., Boland, M. et al. A new genomic blueprint of the human gut microbiota.
Nature 568, 499–504 (2019). https://doi.org/10.1038/s41586-019-0965-1
FAIR Mixed and Multi Object
Source data and results
Instruments, software, workflows, scripts…
Different data types…
Public archives, spreadsheets, project ftp
servers…
FAIR Reproducible Methods
Parameter settings. Configurations.Test data.
Scattered and diverse metadata
Multiple platforms and repositories
Scattered and diverse metadata
Multiple platforms and repositories
Big data, Sensitive data
Data remains at home.
Metadata references the data.
Manage the integrity of the
referencing.
Metadata love letter delivery
Each object in its own repository or platform with its own metadata
: Research Objects
Integrated
view
Package files and URL addressable resources
Describe package
and parts
Need something Infrastructure
independent
• Exchange between repositories,
registries and services.
• Avoid vendor lock-in
Overlaying the
Research Digital Ecosystem
Repositories have their own approaches
DataONE data package
CodeOcean capsule
WholeTale capsules
Compendiums
CombineArchive
DataCrate
Quilt Data Package …
Package files and URL addressable resources
My Platform
Repositories have their own approaches
DataONE data package
CodeOcean capsule
WholeTale capsules
Compendiums
CombineArchive
DataCrate
Quilt Data Package …
Need something Infrastructure
independent
• Exchange between repositories,
registries and services.
• Avoid vendor lock-in
Currency of exchange across
Research Digital Ecosystem
BioConnect Data Packages
Abigail Miller https://zenodo.org/record/7116702#.YzinYLTMKF4
An index of biological research data, analysis tools, and models
hosted internally or externally: LIMS, machine generated data files, manually generated data files, spreadsheets
Import, record, and curate study metadata.
Search on metadata. Export data with their metadata.
Connect data with tools
A snapshot of living objects: Science Changes
Software, reference datasets, methods change
Results may vary
What if we released research rather than
published it?
Like software releases?
Science 2.0 Repositories: Time for a Change in Scholarly Communication Assante, Candela, Castelli, Manghi, Pagano, D-Lib 2015
Do Research
Research
Infrastructure
Publish Research
Scholarship Market
place
Release
Research
Objects
Metadata delivery on released research objects
All the related research objects needed to reuse & reproduce results
Living object for released
research as its being created
rather than “publish” it?
A way of exchanging, archiving, reporting, citing research entities
combine open science with open scholarship
Self-described metadata
objects context,
dependencies and
relationships between the
objects.
Virtual objects referencing
scattered resources
Scale up and working
across all platforms.
Moving knowledge
between different teams.
Actionable knowledge units
Including digital twins.
Metadata is a love note
to collaborators & peers
We need frameworks to be FAIR.
For boxing stuff up.
For platform independent
unified reporting, exchange,
archiving of metadata.
That copes with diversity and legacy.
And that mortals can use.
https://www.flickr.com/photos/ryanishungry/5796976028/
https://www.researchobject.org/
A little bit of packaging goes a long way
http://www.researchobject.org/ro-crate/
Practical lightweight packaging
approach to aggregate files
and/or any URI-addressable
content, with contextual
information into a machine
actionable metadata rich
structured archive.
A little bit of packaging goes a long way
Familiar, developer friendly Lo-Tek - web native, off-the-
shelf, machine and human readable, search engine accessible:
PIDs + JSON-LD + Schema.org + BagIT/Zip/OCDL.
Infrastructure independent to overcome repository
and service silos: Practical, lightweight, robust.
One size does not fit all - embrace diversity, legacy,
unknowns – open-ended, multi-interpretation, self-
describing. Extensible metadata + pre-existing ontologies:
Duck type profiling.
It takes an open village,
with sponsors, leaders and application drivers
https://www.researchobject.org/ro-crate/community.html
Packaging research artefacts with RO-Crate.
Data Science https://doi.org/10.3233/DS-210053
RO-Crate Specification 1.1
https://w3id.org/ro/crate/1.1
Biohackathon
https://biohackat
hon-europe.org/
Structured self-describing, machine readable,
metadata objects
RO-Crate Metadata file
Archive file format / packaging system
type
id
description
datePublished
…
license
author organisation
Linked Data
JSON-LD
Schema.org
Structured
metadata about
the RO-Crate
and content
Standard
Packaging
BagIT, Zip
https://github.com/o/script
files
links to web
resources
RO-Crate Content
directories
type, id
description
datePublished
creator
size
format …
https://zenodo.org/record/3541888
Mixed Objects
Data, Software, Documents…
Unbounded
External PIDs References
Unbounded Boundary Machine Actionable Objects
Descriptive Profiles
Checklist-style typing
must, should, optionals
Unbounded
+ community vocabularies,
formats and standards
Unbounded Boundary Machine Actionable Objects
Openendedness
Known knowns, known unknowns and
unknown unknowns
Multi-interpretation
Interlinguia cross domains
Mixed profiles
Cross domains
Interpret what you care about
Descriptive Profiles
Checklist-style typing
must, should, optionals
Unbounded
+ community vocabularies,
formats and standards
Profiles: Unbounded Boundary Objects
Run
Testing
Data Cubes
Descriptive Profiles
Checklist-style typing
Unbounded
Profile portfolio
Self-describing Profiles using Just Enough Linked Data
A FAIR Knowledge Web of Research Objects
Metadata Graph inside the RO-Crate
Contextual entities and PIDs connect to
the outside world & other RO-Crates
Descriptive Profiles
contextual entities +
community vocabularies
and standards
Developers Matter – this is Middleware!
RSECon 2022 – Research Software Engineers!
https://society-rse.org/
Developer Friendly, Problem Driven
DataCrate
simple web
stack
ROs
rich RDF
stack +
simplifications rather than generalisations
fewer features, more directed
easier to understand, conceptually simpler
opinionated guide to current best practices
constrained and predictable but not too
cumbersome to work with
retain just enough linked data for benefits
querying, vocabularies, clickable URIs,
knowledge graphs
with all the stuff developers need
documentation, examples, libraries, tools
Adoptability!!!!
Developer Friendly, Tool development
Packaging research artefacts with RO-Crate. Data Science
https://doi.org/10.3233/DS-210053
RO-Crate Specification 1.1
https://w3id.org/ro/crate/1.1
Infrastructure facing
Software libraries
https://www.npmjs.com/package/ro-crate
https://github.com/ResearchObject/ro-crate-ruby
https://pypi.org/project/rocrate/
https://github.com/kit-data-manager/ro-crate-java
Contact: andreas.pfeil@kit.edu
Developer Friendly, Tool development
Packaging research artefacts with RO-Crate. Data Science
https://doi.org/10.3233/DS-210053
User Facing Describo
https://uts-eresearch.github.io/describo/
FAIR Research Data Packaging
A data curation service for endangered
languages: 500K+ files, 28K+ items, 574
collections
Archiving and accessibility
Mixed artefacts, mixed metadata
Repositories & registries
Peter Sefton, Marco La Rosa
Ana Trisovic
Submission / download
Exchange between repositories
Mixed objects
Search metadata
Aggregate data collections
Back to BioConnect ….
RO-Crate
Abigail Miller https://zenodo.org/record/7116702#.YzinYLTMKF4
Frictionless data
Packaging Structure File type defs for
tabular data
ISA format
+ +
Exporting data using an interchange format
HMC Hub Energy
Time series data from different databases exported with metadata
description of their structure and content into a single web service
Jan Schweikert - Institute for Automation and Applied Informatics
Web service using
ro-crate-java
Data file format: CSV
LD-Vocabularies: RO-Crate
Context, QUDT, CSVW
https://youtu.be/Rsuxn0m4bIM
https://www.reliance-project.eu/
EOSC, Copernicus, Earth Science
RO-Crate + Data Cubes
Mixed object sharing
Reproducibility
Raul Palma, Oscar Corcho, Daniel Garijo
Computational Workflows
Packaging workflow files &
companion objects
Computational Workflows
Packaging workflow files &
companion objects
Submission / download
Exchange between services
and systems
Reproducibility
Citation
Data provenance collection and
Pipeline Provenance Packaging
Renske de Wit
PROV
CWLPROV
https://www.researchobject.org/workflow-run-crate/
Simone Leo
2022-09-27 Renske deWit: A Non-Intimidating Approach toWorkflow Reproducibility in Bioinformatics
https://www.researchobject.org/ro-crate/1.1/provenance
Workflow, results and traceable provenance packaging,
FAIR Research Objects
https://riojournal.com/article
/94042/
Netherlands X-omics Initiative
Human Infectious Disease Modelling
https://doi.org/10.1098/rsta.2021.0300
Federated Pipelines & Provenance Packaging
Federated analytics, distributed research pipelines, overTrusted Research Environments
for sensitive data
• Controlled access to sensitive data
• Exchange between data platforms
• Reporting & sharing pipelines
• Reporting results & provenance
• Common Provenance Model
handoffs between different orgs
• OMOP mapping pipelines
Tom Giles, Rudolf Wittner
Handling big & sensitive data
Scalable collections of references while data stays at host
Big genomic & clinical data, images etc,
distributed over multiple locations.
https://doi.org/10.1109/BigData.2016.7840618
Retain & archive processed datasets
Reference & transfer large data on demand
Controlled access
Moving data between archives
Ravi Madduri, Kyle Chard, Carl Kesselman, Ian Foster
Biodiversity Digital Objects and Digital Twinning
Hardisty et al (2022): The Specimen Data Refinery: A Canonical Workflow Framework and FAIR Digital Object Approach to
Speeding up Digital Mobilisation of Natural History Collections. Data Intelligence 4(2): 320–341.
https://doi.org/10.1162/dint_a_00134
Bags of references
courtesy of Alex Hardisty, Dimitris Koureas
Digital Surrogate FAIR DigitalObject
https://biodt.eu/
predicting biodiversity dynamics
Package citations, citing
1000s of datasets
GhaithArf/ro-crate-rda-madmp-mapper
10.4126/FRL01-006423291
Lots of stuff needs packaging + metadata …
Conversational Survey
results
https://data.agu.org/DataCitationCoP/
10.1002/essoar.10509966.1
https://coneytoolkit.cefriel.it/1
0.4126/FRL01-006429412
Tomasz Miksa
Shelley Stall, Chris Erdmann,
Christine Kirkpatrick
Deb Agarwal
Mario Scrocca, Irene Celino
Back to Mixed Object Publishing …
HERMES Helmholtz Rich Metadata Software Publication
Druskat, S., Bertuch, O.,Juckeland, G., Knodel, O., & Schlauch,T. (2022). Software publications with rich metadata: state of the art, automated workflows and HERMES
concept. ArXiv, abs/2201.09015.
https://virtual.oxfordabstracts.com/#/event/public/3101/submission/110
FAIR Data Commons
https://helmholtz-metadaten.de/en/fair-data-commons/overview
Testbed for
FAIR Digital
Objects and
RO-Crate
Courtesy: Sören Lorenz, Stefan Stanfed
So a little bit of packaging goes a long way…
Platform independent exchange between
repositories and services
Transfer collections of secure distributed datasets
Describe, export and archive data collections,
datasets, pipelines/workflows with their metadata
Citation aggregation
Reproducibility, connect data with tools
Provenance collection
Mixed object publication
FAIR Digital
Objects
FAIR Digital Objects …. Two Takes
Find, Access
Interoperate, Reuse
RO-Crate support of principles and
adherence to the principles.
FAIR assessment in Research Objects*,
ROHub, Profile registry…
The Principles
*https://dgarijo.com/papers/TPDL2022_gonzalez.pdf https://fairdo.org/ https://www.fdo2022.org
The FDOF Forum
FAIR Digital Object (FDO) – conceptual view
Predictable implementation of FAIR for active objects - not just static data
PID Profile
Collection
FDO
PID
20.301/a
Metadata
Operation
Operation
Operation
Attributes
20.123: “Alice”
20.789: <http://...>
20.456: 10.1234/ab
PID
Record
Bytes
Bytes
FDO
FDO
FDO Type
• Distributed architecture
• Self-describing digital objects
• Several types of metadata
• Encapsulation of operations
RO-Crate implements FDO
with current web stack with
FAIR signposting
FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units:
https://doi.org/10.3390/publications8020021
Soiland-Reyes, Sefton, et al (2022):
Creating lightweight FAIR Digital
Objects with RO-Crate. Research
Ideas and Outcomes, 1st Intl Conf on
FAIR DigitalObjects
https://signposting.org/FAIR/
Metadata is a love note to the FAIR future….
….RO-Crate is the delivery package in a multi-
platform, mixed object research ecosystem.
Keep it practical, real and simple
Adoption and diversity friendliness
Metadata middleware to drive the release of research,
reproducible scholarship and knowledge graphs.
It takes a village– like HMC
To make Research Objects normative
Promote to researchers but …... target Research Infrastructures to deliver
https://www.researchobject.org/ro-crate/
The RO-Crate team is:
● Peter Sefton https://orcid.org/0000-0002-3545-944X (co-chair)
● Stian Soiland-Reyes https://orcid.org/0000-0001-9842-9718 (co-chair)
● Eoghan Ó Carragáin https://orcid.org/0000-0001-8131-2150 (emeritus)
● Oscar Corcho https://orcid.org/0000-0002-9260-0753
● Daniel Garijo https://orcid.org/0000-0003-0454-7145
● Raul Palma https://orcid.org/0000-0003-4289-4922
● Frederik Coppens https://orcid.org/0000-0001-6565-5145
● Carole Goble https://orcid.org/0000-0003-1219-2137
● José María Fernández https://orcid.org/0000-0002-4806-5140
● Kyle Chard https://orcid.org/0000-0002-7370-4805
● Jose Manuel Gomez-Perez https://orcid.org/0000-0002-5491-6431
● Michael R Crusoe https://orcid.org/0000-0002-2961-9670
● Ignacio Eguinoa https://orcid.org/0000-0002-6190-122X
● Nick Juty https://orcid.org/0000-0002-2036-8350
● Kristi Holmes https://orcid.org/0000-0001-8420-5254
● Jason A. Clark https://orcid.org/0000-0002-3588-6257
● Salvador Capella-Gutierrez https://orcid.org/0000-0002-0309-604X
● Alasdair J. G. Gray https://orcid.org/0000-0002-5711-4872
● Stuart Owen https://orcid.org/0000-0003-2130-0865
● Alan R Williams https://orcid.org/0000-0003-3156-2105
● Giacomo Tartari https://orcid.org/0000-0003-1130-2154
● Finn Bacall https://orcid.org/0000-0002-0048-3300
● Thomas Thelen https://orcid.org/0000-0002-1756-2128
● Hervé Ménager https://orcid.org/0000-0002-7552-1009
● Laura Rodríguez-Navas https://orcid.org/0000-0003-4929-1219
● Paul Walk https://orcid.org/0000-0003-1541-5631
● brandon whitehead https://orcid.org/0000-0002-0337-8610
● Mark Wilkinson https://orcid.org/0000-0001-6960-357X
● Paul Groth https://orcid.org/0000-0003-0183-6910
● Erich Bremer https://orcid.org/0000-0003-0223-1059
● LJ Garcia Castro https://orcid.org/0000-0003-3986-0510
● Karl Sebby https://orcid.org/0000-0001-6022-9825
● Alexander Kanitz https://orcid.org/0000-0002-3468-0652
● Ana Trisovic https://orcid.org/0000-0003-1991-0533
● Gavin Kennedy https://orcid.org/0000-0003-3910-0474
● Mark Graves https://orcid.org/0000-0003-3486-8193
● Jasper Koehorst https://orcid.org/0000-0001-8172-8981
● Simone Leo https://orcid.org/0000-0001-8271-5429
● Marc Portier https://orcid.org/0000-0002-9648-6484
● Paul Brack https://orcid.org/0000-0002-5432-2748
● Milan Ojsteršek https://orcid.org/0000-0003-1743-8300
● Bert Droesbeke https://orcid.org/0000-0003-0522-5674
● Chenxu Niu https://orcid.org/0000-0002-2142-1731
● Kosuke Tanabe https://orcid.org/0000-0002-9986-7223
● Tomasz Miksa https://orcid.org/0000-0002-4929-7875
● Marco La Rosa https://orcid.org/0000-0001-5383-6993
● Cedric Decruw https://orcid.org/0000-0001-6387-5988
● Andreas Czerniak https://orcid.org/0000-0003-3883-4169
● Jeremy Jay https://orcid.org/0000-0002-5761-7533
● Sergio Serra https://orcid.org/0000-0002-0792-8157
● Ronald Siebes https://orcid.org/0000-0001-8772-7904
● Shaun de Witt https://orcid.org/0000-0003-4196-3658
● Shady El Damaty https://orcid.org/0000-0002-2318-4477
● Douglas Lowe https://orcid.org/0000-0002-1248-3594
● Xuanqi Li https://orcid.org/0000-0003-1498-6205
● Sveinung Gundersen https://orcid.org/0000-0001-9888-7954
● Muhammad Radifar https://orcid.org/0000-0001-9156-9478
● Rudolf Wittner https://orcid.org/0000-0002-0003-2024
● Oliver Woolland https://orcid.org/0000-0002-4565-9760
● Paul De Geest https://orcid.org/0000-0002-8940-4946
● Douglas Fils https://orcid.org/0000-0002-2257-9127
● Florian Wetzels https://orcid.org/0000-0002-5526-7138
● Raül Sirvent https://orcid.org/0000-0003-0606-2512
● Abigail Miller https://orcid.org/0000-0001-9228-2882
● Jake Emerson https://orcid.org/0000-0003-0617-9219
● Davide Fucci https://orcid.org/0000-0002-0679-4361
Acknowledgements

Contenu connexe

Similaire à RO-Crate: packaging metadata love notes into FAIR Digital Objects

A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community UpdateCarole Goble
 
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
Publishing of Scientific Data  - Science Foundation Ireland Summit 2010Publishing of Scientific Data  - Science Foundation Ireland Summit 2010
Publishing of Scientific Data - Science Foundation Ireland Summit 2010jodischneider
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...Open Science Fair
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research ObjectsCarole Goble
 
247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research WorkbenchStuart Chalk
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynoteCarole Goble
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesStefan Dietze
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessdatacite
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
 
IBC FAIR Data Prototype Implementation slideshow
IBC FAIR Data Prototype Implementation   slideshowIBC FAIR Data Prototype Implementation   slideshow
IBC FAIR Data Prototype Implementation slideshowMark Wilkinson
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use CasesCarole Goble
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Joachim Neubert
 

Similaire à RO-Crate: packaging metadata love notes into FAIR Digital Objects (20)

A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 
Research Object Community Update
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
 
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
Publishing of Scientific Data  - Science Foundation Ireland Summit 2010Publishing of Scientific Data  - Science Foundation Ireland Summit 2010
Publishing of Scientific Data - Science Foundation Ireland Summit 2010
 
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at ScaleFull Erdmann Ruttenberg Community Approaches to Open Data at Scale
Full Erdmann Ruttenberg Community Approaches to Open Data at Scale
 
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
OSFair2017 Workshop | How FAIR friendly is the FAIRDOM Hub? Exposing metadata...
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
"Cool" metadata for FAIR data
"Cool" metadata for FAIR data"Cool" metadata for FAIR data
"Cool" metadata for FAIR data
 
Dataset Metadata, Tools and Approaches for Access and Preservation
Dataset Metadata, Tools and Approaches for Access and PreservationDataset Metadata, Tools and Approaches for Access and Preservation
Dataset Metadata, Tools and Approaches for Access and Preservation
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
The Rhetoric of Research Objects
The Rhetoric of Research ObjectsThe Rhetoric of Research Objects
The Rhetoric of Research Objects
 
247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench247th ACS Meeting: The Eureka Research Workbench
247th ACS Meeting: The Eureka Research Workbench
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
Semantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital LibrariesSemantic Linking & Retrieval for Digital Libraries
Semantic Linking & Retrieval for Digital Libraries
 
Riding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information accessRiding the wave - Paradigm shifts in information access
Riding the wave - Paradigm shifts in information access
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
 
A Clean Slate?
A Clean Slate?A Clean Slate?
A Clean Slate?
 
IBC FAIR Data Prototype Implementation slideshow
IBC FAIR Data Prototype Implementation   slideshowIBC FAIR Data Prototype Implementation   slideshow
IBC FAIR Data Prototype Implementation slideshow
 
dotte.ppt
dotte.pptdotte.ppt
dotte.ppt
 
The Research Object Initiative: Frameworks and Use Cases
The Research Object Initiative:Frameworks and Use CasesThe Research Object Initiative:Frameworks and Use Cases
The Research Object Initiative: Frameworks and Use Cases
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
 

Plus de Carole Goble

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...Carole Goble
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpCarole Goble
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the FutureCarole Goble
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardCarole Goble
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsCarole Goble
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Carole Goble
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
 

Plus de Carole Goble (20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
 
Research Software Sustainability takes a Village
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
 
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
Title: Love, Money, Fame, Nudge: Enabling Data-intensive BioScience through D...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Open Research: Manchester leading and learning
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
What is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
 
ELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
 
FAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
 
Let’s go on a FAIR safari!
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
 
Reproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
 

Dernier

A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxAleenaTreesaSaji
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfWadeK3
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 

Dernier (20)

A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Luciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptxLuciferase in rDNA technology (biotechnology).pptx
Luciferase in rDNA technology (biotechnology).pptx
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 

RO-Crate: packaging metadata love notes into FAIR Digital Objects

  • 1. RO-Crate: packaging metadata love notes into FAIR Digital Objects Professor Carole Goble CBE FREng The University of Manchester, UK ELIXIR-UK Head of Node carole.goble@manchester.ac.uk Significant contributor & RO-crate leader: Stian Soiland-Reyes, The University of Manchester/ The University of Amsterdam soiland-reyes@manchester.ac.uk https://helmholtz-metadaten.de/en/events/hmc-conference-2022 HMC Conference 2022, 05 October 2022
  • 2.
  • 3. Multi-institute Team Science Collaborators Using different platforms Almeida, A., Mitchell, A.L., Boland, M. et al. A new genomic blueprint of the human gut microbiota. Nature 568, 499–504 (2019). https://doi.org/10.1038/s41586-019-0965-1
  • 4. FAIR Mixed and Multi Object Source data and results Instruments, software, workflows, scripts… Different data types… Public archives, spreadsheets, project ftp servers…
  • 5. FAIR Reproducible Methods Parameter settings. Configurations.Test data.
  • 6. Scattered and diverse metadata Multiple platforms and repositories
  • 7. Scattered and diverse metadata Multiple platforms and repositories Big data, Sensitive data Data remains at home. Metadata references the data. Manage the integrity of the referencing.
  • 8. Metadata love letter delivery Each object in its own repository or platform with its own metadata : Research Objects
  • 9. Integrated view Package files and URL addressable resources Describe package and parts Need something Infrastructure independent • Exchange between repositories, registries and services. • Avoid vendor lock-in Overlaying the Research Digital Ecosystem Repositories have their own approaches DataONE data package CodeOcean capsule WholeTale capsules Compendiums CombineArchive DataCrate Quilt Data Package …
  • 10. Package files and URL addressable resources My Platform Repositories have their own approaches DataONE data package CodeOcean capsule WholeTale capsules Compendiums CombineArchive DataCrate Quilt Data Package … Need something Infrastructure independent • Exchange between repositories, registries and services. • Avoid vendor lock-in Currency of exchange across Research Digital Ecosystem
  • 11. BioConnect Data Packages Abigail Miller https://zenodo.org/record/7116702#.YzinYLTMKF4 An index of biological research data, analysis tools, and models hosted internally or externally: LIMS, machine generated data files, manually generated data files, spreadsheets Import, record, and curate study metadata. Search on metadata. Export data with their metadata. Connect data with tools
  • 12. A snapshot of living objects: Science Changes Software, reference datasets, methods change Results may vary What if we released research rather than published it? Like software releases?
  • 13. Science 2.0 Repositories: Time for a Change in Scholarly Communication Assante, Candela, Castelli, Manghi, Pagano, D-Lib 2015 Do Research Research Infrastructure Publish Research Scholarship Market place Release Research Objects
  • 14. Metadata delivery on released research objects All the related research objects needed to reuse & reproduce results Living object for released research as its being created rather than “publish” it? A way of exchanging, archiving, reporting, citing research entities combine open science with open scholarship Self-described metadata objects context, dependencies and relationships between the objects. Virtual objects referencing scattered resources Scale up and working across all platforms. Moving knowledge between different teams. Actionable knowledge units Including digital twins.
  • 15. Metadata is a love note to collaborators & peers We need frameworks to be FAIR. For boxing stuff up. For platform independent unified reporting, exchange, archiving of metadata. That copes with diversity and legacy. And that mortals can use. https://www.flickr.com/photos/ryanishungry/5796976028/
  • 17. A little bit of packaging goes a long way http://www.researchobject.org/ro-crate/ Practical lightweight packaging approach to aggregate files and/or any URI-addressable content, with contextual information into a machine actionable metadata rich structured archive.
  • 18. A little bit of packaging goes a long way Familiar, developer friendly Lo-Tek - web native, off-the- shelf, machine and human readable, search engine accessible: PIDs + JSON-LD + Schema.org + BagIT/Zip/OCDL. Infrastructure independent to overcome repository and service silos: Practical, lightweight, robust. One size does not fit all - embrace diversity, legacy, unknowns – open-ended, multi-interpretation, self- describing. Extensible metadata + pre-existing ontologies: Duck type profiling.
  • 19. It takes an open village, with sponsors, leaders and application drivers https://www.researchobject.org/ro-crate/community.html Packaging research artefacts with RO-Crate. Data Science https://doi.org/10.3233/DS-210053 RO-Crate Specification 1.1 https://w3id.org/ro/crate/1.1 Biohackathon https://biohackat hon-europe.org/
  • 20. Structured self-describing, machine readable, metadata objects RO-Crate Metadata file Archive file format / packaging system type id description datePublished … license author organisation Linked Data JSON-LD Schema.org Structured metadata about the RO-Crate and content Standard Packaging BagIT, Zip https://github.com/o/script files links to web resources RO-Crate Content directories type, id description datePublished creator size format … https://zenodo.org/record/3541888
  • 21. Mixed Objects Data, Software, Documents… Unbounded External PIDs References Unbounded Boundary Machine Actionable Objects Descriptive Profiles Checklist-style typing must, should, optionals Unbounded + community vocabularies, formats and standards
  • 22. Unbounded Boundary Machine Actionable Objects Openendedness Known knowns, known unknowns and unknown unknowns Multi-interpretation Interlinguia cross domains Mixed profiles Cross domains Interpret what you care about Descriptive Profiles Checklist-style typing must, should, optionals Unbounded + community vocabularies, formats and standards
  • 23. Profiles: Unbounded Boundary Objects Run Testing Data Cubes Descriptive Profiles Checklist-style typing Unbounded Profile portfolio
  • 24. Self-describing Profiles using Just Enough Linked Data A FAIR Knowledge Web of Research Objects Metadata Graph inside the RO-Crate Contextual entities and PIDs connect to the outside world & other RO-Crates Descriptive Profiles contextual entities + community vocabularies and standards
  • 25. Developers Matter – this is Middleware! RSECon 2022 – Research Software Engineers! https://society-rse.org/
  • 26. Developer Friendly, Problem Driven DataCrate simple web stack ROs rich RDF stack + simplifications rather than generalisations fewer features, more directed easier to understand, conceptually simpler opinionated guide to current best practices constrained and predictable but not too cumbersome to work with retain just enough linked data for benefits querying, vocabularies, clickable URIs, knowledge graphs with all the stuff developers need documentation, examples, libraries, tools Adoptability!!!!
  • 27. Developer Friendly, Tool development Packaging research artefacts with RO-Crate. Data Science https://doi.org/10.3233/DS-210053 RO-Crate Specification 1.1 https://w3id.org/ro/crate/1.1 Infrastructure facing Software libraries https://www.npmjs.com/package/ro-crate https://github.com/ResearchObject/ro-crate-ruby https://pypi.org/project/rocrate/ https://github.com/kit-data-manager/ro-crate-java Contact: andreas.pfeil@kit.edu
  • 28. Developer Friendly, Tool development Packaging research artefacts with RO-Crate. Data Science https://doi.org/10.3233/DS-210053 User Facing Describo https://uts-eresearch.github.io/describo/
  • 29. FAIR Research Data Packaging A data curation service for endangered languages: 500K+ files, 28K+ items, 574 collections Archiving and accessibility Mixed artefacts, mixed metadata Repositories & registries Peter Sefton, Marco La Rosa Ana Trisovic Submission / download Exchange between repositories Mixed objects Search metadata Aggregate data collections
  • 30. Back to BioConnect …. RO-Crate Abigail Miller https://zenodo.org/record/7116702#.YzinYLTMKF4 Frictionless data Packaging Structure File type defs for tabular data ISA format + +
  • 31. Exporting data using an interchange format HMC Hub Energy Time series data from different databases exported with metadata description of their structure and content into a single web service Jan Schweikert - Institute for Automation and Applied Informatics Web service using ro-crate-java Data file format: CSV LD-Vocabularies: RO-Crate Context, QUDT, CSVW
  • 32. https://youtu.be/Rsuxn0m4bIM https://www.reliance-project.eu/ EOSC, Copernicus, Earth Science RO-Crate + Data Cubes Mixed object sharing Reproducibility Raul Palma, Oscar Corcho, Daniel Garijo
  • 33. Computational Workflows Packaging workflow files & companion objects
  • 34. Computational Workflows Packaging workflow files & companion objects Submission / download Exchange between services and systems Reproducibility Citation
  • 35. Data provenance collection and Pipeline Provenance Packaging Renske de Wit PROV CWLPROV https://www.researchobject.org/workflow-run-crate/ Simone Leo 2022-09-27 Renske deWit: A Non-Intimidating Approach toWorkflow Reproducibility in Bioinformatics https://www.researchobject.org/ro-crate/1.1/provenance
  • 36. Workflow, results and traceable provenance packaging, FAIR Research Objects https://riojournal.com/article /94042/ Netherlands X-omics Initiative Human Infectious Disease Modelling https://doi.org/10.1098/rsta.2021.0300
  • 37. Federated Pipelines & Provenance Packaging Federated analytics, distributed research pipelines, overTrusted Research Environments for sensitive data • Controlled access to sensitive data • Exchange between data platforms • Reporting & sharing pipelines • Reporting results & provenance • Common Provenance Model handoffs between different orgs • OMOP mapping pipelines Tom Giles, Rudolf Wittner
  • 38. Handling big & sensitive data Scalable collections of references while data stays at host Big genomic & clinical data, images etc, distributed over multiple locations. https://doi.org/10.1109/BigData.2016.7840618 Retain & archive processed datasets Reference & transfer large data on demand Controlled access Moving data between archives Ravi Madduri, Kyle Chard, Carl Kesselman, Ian Foster
  • 39. Biodiversity Digital Objects and Digital Twinning Hardisty et al (2022): The Specimen Data Refinery: A Canonical Workflow Framework and FAIR Digital Object Approach to Speeding up Digital Mobilisation of Natural History Collections. Data Intelligence 4(2): 320–341. https://doi.org/10.1162/dint_a_00134 Bags of references courtesy of Alex Hardisty, Dimitris Koureas Digital Surrogate FAIR DigitalObject https://biodt.eu/ predicting biodiversity dynamics
  • 40. Package citations, citing 1000s of datasets GhaithArf/ro-crate-rda-madmp-mapper 10.4126/FRL01-006423291 Lots of stuff needs packaging + metadata … Conversational Survey results https://data.agu.org/DataCitationCoP/ 10.1002/essoar.10509966.1 https://coneytoolkit.cefriel.it/1 0.4126/FRL01-006429412 Tomasz Miksa Shelley Stall, Chris Erdmann, Christine Kirkpatrick Deb Agarwal Mario Scrocca, Irene Celino
  • 41. Back to Mixed Object Publishing … HERMES Helmholtz Rich Metadata Software Publication Druskat, S., Bertuch, O.,Juckeland, G., Knodel, O., & Schlauch,T. (2022). Software publications with rich metadata: state of the art, automated workflows and HERMES concept. ArXiv, abs/2201.09015. https://virtual.oxfordabstracts.com/#/event/public/3101/submission/110
  • 42. FAIR Data Commons https://helmholtz-metadaten.de/en/fair-data-commons/overview Testbed for FAIR Digital Objects and RO-Crate Courtesy: Sören Lorenz, Stefan Stanfed
  • 43. So a little bit of packaging goes a long way… Platform independent exchange between repositories and services Transfer collections of secure distributed datasets Describe, export and archive data collections, datasets, pipelines/workflows with their metadata Citation aggregation Reproducibility, connect data with tools Provenance collection Mixed object publication FAIR Digital Objects
  • 44. FAIR Digital Objects …. Two Takes Find, Access Interoperate, Reuse RO-Crate support of principles and adherence to the principles. FAIR assessment in Research Objects*, ROHub, Profile registry… The Principles *https://dgarijo.com/papers/TPDL2022_gonzalez.pdf https://fairdo.org/ https://www.fdo2022.org The FDOF Forum
  • 45. FAIR Digital Object (FDO) – conceptual view Predictable implementation of FAIR for active objects - not just static data PID Profile Collection FDO PID 20.301/a Metadata Operation Operation Operation Attributes 20.123: “Alice” 20.789: <http://...> 20.456: 10.1234/ab PID Record Bytes Bytes FDO FDO FDO Type • Distributed architecture • Self-describing digital objects • Several types of metadata • Encapsulation of operations RO-Crate implements FDO with current web stack with FAIR signposting FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units: https://doi.org/10.3390/publications8020021 Soiland-Reyes, Sefton, et al (2022): Creating lightweight FAIR Digital Objects with RO-Crate. Research Ideas and Outcomes, 1st Intl Conf on FAIR DigitalObjects https://signposting.org/FAIR/
  • 46. Metadata is a love note to the FAIR future…. ….RO-Crate is the delivery package in a multi- platform, mixed object research ecosystem. Keep it practical, real and simple Adoption and diversity friendliness Metadata middleware to drive the release of research, reproducible scholarship and knowledge graphs. It takes a village– like HMC To make Research Objects normative Promote to researchers but …... target Research Infrastructures to deliver
  • 47. https://www.researchobject.org/ro-crate/ The RO-Crate team is: ● Peter Sefton https://orcid.org/0000-0002-3545-944X (co-chair) ● Stian Soiland-Reyes https://orcid.org/0000-0001-9842-9718 (co-chair) ● Eoghan Ó Carragáin https://orcid.org/0000-0001-8131-2150 (emeritus) ● Oscar Corcho https://orcid.org/0000-0002-9260-0753 ● Daniel Garijo https://orcid.org/0000-0003-0454-7145 ● Raul Palma https://orcid.org/0000-0003-4289-4922 ● Frederik Coppens https://orcid.org/0000-0001-6565-5145 ● Carole Goble https://orcid.org/0000-0003-1219-2137 ● José María Fernández https://orcid.org/0000-0002-4806-5140 ● Kyle Chard https://orcid.org/0000-0002-7370-4805 ● Jose Manuel Gomez-Perez https://orcid.org/0000-0002-5491-6431 ● Michael R Crusoe https://orcid.org/0000-0002-2961-9670 ● Ignacio Eguinoa https://orcid.org/0000-0002-6190-122X ● Nick Juty https://orcid.org/0000-0002-2036-8350 ● Kristi Holmes https://orcid.org/0000-0001-8420-5254 ● Jason A. Clark https://orcid.org/0000-0002-3588-6257 ● Salvador Capella-Gutierrez https://orcid.org/0000-0002-0309-604X ● Alasdair J. G. Gray https://orcid.org/0000-0002-5711-4872 ● Stuart Owen https://orcid.org/0000-0003-2130-0865 ● Alan R Williams https://orcid.org/0000-0003-3156-2105 ● Giacomo Tartari https://orcid.org/0000-0003-1130-2154 ● Finn Bacall https://orcid.org/0000-0002-0048-3300 ● Thomas Thelen https://orcid.org/0000-0002-1756-2128 ● Hervé Ménager https://orcid.org/0000-0002-7552-1009 ● Laura Rodríguez-Navas https://orcid.org/0000-0003-4929-1219 ● Paul Walk https://orcid.org/0000-0003-1541-5631 ● brandon whitehead https://orcid.org/0000-0002-0337-8610 ● Mark Wilkinson https://orcid.org/0000-0001-6960-357X ● Paul Groth https://orcid.org/0000-0003-0183-6910 ● Erich Bremer https://orcid.org/0000-0003-0223-1059 ● LJ Garcia Castro https://orcid.org/0000-0003-3986-0510 ● Karl Sebby https://orcid.org/0000-0001-6022-9825 ● Alexander Kanitz https://orcid.org/0000-0002-3468-0652 ● Ana Trisovic https://orcid.org/0000-0003-1991-0533 ● Gavin Kennedy https://orcid.org/0000-0003-3910-0474 ● Mark Graves https://orcid.org/0000-0003-3486-8193 ● Jasper Koehorst https://orcid.org/0000-0001-8172-8981 ● Simone Leo https://orcid.org/0000-0001-8271-5429 ● Marc Portier https://orcid.org/0000-0002-9648-6484 ● Paul Brack https://orcid.org/0000-0002-5432-2748 ● Milan Ojsteršek https://orcid.org/0000-0003-1743-8300 ● Bert Droesbeke https://orcid.org/0000-0003-0522-5674 ● Chenxu Niu https://orcid.org/0000-0002-2142-1731 ● Kosuke Tanabe https://orcid.org/0000-0002-9986-7223 ● Tomasz Miksa https://orcid.org/0000-0002-4929-7875 ● Marco La Rosa https://orcid.org/0000-0001-5383-6993 ● Cedric Decruw https://orcid.org/0000-0001-6387-5988 ● Andreas Czerniak https://orcid.org/0000-0003-3883-4169 ● Jeremy Jay https://orcid.org/0000-0002-5761-7533 ● Sergio Serra https://orcid.org/0000-0002-0792-8157 ● Ronald Siebes https://orcid.org/0000-0001-8772-7904 ● Shaun de Witt https://orcid.org/0000-0003-4196-3658 ● Shady El Damaty https://orcid.org/0000-0002-2318-4477 ● Douglas Lowe https://orcid.org/0000-0002-1248-3594 ● Xuanqi Li https://orcid.org/0000-0003-1498-6205 ● Sveinung Gundersen https://orcid.org/0000-0001-9888-7954 ● Muhammad Radifar https://orcid.org/0000-0001-9156-9478 ● Rudolf Wittner https://orcid.org/0000-0002-0003-2024 ● Oliver Woolland https://orcid.org/0000-0002-4565-9760 ● Paul De Geest https://orcid.org/0000-0002-8940-4946 ● Douglas Fils https://orcid.org/0000-0002-2257-9127 ● Florian Wetzels https://orcid.org/0000-0002-5526-7138 ● Raül Sirvent https://orcid.org/0000-0003-0606-2512 ● Abigail Miller https://orcid.org/0000-0001-9228-2882 ● Jake Emerson https://orcid.org/0000-0003-0617-9219 ● Davide Fucci https://orcid.org/0000-0002-0679-4361 Acknowledgements