SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Digital Infrastructure for Reproducibility
in Science
RDA – Paris 2015
@mrgunn
https://orcid.org/0000-0002-3555-2054
Old Model: Single type of content;
single mode of distribution
Scholar
Library
Scholar
Publisher
FORCE11.org: Future of research communications and e-scholarship
Credit: Maryann Martone, hypothes.is
Scholar
Consumer
Libraries
Data Repositories
Code Repositories
Community
databases/platforms
OA
Curators
Social
Networks
Social
NetworksSocial
Networks
Peer Reviewers
Narrative
Workflows
Data
Models
Multimedia
Nanopublications
Code
“Hidden web”
FORCE11.org: Future of research
communications and e-scholarship
Credit: Maryann Martone, hypothes.is
Scholar
Consumer
Libraries
Data Repositories
Code Repositories
Community
databases/platforms
OA
Curators
Social
Networks
Social
NetworksSocial
Networks
Peer Reviewers
Narrative
Workflows
Data
Models
Multimedia
Nanopublications
Code
“Hidden web”
FORCE11.org: Future of research
communications and e-scholarship
Credit: Maryann Martone, hypothes.is
Make your research data citable
 DOIs for your data
 Force11 compliant citations
 Unique DOI for each version
 Integrated into article
submission
 data linked to article
 See metrics of reuse
Built for collaboration
Private
Full control over who
can see and download
Add and remove
collaborators
Local hosting option
ELN integration
Built for mobile
Mendeley Data has been
created to work on a wide
range of devices - from your
lab workstation to your mobile
phone. Browse datasets on the
go, or take a picture in the lab
and upload it instantly to your
research data.
Scholar
Consumer
Libraries
Data Repositories
Code Repositories
Community
databases/platforms
OA
Curators
Social
Networks
Social
NetworksSocial
Networks
Peer Reviewers
Narrative
Workflows
Data
Models
Multimedia
Nanopublications
Code
“Hidden web”
FORCE11.org: Future of research
communications and e-scholarship
Credit: Maryann Martone, hypothes.is
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Studies looking at the prevalence of irreproducibility estimate a rate of 50% or more
Vasilevsky et alBayer Healthcare
(Prinz et al.)
Amgen
(Begley and Ellis)
Glaziou et al.
51%
(n=80)
Hartshorne
and Schachner
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
51%
(n=257)
54%
(n=238)
78%
(n=67)
89%
(n=53)
64%
Source: GBSI 2015
Economic impact of irreproducibility
• A 50% irreproducibility rate within preclinical research implies that more
$28B/year is spent on research that is irreproducible
Reproducible
Irreproducible
Preclinical Research
$56.4B
$28.2B
(50%)
$28.2B
(50%)
US Preclinical Spend
2014E ($B)
$56B
(49%)
$58B
(51%)
Clinical
& Other
US Life Science Spend ($B)
Preclinical
R&D
$114B
Total US Life Science
Spend 2014E ($B)
Source: GBSI 2015
Retraction?
Only 0.2% of the literature (vs 50%+ irreproducibility)
Negative findings?
Less than 30% of researchers who could not reproduce published findings
published their failure1
Only 14% of the literature reports any negative results2
Additional publications from other academic
labs?
“We didn’t see that a target is more likely to be validated if it was reported in ten
publications or in two publications”3
Example: Retraction of PLOS4 and Science5 papers by Pamela Ronald at UC
Davis
• Self retraction due to reagent error
• Results had been ‘confirmed’ independently by three other groups6-8
1. Mobley et al. PLOS ONE. 8, e63221 (2013)
2. Fanelli. Scientometrics. 90, 891 (2012)
3. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011)
4. Han et al. PLOS ONE. 6, e29192 (2011)
5. Lee et al. Science. 326, 850 (2009)
6. McCarthy et al. J Bacteriology. 193, 6375 (2011)
7. Shuguo et al. Appl Biochem Biotechnol. 166, 1368 (2012)
8. Qian et al. J Proteome Res. 12, 3327 (2013)
What about citations?
None of the replication studies reported
have found any correlation with citations (or
journal impact factor):
• NINDS - No significant difference1
• Bayer - No significant difference2
• Amgen - “We saw no significant difference in citation rates
between papers that were reproducible versus non-
reproducible”3
1. Stuart et al. Experimental Neurology 233, 597–605 (2012).
2. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011).
3. Begley and Ellis. Nature. 483, 531-3 (2012)
Reproducibility Project: Cancer Biology
Reproducibility Project: Cancer Biology
Independently replicating key experimental results from
the top 50 cancer biology studies from 2010-2012
Learn more at http://cos.io/cancerbiology
Science Exchange
Validation Projects: lessons to date
Validation Projects: lessons to date
Validation Projects: lessons to date
Project 1. Cancer Biology Reproducibility Project
Resource identification:
Suggestions for improvements
Reduce reliance on contacting original authors who may
have moved on or forgotten details by the time of deposit
• Build tools to help capture research workflow and data
• (semi)-automated e.g. Riffyn, GenomeStudio
• Update journal formats to capture more information and
data e.g. GigaScience, Nature
• Uniquely identify commercial reagents e.g. Resource
Identification Initiative
• Repositories for materials e.g. Addgene, JAX, ATCC
Push over pull
DATA!
How and why we fail
at infrastructure
Definition – that which you only
notice when it breaks
We’re terrible at it, because it’s very
bureaucratic and boring
Funders want to talk about
infrastructure (meetings like
this!), but researchers don't.
How we do infrastructure now
1. A funder offers money to build infrastructure.
2. Researchers propose to do what they’re already doing
but generate infrastructure as a “side effect”.
3. A project gets funded using words such as “open,
distributed, lightweight, framework, etc”.
4. The distributed system gets co-opted by a centralized
one (or it fades away)
Examples: Handles & URIs-> DOIs, profiles -> ORCID
Bilder/Lin/Neylon org principles
• Trust can’t be solved by technology.
• A stakeholder-governed organization needs to manage
infrastructure.
• Transparency and openness is necessary but not
sufficient
• Org should aim for sustainability, but must not lobby and
needs formal incentives to replace itself
• Revenue should be based on services, not data
– builds trust, as community can go elsewhere if needed
www.mendeley.com
william.gunn@mendeley.com
@mrgunn

Contenu connexe

Tendances

Laurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data PublicationLaurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data PublicationGigaScience, BGI Hong Kong
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...Carole Goble
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...GigaScience, BGI Hong Kong
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMichel Dumontier
 
NITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkNITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkJean-Claude Bradley
 
Leveraging publication metadata to help overcome the data ingest bottleneck
Leveraging publication metadata to help overcome the data ingest bottleneck Leveraging publication metadata to help overcome the data ingest bottleneck
Leveraging publication metadata to help overcome the data ingest bottleneck Todd Vision
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesMichel Dumontier
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Sandra Binning
 
Data reuse and scholarly reward: understanding practice and building infrastr...
Data reuse and scholarly reward: understanding practice and building infrastr...Data reuse and scholarly reward: understanding practice and building infrastr...
Data reuse and scholarly reward: understanding practice and building infrastr...Todd Vision
 
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...GigaScience, BGI Hong Kong
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theoryC. Tobin Magle
 
Open Notebook Science in Drug Discovery
Open Notebook Science in Drug DiscoveryOpen Notebook Science in Drug Discovery
Open Notebook Science in Drug DiscoveryJean-Claude Bradley
 
Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...Todd Vision
 
Leveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reportsLeveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reportsPaul Albert
 
The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...Todd Vision
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge DiscoveryMichel Dumontier
 

Tendances (20)

Laurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data PublicationLaurie Goodman: Overcoming Hurdles to Data Publication
Laurie Goodman: Overcoming Hurdles to Data Publication
 
SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...SEEK for Science: A Data and Model Management Platform to support Open and Re...
SEEK for Science: A Data and Model Management Platform to support Open and Re...
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
Scott Edmunds talk at G3 (Great GigaScience & Galaxy) workshop: Open Data: th...
 
Third-Party PubMed Tools
Third-Party PubMed ToolsThird-Party PubMed Tools
Third-Party PubMed Tools
 
MLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed ToolsMLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed Tools
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
NITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkNITLE Open Notebook Science Talk
NITLE Open Notebook Science Talk
 
Leveraging publication metadata to help overcome the data ingest bottleneck
Leveraging publication metadata to help overcome the data ingest bottleneck Leveraging publication metadata to help overcome the data ingest bottleneck
Leveraging publication metadata to help overcome the data ingest bottleneck
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?Public Data Archiving in Ecology and Evolution: How well are we doing?
Public Data Archiving in Ecology and Evolution: How well are we doing?
 
Data reuse and scholarly reward: understanding practice and building infrastr...
Data reuse and scholarly reward: understanding practice and building infrastr...Data reuse and scholarly reward: understanding practice and building infrastr...
Data reuse and scholarly reward: understanding practice and building infrastr...
 
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
Laurie Goodman at #aibsdata: Beyond Data Release Mandates - Helping Authors M...
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theory
 
Open Notebook Science in Drug Discovery
Open Notebook Science in Drug DiscoveryOpen Notebook Science in Drug Discovery
Open Notebook Science in Drug Discovery
 
Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...Research data and scholarly publications: going from casual acquaintances to ...
Research data and scholarly publications: going from casual acquaintances to ...
 
Leveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reportsLeveraging VIVO data: visualizations, queries, and reports
Leveraging VIVO data: visualizations, queries, and reports
 
The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...The Dryad Digital Repository: Published evolutionary data as part of the gre...
The Dryad Digital Repository: Published evolutionary data as part of the gre...
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 

En vedette

How to Use the Average Function
How to Use the Average FunctionHow to Use the Average Function
How to Use the Average Functionmurphyk0209
 
Panoramicas IV
Panoramicas IVPanoramicas IV
Panoramicas IVF. Ovies
 
Gala NAZARENA 2.012
Gala NAZARENA 2.012Gala NAZARENA 2.012
Gala NAZARENA 2.012CDANazareno
 
Definition of Mesoscale
Definition of MesoscaleDefinition of Mesoscale
Definition of MesoscaleBen Dewell
 
Redifice Arlington Promenade
Redifice Arlington PromenadeRedifice Arlington Promenade
Redifice Arlington PromenadeBangalore Prj
 
Internet e' Cambiato
Internet e' CambiatoInternet e' Cambiato
Internet e' CambiatoLuca Boldori
 
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALEISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALEBangalore Prj
 
Rory Green - "Building a Major Gift Pipeline From Scratch"
Rory Green - "Building a Major Gift Pipeline From Scratch"Rory Green - "Building a Major Gift Pipeline From Scratch"
Rory Green - "Building a Major Gift Pipeline From Scratch"WeDidIt
 
Translanguaging as pedagogic strategy and as resource for identity performanc...
Translanguaging as pedagogic strategy and as resource for identity performanc...Translanguaging as pedagogic strategy and as resource for identity performanc...
Translanguaging as pedagogic strategy and as resource for identity performanc...RMBorders
 
7. Stress Management Physiological
7. Stress Management Physiological7. Stress Management Physiological
7. Stress Management Physiologicalrossbiology
 
Proyecto final_flipped classroom
Proyecto final_flipped classroomProyecto final_flipped classroom
Proyecto final_flipped classroomPiedad Cerro
 
Análise estática de código Python
Análise estática de código PythonAnálise estática de código Python
Análise estática de código PythonGuilherme Vierno
 
Empreendedorismo e as Oportunidades Disfarçadas
Empreendedorismo e as Oportunidades DisfarçadasEmpreendedorismo e as Oportunidades Disfarçadas
Empreendedorismo e as Oportunidades DisfarçadasHugo Magalhães
 
Aula de Captação de Recursos - parte 2
Aula de Captação de Recursos - parte 2Aula de Captação de Recursos - parte 2
Aula de Captação de Recursos - parte 2Flavia Amorim
 

En vedette (20)

How to Use the Average Function
How to Use the Average FunctionHow to Use the Average Function
How to Use the Average Function
 
Franciaország
FranciaországFranciaország
Franciaország
 
Panoramicas IV
Panoramicas IVPanoramicas IV
Panoramicas IV
 
Charles Porch: Facebook for Nonprofits
Charles Porch: Facebook for NonprofitsCharles Porch: Facebook for Nonprofits
Charles Porch: Facebook for Nonprofits
 
Gala NAZARENA 2.012
Gala NAZARENA 2.012Gala NAZARENA 2.012
Gala NAZARENA 2.012
 
Definition of Mesoscale
Definition of MesoscaleDefinition of Mesoscale
Definition of Mesoscale
 
Redifice Arlington Promenade
Redifice Arlington PromenadeRedifice Arlington Promenade
Redifice Arlington Promenade
 
Internet e' Cambiato
Internet e' CambiatoInternet e' Cambiato
Internet e' Cambiato
 
CIRCUITOS
CIRCUITOSCIRCUITOS
CIRCUITOS
 
Nnv
NnvNnv
Nnv
 
d8ooa.odp
d8ooa.odpd8ooa.odp
d8ooa.odp
 
Alexis Ohanian: The Benevolent Web
Alexis Ohanian: The Benevolent Web  Alexis Ohanian: The Benevolent Web
Alexis Ohanian: The Benevolent Web
 
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALEISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
ISHWERYA QUARTZ 2BHK APARTMENTS FOR SALE
 
Rory Green - "Building a Major Gift Pipeline From Scratch"
Rory Green - "Building a Major Gift Pipeline From Scratch"Rory Green - "Building a Major Gift Pipeline From Scratch"
Rory Green - "Building a Major Gift Pipeline From Scratch"
 
Translanguaging as pedagogic strategy and as resource for identity performanc...
Translanguaging as pedagogic strategy and as resource for identity performanc...Translanguaging as pedagogic strategy and as resource for identity performanc...
Translanguaging as pedagogic strategy and as resource for identity performanc...
 
7. Stress Management Physiological
7. Stress Management Physiological7. Stress Management Physiological
7. Stress Management Physiological
 
Proyecto final_flipped classroom
Proyecto final_flipped classroomProyecto final_flipped classroom
Proyecto final_flipped classroom
 
Análise estática de código Python
Análise estática de código PythonAnálise estática de código Python
Análise estática de código Python
 
Empreendedorismo e as Oportunidades Disfarçadas
Empreendedorismo e as Oportunidades DisfarçadasEmpreendedorismo e as Oportunidades Disfarçadas
Empreendedorismo e as Oportunidades Disfarçadas
 
Aula de Captação de Recursos - parte 2
Aula de Captação de Recursos - parte 2Aula de Captação de Recursos - parte 2
Aula de Captação de Recursos - parte 2
 

Similaire à RDA Scholarly Infrastructure 2015

RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapemhaendel
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersPhilip Bourne
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynoteCarole Goble
 
2015 12 ebi_ganley_final
2015 12 ebi_ganley_final2015 12 ebi_ganley_final
2015 12 ebi_ganley_finalEmma Ganley
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsPhilip Bourne
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reusevoginip
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
Elsevier - Labs on Line
Elsevier - Labs on Line Elsevier - Labs on Line
Elsevier - Labs on Line Philip Bourne
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data ManagementCarole Goble
 
Biomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterpriseBiomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterprisePhilip Bourne
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New ScienceAnita de Waard
 
On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...Susanna-Assunta Sansone
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open DataRoss Mounce
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyFAIRDOM
 

Similaire à RDA Scholarly Infrastructure 2015 (20)

RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
Force11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscapeForce11: Enabling transparency and efficiency in the research landscape
Force11: Enabling transparency and efficiency in the research landscape
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
One Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific PublishersOne Scientist’s Wish List for Scientific Publishers
One Scientist’s Wish List for Scientific Publishers
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
2015 12 ebi_ganley_final
2015 12 ebi_ganley_final2015 12 ebi_ganley_final
2015 12 ebi_ganley_final
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
 
Minimal viable data reuse
Minimal viable data reuseMinimal viable data reuse
Minimal viable data reuse
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Elsevier - Labs on Line
Elsevier - Labs on Line Elsevier - Labs on Line
Elsevier - Labs on Line
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
Biomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterpriseBiomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital Enterprise
 
Looking for Data: Finding New Science
Looking for Data: Finding New ScienceLooking for Data: Finding New Science
Looking for Data: Finding New Science
 
On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...On community-standards, data curation and scholarly communication - BITS, Ita...
On community-standards, data curation and scholarly communication - BITS, Ita...
 
Cartegena051811
Cartegena051811Cartegena051811
Cartegena051811
 
The OpenCon Intro to Open Data
The OpenCon Intro to Open DataThe OpenCon Intro to Open Data
The OpenCon Intro to Open Data
 
The FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
 

Dernier

Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...D. B. S. College Kanpur
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxNandakishor Bhaurao Deshmukh
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayupadhyaymani499
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologycaarthichand2003
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomyDrAnita Sharma
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuinethapagita
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxnoordubaliya2003
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubaikojalkojal131
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringPrajakta Shinde
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024innovationoecd
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)riyaescorts54
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》rnrncn29
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)Columbia Weather Systems
 

Dernier (20)

Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
Fertilization: Sperm and the egg—collectively called the gametes—fuse togethe...
 
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdfPests of safflower_Binomics_Identification_Dr.UPR.pdf
Pests of safflower_Binomics_Identification_Dr.UPR.pdf
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptxTHE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
THE ROLE OF PHARMACOGNOSY IN TRADITIONAL AND MODERN SYSTEM OF MEDICINE.pptx
 
Citronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyayCitronella presentation SlideShare mani upadhyay
Citronella presentation SlideShare mani upadhyay
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Davis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technologyDavis plaque method.pptx recombinant DNA technology
Davis plaque method.pptx recombinant DNA technology
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
basic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomybasic entomology with insect anatomy and taxonomy
basic entomology with insect anatomy and taxonomy
 
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 GenuineCall Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
Call Girls in Majnu Ka Tilla Delhi 🔝9711014705🔝 Genuine
 
preservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptxpreservation, maintanence and improvement of industrial organism.pptx
preservation, maintanence and improvement of industrial organism.pptx
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Volatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -IVolatile Oils Pharmacognosy And Phytochemistry -I
Volatile Oils Pharmacognosy And Phytochemistry -I
 
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In DubaiDubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
Dubai Calls Girl Lisa O525547819 Lexi Call Girls In Dubai
 
Microteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical EngineeringMicroteaching on terms used in filtration .Pharmaceutical Engineering
Microteaching on terms used in filtration .Pharmaceutical Engineering
 
User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)User Guide: Orion™ Weather Station (Columbia Weather Systems)
User Guide: Orion™ Weather Station (Columbia Weather Systems)
 
OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024OECD bibliometric indicators: Selected highlights, April 2024
OECD bibliometric indicators: Selected highlights, April 2024
 
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
(9818099198) Call Girls In Noida Sector 14 (NOIDA ESCORTS)
 
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》《Queensland毕业文凭-昆士兰大学毕业证成绩单》
《Queensland毕业文凭-昆士兰大学毕业证成绩单》
 
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
User Guide: Pulsar™ Weather Station (Columbia Weather Systems)
 

RDA Scholarly Infrastructure 2015

  • 1. Digital Infrastructure for Reproducibility in Science RDA – Paris 2015 @mrgunn https://orcid.org/0000-0002-3555-2054
  • 2. Old Model: Single type of content; single mode of distribution Scholar Library Scholar Publisher FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 3. Scholar Consumer Libraries Data Repositories Code Repositories Community databases/platforms OA Curators Social Networks Social NetworksSocial Networks Peer Reviewers Narrative Workflows Data Models Multimedia Nanopublications Code “Hidden web” FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 4.
  • 5. Scholar Consumer Libraries Data Repositories Code Repositories Community databases/platforms OA Curators Social Networks Social NetworksSocial Networks Peer Reviewers Narrative Workflows Data Models Multimedia Nanopublications Code “Hidden web” FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 6. Make your research data citable  DOIs for your data  Force11 compliant citations  Unique DOI for each version  Integrated into article submission  data linked to article  See metrics of reuse
  • 7. Built for collaboration Private Full control over who can see and download Add and remove collaborators Local hosting option ELN integration
  • 8. Built for mobile Mendeley Data has been created to work on a wide range of devices - from your lab workstation to your mobile phone. Browse datasets on the go, or take a picture in the lab and upload it instantly to your research data.
  • 9.
  • 10. Scholar Consumer Libraries Data Repositories Code Repositories Community databases/platforms OA Curators Social Networks Social NetworksSocial Networks Peer Reviewers Narrative Workflows Data Models Multimedia Nanopublications Code “Hidden web” FORCE11.org: Future of research communications and e-scholarship Credit: Maryann Martone, hypothes.is
  • 11. 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Studies looking at the prevalence of irreproducibility estimate a rate of 50% or more Vasilevsky et alBayer Healthcare (Prinz et al.) Amgen (Begley and Ellis) Glaziou et al. 51% (n=80) Hartshorne and Schachner 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 51% (n=257) 54% (n=238) 78% (n=67) 89% (n=53) 64% Source: GBSI 2015
  • 12. Economic impact of irreproducibility • A 50% irreproducibility rate within preclinical research implies that more $28B/year is spent on research that is irreproducible Reproducible Irreproducible Preclinical Research $56.4B $28.2B (50%) $28.2B (50%) US Preclinical Spend 2014E ($B) $56B (49%) $58B (51%) Clinical & Other US Life Science Spend ($B) Preclinical R&D $114B Total US Life Science Spend 2014E ($B) Source: GBSI 2015
  • 13. Retraction? Only 0.2% of the literature (vs 50%+ irreproducibility) Negative findings? Less than 30% of researchers who could not reproduce published findings published their failure1 Only 14% of the literature reports any negative results2 Additional publications from other academic labs? “We didn’t see that a target is more likely to be validated if it was reported in ten publications or in two publications”3 Example: Retraction of PLOS4 and Science5 papers by Pamela Ronald at UC Davis • Self retraction due to reagent error • Results had been ‘confirmed’ independently by three other groups6-8 1. Mobley et al. PLOS ONE. 8, e63221 (2013) 2. Fanelli. Scientometrics. 90, 891 (2012) 3. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011) 4. Han et al. PLOS ONE. 6, e29192 (2011) 5. Lee et al. Science. 326, 850 (2009) 6. McCarthy et al. J Bacteriology. 193, 6375 (2011) 7. Shuguo et al. Appl Biochem Biotechnol. 166, 1368 (2012) 8. Qian et al. J Proteome Res. 12, 3327 (2013)
  • 14. What about citations? None of the replication studies reported have found any correlation with citations (or journal impact factor): • NINDS - No significant difference1 • Bayer - No significant difference2 • Amgen - “We saw no significant difference in citation rates between papers that were reproducible versus non- reproducible”3 1. Stuart et al. Experimental Neurology 233, 597–605 (2012). 2. Prinz et al. Nat Rev Drug Discov. 10, 712 (2011). 3. Begley and Ellis. Nature. 483, 531-3 (2012)
  • 16. Reproducibility Project: Cancer Biology Independently replicating key experimental results from the top 50 cancer biology studies from 2010-2012 Learn more at http://cos.io/cancerbiology
  • 20. Validation Projects: lessons to date Project 1. Cancer Biology Reproducibility Project Resource identification:
  • 21. Suggestions for improvements Reduce reliance on contacting original authors who may have moved on or forgotten details by the time of deposit • Build tools to help capture research workflow and data • (semi)-automated e.g. Riffyn, GenomeStudio • Update journal formats to capture more information and data e.g. GigaScience, Nature • Uniquely identify commercial reagents e.g. Resource Identification Initiative • Repositories for materials e.g. Addgene, JAX, ATCC
  • 23. How and why we fail at infrastructure
  • 24. Definition – that which you only notice when it breaks We’re terrible at it, because it’s very bureaucratic and boring
  • 25. Funders want to talk about infrastructure (meetings like this!), but researchers don't.
  • 26. How we do infrastructure now 1. A funder offers money to build infrastructure. 2. Researchers propose to do what they’re already doing but generate infrastructure as a “side effect”. 3. A project gets funded using words such as “open, distributed, lightweight, framework, etc”. 4. The distributed system gets co-opted by a centralized one (or it fades away) Examples: Handles & URIs-> DOIs, profiles -> ORCID
  • 27. Bilder/Lin/Neylon org principles • Trust can’t be solved by technology. • A stakeholder-governed organization needs to manage infrastructure. • Transparency and openness is necessary but not sufficient • Org should aim for sustainability, but must not lobby and needs formal incentives to replace itself • Revenue should be based on services, not data – builds trust, as community can go elsewhere if needed