SlideShare une entreprise Scribd logo
1  sur  31
Télécharger pour lire hors ligne
Adopting DSpace 7 and 8:
Challenges and Solutions from
Real Migration Experiences
AGENDA
4Science who we
are
It is not just an
update, it is always
a migration
A couple of hints
about your data
model
There are more data
that need to be
migrated than what
you expect
Plan,
Do, Check, Finalize
Common pitfalls &
Solution strategies
Take aways
Today's speaker
Susanna Mornati,
Chief Operating Officer at 4Science
susanna.mornati@4science.com
Who we are
OUR AIM: to enable implementationof the transnationally
importantpolicies
of Open Research,
Research Impact
and Digital Preservation.
DSpace
(CRIS/GLAM)
OJS
Dataverse
Our
services:
Our solutions
support
compliance
with key
international
standards:
Certified Platinum
Provider and leading
contributorto DSpace 7
✓ OpenAIRE
✓ ORCID
✓ CERIF
✓ IIIF
We provide
solutions for
research
information & data
management and
for cultural
heritage
• Installation
• Configuration
• Hosting and
maintenance
• System integration,
customization and
consultancy
What we believe in Security Certification is not a matterof compromise: our
solutions are secure by design; openness without
security would be counterproductive;security without
openness would be unproductive.
ISO/IEC 27001:2013,
27017:2015, 27018:
2019, and
ISO/IEC 9001:2015
Our solutionssupport
the key defining
transnational
policies, Open
Research and open
digitalcultural
heritage, and are
based on:
Open-
source
software
Open
standards
Interoperability
Preservation
Collaboration
Innovation
A fast-growing organization
Over 100 clients
in 5 continents
worldwide
https://www.4science.com/we-work-for/
The context in which we operate since 2016
We are driven by serving the
open knowledge ecosystem.
Proprietary products often
come with expensivelicenses
and pricing fluctuations,can
become obsolescent and can
result in vendor-lock-in.
Our open solutions (open
standards,open protocols,
open source) aredesigned to
support open science.
Open knowledge helps to
solve,by collaboration,the
world’s very
pressingproblems,and creates
new opportunities, especially
when cross-disciplinary.
4Science role in the Open Science and DSpace community
Certified Platinum
Provider and leading
contributor of DSpace
Our goal is to anticipate
the future making it
more accessible
2023 DSpace worldwide
community leaders for
hours donated for
DSpace development
Experts in the field and
enablers that can help
with any situation
At 4Science we are driven by serving the open knowledge ecosystem.
Openknowledge
empowering open access,
supporting open science,
advancing open scholarly
communication.
FAIR data
Our solutions enable your
data to be Findable,
Accessible, Interoperable and
Reusable
Interoperablesolutions
ORCID and Datacite Certified
Service Provider, CERIF and
IIIF enabler
Compliance& Quality
COAR-NGR, OpenAIRE,
Certified Platinum Provider of
DSpace, ISO 9001:2015
Security
Battle-tested solutions, secure
by design; Trusted Providers
of the Cloud Security Alliance
«Migration» or «update»? Not so different?
In this session we will lookat some insights frombest practices that we havelearned moving from DSpace 5,
DSpace 6, EPrints, Digital Commons, OPUS or even custom solutions, but the first thing we would like to share is…
Even when you are about to upgrade from an old to the new version of DSpace, keep in mind that it has been
completelyreengineered fromprevious ones:anyupdate to a major release should therefore be understood (and
planned with the appropriate timing)as if it were a migration to an entirelynewplatform,in additionto
integrations with systems alreadyin yourecosystem.
Consider it as it was a migration toa completelydifferent system, although the main paradigms and approaches
are preserved
Entities are
the
foundation of
the new data
model
An effective datamodel should also be
flexible
Entities are a pivotal part of defining a
whole datamodel contributing to its
design, they enable flexibility to reflect
your data in a more granular way
Your data model should be as close as
possible to international standards to
enhance interoperability
The current design of DSpace 7
provides the foundation for flexibility
ensuring that it can be tailored to your
requirements
Relations complete the definition of
your data model: authors, publications,
organizations and more, can be
interconnected to each other
Entities should reflect your data model, enabling
relations and exploring connections
ENTITIES AREA WAYOF
REPRESENTING DATA AND THEIR
RELATIONS IN A STRUCTURED
MANNER
ENTITIES ARECONSTITUTED BY
RECORDS THATCAN BE
DESCRIBED, IDENTIFIED, AND
RELATED TO OTHER RECORDS IN
A REPOSITORY
ENTITIES ARE USED TO
REPRESENT REAL-WORLD
OBJECTS SUCH AS PEOPLE,
ORGANIZATIONS,
PUBLICATIONS AS WELL AS
ABSTRACTCONCEPTS SUCH AS
SUSTAINABILITY GOALS,
RESEARCH LINES, THEMATIC
COLLECTIONS
ENTITIES AREUSED TO PROVIDE
CONTEXT, CONNECTIONS, AND
RELATIONSHIPS BETWEEN
OBJECTS INTHE REPOSITORY,
SUPPORTING DISCOVERYAND
COMPREHENSION OF THE
CONTEXT
But with a correct balance:
when you’re about to migrate…
• You could have processes that
you would like to drop
• Customizations that affect your
maintenance costs
• Metadata representing
information that is no longer
useful
• And processes…you’d like to add,
or change
• New features that can substitute
your old customizations
• Opportunity to add new
information to your repository
How to enable entities
during the upgrade: pt. 1
How to enable entities
during the upgrade: pt 2
This step/job may be slow!
How to enable
entities during
the
“migration”
from other
platforms
Follow the DSpace documentation, YES but...Howto import all the
metadata, relationship and files?
• The SAF import could be an option (single records), BUT... you
cannot set the relationship with not-yet-created entities: it is
preferable to individually create all entities, make sure to store a
local.legacyid value for each
• Use the CSV Bulk edit (manually or automatically updated) to
create the relationship(s)
Warning: CSV Bulk cannot manage ordering between entities and
simple strings (i.e. ordering of Authors when only few of them have
a profile)
All of that is
easier in DSpace-
CRIS thanks to
the possibility to
use…
• Denormalized tables where you can prepare your data for import (like
the CSV but on the database) → easier!
• Enhanced Bulk import from Excel instead of CSV (yes, it is a non-standard
format but easier to work with, available for non-technical people →
new lines can be created)
• Promise for future reference that will be resolved once the target item is
created (i.e. you can say will be referenced:ORCID:XXXXX to
create a relation with the item AUTHOR using
person.identifier.orcid = XXXXX)
• You can manage files directly providing a remote URL (no SAF process
needed)
• Ordering between Entities and strings is supported (column with the
specific relationship.type can be ordered by value/promise)
Not enough said, but…
Do not customize your DSpace
database tables/structure, nor
backport any feature that changes it
Why?
Because it could lead to your
automated database upgrade process
to fail
Create new tables (instead of
modifying existing ones)
ALREADYDID?
Consider replacing your additional contents (tables) → new entities enabled by DSpace 7
Yes, your institution has a lot of data
…and not all of them are visible in plain sight (as metadata of your
items)
There will be more data emerging that you did not imagine
So…please keep this in mind
OAI Identifiers should be preserved.This is currently not supported without code change
(we plan to generalize the solution and open a PR → DSpace 8)
OAI URLs should be preserved as well:
redirection is (almost) good but you should
check it at least with your known harvesters
→ Easy to do in Apache or nginX (light web
server)
Statistics can be migrated
Upgradeprocedures, if followed, will resultin a full migration of the data... not -really-
deleted items / bitstreams areloss
When you migratefromanother platformyou can bulk import your statistics data
directly in SOLRvia CSV. Data need to be prepared so a local.legacyidmetadata willbe
crucial to translateyour legacy ID into the new one
Step 1: PLAN - ask yourself all relevant questions
Make sure to sync your activities and preparatory/interdependent tasks...
Prepare a new,
separated, environment
for DSpace 7
Do you use the Handle
Server?
Do you mint DOIs?
Integration:whatapplication extractsdata fromDSpace?What application
pushesdataintoDSpace? Usingwhich technology:SWORD, REST API?How
much time 3rd partieswill needtoswitchfromthe oldintegrationtothe
newone?
Plan to put your
repository in READ ONLY
mode for enough time
to perform the final
migration
Prepare your UATs that
should take into account
of your customizations,
configurations and top-
priority functionalities
You need to run the migration at least two times and
usually you cannot afford to haveyour currentrepository
locked down for a long period
This means that the two runs will useslightly differentdata!
Even if the repository is
in READ ONLY mode,
there are still running
data... Statistics will
grow!
Step 2: DO
Verify Verify the timing for execution/import/indexing during this phase: you’ll
benefit from them for the final migration
Note Remember to keep track of all of your steps (you’ll have to exactly repeat
them for the final migration)
Do Do your first test migration
Step 3:
CHECK
Perform UATs to validate and
flag possible issues (and the
related fixes you applied)
If you notice something
wrong that was not covered
by UATs, you should not
ignore it: UATs should be
amended to reflect the
path
Verify that timing of the
first migration allows you
to meet the
deadlines you were
expecting?
Verufy which tasks could be
optimized/reviewed
Check data integrity: run the
checksum checker (fixed by
4Science in 7.6)
Temporarily disable indexing during intermediate milestones/steps to save some time…
(…but be careful of the interdependencies in further steps and keep in mind that you’ll have to run a full
indexing when needed)
About the automatic initial reindexing: it is not recommended to skip it, unless you will manually reindex at a
later time, or verify that a reindexing is not necessary. Forgetting to reindex your site after an upgrade may result
in unexpected errors or instabilities
Step 4: FINALIZE
Put in read-only mode your current production environment before performing the final
deployment
Alert your partners of integrated systems that the systems is freezed
Extract your data from your current freezed repository
Re-run the steps that you succesfully run during the first test migration: even small
differences may lead to unexpexted issues
Run the UAT books: if everything goes smooth, make the final switch into production
DOs
Alert Give notice to your partners that they can restartto perform ordinary activities on their
3rd party systems
Move Move your handle server to your new environment
Enable Enable all of your crontabjobs
Update Update ALL of your URLs to matchthe ones in productions
More pitfalls and solutions we adopted
with experience
…fromDSpace 5, DSpace 6, EPrints,Digital
Commons, OPUS,Invenio…
UATs, the world where the obvious is certainly
not – guidelines
A plan should be prepared and followed methodically to test and verify
consistencybetween the old systemand the new one. A few examples:
1. How many items were visible in the old system? How many in the new one?
2. How many items were present in the users' workspace? How many in the new
system?
3. Same for workflows: how many in the various steps, how many in charge of the
various users?
4. Are any items restricted or embargoed? Are restrictions migrated correctly and
working?
5. Are all protocols used by 3rd party systems enabled (SWORD? Legacy REST…)?
Time spent in
UATs is very
well-spent
Through these cross-checks we had the
opportunity to discover inconsistencies
between the database and UI of older
versions of DSpace:
oeven fixing the problem in the new
version did not always coincide with
the user's desires (e.g., items
previously not visible by mistake
becoming visible in the new version
and vice versa).
Fun facts and
unapparent trivia
Thumbnails in the new DSpace 7 are now larger
than in the old versions. We learned that the
layout, importing the old ones, would be
compromised.
This resulted in the discovery of the century: all
thumbnails had to be…regenerated.
4Science contributed the fix for the regeneration
of the thumbnails ☺
The moral: consider every possible interaction!
Fun facts and
unapparent trivia
Most viewed item? OH YES PLEASE.
…but the item in the new version turned out to be
different from the item in the old version. Why?
Because slightly different rules had simply been applied-
which led to a different result.
One can never be too cautious: watch out for
inconsistenciesand rule changes, even
minimal ones.
What about DSpace 8?
• DSpace 8 is expected to go live in the spring/summer of 2024
• It will not be a major change like DSpace 7 was
Should I upgrade to DSpace 7 or wait for DSpace 8 to be released?
• We suggest to cautiously migrate/upgrade to the most stable version at the
moment of the release, assessing what is better for your institution
• The upgrade from DSpace 7 to Dspace 8 will not require such a big effort
compared the upgrade from DSpace 5 / 6 to 7
• Institutions upgrading from DSpace 7 to DSpace 8 will enjoy features already
implemented in DSpace-CRIS 7, e.g. Notify protocol (contributedby 4Science
+ Harvard), Correction service to enhance data quality (4Science), Duplicate
detection (ported by TLC from our implementation in DSpace-CRIS)
Be sure to check every
minimal step and take careful
note of it.
Time spent in analysis and
double-checks is really well
spent
We, at 4Science, would love
to put out expertise at your
service on behalf of the
entire community.
Contact us at: info@4Science.com
Visit our website: www.4science.com
Follow us on social media!
4Science International 4ScienceDSpace
4ScienceIT
4Science
Join the 4Science
newsletter to keep up to
date with news about
our contributions to
DSpace and much more!

Contenu connexe

Tendances

Project Manual
Project ManualProject Manual
Project ManualJoe Lynn
 
Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...
Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...
Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...Koch Modular Process Systems, LLC.
 
Turnaround Project Planner Primer
Turnaround Project Planner PrimerTurnaround Project Planner Primer
Turnaround Project Planner Primerahmad bassiouny
 
Detecting Anomalous Behavior with Surveillance​ Analytics​
Detecting Anomalous Behavior with Surveillance​ Analytics​Detecting Anomalous Behavior with Surveillance​ Analytics​
Detecting Anomalous Behavior with Surveillance​ Analytics​Databricks
 
Microsoft Projects; a step by-step guide for beginner's
Microsoft Projects; a step by-step guide for beginner'sMicrosoft Projects; a step by-step guide for beginner's
Microsoft Projects; a step by-step guide for beginner'sAbhik Tushar Das
 
Software Project Management (SPM)
Software Project Management (SPM)Software Project Management (SPM)
Software Project Management (SPM)Shahid Riaz
 
import data to model
import data to modelimport data to model
import data to modelSerdar Kaya
 
BW on HANA optimisation answers
BW on HANA optimisation answersBW on HANA optimisation answers
BW on HANA optimisation answersAjay Kumar Uppal
 
Construction project management & risk mitigation
Construction project management & risk mitigationConstruction project management & risk mitigation
Construction project management & risk mitigationrajlaxmipardeshi
 
Geological modeling lessons learned
Geological modeling lessons learnedGeological modeling lessons learned
Geological modeling lessons learnedSerdar Kaya
 
Primavera P6 manual
Primavera P6 manual Primavera P6 manual
Primavera P6 manual Abbas Tahir
 

Tendances (20)

Project Manual
Project ManualProject Manual
Project Manual
 
Project Scheduling & Controls
Project Scheduling & ControlsProject Scheduling & Controls
Project Scheduling & Controls
 
Hidrotermalismo
HidrotermalismoHidrotermalismo
Hidrotermalismo
 
WBS PROJECT
WBS PROJECTWBS PROJECT
WBS PROJECT
 
50 Planning Fundamentals V5.0 - procedures only
50 Planning Fundamentals V5.0 - procedures only50 Planning Fundamentals V5.0 - procedures only
50 Planning Fundamentals V5.0 - procedures only
 
Ms project training ver 01
Ms project training ver 01Ms project training ver 01
Ms project training ver 01
 
Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...
Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...
Modularization: How Going Modular Can Mitigate Project Risk and Deliver Proje...
 
Turnaround Project Planner Primer
Turnaround Project Planner PrimerTurnaround Project Planner Primer
Turnaround Project Planner Primer
 
Detecting Anomalous Behavior with Surveillance​ Analytics​
Detecting Anomalous Behavior with Surveillance​ Analytics​Detecting Anomalous Behavior with Surveillance​ Analytics​
Detecting Anomalous Behavior with Surveillance​ Analytics​
 
Microsoft Projects; a step by-step guide for beginner's
Microsoft Projects; a step by-step guide for beginner'sMicrosoft Projects; a step by-step guide for beginner's
Microsoft Projects; a step by-step guide for beginner's
 
Software Project Management (SPM)
Software Project Management (SPM)Software Project Management (SPM)
Software Project Management (SPM)
 
Planning For Engineering
Planning For EngineeringPlanning For Engineering
Planning For Engineering
 
import data to model
import data to modelimport data to model
import data to model
 
BW on HANA optimisation answers
BW on HANA optimisation answersBW on HANA optimisation answers
BW on HANA optimisation answers
 
Construction project management & risk mitigation
Construction project management & risk mitigationConstruction project management & risk mitigation
Construction project management & risk mitigation
 
Geological modeling lessons learned
Geological modeling lessons learnedGeological modeling lessons learned
Geological modeling lessons learned
 
Ms Project 2010
Ms Project 2010Ms Project 2010
Ms Project 2010
 
Uses of Project Status field
Uses of  Project Status field Uses of  Project Status field
Uses of Project Status field
 
Anna Stork (University of Bristol) - Microseismic Monitoring at the Aquistore...
Anna Stork (University of Bristol) - Microseismic Monitoring at the Aquistore...Anna Stork (University of Bristol) - Microseismic Monitoring at the Aquistore...
Anna Stork (University of Bristol) - Microseismic Monitoring at the Aquistore...
 
Primavera P6 manual
Primavera P6 manual Primavera P6 manual
Primavera P6 manual
 

Similaire à “Adoption DSpace 7 and 8 Challenges and Solutions from Real Migration Experiences”.pdf

Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Vantara
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceSense Corp
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKRajesh Jayarman
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overviewvhrocca
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseDataWorks Summit
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the OrganizationSeeling Cheung
 
Solution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab AcceleratorSolution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab AcceleratorBlueData, Inc.
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouseStephen Alex
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouseStephen Alex
 
Achieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAchieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAlluxio, Inc.
 
Bridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldBridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldCA Technologies
 
QuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing WebinarQuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing WebinarRTTS
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationDenodo
 
How to Migrate, Manage and Centralize your Web Infrastructure with Drupal
How to Migrate, Manage and Centralize your Web Infrastructure with DrupalHow to Migrate, Manage and Centralize your Web Infrastructure with Drupal
How to Migrate, Manage and Centralize your Web Infrastructure with DrupalAcquia
 
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life EasierWebinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life EasierDataStax
 

Similaire à “Adoption DSpace 7 and 8 Challenges and Solutions from Real Migration Experiences”.pdf (20)

Hitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop SolutionHitachi Data Systems Hadoop Solution
Hitachi Data Systems Hadoop Solution
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RK
 
Hd insight overview
Hd insight overviewHd insight overview
Hd insight overview
 
Innovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data WarehouseInnovation in the Enterprise Rent-A-Car Data Warehouse
Innovation in the Enterprise Rent-A-Car Data Warehouse
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Hadoop and SQL: Delivery Analytics Across the Organization
Hadoop and SQL:  Delivery Analytics Across the OrganizationHadoop and SQL:  Delivery Analytics Across the Organization
Hadoop and SQL: Delivery Analytics Across the Organization
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Big data rmoug
Big data rmougBig data rmoug
Big data rmoug
 
Solution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab AcceleratorSolution Brief: Big Data Lab Accelerator
Solution Brief: Big Data Lab Accelerator
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
 
Modern data warehouse
Modern data warehouseModern data warehouse
Modern data warehouse
 
Talend for big_data_intorduction
Talend for big_data_intorductionTalend for big_data_intorduction
Talend for big_data_intorduction
 
Achieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud WorldAchieving Separation of Compute and Storage in a Cloud World
Achieving Separation of Compute and Storage in a Cloud World
 
Bridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven WorldBridging the Big Data Gap in the Software-Driven World
Bridging the Big Data Gap in the Software-Driven World
 
QuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing WebinarQuerySurge Slide Deck for Big Data Testing Webinar
QuerySurge Slide Deck for Big Data Testing Webinar
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data Virtualization
 
How to Migrate, Manage and Centralize your Web Infrastructure with Drupal
How to Migrate, Manage and Centralize your Web Infrastructure with DrupalHow to Migrate, Manage and Centralize your Web Infrastructure with Drupal
How to Migrate, Manage and Centralize your Web Infrastructure with Drupal
 
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life EasierWebinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
Webinar: DataStax Enterprise 5.0 What’s New and How It’ll Make Your Life Easier
 

Plus de 4Science

From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...
From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...
From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...4Science
 
DSpace-CRIS design & Implementation
DSpace-CRIS design & ImplementationDSpace-CRIS design & Implementation
DSpace-CRIS design & Implementation4Science
 
Status of discussions with repository platforms_ DSpace.pdf
Status of discussions with repository platforms_ DSpace.pdfStatus of discussions with repository platforms_ DSpace.pdf
Status of discussions with repository platforms_ DSpace.pdf4Science
 
DSpace GLAM Infographic.pdf
DSpace GLAM Infographic.pdfDSpace GLAM Infographic.pdf
DSpace GLAM Infographic.pdf4Science
 
DSpace CRIS EFS Miami.pdf
DSpace CRIS EFS Miami.pdfDSpace CRIS EFS Miami.pdf
DSpace CRIS EFS Miami.pdf4Science
 
IIIF and DSpace 7 - IIIF Conference 2023.pdf
IIIF and DSpace 7 - IIIF Conference 2023.pdfIIIF and DSpace 7 - IIIF Conference 2023.pdf
IIIF and DSpace 7 - IIIF Conference 2023.pdf4Science
 
DSpace-CRIS, anticipating innovation
DSpace-CRIS, anticipating innovationDSpace-CRIS, anticipating innovation
DSpace-CRIS, anticipating innovation4Science
 
DSpace 7 ORCID Integration
DSpace 7 ORCID IntegrationDSpace 7 ORCID Integration
DSpace 7 ORCID Integration4Science
 
Bringing IIIF to the DSpace community
Bringing IIIF to the DSpace communityBringing IIIF to the DSpace community
Bringing IIIF to the DSpace community4Science
 
Implementing the Notify protocol and standard practices in DSpace
Implementing the Notify protocol and standard practices in DSpaceImplementing the Notify protocol and standard practices in DSpace
Implementing the Notify protocol and standard practices in DSpace4Science
 
The EOSC DIH "ELD Advance" project
The EOSC DIH "ELD Advance" projectThe EOSC DIH "ELD Advance" project
The EOSC DIH "ELD Advance" project4Science
 
DSpace implementation of the COAR Notify Project - status update
DSpace implementation of the COAR Notify Project - status updateDSpace implementation of the COAR Notify Project - status update
DSpace implementation of the COAR Notify Project - status update4Science
 
Convegno Stelline 2020 - 4Science -16 settembre _ pubbliche
Convegno Stelline 2020 - 4Science -16 settembre _ pubblicheConvegno Stelline 2020 - 4Science -16 settembre _ pubbliche
Convegno Stelline 2020 - 4Science -16 settembre _ pubbliche4Science
 
Convegno Stelline 2020 - 4Science -16 settembre _ accademiche
Convegno Stelline 2020 - 4Science -16 settembre _ accademicheConvegno Stelline 2020 - 4Science -16 settembre _ accademiche
Convegno Stelline 2020 - 4Science -16 settembre _ accademiche4Science
 
Convegno Stelline 2020 - 4Science
Convegno Stelline 2020 - 4Science Convegno Stelline 2020 - 4Science
Convegno Stelline 2020 - 4Science 4Science
 
DSpace-CRIS 7: What is Coming? OR2020
DSpace-CRIS 7: What is Coming? OR2020DSpace-CRIS 7: What is Coming? OR2020
DSpace-CRIS 7: What is Coming? OR20204Science
 
News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 2020News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 20204Science
 
Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...
Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...
Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...4Science
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...4Science
 
DSpace-CRIS ORCID Integration
DSpace-CRIS ORCID IntegrationDSpace-CRIS ORCID Integration
DSpace-CRIS ORCID Integration4Science
 

Plus de 4Science (20)

From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...
From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...
From Digital Records to Digital Cultural Landscapes. Beyond Digital Library b...
 
DSpace-CRIS design & Implementation
DSpace-CRIS design & ImplementationDSpace-CRIS design & Implementation
DSpace-CRIS design & Implementation
 
Status of discussions with repository platforms_ DSpace.pdf
Status of discussions with repository platforms_ DSpace.pdfStatus of discussions with repository platforms_ DSpace.pdf
Status of discussions with repository platforms_ DSpace.pdf
 
DSpace GLAM Infographic.pdf
DSpace GLAM Infographic.pdfDSpace GLAM Infographic.pdf
DSpace GLAM Infographic.pdf
 
DSpace CRIS EFS Miami.pdf
DSpace CRIS EFS Miami.pdfDSpace CRIS EFS Miami.pdf
DSpace CRIS EFS Miami.pdf
 
IIIF and DSpace 7 - IIIF Conference 2023.pdf
IIIF and DSpace 7 - IIIF Conference 2023.pdfIIIF and DSpace 7 - IIIF Conference 2023.pdf
IIIF and DSpace 7 - IIIF Conference 2023.pdf
 
DSpace-CRIS, anticipating innovation
DSpace-CRIS, anticipating innovationDSpace-CRIS, anticipating innovation
DSpace-CRIS, anticipating innovation
 
DSpace 7 ORCID Integration
DSpace 7 ORCID IntegrationDSpace 7 ORCID Integration
DSpace 7 ORCID Integration
 
Bringing IIIF to the DSpace community
Bringing IIIF to the DSpace communityBringing IIIF to the DSpace community
Bringing IIIF to the DSpace community
 
Implementing the Notify protocol and standard practices in DSpace
Implementing the Notify protocol and standard practices in DSpaceImplementing the Notify protocol and standard practices in DSpace
Implementing the Notify protocol and standard practices in DSpace
 
The EOSC DIH "ELD Advance" project
The EOSC DIH "ELD Advance" projectThe EOSC DIH "ELD Advance" project
The EOSC DIH "ELD Advance" project
 
DSpace implementation of the COAR Notify Project - status update
DSpace implementation of the COAR Notify Project - status updateDSpace implementation of the COAR Notify Project - status update
DSpace implementation of the COAR Notify Project - status update
 
Convegno Stelline 2020 - 4Science -16 settembre _ pubbliche
Convegno Stelline 2020 - 4Science -16 settembre _ pubblicheConvegno Stelline 2020 - 4Science -16 settembre _ pubbliche
Convegno Stelline 2020 - 4Science -16 settembre _ pubbliche
 
Convegno Stelline 2020 - 4Science -16 settembre _ accademiche
Convegno Stelline 2020 - 4Science -16 settembre _ accademicheConvegno Stelline 2020 - 4Science -16 settembre _ accademiche
Convegno Stelline 2020 - 4Science -16 settembre _ accademiche
 
Convegno Stelline 2020 - 4Science
Convegno Stelline 2020 - 4Science Convegno Stelline 2020 - 4Science
Convegno Stelline 2020 - 4Science
 
DSpace-CRIS 7: What is Coming? OR2020
DSpace-CRIS 7: What is Coming? OR2020DSpace-CRIS 7: What is Coming? OR2020
DSpace-CRIS 7: What is Coming? OR2020
 
News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 2020News about DSpace-CRIS Anwendertreffen 2020
News about DSpace-CRIS Anwendertreffen 2020
 
Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...
Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...
Digital library: riflessioni su scelte e obiettivi. Visibilità delle collezio...
 
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
How to enhance your DSpace repository: use cases for DSpace-CRIS, DSpace-RDM,...
 
DSpace-CRIS ORCID Integration
DSpace-CRIS ORCID IntegrationDSpace-CRIS ORCID Integration
DSpace-CRIS ORCID Integration
 

Dernier

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 

Dernier (20)

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 

“Adoption DSpace 7 and 8 Challenges and Solutions from Real Migration Experiences”.pdf

  • 1. Adopting DSpace 7 and 8: Challenges and Solutions from Real Migration Experiences
  • 2. AGENDA 4Science who we are It is not just an update, it is always a migration A couple of hints about your data model There are more data that need to be migrated than what you expect Plan, Do, Check, Finalize Common pitfalls & Solution strategies Take aways
  • 3. Today's speaker Susanna Mornati, Chief Operating Officer at 4Science susanna.mornati@4science.com
  • 4. Who we are OUR AIM: to enable implementationof the transnationally importantpolicies of Open Research, Research Impact and Digital Preservation. DSpace (CRIS/GLAM) OJS Dataverse Our services: Our solutions support compliance with key international standards: Certified Platinum Provider and leading contributorto DSpace 7 ✓ OpenAIRE ✓ ORCID ✓ CERIF ✓ IIIF We provide solutions for research information & data management and for cultural heritage • Installation • Configuration • Hosting and maintenance • System integration, customization and consultancy
  • 5. What we believe in Security Certification is not a matterof compromise: our solutions are secure by design; openness without security would be counterproductive;security without openness would be unproductive. ISO/IEC 27001:2013, 27017:2015, 27018: 2019, and ISO/IEC 9001:2015 Our solutionssupport the key defining transnational policies, Open Research and open digitalcultural heritage, and are based on: Open- source software Open standards Interoperability Preservation Collaboration Innovation
  • 6. A fast-growing organization Over 100 clients in 5 continents worldwide https://www.4science.com/we-work-for/
  • 7. The context in which we operate since 2016 We are driven by serving the open knowledge ecosystem. Proprietary products often come with expensivelicenses and pricing fluctuations,can become obsolescent and can result in vendor-lock-in. Our open solutions (open standards,open protocols, open source) aredesigned to support open science. Open knowledge helps to solve,by collaboration,the world’s very pressingproblems,and creates new opportunities, especially when cross-disciplinary.
  • 8. 4Science role in the Open Science and DSpace community Certified Platinum Provider and leading contributor of DSpace Our goal is to anticipate the future making it more accessible 2023 DSpace worldwide community leaders for hours donated for DSpace development Experts in the field and enablers that can help with any situation At 4Science we are driven by serving the open knowledge ecosystem. Openknowledge empowering open access, supporting open science, advancing open scholarly communication. FAIR data Our solutions enable your data to be Findable, Accessible, Interoperable and Reusable Interoperablesolutions ORCID and Datacite Certified Service Provider, CERIF and IIIF enabler Compliance& Quality COAR-NGR, OpenAIRE, Certified Platinum Provider of DSpace, ISO 9001:2015 Security Battle-tested solutions, secure by design; Trusted Providers of the Cloud Security Alliance
  • 9. «Migration» or «update»? Not so different? In this session we will lookat some insights frombest practices that we havelearned moving from DSpace 5, DSpace 6, EPrints, Digital Commons, OPUS or even custom solutions, but the first thing we would like to share is… Even when you are about to upgrade from an old to the new version of DSpace, keep in mind that it has been completelyreengineered fromprevious ones:anyupdate to a major release should therefore be understood (and planned with the appropriate timing)as if it were a migration to an entirelynewplatform,in additionto integrations with systems alreadyin yourecosystem. Consider it as it was a migration toa completelydifferent system, although the main paradigms and approaches are preserved
  • 10. Entities are the foundation of the new data model An effective datamodel should also be flexible Entities are a pivotal part of defining a whole datamodel contributing to its design, they enable flexibility to reflect your data in a more granular way Your data model should be as close as possible to international standards to enhance interoperability The current design of DSpace 7 provides the foundation for flexibility ensuring that it can be tailored to your requirements Relations complete the definition of your data model: authors, publications, organizations and more, can be interconnected to each other
  • 11. Entities should reflect your data model, enabling relations and exploring connections ENTITIES AREA WAYOF REPRESENTING DATA AND THEIR RELATIONS IN A STRUCTURED MANNER ENTITIES ARECONSTITUTED BY RECORDS THATCAN BE DESCRIBED, IDENTIFIED, AND RELATED TO OTHER RECORDS IN A REPOSITORY ENTITIES ARE USED TO REPRESENT REAL-WORLD OBJECTS SUCH AS PEOPLE, ORGANIZATIONS, PUBLICATIONS AS WELL AS ABSTRACTCONCEPTS SUCH AS SUSTAINABILITY GOALS, RESEARCH LINES, THEMATIC COLLECTIONS ENTITIES AREUSED TO PROVIDE CONTEXT, CONNECTIONS, AND RELATIONSHIPS BETWEEN OBJECTS INTHE REPOSITORY, SUPPORTING DISCOVERYAND COMPREHENSION OF THE CONTEXT
  • 12. But with a correct balance: when you’re about to migrate… • You could have processes that you would like to drop • Customizations that affect your maintenance costs • Metadata representing information that is no longer useful • And processes…you’d like to add, or change • New features that can substitute your old customizations • Opportunity to add new information to your repository
  • 13. How to enable entities during the upgrade: pt. 1
  • 14. How to enable entities during the upgrade: pt 2 This step/job may be slow!
  • 15. How to enable entities during the “migration” from other platforms Follow the DSpace documentation, YES but...Howto import all the metadata, relationship and files? • The SAF import could be an option (single records), BUT... you cannot set the relationship with not-yet-created entities: it is preferable to individually create all entities, make sure to store a local.legacyid value for each • Use the CSV Bulk edit (manually or automatically updated) to create the relationship(s) Warning: CSV Bulk cannot manage ordering between entities and simple strings (i.e. ordering of Authors when only few of them have a profile)
  • 16. All of that is easier in DSpace- CRIS thanks to the possibility to use… • Denormalized tables where you can prepare your data for import (like the CSV but on the database) → easier! • Enhanced Bulk import from Excel instead of CSV (yes, it is a non-standard format but easier to work with, available for non-technical people → new lines can be created) • Promise for future reference that will be resolved once the target item is created (i.e. you can say will be referenced:ORCID:XXXXX to create a relation with the item AUTHOR using person.identifier.orcid = XXXXX) • You can manage files directly providing a remote URL (no SAF process needed) • Ordering between Entities and strings is supported (column with the specific relationship.type can be ordered by value/promise)
  • 17. Not enough said, but… Do not customize your DSpace database tables/structure, nor backport any feature that changes it Why? Because it could lead to your automated database upgrade process to fail Create new tables (instead of modifying existing ones) ALREADYDID? Consider replacing your additional contents (tables) → new entities enabled by DSpace 7
  • 18. Yes, your institution has a lot of data …and not all of them are visible in plain sight (as metadata of your items) There will be more data emerging that you did not imagine
  • 19. So…please keep this in mind OAI Identifiers should be preserved.This is currently not supported without code change (we plan to generalize the solution and open a PR → DSpace 8) OAI URLs should be preserved as well: redirection is (almost) good but you should check it at least with your known harvesters → Easy to do in Apache or nginX (light web server) Statistics can be migrated Upgradeprocedures, if followed, will resultin a full migration of the data... not -really- deleted items / bitstreams areloss When you migratefromanother platformyou can bulk import your statistics data directly in SOLRvia CSV. Data need to be prepared so a local.legacyidmetadata willbe crucial to translateyour legacy ID into the new one
  • 20. Step 1: PLAN - ask yourself all relevant questions Make sure to sync your activities and preparatory/interdependent tasks... Prepare a new, separated, environment for DSpace 7 Do you use the Handle Server? Do you mint DOIs? Integration:whatapplication extractsdata fromDSpace?What application pushesdataintoDSpace? Usingwhich technology:SWORD, REST API?How much time 3rd partieswill needtoswitchfromthe oldintegrationtothe newone? Plan to put your repository in READ ONLY mode for enough time to perform the final migration Prepare your UATs that should take into account of your customizations, configurations and top- priority functionalities You need to run the migration at least two times and usually you cannot afford to haveyour currentrepository locked down for a long period This means that the two runs will useslightly differentdata! Even if the repository is in READ ONLY mode, there are still running data... Statistics will grow!
  • 21. Step 2: DO Verify Verify the timing for execution/import/indexing during this phase: you’ll benefit from them for the final migration Note Remember to keep track of all of your steps (you’ll have to exactly repeat them for the final migration) Do Do your first test migration
  • 22. Step 3: CHECK Perform UATs to validate and flag possible issues (and the related fixes you applied) If you notice something wrong that was not covered by UATs, you should not ignore it: UATs should be amended to reflect the path Verify that timing of the first migration allows you to meet the deadlines you were expecting? Verufy which tasks could be optimized/reviewed Check data integrity: run the checksum checker (fixed by 4Science in 7.6) Temporarily disable indexing during intermediate milestones/steps to save some time… (…but be careful of the interdependencies in further steps and keep in mind that you’ll have to run a full indexing when needed) About the automatic initial reindexing: it is not recommended to skip it, unless you will manually reindex at a later time, or verify that a reindexing is not necessary. Forgetting to reindex your site after an upgrade may result in unexpected errors or instabilities
  • 23. Step 4: FINALIZE Put in read-only mode your current production environment before performing the final deployment Alert your partners of integrated systems that the systems is freezed Extract your data from your current freezed repository Re-run the steps that you succesfully run during the first test migration: even small differences may lead to unexpexted issues Run the UAT books: if everything goes smooth, make the final switch into production
  • 24. DOs Alert Give notice to your partners that they can restartto perform ordinary activities on their 3rd party systems Move Move your handle server to your new environment Enable Enable all of your crontabjobs Update Update ALL of your URLs to matchthe ones in productions
  • 25. More pitfalls and solutions we adopted with experience …fromDSpace 5, DSpace 6, EPrints,Digital Commons, OPUS,Invenio…
  • 26. UATs, the world where the obvious is certainly not – guidelines A plan should be prepared and followed methodically to test and verify consistencybetween the old systemand the new one. A few examples: 1. How many items were visible in the old system? How many in the new one? 2. How many items were present in the users' workspace? How many in the new system? 3. Same for workflows: how many in the various steps, how many in charge of the various users? 4. Are any items restricted or embargoed? Are restrictions migrated correctly and working? 5. Are all protocols used by 3rd party systems enabled (SWORD? Legacy REST…)?
  • 27. Time spent in UATs is very well-spent Through these cross-checks we had the opportunity to discover inconsistencies between the database and UI of older versions of DSpace: oeven fixing the problem in the new version did not always coincide with the user's desires (e.g., items previously not visible by mistake becoming visible in the new version and vice versa).
  • 28. Fun facts and unapparent trivia Thumbnails in the new DSpace 7 are now larger than in the old versions. We learned that the layout, importing the old ones, would be compromised. This resulted in the discovery of the century: all thumbnails had to be…regenerated. 4Science contributed the fix for the regeneration of the thumbnails ☺ The moral: consider every possible interaction!
  • 29. Fun facts and unapparent trivia Most viewed item? OH YES PLEASE. …but the item in the new version turned out to be different from the item in the old version. Why? Because slightly different rules had simply been applied- which led to a different result. One can never be too cautious: watch out for inconsistenciesand rule changes, even minimal ones.
  • 30. What about DSpace 8? • DSpace 8 is expected to go live in the spring/summer of 2024 • It will not be a major change like DSpace 7 was Should I upgrade to DSpace 7 or wait for DSpace 8 to be released? • We suggest to cautiously migrate/upgrade to the most stable version at the moment of the release, assessing what is better for your institution • The upgrade from DSpace 7 to Dspace 8 will not require such a big effort compared the upgrade from DSpace 5 / 6 to 7 • Institutions upgrading from DSpace 7 to DSpace 8 will enjoy features already implemented in DSpace-CRIS 7, e.g. Notify protocol (contributedby 4Science + Harvard), Correction service to enhance data quality (4Science), Duplicate detection (ported by TLC from our implementation in DSpace-CRIS)
  • 31. Be sure to check every minimal step and take careful note of it. Time spent in analysis and double-checks is really well spent We, at 4Science, would love to put out expertise at your service on behalf of the entire community. Contact us at: info@4Science.com Visit our website: www.4science.com Follow us on social media! 4Science International 4ScienceDSpace 4ScienceIT 4Science Join the 4Science newsletter to keep up to date with news about our contributions to DSpace and much more!