SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
Library Knowledge Graph
Editor Development
Simeon Warner (Cornell)
https://orcid.org/0000-0002-7970-7855
Reporting work from the LD4P2 project including contributions from: Steven
Folsom, Huda Khan, Lynette Rayle, Jason Kovari, Tim Worrall (Cornell), Astrid
Usong (Stanford), David Eichmann (Iowa), and others…
US2TS 2019, March 11-13, Duke University, Durham, NC
Library Knowledge Graph
~ Library Catalog
#1 - Facilitate discovery of resources
(find, identify, select, obtain)
#2 - Facilitate management of resources
Library Cataloging Background
Many practices developed in the era of card catalogs
MARC format developed in 1960's
Long history of linking entities, albeit with authorized
names rather than identifiers. Used for limited forms of
semantic browse
LD4 work and broader community moving from
MARC→RDF, from authorized names to URIs, and
toward better linking with the web
Henriette Avram 1919–2006,
American computer programmer
and systems analyst who
developed MARC
https://en.wikipedia.org/wiki/Henrie
tte_Avram
Production Scale
Cornell catalog has ~9M records
(~8M physical, ~1M electronic)
Cataloging staff must keep up with
new acquisitions. RSI is a real
Rarely start from scratch: base on
vendor supplied, community records
or record for similar resource
Specialists covering many
languages
Library Technical Services space in
OIin Library, Cornell University
MARC → RDF
Past work on ontology development but current
focus around BIBFRAME model from Library of
Congress (LC), still evolving
Conversions ~100 triples from each MARC record
Cornell: 9M records → ~1 billion triples (cf. WorldCat
scale: 440M bib records, 2.7G holdings)
Community will still rely on centralized services, but
opens possibility for other models too, and ad-hoc
links
Key entity types in BIBFRAME
Shapes
cf. Khan, Folsom, et al.,
poster at US2TS 2018
Want re-use and hence
interested in shared
shapes. Mechanics may
be mix of SHACL, ShEx,
schema
Currently no decoupling of
validation from forms, a
controlled environment
https://drive.google.com/file/d/1M_xhnG8qYL7M9akvIRSETfOgeSEfS9oh/view
Linking Our Data - Focus on Lookups
Build UI and infrastructure around discovery of related entities. We know:
➔ Evolving community norms: appetite for a variety of linked datasets and
associated lookup services; how to link each well and efficiently; sensitivity to
inclusive descriptions
➔ Complexity in how to search (recall/precision -- relevancy tests)
➔ Need context -- labels and types are nowhere near sufficient, what else to
display to enable human verification/selection?
➔ Multiple sources for same entity type (e.g. person in LC NAF, ISNI, ORCID)
➔ If available, hubs likely most efficient
➔ Largely untackled: maintenance and updates (traditional authorities have
strong policies and practices which have benefit but can be stifling)
Lookup Usability Experiments
● Building on VitroLib designs and results
○ Context generally useful and navigation to authoritative sources
important
● Current LD4P2 usability work around Sinopia editor development
○ 6 participants across different institutions
○ Prototype based on LC BIBFRAME Editor (BFE)
○ Contextual information for persons and genre forms
○ Links to Wikipedia, ISNI, VIAF where available
○ Additional mockups
Slides from SWIB18 presentation; Folsom, Khan, et al.
A cataloger has a copy of a film
"Nowhere Boy" by "Sam Taylor", a
British director
A cataloger is trying to add genre to a
record, is "humorous" fiction the right term?
Lookup Usability: Preliminary Results
● Contextual information useful
○ Should also include related works, more identifying info
○ Identify source of information
● External sources such as university profiles, genre or type-specific
sites (e.g. Discogs)
● Vocabularies such as MESH, AAT, Getty (depending on content)
● Links to Wikidata, ISNI, VIAF are useful to include
● Need consistent interface experience, use clearer icons
● Improve hierarchical navigation for subject areas/genre forms
Work Cycle I Data Flow Diagrams and Prototypes October 2018
Thanks to Astrid Usong, Stanford
Discogs -- External Source Data as Lookup
Recall - rarely start from scratch
Cataloging old 45's at Cornell
Exploring use of Discogs to generate
base record directly integrated with
the catalog editor tool
1
2
3
Community Scale Experiments & Challenges
➔ 15 organizations in LD4P2 cohort + project partners
➔ Test editor and lookup infrastructure in a number of cataloging projects
Caching needed because (most) authority sources don't provide sufficient and
stable infrastructure for lookups (also associated validation, cleaning,
transformation for non-LD sources)
Static vs dynamic
➔ caching for static but need live query if one expects catalogers to create new
entities in "real time" and then be able see them
➔ e.g. Wikidata - try against SPARQL API
Discovery Experiments
Primary purpose of library knowledge graph is to enable discovery of library
resources -- the benefits of linked data are so far unproven
➔ Parallels with ideas for lookups and linking
➔ Indexing -- already do some light inferencing from MARC into Solr (e.g.
broader terms, alternates). What other data inclusion or inference is useful?
➔ Individual libraries too small to develop search systems. Considerable effort
around a Solr/Ruby system called Blacklight where UI interactions
studied/improved together. What is broadly reusable?
➔ Most linked data UIs are awful! What good examples we might learn from?
LD4 Discovery Affinity Group having open biweekly calls
Thanks for listening!
http://ld4p.org/
simeon.warner@cornell.edu
@zimeon

Contenu connexe

Tendances

What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?
Violeta Ilik
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
National Information Standards Organization (NISO)
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_Seneff
Heather Seneff
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
Figoblog
 

Tendances (20)

Karma Data Modeling
Karma Data ModelingKarma Data Modeling
Karma Data Modeling
 
Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...Integrating with others: Stable VIVO URIs for local authority records; linkin...
Integrating with others: Stable VIVO URIs for local authority records; linkin...
 
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering EvidenceBIBFRAME and OCLC Works: Defining Models and Discovering Evidence
BIBFRAME and OCLC Works: Defining Models and Discovering Evidence
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
 
What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?What do MARC, RDF, and OWL have in common?
What do MARC, RDF, and OWL have in common?
 
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early AdoptersApril 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
April 8 NISO Webinar: Experimenting with BIBFRAME: Reports from Early Adopters
 
VRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_SeneffVRA_2015_CatalogingRoundup_Seneff
VRA_2015_CatalogingRoundup_Seneff
 
Cataloguer Makeover
Cataloguer MakeoverCataloguer Makeover
Cataloguer Makeover
 
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
It Takes a Village to Grow ORCIDs on Campus: Establishing and Integrating Uni...
 
DSpace-CRIS: An Open Source Solution for Research - @THETA15
DSpace-CRIS: An Open Source Solution for Research - @THETA15DSpace-CRIS: An Open Source Solution for Research - @THETA15
DSpace-CRIS: An Open Source Solution for Research - @THETA15
 
Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"
Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"
Shieh "Enabling Descriptive Data to be Linked at the Smithsonian Libraries"
 
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
Linked Data Principles and RDF: University of Florida Libraries, BIBFRAME Wor...
 
Godby "'What are the 'entities that matter?' And how much should we say about...
Godby "'What are the 'entities that matter?' And how much should we say about...Godby "'What are the 'entities that matter?' And how much should we say about...
Godby "'What are the 'entities that matter?' And how much should we say about...
 
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
Sparling and Cohen "BIBFRAME Implementation at the University of Alberta Libr...
 
Lauruhn-5-jun15
Lauruhn-5-jun15Lauruhn-5-jun15
Lauruhn-5-jun15
 
Snac webinar v3
Snac webinar v3Snac webinar v3
Snac webinar v3
 
Documents, services, and data on the web
Documents, services, and data on the webDocuments, services, and data on the web
Documents, services, and data on the web
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
 
SWSIG wlic2016
SWSIG wlic2016SWSIG wlic2016
SWSIG wlic2016
 
Cultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data CollectionsCultural Heritage Insitutions and Big Data Collections
Cultural Heritage Insitutions and Big Data Collections
 

Similaire à LKG Editor Dev

Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Rafal Kasprowski
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
Lucy McKenna
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)
Bradley Allen
 
Repositories and the wider context
Repositories and the wider contextRepositories and the wider context
Repositories and the wider context
Julie Allinson
 
DLF Aquifer MODS Implementation Guidelines
DLF Aquifer MODS Implementation GuidelinesDLF Aquifer MODS Implementation Guidelines
DLF Aquifer MODS Implementation Guidelines
Sarah Shreeves
 
Reuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and RealizationReuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and Realization
andrea huang
 

Similaire à LKG Editor Dev (20)

Linked Open Data for Cultural Heritage
Linked Open Data for Cultural HeritageLinked Open Data for Cultural Heritage
Linked Open Data for Cultural Heritage
 
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAFWho's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
Who's the Author? Identifier soup - ORCID, ISNI, LC NACO and VIAF
 
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
Web-scale Discovery Implementation with the End User in Mind (SLA 2012)
 
Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind Webscale Discovery with the Enduser in Mind
Webscale Discovery with the Enduser in Mind
 
Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...Engaging Information Professionals in the Process of Authoritative Interlinki...
Engaging Information Professionals in the Process of Authoritative Interlinki...
 
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and StanfordLinked Data for Libraries: Experiments between Cornell, Harvard and Stanford
Linked Data for Libraries: Experiments between Cornell, Harvard and Stanford
 
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
Creating Sustainable Communities in Open Data Resources: The eagle-i and VIVO...
 
VIVO at the University of Idaho
VIVO at the University of IdahoVIVO at the University of Idaho
VIVO at the University of Idaho
 
Federating Research Profiling Data
Federating Research Profiling DataFederating Research Profiling Data
Federating Research Profiling Data
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Next Generation Repositories
Next Generation RepositoriesNext Generation Repositories
Next Generation Repositories
 
Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)Faceted Navigation (LACASIS Fall Workshop 2005)
Faceted Navigation (LACASIS Fall Workshop 2005)
 
Repositories and the wider context
Repositories and the wider contextRepositories and the wider context
Repositories and the wider context
 
OCLC Research @ U of Calgary: New directions for metadata workflows across li...
OCLC Research @ U of Calgary: New directions for metadata workflows across li...OCLC Research @ U of Calgary: New directions for metadata workflows across li...
OCLC Research @ U of Calgary: New directions for metadata workflows across li...
 
DLF Aquifer MODS Implementation Guidelines
DLF Aquifer MODS Implementation GuidelinesDLF Aquifer MODS Implementation Guidelines
DLF Aquifer MODS Implementation Guidelines
 
Reuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and RealizationReuse of Structured Data: Semantics, Linkage, and Realization
Reuse of Structured Data: Semantics, Linkage, and Realization
 
Linked Data Workshop Stanford University
Linked Data Workshop Stanford University Linked Data Workshop Stanford University
Linked Data Workshop Stanford University
 
NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)NISO access related projects (presented at the Charleston conference 2016)
NISO access related projects (presented at the Charleston conference 2016)
 
Digital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million BooksDigital Library Infrastructure for a Million Books
Digital Library Infrastructure for a Million Books
 
Towards an Open Research Knowledge Graph
Towards an Open Research Knowledge GraphTowards an Open Research Knowledge Graph
Towards an Open Research Knowledge Graph
 

Plus de Simeon Warner

Questioning Authority Lookup Service: Linking the Data
Questioning Authority Lookup Service: Linking the DataQuestioning Authority Lookup Service: Linking the Data
Questioning Authority Lookup Service: Linking the Data
Simeon Warner
 

Plus de Simeon Warner (20)

Questioning Authority Lookup Service: Linking the Data
Questioning Authority Lookup Service: Linking the DataQuestioning Authority Lookup Service: Linking the Data
Questioning Authority Lookup Service: Linking the Data
 
OCFL: A Shared Approach to Preservation Persistence
OCFL: A Shared Approach to Preservation PersistenceOCFL: A Shared Approach to Preservation Persistence
OCFL: A Shared Approach to Preservation Persistence
 
The Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservationThe Oxford Common File Layout: A common approach to digital preservation
The Oxford Common File Layout: A common approach to digital preservation
 
Welcome to the FOLIO Community
Welcome to the FOLIO CommunityWelcome to the FOLIO Community
Welcome to the FOLIO Community
 
Sinopia & FOLIO: Bridging the gap to linked data cataloging
Sinopia & FOLIO: Bridging the gap to linked data cataloging Sinopia & FOLIO: Bridging the gap to linked data cataloging
Sinopia & FOLIO: Bridging the gap to linked data cataloging
 
FOLIO and Linked Data
FOLIO and Linked DataFOLIO and Linked Data
FOLIO and Linked Data
 
OCFL v1.0
OCFL v1.0OCFL v1.0
OCFL v1.0
 
IIIF Technical Specification Status Update
IIIF Technical Specification Status UpdateIIIF Technical Specification Status Update
IIIF Technical Specification Status Update
 
Don't bold the field name!
Don't bold the field name!Don't bold the field name!
Don't bold the field name!
 
Samvera and IIIF 2018
Samvera and IIIF 2018Samvera and IIIF 2018
Samvera and IIIF 2018
 
Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)Oxford Common File Layout (OCFL)
Oxford Common File Layout (OCFL)
 
ORCID @ Cornell
ORCID @ CornellORCID @ Cornell
ORCID @ Cornell
 
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
From Open Annotations to W3C Web Annotations (and the impact on IIIF Present...
 
Introduction to the IIIF Presentation API (@SWIB17)
Introduction to the IIIF Presentation API (@SWIB17)Introduction to the IIIF Presentation API (@SWIB17)
Introduction to the IIIF Presentation API (@SWIB17)
 
Introduction to the International Image Interoperability Framework (IIIF)
Introduction to the International Image Interoperability Framework (IIIF)Introduction to the International Image Interoperability Framework (IIIF)
Introduction to the International Image Interoperability Framework (IIIF)
 
From Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and CollaborationsFrom Open Access to Open Standards, (Linked) Data and Collaborations
From Open Access to Open Standards, (Linked) Data and Collaborations
 
Mind the gap! Reflections on the state of repository data harvesting
Mind the gap! Reflections on the state of repository data harvestingMind the gap! Reflections on the state of repository data harvesting
Mind the gap! Reflections on the state of repository data harvesting
 
ORCID & other Person iDs
ORCID & other Person iDsORCID & other Person iDs
ORCID & other Person iDs
 
IIIF without an image server? No problem!
IIIF without an image server? No problem!IIIF without an image server? No problem!
IIIF without an image server? No problem!
 
IIIF Technical Specification Status Update
IIIF Technical Specification Status UpdateIIIF Technical Specification Status Update
IIIF Technical Specification Status Update
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 

LKG Editor Dev

  • 1. Library Knowledge Graph Editor Development Simeon Warner (Cornell) https://orcid.org/0000-0002-7970-7855 Reporting work from the LD4P2 project including contributions from: Steven Folsom, Huda Khan, Lynette Rayle, Jason Kovari, Tim Worrall (Cornell), Astrid Usong (Stanford), David Eichmann (Iowa), and others… US2TS 2019, March 11-13, Duke University, Durham, NC
  • 2. Library Knowledge Graph ~ Library Catalog #1 - Facilitate discovery of resources (find, identify, select, obtain) #2 - Facilitate management of resources
  • 3. Library Cataloging Background Many practices developed in the era of card catalogs MARC format developed in 1960's Long history of linking entities, albeit with authorized names rather than identifiers. Used for limited forms of semantic browse LD4 work and broader community moving from MARC→RDF, from authorized names to URIs, and toward better linking with the web Henriette Avram 1919–2006, American computer programmer and systems analyst who developed MARC https://en.wikipedia.org/wiki/Henrie tte_Avram
  • 4. Production Scale Cornell catalog has ~9M records (~8M physical, ~1M electronic) Cataloging staff must keep up with new acquisitions. RSI is a real Rarely start from scratch: base on vendor supplied, community records or record for similar resource Specialists covering many languages Library Technical Services space in OIin Library, Cornell University
  • 5. MARC → RDF Past work on ontology development but current focus around BIBFRAME model from Library of Congress (LC), still evolving Conversions ~100 triples from each MARC record Cornell: 9M records → ~1 billion triples (cf. WorldCat scale: 440M bib records, 2.7G holdings) Community will still rely on centralized services, but opens possibility for other models too, and ad-hoc links Key entity types in BIBFRAME
  • 6. Shapes cf. Khan, Folsom, et al., poster at US2TS 2018 Want re-use and hence interested in shared shapes. Mechanics may be mix of SHACL, ShEx, schema Currently no decoupling of validation from forms, a controlled environment https://drive.google.com/file/d/1M_xhnG8qYL7M9akvIRSETfOgeSEfS9oh/view
  • 7. Linking Our Data - Focus on Lookups Build UI and infrastructure around discovery of related entities. We know: ➔ Evolving community norms: appetite for a variety of linked datasets and associated lookup services; how to link each well and efficiently; sensitivity to inclusive descriptions ➔ Complexity in how to search (recall/precision -- relevancy tests) ➔ Need context -- labels and types are nowhere near sufficient, what else to display to enable human verification/selection? ➔ Multiple sources for same entity type (e.g. person in LC NAF, ISNI, ORCID) ➔ If available, hubs likely most efficient ➔ Largely untackled: maintenance and updates (traditional authorities have strong policies and practices which have benefit but can be stifling)
  • 8. Lookup Usability Experiments ● Building on VitroLib designs and results ○ Context generally useful and navigation to authoritative sources important ● Current LD4P2 usability work around Sinopia editor development ○ 6 participants across different institutions ○ Prototype based on LC BIBFRAME Editor (BFE) ○ Contextual information for persons and genre forms ○ Links to Wikipedia, ISNI, VIAF where available ○ Additional mockups Slides from SWIB18 presentation; Folsom, Khan, et al.
  • 9. A cataloger has a copy of a film "Nowhere Boy" by "Sam Taylor", a British director
  • 10.
  • 11.
  • 12. A cataloger is trying to add genre to a record, is "humorous" fiction the right term?
  • 13. Lookup Usability: Preliminary Results ● Contextual information useful ○ Should also include related works, more identifying info ○ Identify source of information ● External sources such as university profiles, genre or type-specific sites (e.g. Discogs) ● Vocabularies such as MESH, AAT, Getty (depending on content) ● Links to Wikidata, ISNI, VIAF are useful to include ● Need consistent interface experience, use clearer icons ● Improve hierarchical navigation for subject areas/genre forms
  • 14. Work Cycle I Data Flow Diagrams and Prototypes October 2018 Thanks to Astrid Usong, Stanford
  • 15. Discogs -- External Source Data as Lookup Recall - rarely start from scratch Cataloging old 45's at Cornell Exploring use of Discogs to generate base record directly integrated with the catalog editor tool
  • 16. 1 2 3
  • 17. Community Scale Experiments & Challenges ➔ 15 organizations in LD4P2 cohort + project partners ➔ Test editor and lookup infrastructure in a number of cataloging projects Caching needed because (most) authority sources don't provide sufficient and stable infrastructure for lookups (also associated validation, cleaning, transformation for non-LD sources) Static vs dynamic ➔ caching for static but need live query if one expects catalogers to create new entities in "real time" and then be able see them ➔ e.g. Wikidata - try against SPARQL API
  • 18. Discovery Experiments Primary purpose of library knowledge graph is to enable discovery of library resources -- the benefits of linked data are so far unproven ➔ Parallels with ideas for lookups and linking ➔ Indexing -- already do some light inferencing from MARC into Solr (e.g. broader terms, alternates). What other data inclusion or inference is useful? ➔ Individual libraries too small to develop search systems. Considerable effort around a Solr/Ruby system called Blacklight where UI interactions studied/improved together. What is broadly reusable? ➔ Most linked data UIs are awful! What good examples we might learn from? LD4 Discovery Affinity Group having open biweekly calls