11. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™
12. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories something shared innovation ™ title author genre genre subject subject isbn
13. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories something shared innovation ™ title author genre genre subject subject isbn
14. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™ something title author genre genre subject subject isbn
15. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories Juvenile fiction Rowling, J. K. Wizards Children's stories Potter, Harry (Fictitious character) Harry Potter and the Order of the Phoenix 0747551006 shared innovation ™ something something else
16. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories Juvenile fiction Rowling, J. K. Wizards Children's stories Potter, Harry (Fictitious character) Harry Potter and the Order of the Phoenix 0747551006 shared innovation ™ something something else
21. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories Juvenile fiction Rowling, J. K. Wizards Children's stories Potter, Harry (Fictitious character) Harry Potter and the Order of the Phoenix 0747551006 shared innovation ™ something something else
22. Rowling, J. K. shared innovation ™ something something else the author known as J. K. Rowling
23. Harry ... Phoenix Rowling, J. K. Potter, Harry (Fictitious character) Harry ... Azkaban wizards shared innovation ™ the words "Harry ... Phoenix" used as a title something something else the author known as J. K. Rowling the fictitious wizard known as Harry Potter the words "Harry ... Azkaban" used as a title the concept of magical wizards book
31. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories shared innovation ™
32. Rowling, J. K. Harry Potter and the Prisoner of Azkaban 0747542155 Potter, Harry (Fictitious character) Wizards Juvenile fiction Children's stories /people/36082b69-ba77-486b-b27d-bf3ac3f1bfe7 /titles/08944d4d-5b46-4bf5-9acf-3102b181de95 urn:isbn:0747542155 /character/e8b7ae0c-f465-4251-9bc9-bc4b6a61eb21 /topics/08f0fa23-0cb8-4a66-a310-dfd8ed95e0ae /genres/ea65a567-bc36-4a23-a9de-bad053d18568 /genres/f96eda4a-42ab-4d57-8fc9-96e6f6f81e98 shared innovation ™
33.
34.
35. Example... Image Credit goes here Rowling, J. K. R o w l i n g , J . K . r j k shared innovation ™
36. Example... Image Credit goes here Rowling, J. K. o w l i n g r j k /people/ shared innovation ™
37. Example... Image Credit goes here Prisoner of Azkaban , Harry Potter and The Harry Potter and The Prisoner of Azkaban shared innovation ™
38. Example... prisoner of azkaban harry potter and the harry potter and the prisoner of azkaban shared innovation ™
39. Example... prisoner of azkaban harry potter and the harry potter and the prisoner of azkaban shared innovation ™
40. Example... harry potter and the prisoner of azkaban shared innovation ™ Prisoner of Azkaban , Harry Potter and The Harry Potter and The Prisoner of Azkaban
42. Example data from Wikipedia disambiguation page for 'Stardust' shared innovation ™ adaptation thing one Novel Neil Gaiman Charles Vess thing two Film 2007 is not thing three Novel Robert B Parker 1990 thing four Film 1974 David Essex thing five Linda Darnell 1940 Film thing seven Computer Game thing six Magazine India /titles/stardust
49. Example data from The Library of Congress Name Authority File shared innovation ™ Jones, Adrian, 1845-1938 Jones, Adrian, 1955- Jones, Adrian, 1956- Jones, Adrian, 1961- Jones, Adrian, 1933- Jones, Adrian
It's running off this stuff We do a bit of work to it... Looking at it this makes my eyes go fuzzy
Is that better? What we do is we keep the field data...
We've done work on leader elements, indicators and fixed format fields, but today to keep things simple I'm just taking basic field data. Let's throw the rest away for now.
When you see white boxes like this I mean that the data model contains literal text strings. Let's move them around a bit to make space for something.
Because we're describing the thing the record was about, rather than the record itself, we can use different labels. Let's take just one of these relationships
The problem becomes clear if we move something out of the way, to make room for something else.
Something else has a title of "harry potter and the order of the phoenix" Unsurprisingly, something and something else have a lot in common
This approach is what I would call record-centric, we haven't got any more meaning than was in the original record. Let's talk that through as an example [click] The two books are both by JK Rowling, but this is not JK Rowling, this is a string of text. This is JK Rowling...
OK, so it's not JK Rowling, it's a picture of JK Rowling - I asked, but she wouldn't come. The point is that the words Rowling, JK and the author known as JK Rowling are different things Not a new thought I know...
This man made great use of new technology back in the 1800s, he represented authors, titles and subjects as abstract concepts, with each author being recognised even if they shared the same name. Subjects were recognised as different even if they shared common terms and more. He wrote about it
In this great book - anyone read it? The technology he was taking advantage of?
The card... What I'm trying to do with the data is not really different to the introduction of card catalogs over bound volumes So, let's get back to the present day...
Focussing on JK for the moment, let's throw the rest away
Remember I said a resource could represent anything? well, let's use one to represent the author...
Let's get rid of the strings, and just leave the concepts, that makes things look simpler. Now, one of the things that has gone unsaid so far is that something and something else came from monograph records, so we know that they are both books.
dbpedia, geonames (Embrun, France), CIA Factbook, MusicBrainz/Zitgist (Diana Ross), DBLP (Advances in Orthonormalizing Computation.), RDF BookMashup (Weaving the Web), RDF BookMashup
URI is a superset of what we can do with Primary keys
We take the data elements
We take the data elements
Take the algorithms used for lookups Turn them around to make hashes Keep the essence of the natural data Discard syntactic differences If too long (or you like GUIDs) MD5 them.
* Stardust (novel), an illustrated novel by Neil Gaiman and Charles Vess * Stardust (2007 film), the film adaptation of the above book * Stardust (1990 novel), a 1990 novel by Robert B. Parker * Stardust (1974 film), a film about a rock star, starring David Essex. * Stardust (1940 film), a film with Linda Darnell and John Payne * Stardust (game), a shoot 'em up computer game * Stardust (magazine), a popular Indian film magazine