SlideShare a Scribd company logo
1 of 21
Download to read offline
Deconstructing Freebase
Manuel de la Villa
Department of Information Technologies
University of Huelva                     Medical-Miner Project
                                         Huelva 22/7/2010
Deconstructing Freebase
• What is it
  – A net? A tool? A kind of Data? wikipedia-turned-
    database?
• The Schema,
  – domains, types, properties. Sources.
• How it works
• How can we use it
  – MQL, API, Acre, RDF

                                               Medical-Miner Project
                                               Huelva 22/7/2010
What is it
• Freebase is a large public database that collects three
  kinds of information:
   – data;
   – texts; and
   – media, that references…
• …entities or topics (≈ 12 million). An entity is a unique
  single person, place, or thing.
       • A single concept or real-world thing.
       • A topic could also be called an entity, resource or element or thing, it
         is a fundamental unit in Freebase.
       • /common/topic
       • Each topic has a Guid or globally unique ID
           – http://www.freebase.com/view/en/barack_obama
           – http://www.freebase.com/guid/9202a8c04000641f800000000029c277


                                                                        Medical-Miner Project
                                                                        Huelva 22/7/2010
What is it (and II)
• Freebase connects entities together as a graph,
   – defines its data structure as a set of nodes and a set of
     links that establish relationships between the nodes.
• Most of our topics are associated with one or
  more types (such as people, places, books,
  films, etc) and may have additional properties like "date of
  birth" for a person or latitude and longitude for a location.
  These types and properties and related concepts are called
  Schema.


                                                          Medical-Miner Project
                                                          Huelva 22/7/2010
The Schema
• We use the term Schema to refer to the way Freebase's data is laid out or
  structured. In the Semantic web world, this is referred to as Ontology.
• Schema in Freebase is expressed through Types and Properties. Types are
  grouped together in Domains.

              What is Schema?
                   Schema describes how data in Freebase is structured.

              Schema is made by creating:
                   Types, which define a particular kind of person, place or thing and contain

                   Properties, which define the unique qualities of a type.



                                       William Shakespeare
                                              is a  →      type     → Person
                                              has a →      property → Date of Birth


                                                                                          Medical-Miner Project
                                                                                          Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema (II)




                  Medical-Miner Project
                  Huelva 22/7/2010
The Schema: Medicine




                  Medical-Miner Project
                  Huelva 22/7/2010
Then… is an Ontology?
• Yes and No,

          is a kind of Folksonomy…




                                     Medical-Miner Project
                                     Huelva 22/7/2010
The sources
• Wikipedia, ChefMoz, NNDB, MusicBrainz…
• The Freebase community, with a 2.0 spirit, are:
       •   data contributors,
       •   schema builders,
       •   data curators,
       •   application developers,
       •   and many others who use Freebase
   – Expert status
   – Augmentative edition
• Creative Commons Attribution Licensing, which means
  that it's free for you to browse, query, copy, and even use the data
  in your own systems or software, even for commercial use; all we
  ask in return is that you mention that

                                                               Medical-Miner Project
                                                               Huelva 22/7/2010
How can we use it…
• As a reference or information source
• Create interesting Views and Visualizations and
  share them with others
• Embed Freebase data in your website
• Use our API or Acre, our hosted app development
  platform, to build apps that use Freebase data
• Download our Data dumps
• Use Freebase's RDF for Semantic Web applications


                                              Medical-Miner Project
                                              Huelva 22/7/2010
How can we use it…
• Ways to use Freebase:
  – Use Freebase's Ids to uniquely identify entities
    anywhere on the web
  – Query Freebase's data using MQL
  – Build applications using our API or Acre, the
    hosted development platform




                                                 Medical-Miner Project
                                                 Huelva 22/7/2010
MQL (Metaweb Query Language)
• http://api.freebase.com/api/service/mqlread?query={"query"
  :{"type":"/music/artist","name":"U2","album":[]}}
• Query Editor




                                                     Medical-Miner Project
                                                     Huelva 22/7/2010
MQL (and II)




               Medical-Miner Project
               Huelva 22/7/2010
Freebase API
Freebase's API allows you:
    –  to perform searches and queries against
      Freebase's data, or
    – to write data to Freebase.
    – There are Libraries available for many
      languages.

Searching with Search API
• The search API lets you do a fuzzily-matched
   search for a string that occurs in a topic's
   name, aliases, or description.
Reading
• The mqlread API is for doing structured
   queries against Freebase's schema.
Writing
• The mqlwrite API lets you write data to
   Freebase.


                                                     Medical-Miner Project
                                                     Huelva 22/7/2010
Acre
A hosted application
development platform,
Acre:
   • makes it easy to build and
   share Freebase applications.
   •It encapsulates many of the
   trickier steps (like OAuth
   access for writing to
   Freebase) and
   •lets you view other people's
   apps, clone them, modify
   them, and share your
   improvements.

                                          Medical-Miner Project
                                          Huelva 22/7/2010
Applications… with Acre
• Powerset (NLP search engine)

• Fmdb

• Freebase
     Sets



                                 Medical-Miner Project
                                 Huelva 22/7/2010
RDF




      Medical-Miner Project
      Huelva 22/7/2010
Summarizing
"Freebase is the bridge between the bottom up vision
  of Web 2.0 collective intelligence and the more
  structured world of the semantic web." Tim O'Reilly




                                                Medical-Miner Project
                                                Huelva 22/7/2010

More Related Content

Similar to Deconstructing Freebase: A Guide to its Schema, Data, and APIs

Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Oss and libraries enabling arabic libraries and creating opportunities
Oss and libraries   enabling arabic libraries and creating opportunitiesOss and libraries   enabling arabic libraries and creating opportunities
Oss and libraries enabling arabic libraries and creating opportunitiesMassoud AlShareef
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsJohn Breslin
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataMinerva Lin
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Jon Voss
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSWSören Auer
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and TechniquesBernhard Haslhofer
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012lljohnston
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsJon Voss
 

Similar to Deconstructing Freebase: A Guide to its Schema, Data, and APIs (20)

Linked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & MuseumsLinked Open Data in Libraries, Archives & Museums
Linked Open Data in Libraries, Archives & Museums
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
Oss and libraries enabling arabic libraries and creating opportunities
Oss and libraries   enabling arabic libraries and creating opportunitiesOss and libraries   enabling arabic libraries and creating opportunities
Oss and libraries enabling arabic libraries and creating opportunities
 
Breaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social SemanticsBreaking Down Walls in Enterprise with Social Semantics
Breaking Down Walls in Enterprise with Social Semantics
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 
Global lodlam_communities and open cultural data
Global lodlam_communities and open cultural dataGlobal lodlam_communities and open cultural data
Global lodlam_communities and open cultural data
 
Linked Data
Linked DataLinked Data
Linked Data
 
International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti... NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 
Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.Intro to Linked Open Data in Libraries Archives & Museums.
Intro to Linked Open Data in Libraries Archives & Museums.
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Overview AG AKSW
Overview AG AKSWOverview AG AKSW
Overview AG AKSW
 
Open Data - Principles and Techniques
Open Data - Principles and TechniquesOpen Data - Principles and Techniques
Open Data - Principles and Techniques
 
CAEPIA 2011
CAEPIA 2011CAEPIA 2011
CAEPIA 2011
 
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012
 
International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)International Image Interoperability Framework (IIIF)
International Image Interoperability Framework (IIIF)
 
Linked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & MuseumsLinked Open Data in Libraries Archives & Museums
Linked Open Data in Libraries Archives & Museums
 
Digital Content Management
Digital Content ManagementDigital Content Management
Digital Content Management
 

More from Manuel de la Villa

Presentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta AutomáticosPresentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta AutomáticosManuel de la Villa
 
Presentación programa Social Media UHU
Presentación programa Social Media UHUPresentación programa Social Media UHU
Presentación programa Social Media UHUManuel de la Villa
 
Marca personal para community managers
Marca personal para community managersMarca personal para community managers
Marca personal para community managersManuel de la Villa
 
Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2Manuel de la Villa
 
Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1Manuel de la Villa
 
Taller de Presentaciones efectivas
Taller de Presentaciones efectivasTaller de Presentaciones efectivas
Taller de Presentaciones efectivasManuel de la Villa
 
Presentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHUPresentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHUManuel de la Villa
 
Curso personal branding profesores
Curso personal branding profesoresCurso personal branding profesores
Curso personal branding profesoresManuel de la Villa
 
Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...Manuel de la Villa
 
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...Manuel de la Villa
 
A critical and comparative study about ISO 9001, CMMI and ISO 15504
A critical and comparative study about  ISO 9001, CMMI and ISO 15504A critical and comparative study about  ISO 9001, CMMI and ISO 15504
A critical and comparative study about ISO 9001, CMMI and ISO 15504Manuel de la Villa
 

More from Manuel de la Villa (17)

Mantenimiento del software
Mantenimiento del softwareMantenimiento del software
Mantenimiento del software
 
Presentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta AutomáticosPresentación TFG Informes de Alta Automáticos
Presentación TFG Informes de Alta Automáticos
 
Presentación programa Social Media UHU
Presentación programa Social Media UHUPresentación programa Social Media UHU
Presentación programa Social Media UHU
 
Marca personal para community managers
Marca personal para community managersMarca personal para community managers
Marca personal para community managers
 
Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2Taller Facebook #SMUHU parte 2
Taller Facebook #SMUHU parte 2
 
Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1Taller Facebook #SMUHU parte 1
Taller Facebook #SMUHU parte 1
 
Personal branding
Personal brandingPersonal branding
Personal branding
 
Taller de Presentaciones efectivas
Taller de Presentaciones efectivasTaller de Presentaciones efectivas
Taller de Presentaciones efectivas
 
Presentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHUPresentacion Grado en Ingeniería Informática UHU
Presentacion Grado en Ingeniería Informática UHU
 
Curso personal branding profesores
Curso personal branding profesoresCurso personal branding profesores
Curso personal branding profesores
 
Herramientas web 2.0 parte 2
Herramientas web 2.0 parte 2Herramientas web 2.0 parte 2
Herramientas web 2.0 parte 2
 
Herramientas web 2.0 Parte 1
Herramientas web 2.0 Parte 1Herramientas web 2.0 Parte 1
Herramientas web 2.0 Parte 1
 
MVilla IUI 2012 Lisbon
MVilla IUI 2012 LisbonMVilla IUI 2012 Lisbon
MVilla IUI 2012 Lisbon
 
Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...Experiences on integrating explicit knowledge on information access tools in ...
Experiences on integrating explicit knowledge on information access tools in ...
 
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...A Biomedical Information Retrieval System  based on Clustering for Mobile Dev...
A Biomedical Information Retrieval System based on Clustering for Mobile Dev...
 
A critical and comparative study about ISO 9001, CMMI and ISO 15504
A critical and comparative study about  ISO 9001, CMMI and ISO 15504A critical and comparative study about  ISO 9001, CMMI and ISO 15504
A critical and comparative study about ISO 9001, CMMI and ISO 15504
 
Tesina08
Tesina08Tesina08
Tesina08
 

Recently uploaded

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 

Recently uploaded (20)

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 

Deconstructing Freebase: A Guide to its Schema, Data, and APIs

  • 1. Deconstructing Freebase Manuel de la Villa Department of Information Technologies University of Huelva Medical-Miner Project Huelva 22/7/2010
  • 2. Deconstructing Freebase • What is it – A net? A tool? A kind of Data? wikipedia-turned- database? • The Schema, – domains, types, properties. Sources. • How it works • How can we use it – MQL, API, Acre, RDF Medical-Miner Project Huelva 22/7/2010
  • 3. What is it • Freebase is a large public database that collects three kinds of information: – data; – texts; and – media, that references… • …entities or topics (≈ 12 million). An entity is a unique single person, place, or thing. • A single concept or real-world thing. • A topic could also be called an entity, resource or element or thing, it is a fundamental unit in Freebase. • /common/topic • Each topic has a Guid or globally unique ID – http://www.freebase.com/view/en/barack_obama – http://www.freebase.com/guid/9202a8c04000641f800000000029c277 Medical-Miner Project Huelva 22/7/2010
  • 4. What is it (and II) • Freebase connects entities together as a graph, – defines its data structure as a set of nodes and a set of links that establish relationships between the nodes. • Most of our topics are associated with one or more types (such as people, places, books, films, etc) and may have additional properties like "date of birth" for a person or latitude and longitude for a location. These types and properties and related concepts are called Schema. Medical-Miner Project Huelva 22/7/2010
  • 5. The Schema • We use the term Schema to refer to the way Freebase's data is laid out or structured. In the Semantic web world, this is referred to as Ontology. • Schema in Freebase is expressed through Types and Properties. Types are grouped together in Domains. What is Schema? Schema describes how data in Freebase is structured. Schema is made by creating: Types, which define a particular kind of person, place or thing and contain Properties, which define the unique qualities of a type. William Shakespeare is a → type → Person has a → property → Date of Birth Medical-Miner Project Huelva 22/7/2010
  • 6. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 7. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 8. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 9. The Schema (II) Medical-Miner Project Huelva 22/7/2010
  • 10. The Schema: Medicine Medical-Miner Project Huelva 22/7/2010
  • 11. Then… is an Ontology? • Yes and No, is a kind of Folksonomy… Medical-Miner Project Huelva 22/7/2010
  • 12. The sources • Wikipedia, ChefMoz, NNDB, MusicBrainz… • The Freebase community, with a 2.0 spirit, are: • data contributors, • schema builders, • data curators, • application developers, • and many others who use Freebase – Expert status – Augmentative edition • Creative Commons Attribution Licensing, which means that it's free for you to browse, query, copy, and even use the data in your own systems or software, even for commercial use; all we ask in return is that you mention that Medical-Miner Project Huelva 22/7/2010
  • 13. How can we use it… • As a reference or information source • Create interesting Views and Visualizations and share them with others • Embed Freebase data in your website • Use our API or Acre, our hosted app development platform, to build apps that use Freebase data • Download our Data dumps • Use Freebase's RDF for Semantic Web applications Medical-Miner Project Huelva 22/7/2010
  • 14. How can we use it… • Ways to use Freebase: – Use Freebase's Ids to uniquely identify entities anywhere on the web – Query Freebase's data using MQL – Build applications using our API or Acre, the hosted development platform Medical-Miner Project Huelva 22/7/2010
  • 15. MQL (Metaweb Query Language) • http://api.freebase.com/api/service/mqlread?query={"query" :{"type":"/music/artist","name":"U2","album":[]}} • Query Editor Medical-Miner Project Huelva 22/7/2010
  • 16. MQL (and II) Medical-Miner Project Huelva 22/7/2010
  • 17. Freebase API Freebase's API allows you: – to perform searches and queries against Freebase's data, or – to write data to Freebase. – There are Libraries available for many languages. Searching with Search API • The search API lets you do a fuzzily-matched search for a string that occurs in a topic's name, aliases, or description. Reading • The mqlread API is for doing structured queries against Freebase's schema. Writing • The mqlwrite API lets you write data to Freebase. Medical-Miner Project Huelva 22/7/2010
  • 18. Acre A hosted application development platform, Acre: • makes it easy to build and share Freebase applications. •It encapsulates many of the trickier steps (like OAuth access for writing to Freebase) and •lets you view other people's apps, clone them, modify them, and share your improvements. Medical-Miner Project Huelva 22/7/2010
  • 19. Applications… with Acre • Powerset (NLP search engine) • Fmdb • Freebase Sets Medical-Miner Project Huelva 22/7/2010
  • 20. RDF Medical-Miner Project Huelva 22/7/2010
  • 21. Summarizing "Freebase is the bridge between the bottom up vision of Web 2.0 collective intelligence and the more structured world of the semantic web." Tim O'Reilly Medical-Miner Project Huelva 22/7/2010