SlideShare une entreprise Scribd logo
1  sur  36
The Semantic Web and
why Wikipedia should bother
          Jakob Voß




                       Wikimania 2007
            Taipei, Taiwan, 2007-08-03
Agenda

(1) The Semantic Web
(2) Wikipedia’s contribution
(3) Examples and problems
(4) Possible solutions
The Semantic Web

    Everything can be linked via its URI
●



    Every data in triples with typed links
●




                      Image taken from: Semantic Wikipedia (2006)
The Semantic Web

    Ontologies define
●


    common structures and rules
    More data is generated by aggregation
●


    and reasoning on distributed data from
    several sources
    Software agents understand your
●


    commands, aggregate, reason, decide
    and act independently (at least in theory)
Wikipedia’s contribution

    Largest source of freely available
●


    non-specialized data
    Templates and categories
●


    contain structured data
        Persondata
    –

        DBpedia.org
    –

        Geodata
    –

        ...
    –

    Semantic MediaWiki
●


    adds typed links and attributes
Aggregating and Reasoning
Aggregating and Reasoning

Which polish authors are currently
   most published in Germany?
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●


        List of published books by book vendors
    –
        or by the German National Library
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●



    Authors
●


        National Library catalouge contains author
    –
        and uniquely identifies author by PND-ID
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●



    Authors
●



    Polish authors
●


        German Wikipedia contains PND => article
    –

        Article linked via Interwiki => more articles
    –

        Biographical articles contain place of birth
    –

        Place of birth linked to country via category
    –
Aggregating and Reasoning

subject       predicate       object
Publication   published-in   Germany
Publication   has-author      Person
Person          born-in         Town
Town            place-in      Poland
Where is Poland?
Where is Poland?


          Somewhere here
Where is Poland?


                        Somewhere here
Or five times here in
Maine, Ohia, or NY
Where is Poland?


                        Somewhere here
Or five times here in
Maine, Ohia, or NY




                               Or did you mean
                               Poland, Kiribati?
Poland around 1619




Polish-Lithuanian Commonwealth
Poland 1772...1793..1795
Poland 1945–
Where is Poland?

    Reality is complex, confusing, and fuzzy
●



    What’s the »default« Poland?
●



    Humans can look up context in Wikipedia
●



    Semantic Web only consists of statements
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●



    George W. Bush     2001-01-20 – 2009-01-20
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●



    George W. Bush     2001-01-20 – 2009-01-20
●



    Barack Obama       2009-01-20 –
●
Example #2

    Presidents of the United States
    Bill Clinton        1993-01-20 – 2001-01-20
●



    George W. Bush      2001-01-20 – 2009-01-20
●



    Barack Obama        2009-01-20 – 2013-01-20
●



    A. Schwarzenegger   2013-01-20 –
●
Presidents of the United States

    George W. Bush       2001-01-20 – 2002-06-29
●



    Dick Cheney               07:09 – 09:24 a.m.
●



    George W. Bush       2002-06-29 – 2007-07-21
●



    Dick Cheney               07:14 – 09:21 a.m.
●



    George W. Bush      2007–07-21 –
●




                  Twice president of the US
                     (see 25th amendment)
Presidents of the United States

    The devil is in the details ;-)
●



    Automatic reasoning will
●


    give you inconvenient results
Example #3

 Finally a clear division




                        女性
男性
                        XX
XY
So let’s formalize...

owl:disjointWith
”Classes may be stated to be disjoint from
 each other. For example, Man and Woman
 can be stated to be disjoint classes. [...] a
 reasoner can deduce that if A is an
 instance of Man, then A is not an instance
 of Woman.“
OWL Web Ontology Language Guide
 http://www.w3.org/TR/owl-guide/
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●




       Intersexuality, Hermaproditism
    Chromosomal sex inconsistent with phenotypic
●


    sex or phenotype is not just male or female
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●




       Intersexuality, Hermaproditism
    Chromosomal sex inconsistent with phenotypic
●


    sex or phenotype is not just male or female

                Gender identity
    Gender with which a person identifies
●


    independent from biological sex.
A clear division?

    Reality is far more complicated
●



    Many kinds of exceptions
●
Problems

    Clear divisions discriminate
●



    Discussion and context gets lost
●



    Example #4
●


     IF  your name = X
     AND X on a list of suspected terrorists
    THEN you have a problem
Not our problem?

    Ẁikipedia is already used as
●


    source by millions of people
    People can think, judge and ask,
●


    computers cannot
    We create definitions that will be used in
●


    thousands of applications
    Statistics lie
●


    Aggragation/resoning even lies better
Possible Solutions

    More of all (data, aggregation, reasoning)
●



    Less of all
●



    Statements about statements
●



    Fuzzy logic
●



    Data provenance / data lineage
●



    Allow exceptions
●



    Teach people to be careful
●



    Do not expect or believe simple answers
●



    It’s just dirty data
●
Summary

    Semantic Web is great
●



    Reality is based on exceptions
●



    Simplification is useful but dangerous
●



    Data POV != NPOV
●



    We also bear responsability for
●


    stupid use of Wikipedia data
    Never stop analyzing and thinking
●


    instead of relying on computers
More to read

    Shadbolt, Berners-Lee, and Hall: The Semantic Web
●


    Revisited. IEEE Intelligent Systems 21 (3) pp. 96-101.
    May/June 2006.
    http://eprints.ecs.soton.ac.uk/12614/01/Semantic_Web_Revisted.pdf

    Völkel, Krötzsch, Vrandecic, Haller, and Studer:
●


    Semantic Wikipedia. Proceedings of the WWW2006.
    http://www.aifb.uni-karlsruhe.de/Publikationen/showPublikation_english?publ_id=

    Doctorow: Metacrap: Putting the torch to seven straw-
●


    men of the meta-utopia. August 2001.
    http://www.well.com/~doctorow/metacrap.htm

    Geoffrey and Star: Sorting Things Out: Classification
●


    and Its Consequences. MIT Press, 1999.

Contenu connexe

Similaire à Jakob Voss Wikipedia2007

University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009Tom Moritz
 
Describing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDescribing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDan Brickley
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagementSTIinnsbruck
 
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Cartegraph
 
Argumentative Essay Structure
Argumentative Essay StructureArgumentative Essay Structure
Argumentative Essay StructureVeronica Withers
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysisLuke Czarnecki
 
Wikipedia and Civic Engagement
Wikipedia and Civic EngagementWikipedia and Civic Engagement
Wikipedia and Civic EngagementAndrew Lih
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsJohn Breslin
 
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritWho's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritDominik Lukes
 
The Potential of Web 3.0
The Potential of Web 3.0The Potential of Web 3.0
The Potential of Web 3.0Carsten Ullrich
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchEnrico Daga
 
The Persuasive Speech.ppt
The Persuasive Speech.pptThe Persuasive Speech.ppt
The Persuasive Speech.pptSupreethaS8
 
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionFrom Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionSTLab
 
Developmental Psychology Theoretical Approaches Essay
 Developmental Psychology Theoretical Approaches Essay Developmental Psychology Theoretical Approaches Essay
Developmental Psychology Theoretical Approaches EssayPatty Buckley
 
(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And UnderstandingAlexandre Linhares
 
Content - Cory Doctorow
Content - Cory DoctorowContent - Cory Doctorow
Content - Cory DoctorowGeorge Grayson
 

Similaire à Jakob Voss Wikipedia2007 (20)

University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009
 
Describing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDescribing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classification
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
 
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
 
Argumentative Essay Structure
Argumentative Essay StructureArgumentative Essay Structure
Argumentative Essay Structure
 
Infooverload
InfooverloadInfooverload
Infooverload
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysis
 
Essay Writing Examples Uk
Essay Writing Examples UkEssay Writing Examples Uk
Essay Writing Examples Uk
 
Wikipedia and Civic Engagement
Wikipedia and Civic EngagementWikipedia and Civic Engagement
Wikipedia and Civic Engagement
 
Coincidences
CoincidencesCoincidences
Coincidences
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for Semantics
 
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritWho's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
 
The Potential of Web 3.0
The Potential of Web 3.0The Potential of Web 3.0
The Potential of Web 3.0
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities research
 
The Persuasive Speech.ppt
The Persuasive Speech.pptThe Persuasive Speech.ppt
The Persuasive Speech.ppt
 
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionFrom Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
 
Developmental Psychology Theoretical Approaches Essay
 Developmental Psychology Theoretical Approaches Essay Developmental Psychology Theoretical Approaches Essay
Developmental Psychology Theoretical Approaches Essay
 
Class 4
Class 4Class 4
Class 4
 
(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding
 
Content - Cory Doctorow
Content - Cory DoctorowContent - Cory Doctorow
Content - Cory Doctorow
 

Plus de Bertalan Mesko, MD

Medical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comMedical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comBertalan Mesko, MD
 
Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Bertalan Mesko, MD
 
Medicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldMedicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldBertalan Mesko, MD
 
Practicing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraPracticing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraBertalan Mesko, MD
 
Jason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsJason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsBertalan Mesko, MD
 
Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Bertalan Mesko, MD
 
Medicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerMedicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerBertalan Mesko, MD
 
The impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareThe impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareBertalan Mesko, MD
 

Plus de Bertalan Mesko, MD (9)

Medical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comMedical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.com
 
Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma
 
Medicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldMedicine in Second Life, the virtual world
Medicine in Second Life, the virtual world
 
Practicing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraPracticing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 Era
 
Jason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsJason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired Patients
 
Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0
 
Medicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerMedicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student blogger
 
The impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareThe impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcare
 
Medicine 2.0
Medicine 2.0Medicine 2.0
Medicine 2.0
 

Dernier

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Karmanjay Verma
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...itnewsafrica
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessWSO2
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...itnewsafrica
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 

Dernier (20)

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
Irene Moetsana-Moeng: Stakeholders in Cybersecurity: Collaborative Defence fo...
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with Platformless
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...Zeshan Sattar- Assessing the skill requirements and industry expectations for...
Zeshan Sattar- Assessing the skill requirements and industry expectations for...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 

Jakob Voss Wikipedia2007

  • 1. The Semantic Web and why Wikipedia should bother Jakob Voß Wikimania 2007 Taipei, Taiwan, 2007-08-03
  • 2. Agenda (1) The Semantic Web (2) Wikipedia’s contribution (3) Examples and problems (4) Possible solutions
  • 3. The Semantic Web Everything can be linked via its URI ● Every data in triples with typed links ● Image taken from: Semantic Wikipedia (2006)
  • 4. The Semantic Web Ontologies define ● common structures and rules More data is generated by aggregation ● and reasoning on distributed data from several sources Software agents understand your ● commands, aggregate, reason, decide and act independently (at least in theory)
  • 5. Wikipedia’s contribution Largest source of freely available ● non-specialized data Templates and categories ● contain structured data Persondata – DBpedia.org – Geodata – ... – Semantic MediaWiki ● adds typed links and attributes
  • 7. Aggregating and Reasoning Which polish authors are currently most published in Germany?
  • 8. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● List of published books by book vendors – or by the German National Library
  • 9. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● Authors ● National Library catalouge contains author – and uniquely identifies author by PND-ID
  • 10. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● Authors ● Polish authors ● German Wikipedia contains PND => article – Article linked via Interwiki => more articles – Biographical articles contain place of birth – Place of birth linked to country via category –
  • 11. Aggregating and Reasoning subject predicate object Publication published-in Germany Publication has-author Person Person born-in Town Town place-in Poland
  • 13. Where is Poland? Somewhere here
  • 14. Where is Poland? Somewhere here Or five times here in Maine, Ohia, or NY
  • 15. Where is Poland? Somewhere here Or five times here in Maine, Ohia, or NY Or did you mean Poland, Kiribati?
  • 19. Where is Poland? Reality is complex, confusing, and fuzzy ● What’s the »default« Poland? ● Humans can look up context in Wikipedia ● Semantic Web only consists of statements ●
  • 20. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ●
  • 21. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ●
  • 22. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ● Barack Obama 2009-01-20 – ●
  • 23. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ● Barack Obama 2009-01-20 – 2013-01-20 ● A. Schwarzenegger 2013-01-20 – ●
  • 24. Presidents of the United States George W. Bush 2001-01-20 – 2002-06-29 ● Dick Cheney 07:09 – 09:24 a.m. ● George W. Bush 2002-06-29 – 2007-07-21 ● Dick Cheney 07:14 – 09:21 a.m. ● George W. Bush 2007–07-21 – ● Twice president of the US (see 25th amendment)
  • 25. Presidents of the United States The devil is in the details ;-) ● Automatic reasoning will ● give you inconvenient results
  • 26. Example #3 Finally a clear division 女性 男性 XX XY
  • 27. So let’s formalize... owl:disjointWith ”Classes may be stated to be disjoint from each other. For example, Man and Woman can be stated to be disjoint classes. [...] a reasoner can deduce that if A is an instance of Man, then A is not an instance of Woman.“ OWL Web Ontology Language Guide http://www.w3.org/TR/owl-guide/
  • 28. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ●
  • 29. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ● Intersexuality, Hermaproditism Chromosomal sex inconsistent with phenotypic ● sex or phenotype is not just male or female
  • 30. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ● Intersexuality, Hermaproditism Chromosomal sex inconsistent with phenotypic ● sex or phenotype is not just male or female Gender identity Gender with which a person identifies ● independent from biological sex.
  • 31. A clear division? Reality is far more complicated ● Many kinds of exceptions ●
  • 32. Problems Clear divisions discriminate ● Discussion and context gets lost ● Example #4 ● IF your name = X AND X on a list of suspected terrorists THEN you have a problem
  • 33. Not our problem? Ẁikipedia is already used as ● source by millions of people People can think, judge and ask, ● computers cannot We create definitions that will be used in ● thousands of applications Statistics lie ● Aggragation/resoning even lies better
  • 34. Possible Solutions More of all (data, aggregation, reasoning) ● Less of all ● Statements about statements ● Fuzzy logic ● Data provenance / data lineage ● Allow exceptions ● Teach people to be careful ● Do not expect or believe simple answers ● It’s just dirty data ●
  • 35. Summary Semantic Web is great ● Reality is based on exceptions ● Simplification is useful but dangerous ● Data POV != NPOV ● We also bear responsability for ● stupid use of Wikipedia data Never stop analyzing and thinking ● instead of relying on computers
  • 36. More to read Shadbolt, Berners-Lee, and Hall: The Semantic Web ● Revisited. IEEE Intelligent Systems 21 (3) pp. 96-101. May/June 2006. http://eprints.ecs.soton.ac.uk/12614/01/Semantic_Web_Revisted.pdf Völkel, Krötzsch, Vrandecic, Haller, and Studer: ● Semantic Wikipedia. Proceedings of the WWW2006. http://www.aifb.uni-karlsruhe.de/Publikationen/showPublikation_english?publ_id= Doctorow: Metacrap: Putting the torch to seven straw- ● men of the meta-utopia. August 2001. http://www.well.com/~doctorow/metacrap.htm Geoffrey and Star: Sorting Things Out: Classification ● and Its Consequences. MIT Press, 1999.