SlideShare une entreprise Scribd logo
1  sur  38
Optimized Index Structures
for Querying RDF from the
Web
Presented by : Mahdi Atawna
1
About the Paper
 The paper was published at the Third Latin
American Web Congress in 2005
 Have 56 citation.
Andreas Harth
National University
of Galway, Ireland
Prof. Stefan Decker
National University of
Galway, Ireland
2
Outline
 Overview of Semantic Web.
 Overview of Indexes.
 Paper motivation.
 Methodology.
 Experiment & Result.
 Conclusion.
3
Semantic Web
 Also called :
 Web 3.0.
 the Linked Data Web.
 the Web of Data…whatever you call it.
 the next major evolution in connecting information.
4
Why semantic web?
 It enables data to be linked from a source to any
other source.
 It can be understood by computers so that they can
perform increasingly sophisticated tasks on our
behalf.
5
6
Source: http://lod-cloud.net
Semantic Web Standards
 RDF (Resource Description Framework): The data modeling
language for the Semantic Web (like UML). All Semantic Web
information is stored and represented in the RDF.
 SPARQL : The query language of the Semantic Web.
 OWL (Web Ontology Language) The schema language, or
knowledge representation (KR) language, of the Semantic Web.
7
What is RDF?
 RDF is the data model of the Semantic Web.
 That means that all data in Semantic Web
technologies is represented as RDF.
 If you store Semantic Web data, it's in RDF.
 If you query Semantic Web data (typically using
SPARQL), it's RDF data. If you send Semantic Web
data to your friend, it's RDF.
8
9
10
Source : http://www.w3.org/TR/rdf11-primer/
RDF triples
 are representations of graph edges.
11
Subject Object
Predicate
Mahdi Hebron
born in
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
12
RDF example
@prefix foaf: <http://xmlns.com/foaf/0.1/ .
<http://example.org/bob#me>
foaf:topic_interest
<http://wikidata.org/entity/Q12418> .
Subject
predicate
Object
13
Sparql query language
SELECT ?p ?o
{
<http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o
}
14
What is database index?
 A database index is a data structure that improves the
speed of data retrieval operations on a database table at
the cost of additional writes and storage space to maintain
the index data structure.
 Index goal : The index structure enables fast retrieval of
data
15
16
Index example
key articles
Leonardo [104,70,12,98]
Mona Lisa [2,201,7,20,12]
Francesco [1,8,900,104]
17
Paper motivation
 Previous Systems provide a storage infrastructure for RDF data, but
index structure which do not support typical query scenarios
for data from the Web which results in poor query answering
performance in some cases.
18
Methodology
 The researchers present a new index structure that handle the
data from the Web .
 Implemented the index structure in a lightweight software called
YARS
19
RDF Index structures
 The authors suggested an index structure that contains two
sets:
1. Lexicon : covers the string representation of RDF graph
(r,l,b)
2. Quad indexes : cover the quads (triples).
20
1. Lexicon indexes
 NodeOID and OIDNode Index
 Keyword Index
21
1. Lexicon indexes
 NodeOID and OIDNode Index :
22
Key value
<http://www.harth.org/andreas/#me> 3
<http://decker.cn/stefan/> 14
<http://sw.deri.org/ aharth/foaf.rdf> 11
<http://www.deri.org/> 1
1. Lexicon indexes
 Keyword Index (Popular in search engines)
23
Key No of hits List of hits
“Andreas” 1 3
“Decker” 1 11
”Harth” 1 3
“Stefan” 2 11,13
2. Quad indexes
 Access Patterns
 Combined Indexes
 Occurrence Counts
24
2. Quad indexes
A- Access Patterns
25
No Access pattern No Access pattern
1 (?:?:?:?) 9 (s:?:o:c)
2 (s:?:?:?) 10 (?:?:o:c)
3 (s:p:?:?) 11 (?:?:o:?)
4 (s:p:o:?) 12 (?:?:?:c)
5 (s:p:o:c) 13 (s:?:?:c)
6 (?:p:?:?) 14 (s:p:?:c)
7 (?:p:o:?) 15 (?:p:?:c)
8 (?:p:o:c) 16 (s:?:o:?)
2. Quad indexes
A- Access Patterns
26
No Access pattern values
1 (?:?:?:?) [1,5,3]
2 (s:?:?:?) [2]
3 (s:p:?:?) [9,8,2,3]
4 (s:p:o:?) [1,3]
5 (s:p:o:c) [1]
6 (?:p:?:?) [76,9]
7 (?:p:o:?) [2,3]
2. Quad indexes
B- Combined Indexes
27
spoc poc osc csp cp os
(?:?:?:?) (?:p:?:?) (?:?:o:?)
(?:?:?:c) (?:p:?:c) (s:?:o:?)
(s:?:?:?) (?:p:o:?) (?:?:o:c)
(s:?:?:c)
(s:p:?:?) (?:p:o:c) (s:?:o:c)
(s:p:?:c)
(s:p:o:?)
(s:p:o:c)
2. Quad indexes
C- Occurrence Counts
28
No
Access
pattern
values count
1 (?:?:?:?) [1,5,3] 3
2 (s:?:?:?) [2] 1
3 (s:p:?:?) [9,8,2,3] 4
4 (s:p:o:?) [1,3] 2
5 (s:p:o:c) [1] 1
6 (?:p:?:?) [76,9] 2
7 (?:p:o:?) [2,3] 2
YARS
 Web application that built in JAVA.
 Has two parts:
 a storage component that handles both persistent and in-
memory indexes.
 a query handler to perform query processing and evaluation.
29
Experiment
 They evaluated the performance based on a dataset of 2.8
million triples (293 MB).
 The testing server has :
 Pentium-4 2.4 GHz
 4 GB RAM
 running Debian Sarge .
30
Experiment
They considered the following RDF stores for evaluation:
 Sesame.
 Kowari (failed to get a running version).
 Redland.
 Jena2. ([9] shows that Sesame generally supersedes Jena in
performance results)
31
Experiment
Experiment parts:
 Index Construction.
 Queries.
32
Result – index construction
System Index size (bytes)
Redland 2.164.019.200
Sesame MySQL 340.381.636
Sesame native 39.997.992
YARS 1.090.002.944
33
Table 8: Index size for the synthetic Univ20 dataset.
Result – index construction
34
Result - queries
No Query
1 ?x rdf:type univ:UndergraduateStudent
2 ?x ?p ”UndergraduateStudent0”
3 <http://www.University965.edu> ?p ?o
4 ?x univ:worksFor ?y
35
Query Redland Sesame MySQL Sesame Native YARS
1 0:10.48 0:18.87 1:05.16 0:18.41
2 0:44.14 0:00.73 0:00.55 0:00.49
3 0:44.15 0:00.46 0:00.47 0:00.32
4 3:04.21 0:03.42 0:01.95 0:00.47
Performance results for quad queries.
Conclusion
 The auther introduced query processing for RDF which is an I
portant issue in sematic web.
 YARS has some overhead for resolving the dependencies and
order in comparison with others.
36
Criticism
 - In experiment , the researchers removed “Kowari “ engine
because the cannot install it.
37
Questions?
Thank you
38

Contenu connexe

Tendances

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataOlaf Hartig
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patilwidespreadpromotion
 
R Programming Language
R Programming LanguageR Programming Language
R Programming LanguageNareshKarela1
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-iDr. Awase Khirni Syed
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language Rfbenault
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationINRIA-OAK
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Olaf Hartig
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)Ankit Rathi
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming FundamentalsRagia Ibrahim
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesLeonidas Akritidis
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RYanchang Zhao
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformSyracuse University
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul SinghRavi Basil
 

Tendances (20)

14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil14. Files - Data Structures using C++ by Varsha Patil
14. Files - Data Structures using C++ by Varsha Patil
 
6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil6. Linked list - Data Structures using C++ by Varsha Patil
6. Linked list - Data Structures using C++ by Varsha Patil
 
3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil3. Stack - Data Structures using C++ by Varsha Patil
3. Stack - Data Structures using C++ by Varsha Patil
 
LDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked DataLDQL: A Query Language for the Web of Linked Data
LDQL: A Query Language for the Web of Linked Data
 
Getting Started with R
Getting Started with RGetting Started with R
Getting Started with R
 
5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil5. Queue - Data Structures using C++ by Varsha Patil
5. Queue - Data Structures using C++ by Varsha Patil
 
R Programming Language
R Programming LanguageR Programming Language
R Programming Language
 
R programming groundup-basic-section-i
R programming groundup-basic-section-iR programming groundup-basic-section-i
R programming groundup-basic-section-i
 
Triple Stores
Triple StoresTriple Stores
Triple Stores
 
Introduction to the language R
Introduction to the language RIntroduction to the language R
Introduction to the language R
 
Rdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimationRdf conjunctive query selectivity estimation
Rdf conjunctive query selectivity estimation
 
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
 
final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)final_copy_camera_ready_paper (7)
final_copy_camera_ready_paper (7)
 
Data Structure
Data StructureData Structure
Data Structure
 
R programming Fundamentals
R programming  FundamentalsR programming  Fundamentals
R programming Fundamentals
 
Positional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted IndexesPositional Data Organization and Compression in Web Inverted Indexes
Positional Data Organization and Compression in Web Inverted Indexes
 
Efficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data StreamsEfficient RDF Interchange (ERI) Format for RDF Data Streams
Efficient RDF Interchange (ERI) Format for RDF Data Streams
 
Introduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in RIntroduction to Data Mining with R and Data Import/Export in R
Introduction to Data Mining with R and Data Import/Export in R
 
Why R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics PlatformWhy R? A Brief Introduction to the Open Source Statistics Platform
Why R? A Brief Introduction to the Open Source Statistics Platform
 
R programming Language , Rahul Singh
R programming Language , Rahul SinghR programming Language , Rahul Singh
R programming Language , Rahul Singh
 

Similaire à Optimized index structures for querying rdf from the web

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3CIvan Herman
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Enrico Daga
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Jose Emilio Labra Gayo
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyAlbert Meroño-Peñuela
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsAlejandro Llaves
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityDatabricks
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic webWorawith Sangkatip
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptxAndrew Lamb
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataStuart Chalk
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactJean-Paul Calbimonte
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?Örjan Lundberg
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهWeb Standards School
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_frameworksamuelhard
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureMasud Rahman
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic WebIvan Herman
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkDatabricks
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET FrameworkAdriana Ivanciu
 

Similaire à Optimized index structures for querying rdf from the web (20)

Semantic Web and Related Work at W3C
Semantic Web and Related Work at W3CSemantic Web and Related Work at W3C
Semantic Web and Related Work at W3C
 
Democratizing Big Semantic Data management
Democratizing Big Semantic Data managementDemocratizing Big Semantic Data management
Democratizing Big Semantic Data management
 
Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.Data integration with a façade. The case of knowledge graph construction.
Data integration with a façade. The case of knowledge graph construction.
 
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
Validating statistical Index Data represented in RDF using SPARQL Queries: Co...
 
Modelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic StudyModelling and Querying Lists in RDF. A Pragmatic Study
Modelling and Querying Lists in RDF. A Pragmatic Study
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Towards efficient processing of RDF data streams
Towards efficient processing of RDF data streamsTowards efficient processing of RDF data streams
Towards efficient processing of RDF data streams
 
Strata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark communityStrata NYC 2015 - What's coming for the Spark community
Strata NYC 2015 - What's coming for the Spark community
 
Ontology mapping for the semantic web
Ontology mapping for the semantic webOntology mapping for the semantic web
Ontology mapping for the semantic web
 
2021 04-20 apache arrow and its impact on the database industry.pptx
2021 04-20  apache arrow and its impact on the database industry.pptx2021 04-20  apache arrow and its impact on the database industry.pptx
2021 04-20 apache arrow and its impact on the database industry.pptx
 
ACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility DataACS 248th Paper 108 NIST-IUPAC Solubility Data
ACS 248th Paper 108 NIST-IUPAC Solubility Data
 
RDF Stream Processing: Let's React
RDF Stream Processing: Let's ReactRDF Stream Processing: Let's React
RDF Stream Processing: Let's React
 
What's new in spark 2.0?
What's new in spark 2.0?What's new in spark 2.0?
What's new in spark 2.0?
 
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان دادهمعرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
معرفی کاربردهای یادگیری عمیق و چالش های آن در کلان داده
 
Nutch and lucene_framework
Nutch and lucene_frameworkNutch and lucene_framework
Nutch and lucene_framework
 
Metadata crosswalks
Metadata crosswalksMetadata crosswalks
Metadata crosswalks
 
CMPT470-usask-guest-lecture
CMPT470-usask-guest-lectureCMPT470-usask-guest-lecture
CMPT470-usask-guest-lecture
 
State of the Semantic Web
State of the Semantic WebState of the Semantic Web
State of the Semantic Web
 
Spark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with SparkSpark Application Carousel: Highlights of Several Applications Built with Spark
Spark Application Carousel: Highlights of Several Applications Built with Spark
 
RDF APIs for .NET Framework
RDF APIs for .NET FrameworkRDF APIs for .NET Framework
RDF APIs for .NET Framework
 

Plus de Mahdi Atawneh

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logsMahdi Atawneh
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsMahdi Atawneh
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreMahdi Atawneh
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesMahdi Atawneh
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxMahdi Atawneh
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model databaseMahdi Atawneh
 

Plus de Mahdi Atawneh (6)

Improving ip geolocation using query logs
Improving ip geolocation using query logsImproving ip geolocation using query logs
Improving ip geolocation using query logs
 
Improvement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristicsImprovement of shortest path algorithms using subgraphs heuristics
Improvement of shortest path algorithms using subgraphs heuristics
 
SILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value StoreSILT: A Memory-Efficient, High-Performance Key-Value Store
SILT: A Memory-Efficient, High-Performance Key-Value Store
 
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triplesOWL reasoning with WebPIE: calculating the closer of 100 billion triples
OWL reasoning with WebPIE: calculating the closer of 100 billion triples
 
Bat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptxBat algorithm explained. slides ppt pptx
Bat algorithm explained. slides ppt pptx
 
01 nosql and multi model database
01   nosql and multi model database01   nosql and multi model database
01 nosql and multi model database
 

Dernier

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
ARTERIAL BLOOD GAS ANALYSIS........pptx
ARTERIAL BLOOD  GAS ANALYSIS........pptxARTERIAL BLOOD  GAS ANALYSIS........pptx
ARTERIAL BLOOD GAS ANALYSIS........pptxAneriPatwari
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Celine George
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfJemuel Francisco
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17Celine George
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQuiz Club NITW
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 

Dernier (20)

31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
ARTERIAL BLOOD GAS ANALYSIS........pptx
ARTERIAL BLOOD  GAS ANALYSIS........pptxARTERIAL BLOOD  GAS ANALYSIS........pptx
ARTERIAL BLOOD GAS ANALYSIS........pptx
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17Tree View Decoration Attribute in the Odoo 17
Tree View Decoration Attribute in the Odoo 17
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdfGrade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
Grade 9 Quarter 4 Dll Grade 9 Quarter 4 DLL.pdf
 
How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17How to Fix XML SyntaxError in Odoo the 17
How to Fix XML SyntaxError in Odoo the 17
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITWQ-Factor General Quiz-7th April 2024, Quiz Club NITW
Q-Factor General Quiz-7th April 2024, Quiz Club NITW
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 

Optimized index structures for querying rdf from the web

  • 1. Optimized Index Structures for Querying RDF from the Web Presented by : Mahdi Atawna 1
  • 2. About the Paper  The paper was published at the Third Latin American Web Congress in 2005  Have 56 citation. Andreas Harth National University of Galway, Ireland Prof. Stefan Decker National University of Galway, Ireland 2
  • 3. Outline  Overview of Semantic Web.  Overview of Indexes.  Paper motivation.  Methodology.  Experiment & Result.  Conclusion. 3
  • 4. Semantic Web  Also called :  Web 3.0.  the Linked Data Web.  the Web of Data…whatever you call it.  the next major evolution in connecting information. 4
  • 5. Why semantic web?  It enables data to be linked from a source to any other source.  It can be understood by computers so that they can perform increasingly sophisticated tasks on our behalf. 5
  • 7. Semantic Web Standards  RDF (Resource Description Framework): The data modeling language for the Semantic Web (like UML). All Semantic Web information is stored and represented in the RDF.  SPARQL : The query language of the Semantic Web.  OWL (Web Ontology Language) The schema language, or knowledge representation (KR) language, of the Semantic Web. 7
  • 8. What is RDF?  RDF is the data model of the Semantic Web.  That means that all data in Semantic Web technologies is represented as RDF.  If you store Semantic Web data, it's in RDF.  If you query Semantic Web data (typically using SPARQL), it's RDF data. If you send Semantic Web data to your friend, it's RDF. 8
  • 9. 9
  • 11. RDF triples  are representations of graph edges. 11 Subject Object Predicate Mahdi Hebron born in
  • 12. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . 12
  • 13. RDF example @prefix foaf: <http://xmlns.com/foaf/0.1/ . <http://example.org/bob#me> foaf:topic_interest <http://wikidata.org/entity/Q12418> . Subject predicate Object 13
  • 14. Sparql query language SELECT ?p ?o { <http://nasa.dataincubator.org/spacecraft/1968-089A> ?p ?o } 14
  • 15. What is database index?  A database index is a data structure that improves the speed of data retrieval operations on a database table at the cost of additional writes and storage space to maintain the index data structure.  Index goal : The index structure enables fast retrieval of data 15
  • 16. 16
  • 17. Index example key articles Leonardo [104,70,12,98] Mona Lisa [2,201,7,20,12] Francesco [1,8,900,104] 17
  • 18. Paper motivation  Previous Systems provide a storage infrastructure for RDF data, but index structure which do not support typical query scenarios for data from the Web which results in poor query answering performance in some cases. 18
  • 19. Methodology  The researchers present a new index structure that handle the data from the Web .  Implemented the index structure in a lightweight software called YARS 19
  • 20. RDF Index structures  The authors suggested an index structure that contains two sets: 1. Lexicon : covers the string representation of RDF graph (r,l,b) 2. Quad indexes : cover the quads (triples). 20
  • 21. 1. Lexicon indexes  NodeOID and OIDNode Index  Keyword Index 21
  • 22. 1. Lexicon indexes  NodeOID and OIDNode Index : 22 Key value <http://www.harth.org/andreas/#me> 3 <http://decker.cn/stefan/> 14 <http://sw.deri.org/ aharth/foaf.rdf> 11 <http://www.deri.org/> 1
  • 23. 1. Lexicon indexes  Keyword Index (Popular in search engines) 23 Key No of hits List of hits “Andreas” 1 3 “Decker” 1 11 ”Harth” 1 3 “Stefan” 2 11,13
  • 24. 2. Quad indexes  Access Patterns  Combined Indexes  Occurrence Counts 24
  • 25. 2. Quad indexes A- Access Patterns 25 No Access pattern No Access pattern 1 (?:?:?:?) 9 (s:?:o:c) 2 (s:?:?:?) 10 (?:?:o:c) 3 (s:p:?:?) 11 (?:?:o:?) 4 (s:p:o:?) 12 (?:?:?:c) 5 (s:p:o:c) 13 (s:?:?:c) 6 (?:p:?:?) 14 (s:p:?:c) 7 (?:p:o:?) 15 (?:p:?:c) 8 (?:p:o:c) 16 (s:?:o:?)
  • 26. 2. Quad indexes A- Access Patterns 26 No Access pattern values 1 (?:?:?:?) [1,5,3] 2 (s:?:?:?) [2] 3 (s:p:?:?) [9,8,2,3] 4 (s:p:o:?) [1,3] 5 (s:p:o:c) [1] 6 (?:p:?:?) [76,9] 7 (?:p:o:?) [2,3]
  • 27. 2. Quad indexes B- Combined Indexes 27 spoc poc osc csp cp os (?:?:?:?) (?:p:?:?) (?:?:o:?) (?:?:?:c) (?:p:?:c) (s:?:o:?) (s:?:?:?) (?:p:o:?) (?:?:o:c) (s:?:?:c) (s:p:?:?) (?:p:o:c) (s:?:o:c) (s:p:?:c) (s:p:o:?) (s:p:o:c)
  • 28. 2. Quad indexes C- Occurrence Counts 28 No Access pattern values count 1 (?:?:?:?) [1,5,3] 3 2 (s:?:?:?) [2] 1 3 (s:p:?:?) [9,8,2,3] 4 4 (s:p:o:?) [1,3] 2 5 (s:p:o:c) [1] 1 6 (?:p:?:?) [76,9] 2 7 (?:p:o:?) [2,3] 2
  • 29. YARS  Web application that built in JAVA.  Has two parts:  a storage component that handles both persistent and in- memory indexes.  a query handler to perform query processing and evaluation. 29
  • 30. Experiment  They evaluated the performance based on a dataset of 2.8 million triples (293 MB).  The testing server has :  Pentium-4 2.4 GHz  4 GB RAM  running Debian Sarge . 30
  • 31. Experiment They considered the following RDF stores for evaluation:  Sesame.  Kowari (failed to get a running version).  Redland.  Jena2. ([9] shows that Sesame generally supersedes Jena in performance results) 31
  • 32. Experiment Experiment parts:  Index Construction.  Queries. 32
  • 33. Result – index construction System Index size (bytes) Redland 2.164.019.200 Sesame MySQL 340.381.636 Sesame native 39.997.992 YARS 1.090.002.944 33 Table 8: Index size for the synthetic Univ20 dataset.
  • 34. Result – index construction 34
  • 35. Result - queries No Query 1 ?x rdf:type univ:UndergraduateStudent 2 ?x ?p ”UndergraduateStudent0” 3 <http://www.University965.edu> ?p ?o 4 ?x univ:worksFor ?y 35 Query Redland Sesame MySQL Sesame Native YARS 1 0:10.48 0:18.87 1:05.16 0:18.41 2 0:44.14 0:00.73 0:00.55 0:00.49 3 0:44.15 0:00.46 0:00.47 0:00.32 4 3:04.21 0:03.42 0:01.95 0:00.47 Performance results for quad queries.
  • 36. Conclusion  The auther introduced query processing for RDF which is an I portant issue in sematic web.  YARS has some overhead for resolving the dependencies and order in comparison with others. 36
  • 37. Criticism  - In experiment , the researchers removed “Kowari “ engine because the cannot install it. 37