SlideShare a Scribd company logo
1 of 72
1 
From Search to Predictions in Tagged 
Information Spaces 
Christoph Trattner 
Know-Center 
ctrattner@know-center.at 
@Graz University of Technology, Austria 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
2 
Before start in this presentation I will talk a bit about 
myself, my background… 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
3 
Where do I come from (Austria)? 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
4 
Graz 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
5 
Academic Back-Ground? 
 Studied Computer Science at Graz University of 
Technology & University of Pittsburgh 
 Worked since 2009 as scientific researcher at the KMI & 
IICM (BSc 2008, MSc 2009) 
 My PhD thesis was on the Search & Navigation in Social 
Tagging Systems (defended 2012) 
 Since Feb. 2013 @ Know-Center 
 Leading the Social Computing Area 
 At TUG: 
 WebScience 
 Semantic Technologies 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
6 
My team 
2 Post-Docs, 5 Pre-Docs (2 more to join soon ) 
2 MSc student 
2 BSc student 
DI. Dieter 
Theiler 
DI. Dominik 
Kowald 
Dr. Peter 
Kraker 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
Dr. Elisabeth 
Lex 
Mag. Sebastian 
Dennerlein 
Mag. Matthias 
Rella 
DI. Emanuel 
Lacic 
DI. Ilire Hasani
7 
Thanks to my Collaborators 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
8 
What is my group doing? 
… we research on novel methods and tools that exploit 
social data to generate a greater value for the 
individual, communities, companies and the society as 
whole. 
Our competences: 
• Network & Web Science 
• Science 2.0 
• Predictive Modeling 
• Social Network Analysis 
• Information Quality Assessment 
• User Modeling 
• Machine Learning and Data Mining 
• Collaborative Systems 
Our Services: 
• Social Analytics: Hub-, Expert -, Community - 
, Influencer -, Information Flow-, Trend 
(Event) Detection, etc. 
• Information Quality Assessment 
• Social & Location-based Recommander 
Systems 
• Customer Segmentation 
• Social Systems Design 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
9 
Some industry partners... 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
10 
Current projects 
BlancNoir - “Towards a Big Data recommender engine for offline 
and online marketplaces” 
I2F - “Towards a Social Media and Online Marketing Manager 
Seminar” 
Automation-X - “Towards a scalable Graph-based Visual search 
solution” 
Styria - “Towards a scalable crowd-based hierarchical cluster 
labeling approach for willhaben.at” 
TripRebel - “Towards an engaging hybrid hotel recommender 
solution for triprebel.com” 
CDS - “Towards a scalable Entity & Graph-based Visual search 
solution for cds.at” 
Exthex - “Towards an efficient viral social media marketing 
champagne in Facebook and Twitter” 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
11 
The Projects 
Project 1: Mendeley – UK Startup (recently acquired by Elsevier): 
Interested in the problem of hirarchical concept-based search in 
tagged information spaces. 
Project 2: Tallinn University– Interested in the problem of 
recommending tags and items in tagged information spaces. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
12 
Ok, let’s start…. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
13 
Project 1 
Mendeley – UK Startup (recently acquired by Elsevier): 
Interested in the problem of hierarchical concept-based 
search. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
14 
Research Question 1: 
What kind of meta-data is more useful for search in 
information systems - tags or keywords? 
Externals involved: 
• Mendeley, London, UK 
Helic, D., Körner, C., Granitzer, M., Strohmaier, M. and Trattner, C. 2012. Navigational Efficiency of Broad vs. 
Narrow Folksonomies. In Proceedings of the 23rd ACM Conference on Hypertext and Social Media (HT 
2012), ACM, New York, NY, USA, pp. 63-72. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
15 
Mendeley 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
16 
 We 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
Tags 
Keywords 
Mendeley Desktop
17 
Task 
What is the best way to extract hirarchies from tagged 
information spaces? What is more useful for navigation – 
keyword or tag hierarchies? 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
18 
Different types of hierarchy induction 
algorithms 
Helic, D., Strohmaier, M., Trattner, C., Muhr M. and Lermann, K.: Pragmatic Evaluation of Folksonomies, In 
Proceedings of the 20th international conference on World Wide Web (WWW 2011), ACM, New York, NY, USA, 
417-426, 2011. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
19 
Issue (!!!) 
...no literature on what type of hierarchy is best suited 
for searching... 
D. J. Watts, P. S. Dodds, and M. E. J. Newman. Identity and 
search in social networks. Science, 296:1302–1305, 2002. 
J. M. Kleinberg. Navigation in a small world. Nature, 
406(6798):845, August 2000. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
20 
Stanley Milgram 
 A social psychologist 
 Yale and Harvard University 
 Study on the Small World Problem, 
beyond well defined communities 
and relations 
(such as actors, scientists, …) 
 „An Experimental Study of the Small World Problem” 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
1933-1984
21 
Set Up 
 Target person: 
 A Boston stockbroker 
 Three starting populations 
Nebraska 
random 
 100 “Nebraska stockholders” 
 96 “Nebraska Nebraska 
random” 
 100 “Boston stockholders 
random” 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
Target 
Boston 
stockbroker 
Boston 
random
22 
Results 
 How many of the starters would be able to establish 
contact with the target? 
 64 out of 296 reached the target 
 How many intermediaries would be required to link 
starters with the target? 
 Well, that depends: the overall mean 5.2 links 
 Through hometown: 6.1 links 
 Through business: 4.6 links 
 Boston group faster than Nebraska groups 
 Nebraska stockholders not faster than Nebraska random 
 What form would the distribution of chain lengths 
take? 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
23 
Hierarchical decentralized searcher 
Information 
Network 
Hierarchy 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
24 
Results 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
25 
Validation 
 We compared simulations with 
human click trails of the online Game – 
The Wiki Game (http://thewikigame.com/) 
 Contains 1,500,000 
click trails of more 
than 500,000 users with 
(start; target) information. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
Wikipedia Category Label Dataset: 
2,300,000 category labels, 
4,500,000 articles, 30,000,000 category 
label assignments 
26 
Hierachy Creation (1) 
Two types of hierarchies were evaluated 
1.) First type is based on our previous work 
 Categorial Concepts: 
 Tags from Delicious 
 Category labels from Wikipedia 
Similarity Graph 
Delicious Tag Dataset: 
440,000 tags, 580,000 articles and 
3,400,000 tag assignments 
Latent Hierarchical Taxonomy 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
27 
Hierarchy Creation (2) 
2.) Second type is based on the work of [Muchnik et al. 2007] 
Simple idea: Algorithm iterates through all 
links in the network and decides if that link is 
of a hierarchical type, in which case it 
remains in the network otherwise it is 
removed. 
Directed link-network dataset of the 
English-Wikipedia from February 
2012. 
All in all, the dataset includes 
around 10,000,000 articles and 
around 250,000,000 links 
Muchnik, L., Itzhack, R., Solomon S. and Louzoun Y.: Self-emergence of knowledge trees: Extraction 
of the Wikipedia hierarchies, PHYSICAL REVIEW E 76, 016106 (2007) 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
28 
Validation 
Human Searchers 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
29 
...ok let‘s come back to the Mendeley „problem“... 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
30 
Are keyword hierarchies better for search 
than social tag hierarchies? 
Results: 
With simulations we find that tag-based 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
Tags 
Keywords 
Results: Our Greedy Navigator (= Simulator) needs on average 1-click 
more with keywords to reach the target node than with tags 
hierarchies are more efficient 
for navigation than keywords
31 
...ok let‘s move on to some prediction stuff  
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
32 
Project 2 
Tallinn University – Interested in the problem of 
recommending items and tags to users in social 
tagging systems. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
33 
Research Question 2: 
To what extent is human cognition theory applicable to 
the problem of predicting tags and items to users? 
Externals involved: 
• PUC - Chile, UFCG – Brazil 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
34 
Motivation 
 They help you to classify Web content better [Zubiaga 2012] 
 They help people to navigate large knowledge repositories better 
[Helic et al. 2012] 
 They help people to search for information faster [Trattner et al. 2012] 
However, there is an issue with social tags… 
People are typically lazy to apply social tags(!!) 
Zubiaga, A. (2012). Harnessing Folksonomies for Resource Classification. arXiv preprint arXiv:1204.6521. 
Trattner, C., Lin, Y. L., Parra, D., Yue, Z., Real, W., & Brusilovsky, P. (2012, June). Evaluating tag-based information 
access in image collections. In Proceedings of the 23rd ACM conference on Hypertext and social media (pp. 113- 
122). ACM. 
Helic, D., Körner, C., Granitzer, M., Strohmaier, M., & Trattner, C. (2012, June). Navigational efficiency of broad vs. 
narrow folksonomies. In Proceedings of the 23rd ACM conference on Hypertext and social media (pp. 63-72). ACM. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
35 
Motivation 
To overcome that issue some smart people started to invent mechanisms that 
should help the user in applying tags, known as social tag recommender 
system based on: 
 Collaborative Filtering 
 User based- and item-based CF [Marinho et al. 2008] 
 Matrix Factorization 
 FM, PITF [Rendle et al. 2010, 2011, 2012] 
 Graph Structures 
 Adapted PageRank and FolkRank [Hotho et al. 2006] 
 Topic Models 
 Latent Dirichlet Allocation (LDA) [Krestel et al. 2009, 2010, 2011] 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
36 
Why do we need cognitive models? 
First answer: We do not like data data driven approaches… 
Me: OK 
Second answer: We can understand things better… 
…why is something happening and how… 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
37 
MINERVA2 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
38 
Approach 
 Based on a Human cognition (derived from MINERVA2 [Kruschke et al., 1992]) 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
39 
Evaluation 
 Wikipedia 
 p-core pruning (p = 14) 
 To finally measure to performance of our approach we split up our dataset in two 
sub-sets 80% for training and 20% for testing Training 
 Precision, Recall, F1-score, MRR, MAP 
 As Baseline algorithm we have chosen Latent Dirichlet Allocation (LDA) 
[Krestel et al. 2009] 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
40 
Results 
Results: 
3Layers reaches higher levels of 
estimate than the pure LDA 
approach. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
41 
ACT-R 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
42 
ACT-R 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
43 
Interestingly, when looking into the literatur of tagging 
systems - temporal processes are typically modeled 
with an exponential function... 
D. Yin, L. Hong, and B. D. Davison. Exploiting session-like behaviors in tag prediction. In 
Proceedings of the 20th international conference companion on World wide web, pages 
167–168. ACM, 2011. 
L. Zhang, J. Tang, and M. Zhang. Integrating temporal usage pattern into personalized tag 
prediction. In Web Technologies and Applications, pages 354–365. Springer, 2012 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
 Linear distribution with log-scale 
44 
Empirical Analysis: BibSonomy (1) 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
44 
on Y-axis  
exponential function 
 Linear distribution with log-scale 
on X- and Y-axes  
power function
45 
Empirical Analysis: BibSonomy (2) 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
45 
Exponential distribution 
R² = 31% 
Power distribution 
R² = 89%
46 
Results: 
Decay factor is better modeled as 
power-function rather than an ex-function 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
47 
Experiment 1: Predicting re-use of tags 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
48 
Results: Predicting re-use of tags 
BLLAC 
BLL 
MPU 
GIRP 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
49 
Results: Recall / Precision 
Results: 
BLLAC performs fairly well in 
predicting the re-use of tags 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
50 
Experiment 2: Recommending Tags 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
51 
Results: Recall-Precision plots 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
51 
 The time-depended 
approaches outperform the 
state-of-the-art 
 BLL+MPr reaches the 
highest level of accuracy 
CiteULike
BLL approaches outperform current 
state-of-the-art tag recommender 
approaches. 
52 
Results: Recall  Precision 
Results: 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
53 
...how about runtime? 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
54 
Results: Runtime 
BLL+C needs only around 1s to generate tag-recommendations 
for 5,500 users in BibSonomy 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
55 
Results: Runtime 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
56 
...predicting (re-ranking) items with ACT-R 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
57 
Our Approach 
= CIRTT  2 main steps 
First step: 
– User-based Collaborative Filtering (CF) to get 
candidate items of similar users 
Second step: 
– Item-based CF to rank these candidate items using 
the BLL equation to integrate tag and time 
information: 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
57
IR metrics: nDCG@20, MAP@20, Recall@20, Diversity and 
58 
How does it perform? 
3 freely-available folksonomy datasets 
– BibSonomy (~ 340,000 tag assignments) 
– CiteULike (~ 100.000 tag assignments) 
– MovieLens (~ 100.000 tag assignments) 
Original datasets (no p-core pruning) Doerfel et al. (2013) 
80/20 split (for each user 20% most recent bookmarks/posts 
in test-set, rest in training-set) 
User Coverage 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
58
59 
Baseline Methods 
• Most Popular (MP) 
• User-based Collaborative Filtering (CF) 
• Two alternative approaches based on tag and time 
information 
– Zheng et al. (2011)  exponential function 
– Huang et al. (2014)  linear function 
(remember: our CIRTT uses a power function) 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
59
60 
Results: nDCG plots 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
60 
CIRTT reaches the highest level of accuracy
61 
Results: Recall plots 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 
61 
CIRTT reaches the highest level of accuracy
CIRTT works quite well compared to 
the current state-of-the-art in tag-based 
62 
Results 
Results: 
item recommender systems 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
63 
What are we... 
...currently working on... 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
64 
MINERVA2 + ACT-R 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
65 
Time in Semantic vs. Lexical Memory 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
66 
Topical vs. Lexical shift in time 
Topics 
Tags 
Results: 
Topical shift in time is less 
pronounced than lexical shift 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
67 
Results: Recall / Precision 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
68 
Describer vs. Categorizer 
M. Strohmaier, C. Koerner, and R. Kern. Understanding why users tag: A survey of tagging motivation 
literature and results from an empirical study. Journal of Web Semantics, 17:1–11, 2012. 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
69 
Results: Categorizer vs. Describer 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
70 
... ok that‘s basically it  
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
71 
Code and Framework 
https://github.com/learning-layers/TagRec/ 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
72 
Thank you! 
Christoph Trattner 
Email: trattner.christoph@gmail.com 
Web: christophtrattner.info 
Twitter: @ctrattner 
Sponsors: 
. Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona

More Related Content

What's hot

Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Vrije Universiteit Amsterdam
 
WAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopWAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopSara Day Thomson
 
SSSW 2013 - Feeding Recommender Systems with Linked Open Data
SSSW 2013 - Feeding Recommender Systems with Linked Open DataSSSW 2013 - Feeding Recommender Systems with Linked Open Data
SSSW 2013 - Feeding Recommender Systems with Linked Open DataPolytechnic University of Bari
 
Deep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profilesDeep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profilesTraian Rebedea
 
Interpreting Data Mining Results with Linked Data for Learning Analytics
Interpreting Data Mining Results with Linked Data for Learning AnalyticsInterpreting Data Mining Results with Linked Data for Learning Analytics
Interpreting Data Mining Results with Linked Data for Learning AnalyticsMathieu d'Aquin
 
Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...
Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...
Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...Polytechnic University of Bari
 
Frontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignFrontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignJonathan Stray
 
Frontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text AnalysisFrontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text AnalysisJonathan Stray
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Jonathan Stray
 
Loops of humans and bots in Wikidata
Loops of humans and bots in WikidataLoops of humans and bots in Wikidata
Loops of humans and bots in WikidataElena Simperl
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the WebRinke Hoekstra
 
Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?Rich Heimann
 
Predicting Communication Intention in Social Media
Predicting Communication Intention in Social MediaPredicting Communication Intention in Social Media
Predicting Communication Intention in Social MediaCharalampos Chelmis
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsPaul Groth
 
Data Science Workshop - day 2
Data Science Workshop - day 2Data Science Workshop - day 2
Data Science Workshop - day 2Aseel Addawood
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platformsYannis Charalabidis
 
Content + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningContent + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningPaul Groth
 
Looking for Experts? What can Linked Data do for you?
Looking for Experts? What can Linked Data do for you?Looking for Experts? What can Linked Data do for you?
Looking for Experts? What can Linked Data do for you?Hypios Research
 
The many ways of research in semantic technologies
The many ways of research in semantic technologiesThe many ways of research in semantic technologies
The many ways of research in semantic technologiesMathieu d'Aquin
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and TextNBER
 

What's hot (20)

Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
Using Linked Data Traversal to Label Academic Communities - SAVE-SD2015
 
WAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshopWAPWG Jan 2020 Sloan cosmos workshop
WAPWG Jan 2020 Sloan cosmos workshop
 
SSSW 2013 - Feeding Recommender Systems with Linked Open Data
SSSW 2013 - Feeding Recommender Systems with Linked Open DataSSSW 2013 - Feeding Recommender Systems with Linked Open Data
SSSW 2013 - Feeding Recommender Systems with Linked Open Data
 
Deep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profilesDeep neural networks for matching online social networking profiles
Deep neural networks for matching online social networking profiles
 
Interpreting Data Mining Results with Linked Data for Learning Analytics
Interpreting Data Mining Results with Linked Data for Learning AnalyticsInterpreting Data Mining Results with Linked Data for Learning Analytics
Interpreting Data Mining Results with Linked Data for Learning Analytics
 
Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...
Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...
Tutorial - Recommender systems meet linked open data - ICWE 2016 - Lugano - 0...
 
Frontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter DesignFrontiers of Computational Journalism week 3 - Information Filter Design
Frontiers of Computational Journalism week 3 - Information Filter Design
 
Frontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text AnalysisFrontiers of Computational Journalism week 2 - Text Analysis
Frontiers of Computational Journalism week 2 - Text Analysis
 
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
Loops of humans and bots in Wikidata
Loops of humans and bots in WikidataLoops of humans and bots in Wikidata
Loops of humans and bots in Wikidata
 
Knowledge Representation on the Web
Knowledge Representation on the WebKnowledge Representation on the Web
Knowledge Representation on the Web
 
Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?Why L-3 Data Tactics Data Science?
Why L-3 Data Tactics Data Science?
 
Predicting Communication Intention in Social Media
Predicting Communication Intention in Social MediaPredicting Communication Intention in Social Media
Predicting Communication Intention in Social Media
 
From Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge GraphsFrom Text to Data to the World: The Future of Knowledge Graphs
From Text to Data to the World: The Future of Knowledge Graphs
 
Data Science Workshop - day 2
Data Science Workshop - day 2Data Science Workshop - day 2
Data Science Workshop - day 2
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platforms
 
Content + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learningContent + Signals: The value of the entire data estate for machine learning
Content + Signals: The value of the entire data estate for machine learning
 
Looking for Experts? What can Linked Data do for you?
Looking for Experts? What can Linked Data do for you?Looking for Experts? What can Linked Data do for you?
Looking for Experts? What can Linked Data do for you?
 
The many ways of research in semantic technologies
The many ways of research in semantic technologiesThe many ways of research in semantic technologies
The many ways of research in semantic technologies
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
 

Viewers also liked

Crowd-Based Evaluation Methods
Crowd-Based Evaluation MethodsCrowd-Based Evaluation Methods
Crowd-Based Evaluation MethodsChristoph Trattner
 
On the Navigability of Social Tagging Systems
On the Navigability of Social Tagging SystemsOn the Navigability of Social Tagging Systems
On the Navigability of Social Tagging SystemsChristoph Trattner
 
Enhancing the Navigability of Social Tagging Systems with Tag Taxonomies
Enhancing the Navigability of Social Tagging Systems with Tag TaxonomiesEnhancing the Navigability of Social Tagging Systems with Tag Taxonomies
Enhancing the Navigability of Social Tagging Systems with Tag TaxonomiesChristoph Trattner
 
Towards a Big Data Recommender Engine for Online and Offline Marketplaces
Towards a Big Data Recommender Engine for Online and Offline MarketplacesTowards a Big Data Recommender Engine for Online and Offline Marketplaces
Towards a Big Data Recommender Engine for Online and Offline MarketplacesChristoph Trattner
 
Evaluating Tag-Based Information Access in Image Collections
Evaluating Tag-Based Information Access in Image CollectionsEvaluating Tag-Based Information Access in Image Collections
Evaluating Tag-Based Information Access in Image CollectionsChristoph Trattner
 
Networks Navigability: Theory and Applications
Networks Navigability: Theory and ApplicationsNetworks Navigability: Theory and Applications
Networks Navigability: Theory and ApplicationsChristoph Trattner
 
On the Utility of Tags for Search and Navigation in Online Information Systems
On the Utility of Tags for Search and Navigation in Online Information SystemsOn the Utility of Tags for Search and Navigation in Online Information Systems
On the Utility of Tags for Search and Navigation in Online Information SystemsChristoph Trattner
 
Pragmatic Evaluation of Concept Hierarchies
Pragmatic Evaluation of Concept HierarchiesPragmatic Evaluation of Concept Hierarchies
Pragmatic Evaluation of Concept HierarchiesChristoph Trattner
 
Studying Online Food Consumption and Production Patterns: Recent Trends and C...
Studying Online Food Consumption and Production Patterns: Recent Trends and C...Studying Online Food Consumption and Production Patterns: Recent Trends and C...
Studying Online Food Consumption and Production Patterns: Recent Trends and C...Christoph Trattner
 
Understanding the Impact of Weather for POI Recommendations
Understanding the Impact of Weather for POI RecommendationsUnderstanding the Impact of Weather for POI Recommendations
Understanding the Impact of Weather for POI RecommendationsChristoph Trattner
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Denis Parra Santander
 
Why Social Network Research is good for you!
Why Social Network Research is good for you!Why Social Network Research is good for you!
Why Social Network Research is good for you!Christoph Trattner
 

Viewers also liked (13)

Crowd-Based Evaluation Methods
Crowd-Based Evaluation MethodsCrowd-Based Evaluation Methods
Crowd-Based Evaluation Methods
 
On the Navigability of Social Tagging Systems
On the Navigability of Social Tagging SystemsOn the Navigability of Social Tagging Systems
On the Navigability of Social Tagging Systems
 
Enhancing the Navigability of Social Tagging Systems with Tag Taxonomies
Enhancing the Navigability of Social Tagging Systems with Tag TaxonomiesEnhancing the Navigability of Social Tagging Systems with Tag Taxonomies
Enhancing the Navigability of Social Tagging Systems with Tag Taxonomies
 
Towards a Big Data Recommender Engine for Online and Offline Marketplaces
Towards a Big Data Recommender Engine for Online and Offline MarketplacesTowards a Big Data Recommender Engine for Online and Offline Marketplaces
Towards a Big Data Recommender Engine for Online and Offline Marketplaces
 
Evaluating Tag-Based Information Access in Image Collections
Evaluating Tag-Based Information Access in Image CollectionsEvaluating Tag-Based Information Access in Image Collections
Evaluating Tag-Based Information Access in Image Collections
 
Networks Navigability: Theory and Applications
Networks Navigability: Theory and ApplicationsNetworks Navigability: Theory and Applications
Networks Navigability: Theory and Applications
 
On the Utility of Tags for Search and Navigation in Online Information Systems
On the Utility of Tags for Search and Navigation in Online Information SystemsOn the Utility of Tags for Search and Navigation in Online Information Systems
On the Utility of Tags for Search and Navigation in Online Information Systems
 
Pragmatic Evaluation of Concept Hierarchies
Pragmatic Evaluation of Concept HierarchiesPragmatic Evaluation of Concept Hierarchies
Pragmatic Evaluation of Concept Hierarchies
 
The Smart Pharma Wiki
The Smart Pharma WikiThe Smart Pharma Wiki
The Smart Pharma Wiki
 
Studying Online Food Consumption and Production Patterns: Recent Trends and C...
Studying Online Food Consumption and Production Patterns: Recent Trends and C...Studying Online Food Consumption and Production Patterns: Recent Trends and C...
Studying Online Food Consumption and Production Patterns: Recent Trends and C...
 
Understanding the Impact of Weather for POI Recommendations
Understanding the Impact of Weather for POI RecommendationsUnderstanding the Impact of Weather for POI Recommendations
Understanding the Impact of Weather for POI Recommendations
 
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
 
Why Social Network Research is good for you!
Why Social Network Research is good for you!Why Social Network Research is good for you!
Why Social Network Research is good for you!
 

Similar to From Search to Predictions in Tagged Information Spaces

Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainAngelo Salatino
 
Tags, Networks, Narrative
Tags, Networks, NarrativeTags, Networks, Narrative
Tags, Networks, NarrativeBruce Mason
 
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...University of Groningen (The Netherlands)
 
Twitterdatabase for academics proposal
Twitterdatabase for academics proposalTwitterdatabase for academics proposal
Twitterdatabase for academics proposalMattias Östmar
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-ResearchEric Meyer
 
The Case for Applied Digital Humanities in Scholarly Communications
The Case for Applied Digital Humanities in Scholarly CommunicationsThe Case for Applied Digital Humanities in Scholarly Communications
The Case for Applied Digital Humanities in Scholarly CommunicationsAlex Humphreys
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search petermurrayrust
 
Anatomy of Social Networks, a guide for social media strategists
Anatomy of Social Networks, a guide for social media strategistsAnatomy of Social Networks, a guide for social media strategists
Anatomy of Social Networks, a guide for social media strategistsPaolo Nesi
 
Brain juicer digividuals_research_robots_paper
Brain juicer digividuals_research_robots_paperBrain juicer digividuals_research_robots_paper
Brain juicer digividuals_research_robots_paperPhilter Phactory
 
Cartography of Controversies about MOOCs
Cartography of Controversies about MOOCsCartography of Controversies about MOOCs
Cartography of Controversies about MOOCsKhalid Md Saifuddin
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTAMartin Wynne
 
2009 10 01 Rapid Combinatorial Innovation V Acf 1
2009 10 01 Rapid Combinatorial Innovation  V Acf 12009 10 01 Rapid Combinatorial Innovation  V Acf 1
2009 10 01 Rapid Combinatorial Innovation V Acf 1Augustine Fou
 
I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...
I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...
I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...fridolin.wild
 
Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Amparo Elizabeth Cano Basave
 
Insights From Social Media
Insights From Social MediaInsights From Social Media
Insights From Social MediaDr Wasim Ahmed
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Stella Wisdom
 
Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Jean Burgess
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network AnalysisMarc Smith
 
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the EvidenceGatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the EvidenceAxel Bruns
 

Similar to From Search to Predictions in Tagged Information Spaces (20)

Applying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domainApplying machine learning techniques to big data in the scholarly domain
Applying machine learning techniques to big data in the scholarly domain
 
Tags, Networks, Narrative
Tags, Networks, NarrativeTags, Networks, Narrative
Tags, Networks, Narrative
 
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
Automated Analysis of Journalists' and Politicians' Online Behavior on Social...
 
Twitterdatabase for academics proposal
Twitterdatabase for academics proposalTwitterdatabase for academics proposal
Twitterdatabase for academics proposal
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-Research
 
The Case for Applied Digital Humanities in Scholarly Communications
The Case for Applied Digital Humanities in Scholarly CommunicationsThe Case for Applied Digital Humanities in Scholarly Communications
The Case for Applied Digital Humanities in Scholarly Communications
 
Who are you and makes you special?
Who are you and makes you special?Who are you and makes you special?
Who are you and makes you special?
 
Rapid biomedical search
Rapid biomedical search Rapid biomedical search
Rapid biomedical search
 
Anatomy of Social Networks, a guide for social media strategists
Anatomy of Social Networks, a guide for social media strategistsAnatomy of Social Networks, a guide for social media strategists
Anatomy of Social Networks, a guide for social media strategists
 
Brain juicer digividuals_research_robots_paper
Brain juicer digividuals_research_robots_paperBrain juicer digividuals_research_robots_paper
Brain juicer digividuals_research_robots_paper
 
Cartography of Controversies about MOOCs
Cartography of Controversies about MOOCsCartography of Controversies about MOOCs
Cartography of Controversies about MOOCs
 
Forty Years of the OTA
Forty Years of the OTAForty Years of the OTA
Forty Years of the OTA
 
2009 10 01 Rapid Combinatorial Innovation V Acf 1
2009 10 01 Rapid Combinatorial Innovation  V Acf 12009 10 01 Rapid Combinatorial Innovation  V Acf 1
2009 10 01 Rapid Combinatorial Innovation V Acf 1
 
I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...
I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...
I-Know'11: Track: Recommendation, Data Sharing, and Research Practices in Sci...
 
Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams Entity-Based Semantics Emerging from Personal Awareness Streams
Entity-Based Semantics Emerging from Personal Awareness Streams
 
Insights From Social Media
Insights From Social MediaInsights From Social Media
Insights From Social Media
 
Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods Digital research: Collections, data, tools and methods
Digital research: Collections, data, tools and methods
 
Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...Tracing Publics in the Australian Blogosphere: New Methods for International ...
Tracing Publics in the Australian Blogosphere: New Methods for International ...
 
2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis2010 Catalyst Conference - Trends in Social Network Analysis
2010 Catalyst Conference - Trends in Social Network Analysis
 
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the EvidenceGatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
Gatewatching 11: Echo Chambers? Filter Bubbles? Reviewing the Evidence
 

Recently uploaded

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptxSherlyMaeNeri
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parentsnavabharathschool99
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxnelietumpap1
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 

Recently uploaded (20)

Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptxYOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
YOUVE GOT EMAIL_FINALS_EL_DORADO_2024.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Judging the Relevance and worth of ideas part 2.pptx
Judging the Relevance  and worth of ideas part 2.pptxJudging the Relevance  and worth of ideas part 2.pptx
Judging the Relevance and worth of ideas part 2.pptx
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Choosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for ParentsChoosing the Right CBSE School A Comprehensive Guide for Parents
Choosing the Right CBSE School A Comprehensive Guide for Parents
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Q4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptxQ4 English4 Week3 PPT Melcnmg-based.pptx
Q4 English4 Week3 PPT Melcnmg-based.pptx
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 

From Search to Predictions in Tagged Information Spaces

  • 1. 1 From Search to Predictions in Tagged Information Spaces Christoph Trattner Know-Center ctrattner@know-center.at @Graz University of Technology, Austria . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 2. 2 Before start in this presentation I will talk a bit about myself, my background… . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 3. 3 Where do I come from (Austria)? . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 4. 4 Graz . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 5. 5 Academic Back-Ground?  Studied Computer Science at Graz University of Technology & University of Pittsburgh  Worked since 2009 as scientific researcher at the KMI & IICM (BSc 2008, MSc 2009)  My PhD thesis was on the Search & Navigation in Social Tagging Systems (defended 2012)  Since Feb. 2013 @ Know-Center  Leading the Social Computing Area  At TUG:  WebScience  Semantic Technologies . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 6. 6 My team 2 Post-Docs, 5 Pre-Docs (2 more to join soon ) 2 MSc student 2 BSc student DI. Dieter Theiler DI. Dominik Kowald Dr. Peter Kraker . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona Dr. Elisabeth Lex Mag. Sebastian Dennerlein Mag. Matthias Rella DI. Emanuel Lacic DI. Ilire Hasani
  • 7. 7 Thanks to my Collaborators . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 8. 8 What is my group doing? … we research on novel methods and tools that exploit social data to generate a greater value for the individual, communities, companies and the society as whole. Our competences: • Network & Web Science • Science 2.0 • Predictive Modeling • Social Network Analysis • Information Quality Assessment • User Modeling • Machine Learning and Data Mining • Collaborative Systems Our Services: • Social Analytics: Hub-, Expert -, Community - , Influencer -, Information Flow-, Trend (Event) Detection, etc. • Information Quality Assessment • Social & Location-based Recommander Systems • Customer Segmentation • Social Systems Design . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 9. 9 Some industry partners... . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 10. 10 Current projects BlancNoir - “Towards a Big Data recommender engine for offline and online marketplaces” I2F - “Towards a Social Media and Online Marketing Manager Seminar” Automation-X - “Towards a scalable Graph-based Visual search solution” Styria - “Towards a scalable crowd-based hierarchical cluster labeling approach for willhaben.at” TripRebel - “Towards an engaging hybrid hotel recommender solution for triprebel.com” CDS - “Towards a scalable Entity & Graph-based Visual search solution for cds.at” Exthex - “Towards an efficient viral social media marketing champagne in Facebook and Twitter” . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 11. 11 The Projects Project 1: Mendeley – UK Startup (recently acquired by Elsevier): Interested in the problem of hirarchical concept-based search in tagged information spaces. Project 2: Tallinn University– Interested in the problem of recommending tags and items in tagged information spaces. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 12. 12 Ok, let’s start…. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 13. 13 Project 1 Mendeley – UK Startup (recently acquired by Elsevier): Interested in the problem of hierarchical concept-based search. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 14. 14 Research Question 1: What kind of meta-data is more useful for search in information systems - tags or keywords? Externals involved: • Mendeley, London, UK Helic, D., Körner, C., Granitzer, M., Strohmaier, M. and Trattner, C. 2012. Navigational Efficiency of Broad vs. Narrow Folksonomies. In Proceedings of the 23rd ACM Conference on Hypertext and Social Media (HT 2012), ACM, New York, NY, USA, pp. 63-72. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 15. 15 Mendeley . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 16. 16  We . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona Tags Keywords Mendeley Desktop
  • 17. 17 Task What is the best way to extract hirarchies from tagged information spaces? What is more useful for navigation – keyword or tag hierarchies? . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 18. 18 Different types of hierarchy induction algorithms Helic, D., Strohmaier, M., Trattner, C., Muhr M. and Lermann, K.: Pragmatic Evaluation of Folksonomies, In Proceedings of the 20th international conference on World Wide Web (WWW 2011), ACM, New York, NY, USA, 417-426, 2011. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 19. 19 Issue (!!!) ...no literature on what type of hierarchy is best suited for searching... D. J. Watts, P. S. Dodds, and M. E. J. Newman. Identity and search in social networks. Science, 296:1302–1305, 2002. J. M. Kleinberg. Navigation in a small world. Nature, 406(6798):845, August 2000. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 20. 20 Stanley Milgram  A social psychologist  Yale and Harvard University  Study on the Small World Problem, beyond well defined communities and relations (such as actors, scientists, …)  „An Experimental Study of the Small World Problem” . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 1933-1984
  • 21. 21 Set Up  Target person:  A Boston stockbroker  Three starting populations Nebraska random  100 “Nebraska stockholders”  96 “Nebraska Nebraska random”  100 “Boston stockholders random” . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona Target Boston stockbroker Boston random
  • 22. 22 Results  How many of the starters would be able to establish contact with the target?  64 out of 296 reached the target  How many intermediaries would be required to link starters with the target?  Well, that depends: the overall mean 5.2 links  Through hometown: 6.1 links  Through business: 4.6 links  Boston group faster than Nebraska groups  Nebraska stockholders not faster than Nebraska random  What form would the distribution of chain lengths take? . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 23. 23 Hierarchical decentralized searcher Information Network Hierarchy . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 24. 24 Results . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 25. 25 Validation  We compared simulations with human click trails of the online Game – The Wiki Game (http://thewikigame.com/)  Contains 1,500,000 click trails of more than 500,000 users with (start; target) information. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 26. Wikipedia Category Label Dataset: 2,300,000 category labels, 4,500,000 articles, 30,000,000 category label assignments 26 Hierachy Creation (1) Two types of hierarchies were evaluated 1.) First type is based on our previous work  Categorial Concepts:  Tags from Delicious  Category labels from Wikipedia Similarity Graph Delicious Tag Dataset: 440,000 tags, 580,000 articles and 3,400,000 tag assignments Latent Hierarchical Taxonomy . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 27. 27 Hierarchy Creation (2) 2.) Second type is based on the work of [Muchnik et al. 2007] Simple idea: Algorithm iterates through all links in the network and decides if that link is of a hierarchical type, in which case it remains in the network otherwise it is removed. Directed link-network dataset of the English-Wikipedia from February 2012. All in all, the dataset includes around 10,000,000 articles and around 250,000,000 links Muchnik, L., Itzhack, R., Solomon S. and Louzoun Y.: Self-emergence of knowledge trees: Extraction of the Wikipedia hierarchies, PHYSICAL REVIEW E 76, 016106 (2007) . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 28. 28 Validation Human Searchers . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 29. 29 ...ok let‘s come back to the Mendeley „problem“... . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 30. 30 Are keyword hierarchies better for search than social tag hierarchies? Results: With simulations we find that tag-based . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona Tags Keywords Results: Our Greedy Navigator (= Simulator) needs on average 1-click more with keywords to reach the target node than with tags hierarchies are more efficient for navigation than keywords
  • 31. 31 ...ok let‘s move on to some prediction stuff  . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 32. 32 Project 2 Tallinn University – Interested in the problem of recommending items and tags to users in social tagging systems. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 33. 33 Research Question 2: To what extent is human cognition theory applicable to the problem of predicting tags and items to users? Externals involved: • PUC - Chile, UFCG – Brazil . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 34. 34 Motivation  They help you to classify Web content better [Zubiaga 2012]  They help people to navigate large knowledge repositories better [Helic et al. 2012]  They help people to search for information faster [Trattner et al. 2012] However, there is an issue with social tags… People are typically lazy to apply social tags(!!) Zubiaga, A. (2012). Harnessing Folksonomies for Resource Classification. arXiv preprint arXiv:1204.6521. Trattner, C., Lin, Y. L., Parra, D., Yue, Z., Real, W., & Brusilovsky, P. (2012, June). Evaluating tag-based information access in image collections. In Proceedings of the 23rd ACM conference on Hypertext and social media (pp. 113- 122). ACM. Helic, D., Körner, C., Granitzer, M., Strohmaier, M., & Trattner, C. (2012, June). Navigational efficiency of broad vs. narrow folksonomies. In Proceedings of the 23rd ACM conference on Hypertext and social media (pp. 63-72). ACM. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 35. 35 Motivation To overcome that issue some smart people started to invent mechanisms that should help the user in applying tags, known as social tag recommender system based on:  Collaborative Filtering  User based- and item-based CF [Marinho et al. 2008]  Matrix Factorization  FM, PITF [Rendle et al. 2010, 2011, 2012]  Graph Structures  Adapted PageRank and FolkRank [Hotho et al. 2006]  Topic Models  Latent Dirichlet Allocation (LDA) [Krestel et al. 2009, 2010, 2011] . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 36. 36 Why do we need cognitive models? First answer: We do not like data data driven approaches… Me: OK Second answer: We can understand things better… …why is something happening and how… . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 37. 37 MINERVA2 . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 38. 38 Approach  Based on a Human cognition (derived from MINERVA2 [Kruschke et al., 1992]) . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 39. 39 Evaluation  Wikipedia  p-core pruning (p = 14)  To finally measure to performance of our approach we split up our dataset in two sub-sets 80% for training and 20% for testing Training  Precision, Recall, F1-score, MRR, MAP  As Baseline algorithm we have chosen Latent Dirichlet Allocation (LDA) [Krestel et al. 2009] . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 40. 40 Results Results: 3Layers reaches higher levels of estimate than the pure LDA approach. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 41. 41 ACT-R . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 42. 42 ACT-R . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 43. 43 Interestingly, when looking into the literatur of tagging systems - temporal processes are typically modeled with an exponential function... D. Yin, L. Hong, and B. D. Davison. Exploiting session-like behaviors in tag prediction. In Proceedings of the 20th international conference companion on World wide web, pages 167–168. ACM, 2011. L. Zhang, J. Tang, and M. Zhang. Integrating temporal usage pattern into personalized tag prediction. In Web Technologies and Applications, pages 354–365. Springer, 2012 . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 44.  Linear distribution with log-scale 44 Empirical Analysis: BibSonomy (1) . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 44 on Y-axis  exponential function  Linear distribution with log-scale on X- and Y-axes  power function
  • 45. 45 Empirical Analysis: BibSonomy (2) . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 45 Exponential distribution R² = 31% Power distribution R² = 89%
  • 46. 46 Results: Decay factor is better modeled as power-function rather than an ex-function . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 47. 47 Experiment 1: Predicting re-use of tags . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 48. 48 Results: Predicting re-use of tags BLLAC BLL MPU GIRP . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 49. 49 Results: Recall / Precision Results: BLLAC performs fairly well in predicting the re-use of tags . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 50. 50 Experiment 2: Recommending Tags . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 51. 51 Results: Recall-Precision plots . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 51  The time-depended approaches outperform the state-of-the-art  BLL+MPr reaches the highest level of accuracy CiteULike
  • 52. BLL approaches outperform current state-of-the-art tag recommender approaches. 52 Results: Recall Precision Results: . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 53. 53 ...how about runtime? . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 54. 54 Results: Runtime BLL+C needs only around 1s to generate tag-recommendations for 5,500 users in BibSonomy . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 55. 55 Results: Runtime . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 56. 56 ...predicting (re-ranking) items with ACT-R . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 57. 57 Our Approach = CIRTT  2 main steps First step: – User-based Collaborative Filtering (CF) to get candidate items of similar users Second step: – Item-based CF to rank these candidate items using the BLL equation to integrate tag and time information: . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 57
  • 58. IR metrics: nDCG@20, MAP@20, Recall@20, Diversity and 58 How does it perform? 3 freely-available folksonomy datasets – BibSonomy (~ 340,000 tag assignments) – CiteULike (~ 100.000 tag assignments) – MovieLens (~ 100.000 tag assignments) Original datasets (no p-core pruning) Doerfel et al. (2013) 80/20 split (for each user 20% most recent bookmarks/posts in test-set, rest in training-set) User Coverage . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 58
  • 59. 59 Baseline Methods • Most Popular (MP) • User-based Collaborative Filtering (CF) • Two alternative approaches based on tag and time information – Zheng et al. (2011)  exponential function – Huang et al. (2014)  linear function (remember: our CIRTT uses a power function) . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 59
  • 60. 60 Results: nDCG plots . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 60 CIRTT reaches the highest level of accuracy
  • 61. 61 Results: Recall plots . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona 61 CIRTT reaches the highest level of accuracy
  • 62. CIRTT works quite well compared to the current state-of-the-art in tag-based 62 Results Results: item recommender systems . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 63. 63 What are we... ...currently working on... . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 64. 64 MINERVA2 + ACT-R . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 65. 65 Time in Semantic vs. Lexical Memory . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 66. 66 Topical vs. Lexical shift in time Topics Tags Results: Topical shift in time is less pronounced than lexical shift . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 67. 67 Results: Recall / Precision . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 68. 68 Describer vs. Categorizer M. Strohmaier, C. Koerner, and R. Kern. Understanding why users tag: A survey of tagging motivation literature and results from an empirical study. Journal of Web Semantics, 17:1–11, 2012. . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 69. 69 Results: Categorizer vs. Describer . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 70. 70 ... ok that‘s basically it  . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 71. 71 Code and Framework https://github.com/learning-layers/TagRec/ . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona
  • 72. 72 Thank you! Christoph Trattner Email: trattner.christoph@gmail.com Web: christophtrattner.info Twitter: @ctrattner Sponsors: . Christoph Trattner 30.10.2014 – Yahoo! Labs, Barcelona