Security and Trust in social media networks

1

Security and Trust in social media networks

Prof. Touradj Ebrahimi
touradj.ebrahimi@epfl.ch

The First International Workshop on Social Multimedia Computing (SMC2012)
Melbourne, Australia
9 July 2012

Multimedia Signal Processing Group
Swiss Federal Institute of Technology

Social media landscape 2012 2

http://www.fredcavazza.net/2012/02/22/social-media-landscape-2012/


Popularity of social networks 3

March 2012

May 2012

http://blog.tweetsmarter.com/social-media/spring-2012-social-media-user-statistics/


Popularity of social networks (cont.)

1+ billion photos

7+ billion photos

~140 billion photos

72 hours of video
every minute

April 2012


Popularity of social networks (cont.) 5

February 2012
http://www.trendweek.com/en/60-seconds-of-social-media-sharing/


Popularity of social networks (cont.) 6

January 2012

http://online.wsj.com/article/SB10001424052970204653604577249341403742390.html


Trust & security issues in social networks 7


Trust & security issues in social networks (cont.) 8

• Socialcam application automatically shares with your
Facebook friends videos you watch
• May be doing you more harm than good!



• Spotify and Yahoo News Activity automatically publish
songs and news you have listened to or read on your profile
page



• “Responsible” sharing (e.g. a friend posted an
embarrassing photo of a young political candidate Emma
Kiernan enjoying a party)

Anybody can post any
picture of you on
Facebook at any time!
Even if deleted, it is
there forever, stored in
the vast Internet
memory bank to be
found again and again!

http://newsbizarre.com/2009/05/emma-kiernan-facebook-photo-puts-young.html



• “Responsible” sharing (e.g. top model Rosanagh Wypych
posted photos of her and friends drinking alcohol and
consuming cannabis)

http://www.3news.co.nz/Top-Model-Rosanagh-worried-after-Facebook-pot-pic/tabid/418/articleID/224047/Default.aspx



• Wrong tags in Flickr
Challenges:
• Identify the most
appropriate tags
• Eliminate noisy
or spam tags

Only around 50%
of tags are truly
related to an image
[Kennedy et al., ACM MIR 2006]



• Spam bookmarks in Delicious


Trust & security solutions in social networks 14


Trust & security solutions in social networks (cont.) 15

• Limit number of people to share content and
communications with – create private groups
(e.g. Google+ Circles, Facebook Smart Lists)



• Limit time to watch shared content
(e.g. SnapChat application – though they are
not able to guarantee that your messaging data
will be deleted in all instances)



• Manage your account (friends, photos, comments, posts)
• Make sure that what you are sharing or posting is not going
to cause regret

http://www.huffingtonpost.com/2012/03/20/social-media-privacy-infographic_n_1367223.html


Trust & security before social media 18

• E-mail

• Web search

• Web videos

• Blogs

• Online shopping systems

• Peer-to-peer (P2P) networks


Anti-spam strategies in online systems 19

[Heymann, Koutrika, Garcia-Molina, IEEE IC 2007]


Trust and reputation online systems 20


Trust and reputation online systems (cont.) 21

Email spam filtering


Model of social tagging system 22

[Ivanov, Vajda, Lee, Ebrahimi, IEEE SPM 2012]


Categorization of trust models 23



User versus content trust modeling 24

• User trust modeling is more popular than content trust
modeling:
• User trust models has a less complexity than content trust models
• User trust models can quickly adapt to the constantly evolving and
changing environment in social systems due to the type of features
used for modeling, and thus be applicable longer than content trust
models, without need for creation of new models
• User trust modeling has a disadvantage of “broad brush”,
i.e. it may be excessively strict if a user happens to post one
bit of questionable content on otherwise legitimate content
• “Subjectivity” in classifying spam and non-spam content/
users remains as a fundamental issue, i.e. what is spam
content/user to one person may be interesting to another one


Summary of representative recent techniques 25

Reference Trust Media Method Dataset
model
Gyongyi et content web pages an iterative approach, called TrustRank, to propagate AltaVista,
al. 2004 trust scores to all nodes in the graph by splitting the real
trust score of a node among its neighbors according
to a weighting scheme
Koutrika et content bookmarks a coincidence-based model for query-by-tag search Delicious,
al. 2008 which estimates the level of agreement among real & simulated
different users in the system for a given tag

Wu et al. content images a computer vision technique based on low-level SpamArchive
2005 image features to detect embedded text and & Ling-Spam,
computer-generated graphics real
Liu et al. content & bookmarks an iterative approach to identify spam content by its Delicious,
2009 user information value extracted from the collaborative real
knowledge

Bogers and content & bookmarks KL-divergence to measure the similarity between BibSonomy &
Van den user language models and new posts CiteULike,
Bosch 2008 real
Ivanov et al. user images an approach based on the feedback from other users Panoramio,
2012 who agree or disagree with a tag associated with an real
image


Summary of representative recent techniques (cont.) 26
Reference Trust Media Method Dataset
model
Xu et al. user bookmarks an iterative approach to compute the goodness of MyWeb 2.0,
2006 each tag with respect to a content and the authority real
scores of the users
Krestel and user bookmarks a TrustRank-based approach using features which BibSonomy,
Chen 2008 model tag co-occurance, content co-occurance and real
co-occurance of tag-content

Benevenuto user videos a supervised learning approach applied on features YouTube,
et al. 2009 that reflect users behavior through video responses real

Lee et al. user tweets a machine learning approach applied on social Twitter,
2010 honeypots including users’ profile and tweets’ real & simulated
features

Krause et al. user bookmarks a machine learning approach applied on a user’s BibSonomy,
2008 profile, bookmarking activity and context of tags real
features

Markines et user bookmarks a machine learning approach applied on tag-, BibSonomy,
al. 2009 content- and user-based features real

Caverlee et user user an approach to compute a dynamic trust score, MySpace,
al. 2008 profiles called SocialTrust, depending on the quality of real
the relationship and personalized feedback ratings

User trust modeling 27

• User reliability based model [Ivanov et al. 2012]
• User trust modeling in Panoramio images

Tagged incorrectly?



• User reliability based model (cont.) [Ivanov et al. 2012]
• Idea:

true flag (correct tag)

false flag (wrong tag + suggest a correct tag)

• The more misplacements a user has, the more untrusted he/she is
• User trust ratio:



• User reliability based model (cont.) [Ivanov et al. 2012]
• Users can collaboratively eliminate a spammer

Bob

Eve

Alice

Bob Alice Eve


Example: Geotag propagation with trust modeling 30

• Geotag propagation = geotagging
• GPS coordinates are not available for the majority of web
photos
• GPS sensor in a camera provides only the location of the
photographer instead of that of a captured landmark



• Sometimes GPS and Wi-Fi geotagging determine
wrong location due to noise in communication channel



[Ivanov, Vajda, Lee, Ebrahimi, MTAP 2012]


Geotag propagation with trust modeling

Bob Alice Eve

Alice propagate
tags

Bob Eve
propag
ate
tags

http://www.flickr.com/photos/gustavog/9708628
http://www.flickr.com/photos/gustavog/9708628

Performance evaluation 34

• Dataset
• 1320 images ‒ 22 cities, 3 sublocations for each city, 20 sample
images for each sublocation
• 47 users; 3295 tags (658 unique)


The distribution of the normalized trust values 35


The distribution of the normalized trust values for all users 36


The recognition rate of the geotag propagation system 37


Open issues & challenges 38

• Publicly available datasets:
• Publication of datasets from different social networks and even
different datasets of one social network for evaluation of trust
modeling approaches is rarely found, which makes it difficult to
compare results and performance of
different trust models
• Most of the datasets provide data for evaluating
only one aspect of trust modeling, either
user or content trust modeling, while evaluation
of the other aspect requires introducing
simulated objects in the real-world social
tagging datasets


Open issues & challenges (cont.) 39

• Dynamics of trust:
• User’s trust tends to vary over time according to the user’s
experience and evolvement of social networks
• Only a few approaches deal with dynamics of trust by distinguishing
between recent and old tags



• Multilingualism:
• Most of the existing trust modeling approaches based on text
information assume monolingual environments
• Some text information may be regarded as wrong due to the
language difference – people from various countries, so various
languages simultaneously appear in tags and comments



• Interaction across social networks:
• How trust models across domains can be effectively connected
and shared? E.g. users can use their Facebook accounts to log in
some other social network services



• Multimodal analysis:
• Most of the current techniques for noise and spam reduction focus
only on textual tag processing and user profile analysis, while
audio and visual content features of multimedia content can also
provide useful information about the relevance of the content and
content-tag relationship


References 43

B. Sigurbjornsson, and R. van Zwol, "Flickr tag recommendation based on collective knowledge," in Proc.
WWW, Apr. 2008, pp. 327–336.
L. S. Kennedy, S.-F. Chang, and I. V. Kozintsev, “To search or to label?: Predicting the performance of search-
based automatic image classifiers,” in Proc. ACM MIR, Oct. 2006, pp. 249–258.
K. Liu, B. Fang, and Y. Zhang, “Detecting tag spam in social tagging systems with collaborative knowledge,” in
Proc. IEEE FSKD, Aug. 2009, pp. 427–431.
P. Heymann, G. Koutrika, and H. Garcia-Molina, “Fighting spam on social web sites: A survey of approaches
and future challenges,” IEEE Internet Comput., vol. 11, no. 6, pp. 36–45, Nov. 2007.
X. Li, C. Snoek, and M. Worring, "Learning tag relevance by neighbor voting for social image retrieval," in
Proc. ACM MIR, Oct. 2008, pp. 180-187.
B. Markines, C. Cattuto, and F. Menczer, “Social spam detection,” in Proc. ACM AIRWeb, Apr. 2009, pp. 41–
48.
B. Krause, C. Schmitz, A. Hotho, and Stum G., “The anti-social tagger: Detecting spam in social bookmarking
systems,” in Proc. ACM AIRWeb, Apr. 2008, pp. 61–68.
G. Koutrika, F. A. Effendi, Z. Gyongyi, P. Heymann, and H. Garcia-Molina, “Combating spam in tagging
systems: An evaluation,” ACM TWEB, vol. 2, no. 4, pp. 22:1–22:34, Oct. 2008.
Z. Gyongyi, H. Garcia-Molina, and J. Pedersen, “Combating web spam with TrustRank,” in Proc. VLDB, Aug.
2004, pp. 576–587.
C.-T. Wu, K.-T. Cheng, Q. Zhu, and Y.-L. Wu, “Using visual features for anti-spam filtering,” in Proc. IEEE ICIP,
Sep. 2005, vol. 3, pp. III – 509–512.


References (cont.) 44

I. Ivanov, P. Vajda, J.-S. Lee, and T. Ebrahimi, "In tags we trust: Trust modeling in social tagging of multimedia
content," in IEEE SPM, vol. 29, no. 2, pp. 98-107, Mar. 2012.
I. Ivanov, P. Vajda, J.-S. Lee, L. Goldmann, and T. Ebrahimi, “Geotag propagation in social networks based on
user trust model,” MTAP, vol. 56, no. 1, pp. 155-177, Jan. 2012.
I. Ivanov, P. Vajda, L. Goldmann, J.-S. Lee, and T. Ebrahimi, "Object-based tag propagation for semi-
automatic annotation of images," in Proc. ACM MIR, Mar. 2010, pp. 497-506.
T. Bogers and A. Van den Bosch, “Using language models for spam detection in social bookmarking,” in Proc.
ECML PKDD, Sep. 2008, pp. 1–12.
Z. Xu, Y. Fu, J. Mao, and D. Su, “Towards the semantic web: Collaborative tag suggestions,” in Proc. ACM
WWW, May 2006.
R. Krestel and L. Chen, “Using co-occurence of tags and resources to identify spammers,” in Proc. ECML
PKDD, Sep. 2008, pp. 38–46.
F. Benevenuto, T. Rodrigues, V. Almeida, J. Almeida, and M. Gonc¸alves, “Detecting spammers and content
promoters in online video social networks,” in Proc. ACM SIGIR, Jul. 2009, pp. 620–627.
K. Lee, J. Caverlee, and S. Webb, “Uncovering social spammers: social honeypots + machine learning,” in
Proc. ACM SIGIR, Jul. 2010, pp. 435–442.
M. G. Noll, C. A. Yeung, N. Gibbins, C. Meinel, and N. Shadbolt, “Telling experts from spammers: Expertise
ranking in folksonomies,” in Proc. ACM SIGIR, Jul. 2009, pp. 612–619.
J. Caverlee, L. Liu, and S. Webb, “SocialTrust: Tamper-resilient trust establishment in online communities,” in
Proc. ACM JCDL, Jun. 2008, pp. 104–114.
A. Hotho, D. Benz, R. J¨aschke, and B. Krause, Eds., ECML PKDD Discovery Challenge, Sep. 2008,
Available at: http://www.kde.cs.uni-kassel.de/ws/rsdc08.


Any question?
Prof. Touradj Ebrahimi
Touradj.Ebrahimi@epfl.ch

MMSPG EPFL

Contribution from Ivan
Ivanov who diligently
prepared many of materials
discussed is hereby
acknowledged.

Security and Trust in social media networks

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Security and Trust in social media networks

Similaire à Security and Trust in social media networks (20)

Plus de Touradj Ebrahimi

Plus de Touradj Ebrahimi (20)

Dernier

Dernier (20)

Security and Trust in social media networks

Notes de l'éditeur