SlideShare a Scribd company logo
1 of 37
SOCIAL NETWORK
ANALYSIS
Caleb Jones
{
“email” : “calebjones@gmail.com”,
“website” : “http://calebjones.info”,
“twitter” : “@JonesWCaleb”
}
Overview
•  Network Analysis – Crash Course
•  Degree
•  Components
•  Modularity
•  Ranking
•  Resiliency
•  Gephi – Intro
•  Loading data (Facebook)
•  Navigation
•  Statistics
•  Exporting
•  Filtering
•  Resiliency
Resources
SNA Coursera Course
(next being taught October 2013)
Linked by
Albert-László Barabási
Network Analysis – Crash Course
•  Degree (n): The number of connections a node has.
•  Node A has in-degree 3 and out-degree 1
•  Node B has degree 4
A
B
Network Analysis – Crash Course
•  Component (n): A a maximally connected subgraph
(undirected).
•  Giant component is largest component
component (giant) component
Graph with nodes { A, B, C, X, Y, Z }
Network Analysis – Crash Course
•  Modularity (n) ~ Division of a graph into communities
(modules/classes/cliques) with dense interconnection with
the network having relatively sparse interconnection
between communities.
Community 1 Community 2
Graph with nodes { A, B, C, X, Y, Z }
Network Analysis – Crash Course
• Ranking: A measure of a node’s
“importance”
• Many different methods for determining
“importance”
• Degree, Centrality, Closeness, Betweenness,
Eigenvector, HITS, PageRank, Erdös Number
• Which one to consider depends on the
question being asked
• Precursor to identifying network resilience,
diffusion, and vulnerability
Network Analysis – Crash Course
• Degree ranking: Quantity over quality
Node Score
A 3
B 3
C 1
D 1
X 1
Y 1
Z 3
Q 1
Network Analysis – Crash Course
• Betweeness Ranking: How frequently a
node appears on shortest paths.
Node Score
A 15
B 11
C 0
D 0
X 0
Y 0
Z 11
Q 0
Network Analysis – Crash Course
• Closeness Ranking: Average number of
hops from a node to rest of network.
Node Score
A 1.571
B 1.857
C 2.714
D 2.714
X 2.714
Y 2.714
Z 1.857
Q 2.429
Note: Smaller is (usually) better
Network Analysis – Crash Course
• Eigenvector Ranking: A node’s “influence”
on the network (accounts for who you know)
Node Score
A 1
B 0.836
C 0.392
D 0.392
X 0.392
Y 0.392
Z 0.836
Q 0.465
Google’s PageRank is a variant of this
Based on eigenvector of adjacency matrix
Network Analysis – Crash Course
• Erdös Ranking: Number of hops to
specific node (degrees of separation).
Node Score
A 0
B 1
C 2
D 2
X 2
Y 2
Z 1
Q 1
Note: Smaller is (usually) better
What if “Erdös” is an influential CEO?
What if “Erdös” has bird flu?
Erdös
Network Analysis – Crash Course
• Erdös Ranking: Number of hops to
specific node (degrees of separation).
Node Score
A 2
B 1
C 2
D 0
X 4
Y 4
Z 3
Q 3
Note: Smaller is (usually) better
What if “Erdös” is an influential CEO?
What if “Erdös” has bird flu?
Erdös
Network Analysis – Crash Course
• Limitations:
• Only considered undirected networks (directed
is more complicated)
• Treated all edges as equal. Many networks
have a weight or cost associated to edges (e.g.
distance)
• Treated all nodes as equal. A node’s importance
may be inherent based on attributes separate
from its position in network (e.g. dating sites)
Network Analysis – Crash Course
• Resiliency (removing nodes/links):
• Target nodes based on their “importance”
• High degree nodes more likely to affect
local communities
• High betweeness/Eigenvector nodes
more likely to fragment communities
Gephi Introduction
•  Platform for visualizing and analyzing networks
•  https://gephi.org/
•  Cross-platform
•  Plugin model
Facebook Dataset
•  Download your data (gml)
•  http://snacourse.com/getnet/
•  Import into Gephi
•  File -> Open -> Select downloaded
.gml file
•  Choose “undirected”
for “Graph Type”
Layout
Layout -> Fruchterman Reingold
Partitioning Communities
1.  Statistic -> Modularity -> Run (use defaults)
2.  Partition -> Nodes (refresh) -> Modularity class -> Apply
Degree Distribution
1.  Statistic -> Average Degree -> Run
2.  Partition -> Nodes (refresh) -> Modularity class -> Apply
Lots of nodes with
few connections
Only a few with a large
number of connections
Power law distribution?
Node Ranking by Degree
1.  Ranking -> Nodes (refresh) -> Degree -> Apply
(try tweaking min/max size and Spline for desired emphasis)
Filtering Isolated Nodes (“noise”)
1.  Statistics -> Connected
Components -> Run
2.  Filters -> Attributes -> Partition
Count -> Component ID
3.  Drag “Component ID” down into
“Queries” section
4.  Click on “Partition Count”, slide the
settings bar, and click “Filter” –
adjust to remove isolated nodes
Can be important step when dealing with very
large data sets. Depending on degree
distribution, filter can be set quite high.
Re-adjust after Filtering
• Need to re-run previous steps to refresh
calculated values now that filtering has been
done.
• Statistics -> Average degree, modularity,
connected components
•  How did these numbers change?
• Re-partition node color by modularity class now
that modularity has been recalculated
• Run Fruchterman Reingold layout again to fill
space left over from filtered nodes
Have you saved yet!?
Node Ranking by Centrality
1.  Statistics -> Network Diameter -> Run
2.  Ranking -> Betweeness Centrality -> Apply
Erdös Number
•  You may have noticed a key node which both has the
highest degree and betweeness ranking.
•  Click on the “Edit” button and select that node
(note the name)
•  Statistics -> Erdös Number -> Select that name -> OK
•  What will happen if you select a less conspicuous node?
Data Lab
•  Go to “Data Laboratory”
•  All node information as well as calculated statistics appear
here in a spreadsheet.
•  Sort by “Erdös Number” (descending)
•  What is the largest Erdös Number? N degrees of ________ .
•  Try sorting by other values (degree, closeness, betweeness)
Max is 7 degrees
of separation
Node Ranking by Eigenvector Centrality
1.  Statistics -> Eigenvector Centrality -> Run
2.  Ranking -> Eigenvector Centrality -> Apply
Node Ranking by PageRank
1.  Statistics -> PageRank -> Run
2.  Ranking -> PageRank -> Apply
Export to Image
•  Go to “Preview” mode
•  Click “Refresh” to see what you have now
•  Add node labels
•  “Node Labels” -> “Show Labels”
•  Adjust font size to avoid label overlapping
•  If Node Labels are overlapping, try expanding layout
•  Back to “Overview” -> Layout -> Fruchterman Reingold
•  Increase the “Area” parameter and re-run the layout
•  Then go back to “Preview” mode and click “Refresh”
•  May need to re-adjust Node Label text size
•  Experiment with “Curved” edges
labels omitted in slidedeck for privacy
Before we attack the network, save!
Network Resiliency
•  How can we fragment the network or increase the
separation between nodes?
•  Which nodes, if removed/influenced, would most greatly
impact the network?
•  What information have we learned already that could be
used?
Network Resiliency
•  Go to “Data Laboratory” -> sort by “PageRank descending
•  Select top 5 rows and delete them (did you save first!!!)
•  Note their names – Are these people influential in your life? sort
Top 5
Network Resiliency
•  Go back to statistics and note the following:
•  Average Degree, Network Diameter, Modularity, Connected
Components, Average Path Length
•  Also note how the network visually has changed
•  Re-run the statistics above and note how the numbers
changed
•  Did you successfully fragment the network (did # of connected
components increase)? (disrupting communications)
•  How many nodes do you think you’d have to remove if you
removed by lowest PageRank scores first? (robustness of network)
•  What if links represented load distributed across network? How
would the network load change after removing these key nodes?
(cascading failure)
Review
•  Network Analysis – Crash Course
•  Degree
•  Components
•  Modularity
•  Ranking
•  Resiliency
•  Gephi – Intro
•  Loading data (Facebook)
•  Navigation
•  Statistics
•  Exporting
•  Filtering
•  Resiliency
Questions?

More Related Content

What's hot

Social network analysis intro part I
Social network analysis intro part ISocial network analysis intro part I
Social network analysis intro part ITHomas Plotkowiak
 
Social network analysis part ii
Social network analysis part iiSocial network analysis part ii
Social network analysis part iiTHomas Plotkowiak
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis WorkshopData Works MD
 
The Basics of Social Network Analysis
The Basics of Social Network AnalysisThe Basics of Social Network Analysis
The Basics of Social Network AnalysisRory Sie
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Xiaohan Zeng
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social MediaSymeon Papadopoulos
 
Network centrality measures and their effectiveness
Network centrality measures and their effectivenessNetwork centrality measures and their effectiveness
Network centrality measures and their effectivenessemapesce
 
Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewSatyaki Sikdar
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networksFrancisco Restivo
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network AnalysisPatti Anklam
 
Social Media Mining - Chapter 6 (Community Analysis)
Social Media Mining - Chapter 6 (Community Analysis)Social Media Mining - Chapter 6 (Community Analysis)
Social Media Mining - Chapter 6 (Community Analysis)SocialMediaMining
 
CS6010 Social Network Analysis Unit III
CS6010 Social Network Analysis   Unit IIICS6010 Social Network Analysis   Unit III
CS6010 Social Network Analysis Unit IIIpkaviya
 
Social Network Analysis power point presentation
Social Network Analysis power point presentation Social Network Analysis power point presentation
Social Network Analysis power point presentation Ratnesh Shah
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisSujoy Bag
 
Social Network Analysis (SNA) Made Easy
Social Network Analysis (SNA) Made EasySocial Network Analysis (SNA) Made Easy
Social Network Analysis (SNA) Made EasyJeff Mohr
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISrathnaarul
 
Social Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online CommunitiesSocial Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online Communitiessubby88
 
Big Data: Social Network Analysis
Big Data: Social Network AnalysisBig Data: Social Network Analysis
Big Data: Social Network AnalysisMichel Bruley
 

What's hot (20)

Social network analysis intro part I
Social network analysis intro part ISocial network analysis intro part I
Social network analysis intro part I
 
Social network analysis part ii
Social network analysis part iiSocial network analysis part ii
Social network analysis part ii
 
Social Network Analysis Workshop
Social Network Analysis WorkshopSocial Network Analysis Workshop
Social Network Analysis Workshop
 
3 Centrality
3 Centrality3 Centrality
3 Centrality
 
The Basics of Social Network Analysis
The Basics of Social Network AnalysisThe Basics of Social Network Analysis
The Basics of Social Network Analysis
 
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
 
Community Detection in Social Media
Community Detection in Social MediaCommunity Detection in Social Media
Community Detection in Social Media
 
Network centrality measures and their effectiveness
Network centrality measures and their effectivenessNetwork centrality measures and their effectiveness
Network centrality measures and their effectiveness
 
Community Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief OverviewCommunity Detection in Social Networks: A Brief Overview
Community Detection in Social Networks: A Brief Overview
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networks
 
06 Community Detection
06 Community Detection06 Community Detection
06 Community Detection
 
Introduction to Social Network Analysis
Introduction to Social Network AnalysisIntroduction to Social Network Analysis
Introduction to Social Network Analysis
 
Social Media Mining - Chapter 6 (Community Analysis)
Social Media Mining - Chapter 6 (Community Analysis)Social Media Mining - Chapter 6 (Community Analysis)
Social Media Mining - Chapter 6 (Community Analysis)
 
CS6010 Social Network Analysis Unit III
CS6010 Social Network Analysis   Unit IIICS6010 Social Network Analysis   Unit III
CS6010 Social Network Analysis Unit III
 
Social Network Analysis power point presentation
Social Network Analysis power point presentation Social Network Analysis power point presentation
Social Network Analysis power point presentation
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Social Network Analysis (SNA) Made Easy
Social Network Analysis (SNA) Made EasySocial Network Analysis (SNA) Made Easy
Social Network Analysis (SNA) Made Easy
 
NE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSISNE7012- SOCIAL NETWORK ANALYSIS
NE7012- SOCIAL NETWORK ANALYSIS
 
Social Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online CommunitiesSocial Network Analysis To Blog Based Online Communities
Social Network Analysis To Blog Based Online Communities
 
Big Data: Social Network Analysis
Big Data: Social Network AnalysisBig Data: Social Network Analysis
Big Data: Social Network Analysis
 

Viewers also liked

Social Network Analysis & an Introduction to Tools
Social Network Analysis & an Introduction to ToolsSocial Network Analysis & an Introduction to Tools
Social Network Analysis & an Introduction to ToolsPatti Anklam
 
Social Networks and Social Capital
Social Networks and Social CapitalSocial Networks and Social Capital
Social Networks and Social CapitalGiorgos Cheliotis
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...Jeromy Anglim
 
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Lauri Eloranta
 
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Lauri Eloranta
 
Facebook Network Analysis using Gephi
Facebook Network Analysis using GephiFacebook Network Analysis using Gephi
Facebook Network Analysis using GephiSarah Joy Murray
 
Social Network Analysis in R
Social Network Analysis in RSocial Network Analysis in R
Social Network Analysis in RIan Cook
 
Social Network Analysis With R
Social Network Analysis With RSocial Network Analysis With R
Social Network Analysis With RDavid Chiu
 
Social network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and moreSocial network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and moreWael Elrifai
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network AnalysisWael Elrifai
 
Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...
Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...
Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...Lauri Eloranta
 
Social Network Analysis and Visualization
Social Network Analysis and VisualizationSocial Network Analysis and Visualization
Social Network Analysis and VisualizationAlberto Ramirez
 
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...Lauri Eloranta
 
Social Network Analysis, Semantic Web and Learning Networks
Social Network Analysis, Semantic Web and Learning NetworksSocial Network Analysis, Semantic Web and Learning Networks
Social Network Analysis, Semantic Web and Learning NetworksRory Sie
 
A comparative study of social network analysis tools
A comparative study of social network analysis toolsA comparative study of social network analysis tools
A comparative study of social network analysis toolsDavid Combe
 
Social Network Visualization 101
Social Network Visualization 101Social Network Visualization 101
Social Network Visualization 101librarianrafia
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Lauri Eloranta
 

Viewers also liked (20)

Social Network Analysis & an Introduction to Tools
Social Network Analysis & an Introduction to ToolsSocial Network Analysis & an Introduction to Tools
Social Network Analysis & an Introduction to Tools
 
Social Networks and Social Capital
Social Networks and Social CapitalSocial Networks and Social Capital
Social Networks and Social Capital
 
How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...How to conduct a social network analysis: A tool for empowering teams and wor...
How to conduct a social network analysis: A tool for empowering teams and wor...
 
I Social Network
I Social NetworkI Social Network
I Social Network
 
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
 
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
Ethical and Legal Issues in Computational Social Science - Lecture 7 in Intro...
 
Facebook Network Analysis using Gephi
Facebook Network Analysis using GephiFacebook Network Analysis using Gephi
Facebook Network Analysis using Gephi
 
Social Network Analysis in R
Social Network Analysis in RSocial Network Analysis in R
Social Network Analysis in R
 
Social Network Analysis With R
Social Network Analysis With RSocial Network Analysis With R
Social Network Analysis With R
 
Social network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and moreSocial network analysis & Big Data - Telecommunications and more
Social network analysis & Big Data - Telecommunications and more
 
Social Network Analysis
Social Network AnalysisSocial Network Analysis
Social Network Analysis
 
Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...
Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...
Basics of Computation and Modeling - Lecture 2 in Introduction to Computation...
 
Social Network Analysis and Visualization
Social Network Analysis and VisualizationSocial Network Analysis and Visualization
Social Network Analysis and Visualization
 
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...Simulation in Social Sciences -  Lecture 6 in Introduction to Computational S...
Simulation in Social Sciences - Lecture 6 in Introduction to Computational S...
 
Graph Evolution Models
Graph Evolution ModelsGraph Evolution Models
Graph Evolution Models
 
DREaM Event 2: Louise Cooke
DREaM Event 2: Louise CookeDREaM Event 2: Louise Cooke
DREaM Event 2: Louise Cooke
 
Social Network Analysis, Semantic Web and Learning Networks
Social Network Analysis, Semantic Web and Learning NetworksSocial Network Analysis, Semantic Web and Learning Networks
Social Network Analysis, Semantic Web and Learning Networks
 
A comparative study of social network analysis tools
A comparative study of social network analysis toolsA comparative study of social network analysis tools
A comparative study of social network analysis tools
 
Social Network Visualization 101
Social Network Visualization 101Social Network Visualization 101
Social Network Visualization 101
 
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
Big Data and Data Mining - Lecture 3 in Introduction to Computational Social ...
 

Similar to Social network analysis

Practice discovering biological knowledge using networks approach.
Practice discovering biological knowledge using networks approach.Practice discovering biological knowledge using networks approach.
Practice discovering biological knowledge using networks approach.Elena Sügis
 
network mining and representation learning
network mining and representation learningnetwork mining and representation learning
network mining and representation learningsun peiyuan
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.pptchatbot9
 
Advanced c c++
Advanced c c++Advanced c c++
Advanced c c++muilevan
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.pptcongtran88
 
6.1-Cassandra.ppt
6.1-Cassandra.ppt6.1-Cassandra.ppt
6.1-Cassandra.pptDanBarcan2
 
Lecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdfLecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdfssuser4c50a9
 
Mining the social web 6
Mining the social web 6Mining the social web 6
Mining the social web 6HyeonSeok Choi
 
Dynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theoremDynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theoremGrisha Weintraub
 
Preprocessing
PreprocessingPreprocessing
Preprocessingmmuthuraj
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 TutorialAlexander Pico
 
Storage Systems For Scalable systems
Storage Systems For Scalable systemsStorage Systems For Scalable systems
Storage Systems For Scalable systemselliando dias
 

Similar to Social network analysis (20)

Practice discovering biological knowledge using networks approach.
Practice discovering biological knowledge using networks approach.Practice discovering biological knowledge using networks approach.
Practice discovering biological knowledge using networks approach.
 
network mining and representation learning
network mining and representation learningnetwork mining and representation learning
network mining and representation learning
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
Advanced c c++
Advanced c c++Advanced c c++
Advanced c c++
 
Preprocessing.ppt
Preprocessing.pptPreprocessing.ppt
Preprocessing.ppt
 
6.1-Cassandra.ppt
6.1-Cassandra.ppt6.1-Cassandra.ppt
6.1-Cassandra.ppt
 
6.1-Cassandra.ppt
6.1-Cassandra.ppt6.1-Cassandra.ppt
6.1-Cassandra.ppt
 
Cassandra
CassandraCassandra
Cassandra
 
Lecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdfLecture 5 Decision tree.pdf
Lecture 5 Decision tree.pdf
 
Pathway and network analysis
Pathway and network analysisPathway and network analysis
Pathway and network analysis
 
Birch1
Birch1Birch1
Birch1
 
Mining the social web 6
Mining the social web 6Mining the social web 6
Mining the social web 6
 
Dynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theoremDynamo and BigTable in light of the CAP theorem
Dynamo and BigTable in light of the CAP theorem
 
Preprocessing
PreprocessingPreprocessing
Preprocessing
 
2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial2016 Cytoscape 3.3 Tutorial
2016 Cytoscape 3.3 Tutorial
 
Storage Systems For Scalable systems
Storage Systems For Scalable systemsStorage Systems For Scalable systems
Storage Systems For Scalable systems
 
social.pptx
social.pptxsocial.pptx
social.pptx
 

Recently uploaded

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integrationmarketing932765
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...amber724300
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessWSO2
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...BookNet Canada
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...Karmanjay Verma
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Français Patch Tuesday - Avril
Français Patch Tuesday - AvrilFrançais Patch Tuesday - Avril
Français Patch Tuesday - AvrilIvanti
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 

Recently uploaded (20)

Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS:  6 Ways to Automate Your Data IntegrationBridging Between CAD & GIS:  6 Ways to Automate Your Data Integration
Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
JET Technology Labs White Paper for Virtualized Security and Encryption Techn...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Accelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with PlatformlessAccelerating Enterprise Software Engineering with Platformless
Accelerating Enterprise Software Engineering with Platformless
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Français Patch Tuesday - Avril
Français Patch Tuesday - AvrilFrançais Patch Tuesday - Avril
Français Patch Tuesday - Avril
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 

Social network analysis

  • 1. SOCIAL NETWORK ANALYSIS Caleb Jones { “email” : “calebjones@gmail.com”, “website” : “http://calebjones.info”, “twitter” : “@JonesWCaleb” }
  • 2. Overview •  Network Analysis – Crash Course •  Degree •  Components •  Modularity •  Ranking •  Resiliency •  Gephi – Intro •  Loading data (Facebook) •  Navigation •  Statistics •  Exporting •  Filtering •  Resiliency
  • 3. Resources SNA Coursera Course (next being taught October 2013) Linked by Albert-László Barabási
  • 4. Network Analysis – Crash Course •  Degree (n): The number of connections a node has. •  Node A has in-degree 3 and out-degree 1 •  Node B has degree 4 A B
  • 5. Network Analysis – Crash Course •  Component (n): A a maximally connected subgraph (undirected). •  Giant component is largest component component (giant) component Graph with nodes { A, B, C, X, Y, Z }
  • 6. Network Analysis – Crash Course •  Modularity (n) ~ Division of a graph into communities (modules/classes/cliques) with dense interconnection with the network having relatively sparse interconnection between communities. Community 1 Community 2 Graph with nodes { A, B, C, X, Y, Z }
  • 7. Network Analysis – Crash Course • Ranking: A measure of a node’s “importance” • Many different methods for determining “importance” • Degree, Centrality, Closeness, Betweenness, Eigenvector, HITS, PageRank, Erdös Number • Which one to consider depends on the question being asked • Precursor to identifying network resilience, diffusion, and vulnerability
  • 8. Network Analysis – Crash Course • Degree ranking: Quantity over quality Node Score A 3 B 3 C 1 D 1 X 1 Y 1 Z 3 Q 1
  • 9. Network Analysis – Crash Course • Betweeness Ranking: How frequently a node appears on shortest paths. Node Score A 15 B 11 C 0 D 0 X 0 Y 0 Z 11 Q 0
  • 10. Network Analysis – Crash Course • Closeness Ranking: Average number of hops from a node to rest of network. Node Score A 1.571 B 1.857 C 2.714 D 2.714 X 2.714 Y 2.714 Z 1.857 Q 2.429 Note: Smaller is (usually) better
  • 11. Network Analysis – Crash Course • Eigenvector Ranking: A node’s “influence” on the network (accounts for who you know) Node Score A 1 B 0.836 C 0.392 D 0.392 X 0.392 Y 0.392 Z 0.836 Q 0.465 Google’s PageRank is a variant of this Based on eigenvector of adjacency matrix
  • 12. Network Analysis – Crash Course • Erdös Ranking: Number of hops to specific node (degrees of separation). Node Score A 0 B 1 C 2 D 2 X 2 Y 2 Z 1 Q 1 Note: Smaller is (usually) better What if “Erdös” is an influential CEO? What if “Erdös” has bird flu? Erdös
  • 13. Network Analysis – Crash Course • Erdös Ranking: Number of hops to specific node (degrees of separation). Node Score A 2 B 1 C 2 D 0 X 4 Y 4 Z 3 Q 3 Note: Smaller is (usually) better What if “Erdös” is an influential CEO? What if “Erdös” has bird flu? Erdös
  • 14. Network Analysis – Crash Course • Limitations: • Only considered undirected networks (directed is more complicated) • Treated all edges as equal. Many networks have a weight or cost associated to edges (e.g. distance) • Treated all nodes as equal. A node’s importance may be inherent based on attributes separate from its position in network (e.g. dating sites)
  • 15. Network Analysis – Crash Course • Resiliency (removing nodes/links): • Target nodes based on their “importance” • High degree nodes more likely to affect local communities • High betweeness/Eigenvector nodes more likely to fragment communities
  • 16. Gephi Introduction •  Platform for visualizing and analyzing networks •  https://gephi.org/ •  Cross-platform •  Plugin model
  • 17. Facebook Dataset •  Download your data (gml) •  http://snacourse.com/getnet/ •  Import into Gephi •  File -> Open -> Select downloaded .gml file •  Choose “undirected” for “Graph Type”
  • 19. Partitioning Communities 1.  Statistic -> Modularity -> Run (use defaults) 2.  Partition -> Nodes (refresh) -> Modularity class -> Apply
  • 20. Degree Distribution 1.  Statistic -> Average Degree -> Run 2.  Partition -> Nodes (refresh) -> Modularity class -> Apply Lots of nodes with few connections Only a few with a large number of connections Power law distribution?
  • 21. Node Ranking by Degree 1.  Ranking -> Nodes (refresh) -> Degree -> Apply (try tweaking min/max size and Spline for desired emphasis)
  • 22. Filtering Isolated Nodes (“noise”) 1.  Statistics -> Connected Components -> Run 2.  Filters -> Attributes -> Partition Count -> Component ID 3.  Drag “Component ID” down into “Queries” section 4.  Click on “Partition Count”, slide the settings bar, and click “Filter” – adjust to remove isolated nodes Can be important step when dealing with very large data sets. Depending on degree distribution, filter can be set quite high.
  • 23. Re-adjust after Filtering • Need to re-run previous steps to refresh calculated values now that filtering has been done. • Statistics -> Average degree, modularity, connected components •  How did these numbers change? • Re-partition node color by modularity class now that modularity has been recalculated • Run Fruchterman Reingold layout again to fill space left over from filtered nodes
  • 24. Have you saved yet!?
  • 25. Node Ranking by Centrality 1.  Statistics -> Network Diameter -> Run 2.  Ranking -> Betweeness Centrality -> Apply
  • 26. Erdös Number •  You may have noticed a key node which both has the highest degree and betweeness ranking. •  Click on the “Edit” button and select that node (note the name) •  Statistics -> Erdös Number -> Select that name -> OK •  What will happen if you select a less conspicuous node?
  • 27. Data Lab •  Go to “Data Laboratory” •  All node information as well as calculated statistics appear here in a spreadsheet. •  Sort by “Erdös Number” (descending) •  What is the largest Erdös Number? N degrees of ________ . •  Try sorting by other values (degree, closeness, betweeness) Max is 7 degrees of separation
  • 28. Node Ranking by Eigenvector Centrality 1.  Statistics -> Eigenvector Centrality -> Run 2.  Ranking -> Eigenvector Centrality -> Apply
  • 29. Node Ranking by PageRank 1.  Statistics -> PageRank -> Run 2.  Ranking -> PageRank -> Apply
  • 30. Export to Image •  Go to “Preview” mode •  Click “Refresh” to see what you have now •  Add node labels •  “Node Labels” -> “Show Labels” •  Adjust font size to avoid label overlapping •  If Node Labels are overlapping, try expanding layout •  Back to “Overview” -> Layout -> Fruchterman Reingold •  Increase the “Area” parameter and re-run the layout •  Then go back to “Preview” mode and click “Refresh” •  May need to re-adjust Node Label text size •  Experiment with “Curved” edges
  • 31. labels omitted in slidedeck for privacy
  • 32. Before we attack the network, save!
  • 33. Network Resiliency •  How can we fragment the network or increase the separation between nodes? •  Which nodes, if removed/influenced, would most greatly impact the network? •  What information have we learned already that could be used?
  • 34. Network Resiliency •  Go to “Data Laboratory” -> sort by “PageRank descending •  Select top 5 rows and delete them (did you save first!!!) •  Note their names – Are these people influential in your life? sort Top 5
  • 35. Network Resiliency •  Go back to statistics and note the following: •  Average Degree, Network Diameter, Modularity, Connected Components, Average Path Length •  Also note how the network visually has changed •  Re-run the statistics above and note how the numbers changed •  Did you successfully fragment the network (did # of connected components increase)? (disrupting communications) •  How many nodes do you think you’d have to remove if you removed by lowest PageRank scores first? (robustness of network) •  What if links represented load distributed across network? How would the network load change after removing these key nodes? (cascading failure)
  • 36. Review •  Network Analysis – Crash Course •  Degree •  Components •  Modularity •  Ranking •  Resiliency •  Gephi – Intro •  Loading data (Facebook) •  Navigation •  Statistics •  Exporting •  Filtering •  Resiliency