PRS: Practicing Research Skills

Computational Approaches

in Political and Social

Sciences

Week 3A: Data Analysis / Skills tutorial on network analysis. Mon 17 June

Diliara Valeeva and Eelke Heemskerk

Plan for Week 3

Mon 17 June: Advanced Gephi Tutorial

                     + How to write 'Data Analysis'

 

Wed 19 June: Group meetups

 

Fri 21 June: Group meetups

=> Submitting 'Data Analysis' assignment

 

Plan for today

1. Reflections

2. Skills tutorial on network analysis - 2

3. How to write 'Data Analysis'

 

1. Reflections

2. Skills tutorial on

network analysis - 2

Data on

co-authorship (2015)

Canvas => Modules => Week 3 => 2015_coauthorship.gexf

2 main types of measures

Network-related

Node-related

1. Network Measures

Average Degree

  • Degree is the number of ties a node has to other nodes
  • High degree: nodes have a large number of ties with others in a network 

Graph Density

  • Density indicates how densely nodes are connected in a network
  • = Number of actual ties / Number of potential ties
  • High density: everyone knows each other in a network

Network Diameter

  • Diameter shows how far apart are the most distant nodes  from each other
  • High diameter: nodes are far away, it can take time and effort to reach each other

Betweenness Centrality

  • Betwenness show whether a node obtains "a bridge" positions between others
  • High betweenness: node has access to diverse information and resources, connects non-connected groups

Closeness Centrality

  • Closeness shows how close is the node to all others
  • High closeness: a node can reach others in a network very easily

Modularity

  • Shows the network communities
  • Nodes are in one community if they are densely connected with each other
  • High modularity: dense intra-community, sparse inter-community ties
  • Higher is better

Connected components

  • Nodes are in one component if they are connected only with each other
  • A large number of components: network is highly disconnected

2. Node Measures

Clustering coefficient

  • It is a measure of how complete the neighborhood of a node is
  • The friend of my friend is also my friend" effect
  • High clustering: everyone knows each other

Eigenvector Centrality

  • Shows how a node is connected with other influential nodes
  • High eigenvector: the node has a large number of ties and its neighbors also have a large number of ties

Where to find all the results?

  • Gephi returns reports
  • Network-related measures are also saved in 'Data Laboratory' tab.
  • Export results using "Export Table" and explore in Excel

 

3. Data Analysis

Data Analysis

1. Introduction

 

2. Description and interpretation of the results

 

3. Conclusions

* max 2000 words

Deadline: Friday 21 June, 19.00

1. Introduction

  • Research question(s) and hypothesis(es):   A brief reminder

  • Reformulate if they have changed

  • ​Data and methods: Which dataset(s) and method(s) you used?

2. Results

  • What are your main findings?

  • How would you explain these findings? What do they mean?​ Did you obtain what you expect or not?

 

* Think about the research question

* Return to the literature review if needed

 

 

3. Conclusions

  • A summary of the main findings

 

  • Did you answer your research question or not? Did you confirm your hypothesis(es) or not?

  • What would you recommend for future studies? What can be explored more? What would you like to study more on this topic if you would have more time / better datasets?

Group meetups

Wednesday 19 June

 

Success offline : 9.00 - 9.20

Success online : 9.20 - 9.40

Past and Present: 9.40 - 10.00

Neighbors: 10.10 - 10.30

Key players : 10.30 - 10.50

 

PRS / Week 3A: Data analysis / Skills tutorial on network analysis

By Diliara Valeeva

PRS / Week 3A: Data analysis / Skills tutorial on network analysis

PRS Practicing Research Skills. Week 3A: Sills tutorial on network analysis in Gephi (advanced) and Data Analysis. 17 June, Wednesday.

  • 296