June 2017

Big Data Analysis for Social Scientists Masterclass - National Chengchi University, Taipei


Note: "vosonSML" is the new name for the R package that was previously called "SocialMediaLab". These archived documents may contain materials (e.g. R scripts) that refer to the old package. For up-to date references please visit the Training workshops page or the vosonSML page.

This page provides resources for participants in the Big Data Analysis for Social Scientists 1-day course that will be held at National Chengchi University, Taipei on 26 June 2017.  The course will be taught by Robert Ackland (Australian National University).

If you would like to start looking at related content, try the SocialMediaLab page. Also, a 5-day course run earlier this year will give you some idea about what we will cover in the course (we will cover less, as we only have one day).

SocialMediaLab collects Twitter, YouTube and Facebook* data from the respective free APIs. If you want to collect data you will need to arrange API access. You can find details on how to get API access here.

Slides

VOSON Lab Introduction and SocialMediaLab

R scripts:

  • Introduction to R and RStudio: Rmd, pdf
  • SocialMediaLab Tutorial - Data Collection: Rmd, pdf
  • Introduction to SNA with igraph: Rmd, pdf
  • Analysing the "Divided They Blog" network: Rmd, pdf
  • Introduction to Text Analysis in R - Environmental Activist Websites: Rmd, pdf
  • 1-mode projections from 2-mode graphs (Facebook): using igraph, using Jaccard threshold

Datasets

  • Facebook data collection:
    • 2015-06-23_to_2015-07-23_StopCoalSeamGasBlueMountains_FacebookData.csv
    • g_bimodal_facebook_csg.graphml
  • Twitter collection:
    • Jun_26_03_02_01_2017_AEST_auspol_TwitterData.csv
    • g_twitter_actor.graphml
  • YouTube collection:
    • Jun_26_03_12_16_2017_AEST_YoutubeData.csv
    • g_youtube_actor.graphml
  • For Introduction to SNA with igraph: Sat_Jun_27_13-04-34_2015_EST_YoutubeActorNetwork_FiveFracking.graphml
  • For Introduction to Text Analysis in R: nano2seeds_v2.csv
  • For exercises:
    • Divided They Blog (40 A-listers) hyperlink network:
      edges, vertices
    • Abortion debate WWW hyperlink network csv files: edges, vertices
    • Abortion debate WWW hyperlinks and meta keywords

Exercises