• Multiprocessing in Python

  • Intro to data analysis using machine learning (50 min)

  • Attitude analysis and corpus analytics of 1M tweets about feminism (summary)

    Using a corpus of 988,000 tweets retrieved from Twitter's Search API from January to April 2015 containing the words "feminism", "feminist" or "feminists", we trained a classifier to label them as pro-feminist, anti-feminist or neither (regardless of sentiment), and determined the most characteristic words used by each group with the log-likelihood keyness method.

  • Attitude analysis and corpus analytics of 1M tweets about feminism

    Using a corpus of 988,000 tweets retrieved from Twitter's Search API from January to April 2015 containing the words "feminism", "feminist" or "feminists", we trained a classifier to label them as pro-feminist, anti-feminist or neither (regardless of sentiment), and determined the most characteristic words used by each group with the log-likelihood keyness method.

  • Intro to data analysis using machine learning (presentation version)

  • Intro to data analysis using machine learning (standalone version)