Join the community!
Visit our GitHub or LinkedIn page and join the Tilburg Science Hub community.
Want to change something or add new content? Click the Contribute button!
Let’s modify the analysis
We’re done with the data-preparation
pipeline for now, and turn our
attention to the analysis
part of our project.
Admittedly, there’s not much here yet. Try fiddling around with the files a bit when you proceed to our practice questions and answers below.
Now let’s continue with a couple of modifications. You can directly start working on the practice questions below.
Practice questions and answers
-
Recall the powerful
makefile
we’ve introduced to you a while ago? Well, open themakefile
insrc/analysis/
now and try to understand the steps of this stage of the pipeline! What happens, exactly? -
Let’s open
preclean.R
(e.g., in RStudio). Try to understand what this script does. Then, filter the data only for tweets withpolarity>0
. -
Last, provide some summary statistics (
summary(dt$nwords)
) of the word count, and produce a histogramhist(dt$nwords)
of it in the RMarkdown document.