Summarizing data with dplyr

The following makes use of the dplyr package. You may need to install it from CRAN using the code install.packages("dplyr") if you want to run this on your computer. (The package is already installed on the notebook container, however.)


Attaching package: ‘dplyr’

The following objects are masked from ‘package:stats’:

    filter, lag

The following objects are masked from ‘package:base’:

    intersect, setdiff, setequal, union

Here we use data from the British Election Study 2010. The data set bes2010feelings.RData is prepared from the original available at by removing identifying information and scrambling the data.


# A convenience function
Mean <- function(x,...) mean(x,na.rm=TRUE,...)

bes2010feelings %>% group_by(wave,region) %>%
`summarise()` regrouping output by 'wave' (override with `.groups` argument)

  wave region   Brown    Cameron  Clegg    N
1 Pre  England  4.092674 5.284810 4.618690 1159
2 Pre  Scotland 5.395000 4.502591 4.405229  207
3 Pre  Wales    4.328244 4.774194 4.592233  132
4 Pre  NA       4.507143 4.929870 4.426573  437
5 Post England  4.140990 5.441454 5.160313 2175
6 Post Scotland 5.510769 4.539075 4.513793  665
7 Post Wales    4.307692 4.855895 4.814480  235

