Statistical Methods for Studying Literature Using R
R is a powerful programing language for statistical analysis and visualization that can be broadly used for many applications in the digital humanities. As with any programming language, getting started with R involves a steep initial learning curve in order to produce useful results. In its current form, this blog contains the notes from a hands-on workshop that I initially ran at the University of Kansas’s Digital Humanities Forum/THATCamp Representing Knowledge in the Digital Humanities in September of 2011 and expanded with a more literary focus at the (University of Kansas 2012 Digital Humanities Forum). It was further revised for an additional workshop at the University of Iowa Oberman Center for Advanced Study in the fall of 2014. The purpose of these two workshops was to introduce the R environment, describe data structures in R, ways to format data about literary texts for statistical analysis, and provide practical examples of ways to use R to answer questions about literature.