Projects
Record linkage and the Downstream Task
Linking data from multiple databases increases both the size and scope of a dataset, enabling post-processing tasks such as linear regression or capture-recapture to be performed. This work focused on novel approaches to multiple downstream tasks that allow for errors to propagate through the analyses and provide accurate inference.
Restricted Boltzmann Machines
Steps toward a thorough understanding of the RBM model class and its behavior from the perspective of statistical theory and exploration of the possibility of a rigorous fitting methodology via MCMC.
[paper][paper][shiny apps][github]
Protoshiny
Shiny app to interactively visualize large dendrograms resulting from hierarchical clustering with prototypes. Joint work with Jacob Bien.
[github]
NCS Dataviz
A graphical tool that allows users to understand variable relationships within the 2012 NCS Vanguard Study dataset. Built using D3. Joint work with Yongeng Lin from NORC.
CommuniD3
Exploratory tool based on 'Soul of the Community' data generated by the Knight Foundation in cooperation with Gallup. Project was winner of the 2013 Data Exposition. Created using Shiny and D3. Joint work with Eric Hare.
Election 2012
Examined campaign contributions and political action committee spending data from the Federal Election Commission, and interpreted for the 2012 election cycle. Joint work with Di Cook, Heike Hofmann, Eric Hare, and Susan VanderPlas.
[paper][supplement][site][github]