The Federalist Papers: Author Identification Through K-Means Clustering – JonLuca’s Blog

My goal is to recreate the results found by Mosteller and Wallace through modern statistical methods – namely K-Means clustering and TFIDF.