Isotonic regression is a great tool to keep in your repertoire; it’s like weighted least-squ...
Measuring the Complexity of the Law: The U.S. Code
 Four years ago, Dan Katz and I began working on a project to measure the complexity of the law. Â...
Git Repository for Congressional Bill Statistics
 After a nice twitter conversation this morning, I finally got the impetus to release the source f...
Summary of community detection algorithms in igraph 0.6
 Based on Launchpad traffic and mailing list responses, Gabor and Tamas will soon be releasing igr...
Building Python pandas from development source
 I first heard about Python pandas from a friend at RenTech or AQR in the early summer of last yea...
Grexit stage left: visualizing the online discussion around Greece’s possible Euro exit
 While Tsipras and his Syriza coalition have been busy in Greek parliament, the Internet has been ...
Visualizing the #nonato Twitter hashtag – time series and top users
 The NATO summit is currently being held in Chicago, and, as is typical for NATO or G# summits, th...
“Google” for subpoenaed emails: AWS CloudSearch for eDiscovery
 In the last post on AWS CloudSearch, I provided a tutorial on the creation of a simple CloudSear...
Visualization of Reading Level Frequency by Congressional Bill Stage
 Here’s a fun example of how you might use my data on Congressional bill length and complexi...
Updates to data and statistics on Congressional bill complexity
 When I put together my original post on the length and complexity of Congressional bills, I was h...