Last week, I shared that Dan Katz and I had finally published a draft of our paper, Measuring t...
Measuring the Complexity of the Law: The U.S. Code
 Four years ago, Dan Katz and I began working on a project to measure the complexity of the law. Â...
Building an AWS CloudSearch domain for the Supreme Court
 It should be pretty clear by now that two things I’m very interested in are cloud computing...
Updates to data and statistics on Congressional bill complexity
 When I put together my original post on the length and complexity of Congressional bills, I was h...
Now in print: An Empirical Survey of the Population of U.S. Tax Court Written Decisions
When someone brings up the empirical study of legal citation, most people think of the w...
Upcoming post series: Building a better legal search engine
Later this month, I’ll be giving a keynote at a meeting on Law and Computation at ...
Pre-processing text: R/tm vs. python/NLTK
Let’s say that you want to take a set of documents and apply a computational lingu...
Dataset: 5 Days of #25bahman
What do 88,831 tweets about protest and revolution in Iran look like? Â Following in the success of ...
Twitter Hashtag Battle Royale – #(feb|fev)12 vs. #12(feb|fev)
Algeria and Yemen seem to be pushing a #feb12 revolution hashtag like Egypt’s #jan25 tag. In t...