I first heard about Python pandas from a friend at RenTech or AQR in the early summer of last yea...
Visualization of Reading Level Frequency by Congressional Bill Stage
Here’s a fun example of how you might use my data on Congressional bill length and complexi...
Installing AWS Cloud Search Command Line Tools
In case you’re too lazy to dig into the full CloudSearch developer guide, here’s a qu...
Natty Narwhal on the Precision M4600
Since there are always questions of support for newly released models, I thought I’d put up a ...
Upcoming post series: Building a better legal search engine
Later this month, I’ll be giving a keynote at a meeting on Law and Computation at ...
Dataset: Wisconsin Union Protester Tweets #wiunion
I’ve been playing with Twitter data over the last week, archiving Algerian, Egyptian, Ira...
Dataset: Tweets from the Chinese Protests #cn220
Earlier this week, I posted a ~100k tweet dataset on the #25bahman protests in Iran. The corre...
R Bloggers: The Site I Wish Existed in 2007
My first experience with R was in 2007 as a sophomore in undergrad. As part of a l...
Most Contacted HBGary Emails and Domains
You may have heard about the recently leaked presentation on combating Wikileaks that was produc...
Plotting a Revolution: Time Series Comparison of #feb12 vs. #fev12
I wondered yesterday whether one of the Algeria/Yemen hashtags would dominate. In order ...