This term, I’m teaching Complex Systems 530 – Computer Modeling for Complex Systems a...
Advanced approximate sentence matching in Python
In our last post, we went over a range of options to perform approximate sentence matching in Python...
Fuzzy match sentences in Python
Let’s imagine you have a sentence of interest. Â You’d like to find all occurrences of t...
Isotonic Regressions in scikit-learn
Isotonic regression is a great tool to keep in your repertoire; it’s like weighted least-squ...
Is the Tax Code the longest Title?
  Last week, I shared that Dan Katz and I had finally published a draft of our paper, Measuring t...
Measuring the Complexity of the Law: The U.S. Code
 Four years ago, Dan Katz and I began working on a project to measure the complexity of the law. Â...
Revisiting text processing with R and Python
 Back in 2011, I covered the relative performance difference of the most popular libraries for tex...
Generating SSH config from AWS hosts using boto
 As a consultant and advisor to many firms running on or investigating AWS, I find SSH host and ke...
Git Repository for Congressional Bill Statistics
 After a nice twitter conversation this morning, I finally got the impetus to release the source f...
Summary of community detection algorithms in igraph 0.6
 Based on Launchpad traffic and mailing list responses, Gabor and Tamas will soon be releasing igr...