Tracking the Frequency of Twitter Hashtags with R

 I’ve posted three examples of Twitter hashtags datasets in the last week: one on China, one on Iran, and one on Algeria.  In order to build these datasets, I needed to obtain older tweets; this is slightly more difficult than simply filtering the streaming feed for your hashtag of choice.  The original code I wrote

By |2011-02-21T01:22:32-05:00February 21st, 2011|Programming|7 Comments

Dataset: Tweets from the Chinese Protests #cn220

  Earlier this week, I posted a ~100k tweet dataset on the #25bahman protests in Iran.  The corresponding figure of frequencies showed a strong presence on Twitter, with over 500 tweets per 5 minute period at peak.  You can download the dataset or check out the figure in that post.   I decided to take a quick

By |2011-02-20T14:03:44-05:00February 20th, 2011|Programming, Society|0 Comments

R Bloggers: The Site I Wish Existed in 2007

  My first experience with R was in 2007 as a sophomore in undergrad.  As part of a larger project on pricing day-ahead electricity futures, I wanted to cluster locational marginal price (LMP) data from the ISO-NE.  Something like k-means is easy to plot and visualize in low-dimensions, but this data was better approached by hierarchical methods.

By |2011-02-19T10:58:10-05:00February 19th, 2011|Programming|1 Comment

Most Contacted HBGary Emails and Domains

 You may have heard about the recently leaked presentation on combating Wikileaks that was produced by employees of HBGary Federal, Palantir Tech, and Berico Tech.  You may have also heard that Anonymous retaliated against HB Gary Federal for threatening to release their identities.  I thought it would be interesting to run some analysis of the email networks

By |2011-02-19T09:45:29-05:00February 19th, 2011|Programming, Technology|0 Comments

Pre-processing text: R/tm vs. python/NLTK

  Let's say that you want to take a set of documents and apply a computational linguistic technique.  If your method is based on the bag-of-words model, you probably need to pre-process these documents first by segmenting, tokenizing, stripping, stopwording, and stemming each one (phew, that's a lot of -ing's).     In the past, I've relied

By |2011-02-16T10:12:07-05:00February 16th, 2011|Programming|15 Comments

Top Sliding Bar

This Sliding Bar can be switched on or off in theme options, and can take any widget you throw at it or even fill it with your custom HTML Code. Its perfect for grabbing the attention of your viewers. Choose between 1, 2, 3 or 4 columns, set the background color, widget divider color, activate transparency, a top border or fully disable it on desktop and mobile.

Recent Tweets

Newsletter

Sign-up to get the latest news and update information. Don’t worry, we won’t send spam!

Go to Top