Livejournal Blog Analysis



Mar 24, 2008

In the Spring of 2008, I set out to take the pulse of the internet. My goal was to examine the ways in which the internet changed on a regular basis. The object of this research

After only a small amount of effort I had created a script that was capable of searching thousands of blogs at a time for an array of about 12 regular expressions each corresponding to a particular type of sentence in which I was interested. The script would run autonomously 4 times each day for an indefinite period of time.

I have visited over 150,000 blogs to date and have collected close to 400,000 distinct feelings (among many many other things). It is difficult to determine what exactly to do with such information, but I do plan on doing an extensive visualization at some point.

To date the most I've done with this dataset was to simply plot the linkages between individual blogs in GraphViz. The image for this test is shown at right.

Related files:

Related links:

LiveJournal Network
Direct hyperlinks between thousands of blogs.