BlogScope Offers Unique Ways to Search the Blogosphere (blogspot for now)

One of the papers presented at the WWW 2007 involves the efforts of researchers at the University of Toronto to provide some interesting ways to search blogs – BlogScope: Spatio-temporal Analysis of the Blogosphere

The application described is online in “preview” form – BlogScope. It’s presently tracking 9.90 million blogs with 76.78 million posts from Blogspot.

In addition to searching these Blogspot posts, it provides some interesting ways of looking at information in its index that we really aren’t seeing from any other blog search engine:

Popularity Curve – provides the ability to visualize the popularity of query terms as a function of time

Burst Detection – identifies and marks bursts–interesting events –on popularity curve in red.

Information Analysis – enables users to zoom in various parts of popularity curve, and can be restricted to a time interval, which lets you view bursts and evolution of various topics.

Correlation Discovery – shows in search results a list of keywords closely associated with the query terms over the selected time window.

Geo Locator – the search results for a query can be visualized on a world map.

I liked the Geo Search a lot. As they tell us in the paper about what they are trying to do:

The analysis paradigm that BlogScope facilitates is segmented in four steps. BlogScope identifies what is ‘interesting’, when it was ‘interesting’, why it is ‘interesting’, and where it is ‘interesting’.

It would be great to see this expanded to blogs outside of Blogspot. I hope that some of the other folks in the blog search space are paying attention. Some great ideas in here.

An expanded paper from a couple of the inventors – Searching the Blogosphere provides some more insight into efforts being made by the team behind this blog search engine.

Share

5 thoughts on “BlogScope Offers Unique Ways to Search the Blogosphere (blogspot for now)”

  1. Some very, very interesting techniques used there. I suppose having a corpus that consists of just one source makes it more technically feasible to achieve this, but you have to imagine that similar functionality could be built into any of the big blog search properties (thinking technorati or feedburner?).

    The UI is a bit cluttered (IMO), but this is a very refreshing way to search through result sets.

    Rgds
    Richard

  2. Hi Richard,

    Limiting yourself to one data source at first does makes sense for a research project.

    It becomes a whole different project when you work on adding other data sources.

    The additional types of searches are pretty interesting. Hopefully some of the other blog searches are paying attention.

  3. The curve is interesting.

    The geographic results would be awesome, if you could then click upon a point on the map, and see the results tied to those locations.

Comments are closed.