New Papers at Google Labs

From the nice-to-finally-see-an-update department, comes news that Google has updated the list of papers from Googlers over at Google Labs.

The good folks over at ResourceShelf noted the appearance of new papers over there earlier today. Sadly, many of the papers listed aren’t available, but instead link to Google searches for the names of the documents.

A New Personalized Recommendation System

One paper that is available, and looks to be worth a look is Retroactive Answering of Search Queries, (pdf) from Beverly Yang and Glen Jeh, which was presented at the WWW 2006, in Edinburgh, Scotland.

Continue reading “New Papers at Google Labs”

Become Explores Spring Networks to Rank Pages and Avoid Spam

It’s fun to see something interesting come from a search engine that isn’t one of the big names.

A new patent application from Become, Inc., looks at links in a different way, so that while links play a role in the rankings of pages, not every link holds the same value.

The summary of the patent points out some of the problems with on page factors analysis and link structure analysis. They also write about the “artificial web,” which involves the use of scripts to write:

…millions or billions of simple web pages that contain links to a few websites to be promoted. As the number of these artificial web pages can be comparable to that of the major portion of the real Web, the spammers can wield undue influence in manipulating the link structure of the entire Web, thereby affecting the computation of PageRank.

We’ve seen this “artificial web” as a significant issue recently with Google, as reported on Search Engine Watch in Google Yanks Sites 5 Billion Pages After Spam Complaint. Does have a solution to this type of problem?

Continue reading “Become Explores Spring Networks to Rank Pages and Avoid Spam”