Google was granted a new patent involving pagerank last week, which appears to focus upon a paper from April of 2003, Adaptive Methods for the Computation of PageRank.
The paper does a nice job of describing what they were aiming at with this patent – a faster way of assigning query-independent rankings of value to pages on the web, based upon links to those pages. This work doesn’t aim at changing rankings, or determinations of relevance of web pages, but is instead aimed at making the computational elements of calculating rankings cheaper and faster.
While the methods and processes described in the patent aren’t really anything new, it’s interesting to see how the ideas in the paper can be implemented as part of a search engine. The patent provides a nice overview of how a search engine functions, covering topics such as crawling, filtering, indexing, index partitioning, caching, mapping links, and serving pages.
It then dives into the method they have devised to take advantage of the observation they made that some pages take less time, and less recalculations of pagerank to reach their final pagerank. As the authors note in the paper:
That is, many pages converge quickly, with a few pages taking much longer to converge. Furthermore, the pages that converge slowly are generally those pages with high PageRank.
The patent cited as references a number of other papers and patents, and I’ve linked to those below if you would like to follow the evolution of the work that helped lead to this patent and paper. While the patent mentions personalization, it doesn’t cover the topic in much depth. It also doesn’t address such things as historical data, seasonality, burstiness, user behavior, and other areas which we’ve seen included in other papers and patent applications from academia and patent filings.
Adaptive computation of ranking
Inventors: Sepandar D. Kamvar, Taher Haveliwala, and Gene H. Golub
Assigned to Google Inc.
US Patent 7,028,029
Granted April 11, 2006
Filed: August 23, 2004
Abstract
A system and method is disclosed in which a ranking function for a set of document rank values is iteratively solved with respect to a set of linked documents until a first stability condition is satisfied. After such condition is satisfied, some of the ranks will have converged. The ranking function is modified to take into account these converged ranks so as to reduce the ranking function’s computation cost. The modified ranking function is then solved until a second stability condition is satisfied. After such condition is satisfied more of the ranks will have converged. The ranking function is again modified and process continues until complete.
The patent also notes that it is related to Methods for ranking nodes in large directed graphs, published February 10, 2005, and invented by Sepandar D. Kamvar, Taher Haveliwala, Glen Jeh, and Gene H. Golub
U.S. patents and patent applications cited:
- Method and system for identifying authoritative information resources in an environment with content-based links between information resources
Patent number 6112202
Granted August 2000
Inventor Jon Michael Kleinberg - Method for node ranking in a linked database
Patent number 6285999
Granted September 2001
Inventor Lawrence Page - Method and apparatus for ranking Web page search results
Patent number 6560600
Granted May 2003
Inventor Andrei Z. Broder - Method and apparatus to retrieve information from a network
Patent number 6584468
Granted June 2003
Inventors Kaigham J. Gabriel, Evan M. Indianer, Christopher M. Umbel, and Joel Lenhart - Method for scoring documents in a linked database
Patent number 6799176
Granted September 2004
Inventor Lawrence Page - Method and apparatus for ranking web page search results
Patent number 6871202
Granted March 2005
Inventor Andrei Z. Broder - System and method for rapid computation of PageRank
Published October 2003
Inventors John Anthony Tomlin, Andrew S. Tomkins, and Arvind Arasu - Directory services searching system and methods
Published November 2003
Inventor Richard Hans Harvey - Systems and methods of retrieving relevant information
Published, November 2003
Inventors Brian S. Kim, Sudong Chung, Anurag Dod, Michael Kim, and Yeogirl Yun - Method and apparatus for search ranking using human input and automated ranking
Published February 2004
Inventors Udi Manber and Chi-Chao Chang - Method and apparatus for ranking web page search results
Published June 2004
Inventor Andrei Z. Broder
Other cited references:
- PageRank Computation and the Structure of the Web: Experiments and Algorithms (pdf)
- Improved Algorithms for Topic Distillation in a Hyperlinked Environment
- Efficient Computation of PageRank
- Topic Sensitive PageRank
- The Second Eigenvalue of the Google Matrix
- Scaling Personalized Web Search
- Exploiting the Block Structure of the Web for Computing PageRank
- Extrapolation Methods for Accelerating PageRank Computations
- The PageRank Citation Ranking: Bringing Order to the Web
[...] You did meet Bill at the conferences to talk about random [Google ranking patents] stuff [...]
[...] In April of 2006, I wrote a post on Google’s adaptive pagerank patent, which looks at Adaptive computation of ranking. It appears to focus upon the processes described in the “Adaptive Methods for the Computation of PageRank” that I also link to above. [...]