Last October, I made a list of 20 Ways Search Engines May Rerank Search Results, which was well-received (thank you!), and it was suggested that I come up with an updated list. There’s also a follow-up post to this one now at Another 10 Ways Search Engines May Rerank Search Results.
When someone searches, the conventional approach a search engine takes is to find pages with the keywords. It may then rank and serve them to combine an information retrieval relevance score for the pages with some importance metric, such as a PageRank Score. There are times when Google may then rerank search results based on other signals getting involved.
This list contains links to several patent applications and a few papers involving ways to rerank search results. Most of these got published after creating my previous reranking search results post.
The approaches in some may overlap a little with some on the previous list in terms of topics covered, but these are new documents discussing how search results might get reranked.
Some of the methods described here may not presently get used, but it might not hurt to think about them.
1. Desktop Search Influenced by the Contents of an Active Window
Granted on Tuesday, this Google patent describes how a search of the Web may get altered based upon an active document, such as a text document or email or IM message, in an open window on a person’s computer at the time that they are searching while using a desktop search application. That is a compelling reason to rerank search results because it is very timely information.
2. Expanded and Adjacent Queries from User Logs
Another recent patent filing from Microsoft looks at user query sessions to log files. It looks at the terms used in a query and accepts aggregated queries of others who have used the same search terms and other words in their questions to create rewritten queries.
It also looks at related queries during sessions (adjacent queries) from other searchers who have searched for the same question. Results from these rewritten and adjacent queries may be used to rerank search results that show up in response to the original query.
3. Social Network Endorsements
Suppose you are a social network member, and the network allows you to rate and endorse web pages and let your friends do the same. In that case, the endorsements made by you and from your friends may cause the results that you see from a search to get reranked, according to the following Google patent application:
4. Personalized Anchor Text Relevance
Links from pages that contain some anchor text that seems related to information found in an explicit or implicit profile of your interests may weigh more heavily in the rankings of pages under the following recently granted Google patent.
Explicit information is information related to something that you have expressly stated that you have some interest in. Implicit information involves information that may get inferred that you are interested in based upon such things as pages that you have bookmarked or visited in the past. In that instance, Google may rerank search results that you, and only you, see.
5. Recognizing Semantically Meaningful Compounds
A search for more than one term may result in a search engine searching for sets of pages that use all of those terms. Treating some of the words within a query as semantically meaningful compounds and may rerank search results based upon pages containing such a compound may mean that more relevant documents are returned to a searcher.
6. Use of Trends and Bursty Topics
From an Ask.com patent filing, fresh, highly topical, and popular content related to a query may make its way into search results and push down other relevant impacts.
Correlated top gainer events can be used to improve the ranking of search engines and predict search trends. This is used for adding freshness to the Web index. Those Web pages that contain fresh topicsâ€ “identified over the stream of newsâ€ “are boosted in ranking for the period of observation. After a certain amount of time (e.g., a week, a month, etc.), if the topic is no longer fresh, the boosting effect is subject to a decay rule.
7. User Distributed Search Results
A Google patent application describes a method of letting people insert search results into their blogs, emails, and instant messages. Reputation scores may get created for people who do this, and the higher their reputation score, the higher that result might rank for a relevant query searched for by someone else. So reputation can cause Google to rerank search results.
8. Advanced Search Users
A Microsoft paper looks at how advanced users of search engines search and browser results to understand how results might improve for all searchers.
9. Dual Trustrank
You used community endorsements and ratings of endorsers and link-based TrustRank in a dual TrustRank process from Yahoo to rerank pages. The idea is that there are members of your social network whom you trust, and if they endorse a page, then it is likely to become more trustworthy.
Couple that with a link analysis approach to finding webspam, and this “dual” method of identifying trust can become used to show a searcher more trustworthy pages.
10. Web Traffic
By looking at real-time, or near real-time web traffic and activity, including search results selections at other search engines, results can get reranked under the methods described in this Ask.com patent application.
11. Different Queries, Similar Results and Selections
A Yahoo patent application looks at query histories for different queries that provide similar results and similar selections amongst the searchers who enter those queries. This may allow a search engine to broaden result sets to include results from those different queries.
- Using matrix representations of search engine operations to make inferences about documents in a search engine corpus
12. Understanding Timely Topics through Alerts
The frequency and timeliness of alert sign-ups
- for different topics could affect this Google patent application rankings.
13, 14, 15. Similar Users with Similar Interest and Their Selections
Three methods of clustering users with similar interests to rerank search results based on those other users selected. There are some similarities under these approaches and some significant differences. But it seemed reasonable when I wrote this to cluster them together.
- Scalable user clustering based on set similarity (Google)
- Augmenting user, query, and document triplets using singular value decomposition (Microsoft)
- Methods and systems for providing a response to a query (IAC Search & Media, Inc.)
16. Paid and Organic Results on the Same Page
The appearance of results in both paid search and organic might cause the organic results to get removed, as described in this Microsoft patent application. Not a major “reranking,” but an interesting one. An instance where you see Microsoft rerank search results by removing one.
17. High Confidence Spelling Corrections
Spelling corrections where the search engine believes with a high degree of confidence that the query included a misspelling may result in pages being included in results that use the correct spelling. The results for what the search engine believes are the misspelling may then get pushed back under this Google patent application.
18. Language Match Between Query and Pages Returned
If the language used in the query doesn’t match the language used on the page being returned (except for English language pages), the page may get moved down in search results:
19. Labels of Custom Search Results
Google may rerank search results to create and use custom search engines on different topics, with labels relevant to the queries used. Rather than pointing at the patent filing for this one, this paper on indexing data structures was pretty interesting:
In the case of Google Co-op, customized search engines can specify query patterns that trigger specific facets as well as provide hints for reranking search results. The annotations that any customized search engine specifies are visible only within the context of that search engine. However, as we start seeing more custom search engines, it would become desirable to point users to different engines that might be relevant.
- Structured Data Meets the Web: A Few Observations (no longer available)
20. Agent Rank
Rankings based upon an author’s reputation, under a system that ranks different parts of pages based upon verifiable authorship of those sections.
I wrote about this way to rerank search results more at Search Engine Land – Google’s Agent Rank Patent Application
Other Factors, like Universal Search
There are probably many other factors that may influence and cause search results to be reranked. I didn’t even include Universal and Blended search results in this list, though I probably could and should have.
Again, some of these reranking methods may be presently getting used, some may be used in the future, and some may not be used at all. It’s even more likely that as we move forward, two people performing the same search in different locations, at different times, or both, will see different results from the search engines in response to the same queries.